Lead Data Scientist with experience in designing, building, and shipping diverse AI/ML and data engineering solutions which include Large Scale IoT streaming Analytics/Data Pipelines, Large Scale Machine Learning, ASR, Recommendation System, Image semantics, Text semantics, Information extraction, Information retrieval, ML and heuristics-based image segmentation as well as MLOps leveraging MLFlow and Kubeflow with massive scalability and optimizations.
Well-versed in designing cloud-native horizontally scalable distributed system architecture leveraging queuing, consumer groups, etc, detecting bottlenecks, optimizing infrastructure costs as well as compute, and creating high throughput deployment patterns for serving machine learning and deep learning models in production.
Skills:
Past Achievements:
Won few ML competitions in the past (as solo): -- AIM Identify the author Challenge by Machine Hack (rank 2) -- ZS Young data scientist challenge 2018 by HackerEarth (rank 3) -- World data science challenge by Bitgrit (rank 4)
Kaggle Competitions Expert
Competitive Programming: -- Won 5 medals (2 silver and 3 bronze) at Hackerrank
B.Tech Double Gold Medalist in Academics
Writer @ Medium (https://mayank-k-jha.medium.com/)
Speaker - PyData, Kaggle Days, ACM Student Chapter