EPISODE · Jun 10, 2022 · 52 MIN
Fixing Your ML Data Blind Spots // Yash Sheth // MLOps Coffee Sessions #102
from MLOps.community · host Demetrios
MLOps Coffee Sessions #102 with Yash Sheth, Fixing Your ML Data Blindspots, co-hosted by Adam Sroka. Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractImproving your dataset quality is absolutely critical for effective ML. Finding errors in your datasets is generally a slow, iterative, and painstaking process. Data scientists should be proactively fixing their models’ blind spots by improving their training data. In this talk, Yash discusses how Galileo helps data scientists identify, fix, and track data across the entire ML workflow. // BioCo-founder and VP of Engineering. Prior to starting Galileo, Yash spent the last decade working on Automatic Speech Recognition (ASR) at Google, leading their core speech recognition platform team, which powers speech-to-text across 20+ products at Google in over 80 languages, along with thousands of businesses through their Cloud Speech API. // MLOps Jobs board jobs.mlops.communityMLOps Swag/Merchhttps://mlops-community.myshopify.com/// Related LinksWebsite: https://www.rungalileo.io/Trade-Off: Why Some Things Catch On, and Others book by Kevin Maney:https://www.amazon.com/Trade-Off-Some-Things-Catch-Others/dp/0385525958--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Adam on LinkedIn: https://www.linkedin.com/in/aesroka/Connect with Yash on LinkedIn: https://www.linkedin.com/in/yash-sheth-72111216/Timestamps:[00:00] Introduction to Yash Sheth[02:53] Takeaways[04:35] Why unstructured data?[06:59] Fitting in the workflow[10:56] Digging into the different pains[18:23] Vision around the democratization of machine learning[24:31] Unstructured data problem[25:49] Galileo handling unified tools[27:21] Calculus for ML[28:45] Gatekeep[29:49] Synthetic data in the unstructured data world of Galileo[33:10] Tips for data scientists who have unstructured data but a small data set[35:00] Benefits of users from Galileo[37:15] Business case for dummies[42:36] War stories[44:49] Rapid-fire questions[50:55] Wrap up
NOW PLAYING
Fixing Your ML Data Blind Spots // Yash Sheth // MLOps Coffee Sessions #102
No transcript for this episode yet
Similar Episodes
Apr 21, 2026 ·13m
Apr 19, 2026 ·16m
Apr 17, 2026 ·13m
Apr 13, 2026 ·11m
Apr 11, 2026 ·16m