Improving AI Through Data Quality episode artwork

EPISODE · Jul 30, 2025 · 26 MIN

Improving AI Through Data Quality

from The Reasoning Show · host Massive Studios

Elliot Shmukler (@eshmu, Co-Founder/CEO @anomalo_hq) talks about the impact of data quality on AI, how unstructured data can be improved, and how monitoring of data lakes can help prevent model drift and give organizations confidence with predictable results.SHOW: 945SHOW TRANSCRIPT: The Cloudcast #945 TranscriptSHOW VIDEO: https://youtube.com/@TheCloudcastNET CLOUD NEWS OF THE WEEK:  http://bit.ly/cloudcast-cnotwNEW TO CLOUD? CHECK OUT OUR OTHER PODCAST:  "CLOUDCAST BASICS"SPONSORS:[DoIT] Visit doit.com (that’s d-o-i-t.com) to unlock intent-aware FinOps at scale with DoiT Cloud Intelligence.[VASION] Vasion Print eliminates the need for print servers by enabling secure, cloud-based printing from any device, anywhere. Get a custom demo to see the difference for yourself.[FCTR] Try FCTR.io (that's F-C-T-R dot io) free for 60 days. Modern security demands modern solutions. Check out Fctr's Tako AI, the first AI agent for Okta, on their websiteSHOW NOTES:Anomalo websiteThe Cloudcast #598 - Data QualitySnowflake invests in AnomaloTopic 1 - Elliot, welcome back! It’s hard to believe it has been 3 years since we spoke! Give everyone a brief introduction.Topic 2 - Here’s the problem I see when it comes to AI adoption today. There isn’t an “off the shelf” AI model with an organization's data built in; that’s impossible. So, you must bring this data, often unstructured, to the model, often with mixed results. Do you agree?Topic 3 - I see data quality in two ways… the quality of the data before ingestion is one way, we want the data to be clean going in. But, we also need a way to detect, mitigate, and do a root cause analysis for quality checks along the way, correct? Give everyone an idea of what this life cycle looks like.Topic 4 - What are you seeing as the barriers to adoption? Is it the tools, the models, the need for RAG pipelines, the lack of data scientists, and AIOps?Topic 5 - We have this crossroads where proprietary data makes an organization unique, but exposing that unique data puts the organization at risk. How much of a factor does this play, and how do you advise organizations around this complex intersectionTopic 6 - There is always this concept of predictable results. This answer should be consistent and repeatable. We’ve seen things like model/data drift and hallucinations hinder this concept, leading to a lack of confidence in the results. How do you advise organizations to tackle this lifecycle management and predictability over time?FEEDBACK?Email: show at the cloudcast dot netBluesky: FEEDBACK?Email: show @ the enterprise ai show dot comeBluesky: @EntAIShow.bsky.socialTwitter/X: @TheEntAIShowInstagram: @TheEntAIShow

Elliot Shmukler (@eshmu, Co-Founder/CEO @anomalo_hq) talks about the impact of data quality on AI, how unstructured data can be improved, and how monitoring of data lakes can help prevent model drift and give organizations confidence with predictable results. SHOW: 945 SHOW TRANSCRIPT: The Cloudcast #945 Transcript SHOW VIDEO: https://youtube.com/@TheCloudcastNET CLOUD NEWS OF THE WEEK: http://bit.ly/cloudcast-cnotw NEW TO CLOUD? CHECK OUT OUR OTHER PODCAST: "CLOUDCAST BASIC...

NOW PLAYING

Improving AI Through Data Quality

0:00 26:27

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Reasoning Show?

This episode is 26 minutes long.

When was this The Reasoning Show episode published?

This episode was published on July 30, 2025.

What is this episode about?

Elliot Shmukler (@eshmu, Co-Founder/CEO @anomalo_hq) talks about the impact of data quality on AI, how unstructured data can be improved, and how monitoring of data lakes can help prevent model drift and give organizations confidence with...

Can I download this The Reasoning Show episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!