Catalog Attribute Extraction with Multi-Modal LLMs [Instacart]
Episode 102 of the Snacks Weekly on Data Science podcast, hosted by Pan Wu, titled "Catalog Attribute Extraction with Multi-Modal LLMs [Instacart]" was published on September 8, 2025 and runs 10 minutes.
September 8, 2025 ·10m · Snacks Weekly on Data Science
Summary
In this episode, we explore how Instacart tackled the challenge of extracting accurate product attributes at scale. We discuss different solutions—starting with SQL rules, moving to text-based ML models, and finally, Instacart’s multi-modal LLM platform, PARSE. By blending text and image data and enabling rapid configuration, PARSE demonstrates how modern AI tools can streamline data pipelines, reduce engineering overhead, and deliver better user experiences.For more details, you can refer to their published tech blog, linked here for your reference: https://tech.instacart.com/multi-modal-catalog-attribute-extraction-platform-at-instacart-b9228754a527
Episode Description
In this episode, we explore how Instacart tackled the challenge of extracting accurate product attributes at scale. We discuss different solutions—starting with SQL rules, moving to text-based ML models, and finally, Instacart’s multi-modal LLM platform, PARSE. By blending text and image data and enabling rapid configuration, PARSE demonstrates how modern AI tools can streamline data pipelines, reduce engineering overhead, and deliver better user experiences.
For more details, you can refer to their published tech blog, linked here for your reference: https://tech.instacart.com/multi-modal-catalog-attribute-extraction-platform-at-instacart-b9228754a527
Similar Episodes
Jun 19, 2025 ·46m
Jun 13, 2025 ·40m
May 20, 2025 ·80m
May 13, 2025 ·74m
May 7, 2025 ·64m