Unlocking Unstructured Data with LLMs episode artwork

EPISODE · Jul 3, 2025 · 27 MIN

Unlocking Unstructured Data with LLMs

from The Data Exchange with Ben Lorica · host Ben Lorica

Shreya Shankar is a  PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale.Subscribe to the Gradient Flow Newsletter 📩  https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon ·  RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.

Shreya Shankar is a PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale. Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/ Subscribe: Apple · Spotify · Overcast · ...

NOW PLAYING

Unlocking Unstructured Data with LLMs

0:00 27:46

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Data Exchange with Ben Lorica?

This episode is 27 minutes long.

When was this The Data Exchange with Ben Lorica episode published?

This episode was published on July 3, 2025.

What is this episode about?

Shreya Shankar is a  PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a...

Can I download this The Data Exchange with Ben Lorica episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!