[BS] When Cameras Learn: The Rise of Video-Language Models episode artwork

EPISODE · Oct 20, 2025 · 9 MIN

[BS] When Cameras Learn: The Rise of Video-Language Models

from The Eighth · host Avraham Raskin

A concise, investigative tour of how security video evolved from passive cctv to intelligent, searchable footage powered by local ai and video-language models. I maps the technical lineage-motion sensing, smart detections, face and plate id, the “AI Key,” and scene-level vlm search-and explain why pattern discovery at scale is the next operational leap for site security and investigations.“video that used to be passive now becomes a searchable narrative.”🎧 listen on spotify, youtube, apple podcasts🔗 more episodes → https://avrahamraskin.com/podcasttl;drsecurity cameras have graduated from passive recorders to active, searchable sensors. video-language models (vlms) and local llm-like agents enable natural-language scene search and condensed pattern visualisations-powerful for investigations but constrained today by compute and edge deployment. the next frontier is real-time, site-wide pattern detection running at the edge.timestamps00:00 | introduction and context00:23 | the evolution: cctv → motion → smart detections01:56 | face detection, license plates, and granular id02:19 | the “ai key”: local llm-style analytics (what it adds)03:35 | video-language models: frame description → search05:03 | practical investigative tools and scene search examples06:03 | pattern discovery: briefcam and condensed timelines07:40 | limitations today: compute, edge, and the next step09:56 | closing thoughts and what’s next

A concise, investigative tour of how security video evolved from passive cctv to intelligent, searchable footage powered by local ai and video-language models. I maps the technical lineage-motion sensing, smart detections, face and plate id, the “AI Key,” and scene-level vlm search-and explain why pattern discovery at scale is the next operational leap for site security and investigations.“video that used to be passive now becomes a searchable narrative.”🎧 listen on spotify, youtube, apple podcasts🔗 more episodes → https://avrahamraskin.com/podcasttl;drsecurity cameras have graduated from passive recorders to active, searchable sensors. video-language models (vlms) and local llm-like agents enable natural-language scene search and condensed pattern visualisations-powerful for investigations but constrained today by compute and edge deployment. the next frontier is real-time, site-wide pattern detection running at the edge.timestamps00:00 | introduction and context00:23 | the evolution: cctv → motion → smart detections01:56 | face detection, license plates, and granular id02:19 | the “ai key”: local llm-style analytics (what it adds)03:35 | video-language models: frame description → search05:03 | practical investigative tools and scene search examples06:03 | pattern discovery: briefcam and condensed timelines07:40 | limitations today: compute, edge, and the next step09:56 | closing thoughts and what’s next

NOW PLAYING

[BS] When Cameras Learn: The Rise of Video-Language Models

0:00 9:59

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Eighth?

This episode is 9 minutes long.

When was this The Eighth episode published?

This episode was published on October 20, 2025.

What is this episode about?

A concise, investigative tour of how security video evolved from passive cctv to intelligent, searchable footage powered by local ai and video-language models. I maps the technical lineage-motion sensing, smart detections, face and plate id, the “AI...

Can I download this The Eighth episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!