EPISODE · Oct 20, 2025 · 9 MIN
[BS] When Cameras Learn: The Rise of Video-Language Models
from The Eighth · host Avraham Raskin
A concise, investigative tour of how security video evolved from passive cctv to intelligent, searchable footage powered by local ai and video-language models. I maps the technical lineage-motion sensing, smart detections, face and plate id, the “AI Key,” and scene-level vlm search-and explain why pattern discovery at scale is the next operational leap for site security and investigations.“video that used to be passive now becomes a searchable narrative.”🎧 listen on spotify, youtube, apple podcasts🔗 more episodes → https://avrahamraskin.com/podcasttl;drsecurity cameras have graduated from passive recorders to active, searchable sensors. video-language models (vlms) and local llm-like agents enable natural-language scene search and condensed pattern visualisations-powerful for investigations but constrained today by compute and edge deployment. the next frontier is real-time, site-wide pattern detection running at the edge.timestamps00:00 | introduction and context00:23 | the evolution: cctv → motion → smart detections01:56 | face detection, license plates, and granular id02:19 | the “ai key”: local llm-style analytics (what it adds)03:35 | video-language models: frame description → search05:03 | practical investigative tools and scene search examples06:03 | pattern discovery: briefcam and condensed timelines07:40 | limitations today: compute, edge, and the next step09:56 | closing thoughts and what’s next
What this episode covers
A concise, investigative tour of how security video evolved from passive cctv to intelligent, searchable footage powered by local ai and video-language models. I maps the technical lineage-motion sensing, smart detections, face and plate id, the “AI Key,” and scene-level vlm search-and explain why pattern discovery at scale is the next operational leap for site security and investigations.“video that used to be passive now becomes a searchable narrative.”🎧 listen on spotify, youtube, apple podcasts🔗 more episodes → https://avrahamraskin.com/podcasttl;drsecurity cameras have graduated from passive recorders to active, searchable sensors. video-language models (vlms) and local llm-like agents enable natural-language scene search and condensed pattern visualisations-powerful for investigations but constrained today by compute and edge deployment. the next frontier is real-time, site-wide pattern detection running at the edge.timestamps00:00 | introduction and context00:23 | the evolution: cctv → motion → smart detections01:56 | face detection, license plates, and granular id02:19 | the “ai key”: local llm-style analytics (what it adds)03:35 | video-language models: frame description → search05:03 | practical investigative tools and scene search examples06:03 | pattern discovery: briefcam and condensed timelines07:40 | limitations today: compute, edge, and the next step09:56 | closing thoughts and what’s next
NOW PLAYING
[BS] When Cameras Learn: The Rise of Video-Language Models
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m