PODCAST · technology
The Cisco AI Insights Podcast
by Cisco Podcast Network
Curious about the latest in AI? Cisco’s podcast unpacks artificial intelligence, data science, and machine learning. Hosts Rafael Herrera and Sonia Marques welcome monthly guests, including AI PhDs and experts, better understand AI academic papers and discuss their impact in and out of Cisco. Whether you’re new to AI or an experienced pro, tune in for insights the technology shaping our future.
-
4
EP. 4 - Can AI Pick Stocks by Reading the News?
In this month’s episode of The Cisco AI Insights Podcast, hosts Rafael Herrera and Sónia Marques explore the evolving landscape of sentiment analysis with Cisco machine learning engineer Joan Rossello. The conversation centers on the research paper from Imperial College London, “FinDPO: Financial Sentiment Analysis for Algorithmic Trading Through Preference Optimization of LLMs,” which proposes a new way to train large language models to better interpret nuanced human language. By comparing output quality, this preference-based training improved generalization and helped models interpret complex financial language without relying on simple memorization. Additionally, the discussion explored how sentiment scores derived from model probabilities were used to rank companies based on the strength of their news coverage, yielding promising results. A special thank you to the team that developed this month's paper. If you are interested in reading the paper yourself, please visit this link: https://arxiv.org/abs/2507.18417.
-
3
EP. 3 - Rethinking AI Performance Metrics
In the latest episode of the Cisco AI Insights podcast, hosts Rafael Herrera and Sonia Marques are joined by Dr. Catarina Carvalho, a Cisco leader in machine learning engineering. Together, they unpack the complex academic paper " Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following," developed by researchers from the University of Maryland and the University of Waterloo. As the industry moves toward more reliable multimodal models, traditional pass-or-fail evaluation is no longer sufficient. This paper introduces a hierarchical framework that uses "LLM-as-a-judge" to evaluate outputs across five distinct criteria: visual grounding, logical coherence, factuality, reflection, and conciseness. Dr. Carvalho guides the discussion through the nuances of this "judge of judges" approach, exploring why human alignment remains the gold standard even as we automate evaluation processes. A special thank you to the teams at both The University of Waterloo and The University of Maryland, College Park, for developing this month's paper. If you are interested in reading the paper yourself, please visit this link: https://arxiv.org/pdf/2511.21662.
-
2
EP. 2 - Unlocking Cost-Effective AI with Small Language Models
In the latest episode of the Cisco AI Insights Podcast, hosts Rafael Herrera and Sónia Marques welcome Cisco AI operations engineer James Tidd for a discussion on the world of small language models (SLMs) and the evolution of efficient AI inference. Together, they unravel the complexities behind “Fast Inference from Transformers via Speculative Decoding,” a groundbreaking paper from Google that explores how smaller draft models can speed up large language model predictions while maintaining accuracy.James shares his hands-on experience experimenting with the technique, leveraging knowledge distillation and speculative execution. The trio also discusses the potential of this approach to optimize AI, reduce power consumption and costs, and help businesses of all sizes get more out of existing hardware.A special thank you to Google’s AI team for developing this month's paper. If you are interested in reading the paper yourself, please visit this link: research.google/blog/looking-back…ulative-decoding/.
-
1
EP. 1 - Building Better AI Agents
In this month’s episode of The Cisco AI Insights Podcast, hosts Rafael Herrera and Sónia Marques welcome Giota Antonakaki, a Cisco leader in machine learning engineering, for an illuminating deep dive into a recent academic research shaping AI agent technology. Together, they unpack “The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs”, a paper challenging the common belief that scaling large language models (LLMs) results in only marginal gains. The conversation explores fresh concepts like “long horizon” and “self-conditioning,” revealing how even small improvements in LLM step accuracy can dramatically boost performance in complex, multi-step tasks. Giota breaks down the difference between planning, reasoning, and execution in LLMs, and why reasoning-focused models outperform even the largest standard LLMs for long-running AI agents. We also extend a special thank you to Akshit Sinha and his team of researchers for developing this month's paper. If you are interested in reading the paper yourself, please visit this link: https://arxiv.org/abs/2509.09677.
We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.
No matches for "" in this podcast's transcripts.
No topics indexed yet for this podcast.
Loading reviews...
ABOUT THIS SHOW
Curious about the latest in AI? Cisco’s podcast unpacks artificial intelligence, data science, and machine learning. Hosts Rafael Herrera and Sonia Marques welcome monthly guests, including AI PhDs and experts, better understand AI academic papers and discuss their impact in and out of Cisco. Whether you’re new to AI or an experienced pro, tune in for insights the technology shaping our future.
HOSTED BY
Cisco Podcast Network
CATEGORIES
Loading similar podcasts...