The Cutting Edge of Speech Recognition episode artwork

EPISODE · May 27, 2025 · 29 MIN

The Cutting Edge of Speech Recognition

from Our Digital Life Podcast: A series by IEEE-SPS · host IEEE-SPS

In this episode of the IEEE Signal Processing Society podcast, Dr. Sanjeev Khudanpur, Director of the Center for Language and Speech Processing, Johns Hopkins University interviews Associate Prof. Shinji Watanabe, Language Technologies Institute, Carnegie Mellon University. They talk about the latest research and innovations in speech recognition technologies and their impact across various industries. Shinji WatanabeShinji Watanabe is an Associate Professor at Carnegie Mellon University in Pittsburgh and a leading researcher in speech and language processing. His work spans automatic speech recognition, speech enhancement, spoken language understanding, and machine learning for speech and language processing. He has contributed more than 500 publications to peer-reviewed journals and received several awards, including the best paper award from ISCA Interspeech 2024. In this episode, Associate Prof. Watanabe reflects on the transformative progress in speech recognition over the past decade, highlighting milestones from the adoption of deep neural networks to the rise of large-scale models like OpenAI Whisper. He discusses the ongoing challenges in achieving human-level understanding in complex scenarios such as multi-speaker conversations, accented and multilingual speech, and child or disordered speech. He concludes with thoughts on academia’s enduring role in shaping the field, and how his inspiration is often drawn from science fiction and Japanese animation.

NOW PLAYING

The Cutting Edge of Speech Recognition

0:00 29:05

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

MG Show MG Show The MG Show, hosted by Jeffrey Pedersen and Shannon Townsend, is a leading alternative media platform dedicated to uncovering the truth behind today’s most pressing political issues. Launched in 2019, the show has grown exponentially, offering unfiltered insights, comprehensive research, and real-time analysis. With a commitment to independent journalism and factual integrity, the MG Show empowers its audience with knowledge and encourages active participation in the political discourse. Ask A Spaceman Archives - 365 Days of Astronomy Ask A Spaceman Archives - 365 Days of Astronomy Podcasting Astronomy Every Day of the Year Eat to Live Jenna Fuhrman, Dr. Fuhrman Our health is our most precious gift and smart nutrition can change your life. Each month, join Dr. Fuhrman and his daughter, Jenna Fuhrman as they discuss important topics in the world of nutrition. Eat to Live will change the way you eat and think about food. French Your Way Jessica: Native French teacher founder of French Your Way Boost your French listening skills and test your comprehension with this one of a kind series of podcasts. Get the chance to listen to a real conversation between native speakers talking at normal speed AND customise your learning experience through carefully designed sets of questions (2 levels of difficulty) available for download at www.frenchvoicespodcast.com. All interviews also come with the transcript. French teacher Jessica interviews native speakers of French from around the world who share a bit of their life and passion. Where else would you meet in one same place a French yoga teacher based in Melbourne, a soap manufacturer from Provence, or a couple cycling around the world?

Frequently Asked Questions

How long is this episode of Our Digital Life Podcast: A series by IEEE-SPS?

This episode is 29 minutes long.

When was this Our Digital Life Podcast: A series by IEEE-SPS episode published?

This episode was published on May 27, 2025.

What is this episode about?

In this episode of the IEEE Signal Processing Society podcast, Dr. Sanjeev Khudanpur, Director of the Center for Language and Speech Processing, Johns Hopkins University interviews Associate Prof. Shinji Watanabe, Language Technologies Institute,...

Can I download this Our Digital Life Podcast: A series by IEEE-SPS episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!