PodParley PodParley

Episode 175 - Gemini: A First Look

Episode 175 of the Two Voice Devs podcast, hosted by Mark and Allen, titled "Episode 175 - Gemini: A First Look" was published on December 15, 2023 and runs 41 minutes.

December 15, 2023 ·41m · Two Voice Devs

0:00 / 0:00

In this in-depth chat between Allen Firstenberg and Linda Lawton, they dive into the functionalities and potential of Google's newly released Gemini model. From their initial experiences to exciting possibilities for the future, they discuss the Gemini Pro and Gemini Pro Vision models, how to #BuildWithGemini, its focus on both text and images, and speedier and more cohesive responses compared to older models. They also delve into its potential for multi-modal support, unique reasoning capabilities, and the challenges they've encountered. The conversation draws interesting insights and sparks exciting ideas on how Gemini could evolve in the future. 00:04 Introduction and Welcome 00:23 Discussing the New Gemini Model 01:33 Comparing Gemini and Bison Models 02:07 Exploring Gemini's Vision Model 03:03 Gemini's Response Quality and Speed 03:53 Gemini's Token Length and Context Window 05:05 Gemini's Pricing and Google AI Studio 05:33 Upcoming Projects and Previews 06:16 Gemini's Role in Code Generation 07:54 Gemini's Model Variants and Limitations 12:01 Creating a Python Desktop App with Gemini 14:07 Gemini's Potential for Assisting the Visually Impaired 18:35 Gemini's Ability to Reason and Count 20:15 Gemini's Multi-Step Reasoning 20:33 Testing Gemini with Multiple Images 21:52 Exploring Image Recognition Capabilities 22:13 Discussing the Limitations of 3D Object Recognition 23:53 Testing Image Recognition with Personal Photos 24:52 Potential Applications of Image Recognition 25:45 Exploring the Multimodal Capabilities of the AI 26:41 Discussing the Challenges of Using the AI in Europe 27:26 Exploring the AQA Model and Its Potential 33:37 Discussing the Future of AI and Image Recognition 37:12 Wishlist for Future AI Capabilities 40:11 Wrapping Up and Looking Forward

In this in-depth chat between Allen Firstenberg and Linda Lawton, they dive into the functionalities and potential of Google's newly released Gemini model. From their initial experiences to exciting possibilities for the future, they discuss the Gemini Pro and Gemini Pro Vision models, how to #BuildWithGemini, its focus on both text and images, and speedier and more cohesive responses compared to older models. They also delve into its potential for multi-modal support, unique reasoning capabilities, and the challenges they've encountered. The conversation draws interesting insights and sparks exciting ideas on how Gemini could evolve in the future.


00:04 Introduction and Welcome

00:23 Discussing the New Gemini Model

01:33 Comparing Gemini and Bison Models

02:07 Exploring Gemini's Vision Model

03:03 Gemini's Response Quality and Speed

03:53 Gemini's Token Length and Context Window

05:05 Gemini's Pricing and Google AI Studio

05:33 Upcoming Projects and Previews

06:16 Gemini's Role in Code Generation

07:54 Gemini's Model Variants and Limitations

12:01 Creating a Python Desktop App with Gemini

14:07 Gemini's Potential for Assisting the Visually Impaired

18:35 Gemini's Ability to Reason and Count

20:15 Gemini's Multi-Step Reasoning

20:33 Testing Gemini with Multiple Images

21:52 Exploring Image Recognition Capabilities

22:13 Discussing the Limitations of 3D Object Recognition

23:53 Testing Image Recognition with Personal Photos

24:52 Potential Applications of Image Recognition

25:45 Exploring the Multimodal Capabilities of the AI

26:41 Discussing the Challenges of Using the AI in Europe

27:26 Exploring the AQA Model and Its Potential

33:37 Discussing the Future of AI and Image Recognition

37:12 Wishlist for Future AI Capabilities

40:11 Wrapping Up and Looking Forward

01 - Chapter 1

Jan 2, 2026 ·13m

02 - Chapter 2

Jan 1, 2026 ·12m

03 - Chapter 3

Dec 31, 2025 ·13m

04 - Chapter 4

Dec 30, 2025 ·7m

Two-Minute Danger Theater » Podcast Feed [email protected] Two-Minute Danger Theater chronicles the adventures of The Voice, Blake Diamond, and Commander Ranger and Cadet Nancy of Blast-Off Patrol. It's what old-time radio might have sounded like had modern-day pharmaceutical drugs been available. Designers' Voice Alys Bryan Every month, we listen into a conversation about design, bringing together two expert voices in their field, guided by our host, Alys Bryan.These engaging talks have taken place in person and go beyond audio recordings – they come alive on screen, as we've captured them in video format, allowing us to share captivating snippets with you.In each Designers' Voice episode, we eavesdrop on a different pressing topic, tailored to pique the interest of design enthusiasts of all backgrounds. Our guests chat about design history, the magic of materials, sustainable design, people-centric design, and the ever-changing world of design education. Come join us as we explore these fascinating topics together.Designers' Voice is a self funded project, each episode is presented and produced by Alys Bryan and expertly filmed and edited by Daniel Budda. Hosted on Acast. See <a style='color:grey;' target='_blank' Geeks Ramble via Voice Notes Geeks Ramble @Oscargrullons and @TeacherYorch tackle the intricacies of being geeks via multiple voice notes. The rules? No edits. No redos. Everything is recorded. No topic is off limits! We are just two geeks who love geeky things and love talking about geeky things. That's So Rank Chris N Devin Two guys on a journey to voice their opinions about things that don't matter.
URL copied to clipboard!