Epsiode 202 - Hosting and Large Language Models episode artwork

EPISODE · Aug 15, 2024 · 27 MIN

Epsiode 202 - Hosting and Large Language Models

from Two Voice Devs · host Mark and Allen

Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they discuss the challenges and solutions of hosting large language models (LLMs). They explore various hosting environments, including Firebase, AWS Amplify, Vertex AI, and Docker/Kubernetes, comparing their strengths and weaknesses. Allen shares his experience with Firebase Cloud Functions and the seamless integration with Google Cloud services, while Mark tackles the complexities of Docker, Kubernetes, and enterprise-level deployment strategies. From managing API keys and credentials to implementing design patterns and best practices, they explore the challenges and solutions for building robust and scalable AI systems. This episode is packed with practical tips for developers, covering topics like: [00:02:00] Firebase Suite of Tools: Learn how Firebase provides a comprehensive platform for hosting LLMs, including real-time databases, cloud storage, cloud functions, and authentication. [00:04:00] Firebase vs. AWS Amplify: Discover the key differences between these two popular serverless platforms and their database options. [00:05:00] Cloud Service Accounts for Security: Allen demonstrates how leveraging cloud service accounts can simplify permission management and enhance security. [00:11:00] Architecture Design and Long-Term Hosting: Allen emphasizes the impor tance of considering future scalability and maintenance when selecting a hosting environment. [00:12:30] Working with Docker and Kubernetes: Mark dives into his experience using Docker containers and Kubernetes for enterprise-level LLM deployment. [00:15:00] Learning Python for LLM Development: Mark shares his experience learning Python for working with LLMs and using libraries like FastAPI for REST API development. [00:17:00] Design Patterns and Best Practices: Allen and Mark discuss the evolving nature of design patterns and their importance in modern software development. [00:20:00] KitOps for Model Deployment: Mark explains how KitOps can be used to separate model deployment from service deployment in a Kubernetes environment. [00:23:00] Docker and Configuration Management: Allen discusses the challenge of configuration management in Docker environments and how to manage changes efficiently. [00:24:00] Enterprise Security and Tooling: Mark explores the use of tools like HashiCorp Console and Vault for managing configurations and secrets in enterprise deployments. [00:26:00] The Importance of Containerization: Allen and Mark reiterate the fundamental role of containers in modern software development. Don't miss this insightful episode of Two Voice Devs, where you'll gain valuable insights and practical tips for hosting and deploying your own LLMs! #AI #Development #Hosting #Cloud #Docker #Kubernetes #Firebase #GoogleCloud #DesignPatterns #TwoVoiceDevs

Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they discuss the challenges and solutions of hosting large language models (LLMs). They explore various hosting environments, including Firebase, AWS Amplify, Vertex AI, and Docker/Kubernetes, comparing their strengths and weaknesses. Allen shares his experience with Firebase Cloud Functions and the seamless integration with Google Cloud services, while Mark tackles the complexities of Docker, Kubernetes, and enterprise-level deployment strategies. From managing API keys and credentials to implementing design patterns and best practices, they explore the challenges and solutions for building robust and scalable AI systems. This episode is packed with practical tips for developers, covering topics like: [00:02:00] Firebase Suite of Tools: Learn how Firebase provides a comprehensive platform for hosting LLMs, including real-time databases, cloud storage, cloud functions, and authentication. [00:04:00] Firebase vs. AWS Amplify: Discover the key differences between these two popular serverless platforms and their database options. [00:05:00] Cloud Service Accounts for Security: Allen demonstrates how leveraging cloud service accounts can simplify permission management and enhance security. [00:11:00] Architecture Design and Long-Term Hosting: Allen emphasizes the impor tance of considering future scalability and maintenance when selecting a hosting environment. [00:12:30] Working with Docker and Kubernetes: Mark dives into his experience using Docker containers and Kubernetes for enterprise-level LLM deployment. [00:15:00] Learning Python for LLM Development: Mark shares his experience learning Python for working with LLMs and using libraries like FastAPI for REST API development. [00:17:00] Design Patterns and Best Practices: Allen and Mark discuss the evolving nature of design patterns and their importance in modern software development. [00:20:00] KitOps for Model Deployment: Mark explains how KitOps can be used to separate model deployment from service deployment in a Kubernetes environment. [00:23:00] Docker and Configuration Management: Allen discusses the challenge of configuration management in Docker environments and how to manage changes efficiently. [00:24:00] Enterprise Security and Tooling: Mark explores the use of tools like HashiCorp Console and Vault for managing configurations and secrets in enterprise deployments. [00:26:00] The Importance of Containerization: Allen and Mark reiterate the fundamental role of containers in modern software development. Don't miss this insightful episode of Two Voice Devs, where you'll gain valuable insights and practical tips for hosting and deploying your own LLMs! #AI #Development #Hosting #Cloud #Docker #Kubernetes #Firebase #GoogleCloud #DesignPatterns #TwoVoiceDevs

NOW PLAYING

Epsiode 202 - Hosting and Large Language Models

0:00 27:16

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! 2 Old Ladies Walking Rozee 2 Old Ladies Walking features the journeys, insights, and light conversation between Liz and Rosie, two women of a certain age who live in the Hudson Valley of New York. From pelvic floor challenges and life with young adult children to food, bird calls, fear of “mad lamb” disease, and myriad topics in between, we cover it all while walking on the scenic trails of the northeast, or wherever our travels take us. Join us and have a listen! Radio Maria Kenya Radio Maria Kenya A Christian voice in Kenya and in the World Two Recruiters: Zero Filter Two Recruiters At Two Recruiters: Zero Filter, we're on a mission to demystify the hiring process, share insider tips, and empower you to maneuver through the professional world with confidence. With more than 30 years of combined experience navigating the intricate web of job markets, talent acquisition, and career development, we're here to spill the tea on everything career related. But wait, there’s more! We will dive into many life topics that are interesting to us as well.  Get ready for a rollercoaster of insights, stories, and no-holds-barred advice!Join us for conversations that matter – where work, life, and authenticity collide in the most unexpected and rewarding ways.

Frequently Asked Questions

How long is this episode of Two Voice Devs?

This episode is 27 minutes long.

When was this Two Voice Devs episode published?

This episode was published on August 15, 2024.

What is this episode about?

Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they discuss the challenges and solutions of hosting large language models (LLMs). They explore various hosting environments, including Firebase, AWS Amplify, Vertex AI, and...

Can I download this Two Voice Devs episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!