Creative Flux: The Generative Media Podcast

PODCAST · business

Creative Flux: The Generative Media Podcast

Each week, AI engineers Pierson Marks (@piersonmarks) & Bilal Tahir (@deepwhitman) bring you practical insights, creative workflows, and the latest breakthroughs in generative media.We cover everything that's happening in AI-powered audio, video, and image creation, sharing hands-on tips and industry news straight from the front lines. Topics include new generative models, creative best practices, open-source tools, real-world use cases, and the evolving landscape of AI-driven content creation.

  1. 44

    We Built an AI Podcast Producer- Plus GPT Realtime, AI Anime & the Anthropic/SpaceX Deal | EP 43

    In Episode 43, Pierson and Bilal come fresh off the a16z Generative Media Video Hackathon - and they built some wild stuff.Bilal went from a prompt to a full AI-generated podcast: custom avatars, Gemini TTS voices, lip-synced video, and live infographics pulled by deep research. All stitched together programmatically. Pierson built something different — an AI producer that joins your recording call as a guest, listens for a wake word, and pulls up images, search results, or visuals on a live canvas. Think: Jamie from Joe Rogan, but agentic.From there they get into:GPT Realtime 2 — real-time transcription, translation, and a voice API that stays listening even when "asleep"The shift from AI writing code to AI owning features — and what engineering responsibility looks like in that worldAI-generated anime breaking the last creative barrier humans thought they hadXAI dissolving into SpaceX, Elon's Colossus One deal with Anthropic, and what it means for Claude Code limitsMicro datacenters — Nvidia Blackwell chips mounted in your garage via SPAN's XFRAChapters: 00:00 Episode 43 & The a16z Generative Media Hackathon 04:00 Bilal's Build: Prompt to Full AI Podcast with Avatars 10:30 Pierson's Build: An AI Producer That Joins Your Call Live 20:00 GPT Realtime 2 & the Always-On AI Future 31:00 AI Anime, Creativity & the Last Human Bastion 37:00 XAI, Anthropic's Colossus Deal & Micro Data centers🔗 Links mentioned:GPT Realtime API: https://developers.openai.com/api/docs/models/gpt-realtimeSin Citium anime: https://x.com/Cont_animation/status/2051296715781619829SPAN Micro Datacenters x Nvidia: https://www.businesswire.com/news/home/20260414372626/en/SPAN-Announces-XFRA-a-Distributed-Data-Center-Solution-to-Close-the-Speed-to-Power-Gap-for-AI-Compute-Demand🐦 Follow the hosts:Pierson → https://x.com/piersonmarksBilal → https://x.com/deepwhitman📺 More episodes: / @jellypodai

  2. 43

    The AI Podcast That Fooled Everyone, Talkie 13B & Stripe for Agents | EP 42

    In Episode 42 of Creative Flux, Pierson and Bilal kick off episode 42 - the answer to the universe - and things get weird fast.From there they cover: - ElevenLabs launching a standalone music social app, why Suno accidentally became a social network first, and the impossible position Spotify is now in - An AI podcast of Henrik Johansen and Martin Shkreli so realistic even Bilal got fooled - Talkie 13B — a model trained exclusively on pre-1931 text by Alec Radford, and why it feels like actually talking to someone from the past - Stripe Link going agent-first and Sam Altman hinting at something big around bring-your-own-tokens at Stripe Sessions Chapters: 00:00 Hitchhiker's Guide, Audio Books & the Future of Multi-Voice AI Narration 08:10 ElevenLabs Music, Suno's Social Network & What Spotify Is Really Up Against 16:00 Henrik Johansen x Martin Shkreli — The AI Podcast That Fooled Everyone 22:00 Talkie 13B — A Language Model Trained Exclusively on Pre-1930 Text 36:30 Stripe Link for Agents, Sign In with ChatGPT & the Agent-First Platform Shift 🔗 Show Notes: ElevenLabs Music: https://elevenlabs.io/music Henrick Johanssen & Martin Shkreli: https://x.com/compliantvc/status/2049535715369775559?s=12 Talkie 13B (pre-1930 LLM): https://talkie-lm.com/introducing-talkie Stripe Link for Agents: https://stripe.com/blog/giving-agents-the-ability-to-pay 🐦 Follow the hosts: Pierson → https://x.com/piersonmarks Bilal → https://x.com/deepwhitman 📺 More episodes: / @jellypodai

  3. 42

    GPT 5.5, Tidepool Agents & the Chip War Nobody's Winning Yet | EP 41

    In Episode 41 of Creative Flux, Pierson and Bilal cover one of the most chaotic weeks in AI yet — GPT 5.5 literally dropped 20 minutes before they hit record. From there they cover: Opus 4.7's regression Anthropic finally admitted to, what actually broke in Claude Code, and the $25B Amazon deal behind their compute crunch The chip race — why Anthropic matching OpenAI on inferior hardware changes everything once they actually get Nvidia access Tidepool (formerly Claw Connect) — an open source peer-to-peer protocol letting isolated Claude Code instances talk to each other. Three agents, no shared context, they built a subscription business GPT Image 2's 300-point ELO jump and why it's an image agent, not just a model 🔗 Show Notes: Andon Store: https://x.com/jlagerros/status/2046966793538048295 Tidepool - Agent to Agent Communication Protocol: https://github.com/Jellypod-Inc/tidepool Opus 4.7: https://www.anthropic.com/news/claude-opus-4-7 GPT Image 2: https://developers.openai.com/api/docs/models/gpt-image-2 GPT 5.5: https://openai.com/index/introducing-gpt-5-5/ Hyperspace Pods: https://x.com/varun_mathur/status/2044882359565312468 Amit Jain from Luma AI on Unified Intelligence Systems (CS 153: Frontier Systems): https://www.youtube.com/watch?v=WNNrUuMQkl8 🐦 Follow the hosts: Pierson → https://x.com/piersonmarks Bilal → https://x.com/deepwhitman 📺 More episodes: / @jellypodai

  4. 41

    What Happens When AI Agents Start Talking to Each Other | EP 40

    In Episode 40, Pierson and Bilal get into something most people are completely overlooking.Using Obsidian and local Markdown files to build a personal knowledge base that an LLM organizes, interlinks, and updates for you every single day. Your notes, your transcripts, your ideas — structured like your own Wikipedia, maintained by AI.From there they cover:Why local-first AI models are winning the privacy argumentHow open source is quietly taking overWhat Opus 4.7 actually changes for developersPierson's live demo of Claw Connect — a peer-to-peer protocol that lets AI agents talk directly to each other, across any model or harnessTwo Claude Code instances. On screen. Having their own conversation. It gets wild.Lots of rabbit holes. All worth it.🔗 Links mentioned:Opus 4.7: https://www.anthropic.com/news/claude-opus-4-7Claude Code Routines: https://code.claude.com/docs/en/routinesGemini TTS: https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-tts-previewAndrej Karpathy on LLM Knowledge Bases: https://x.com/karpathy/status/2039805659525644595Chapters:00:00 Episode 40 & Building a Personal Knowledge Base with Obsidian 08:10 Daily AI Routines, Claude Code & Automating Your Workflow 13:46 Local AI Models, Open Source & the Privacy Argument 24:29 Opus 4.7, Energy, Intelligence & What's Next 39:37 Introducing Claw Connect: Peer-to-Peer Agent Communication 56:18 Two AI Agents Talking Live & the Future of Agent Collaboration🐦 Follow the hosts:Pierson → https://x.com/piersonmarksBilal → https://x.com/deepwhitman📺 More episodes: https://www.youtube.com/@JellypodAi

  5. 40

    The Race to Build the Next Big AI Model: Anthropic, OpenAI & What's Coming Next | EP 39

    In Episode 39, Pierson and Bilal cover a lot of ground, starting with a chance meeting at a rooftop event in SF that led to a conversation about AI agents hosting their own podcasts.They break down Clawcast (what happens when your AI agent can invite other agents to record a podcast), why agent-to-agent communication might be the next big wave, and how smart contracts could use AI to handle deals and settle disputes automatically.Then they get into the AI model race, Anthropic vs OpenAI, what 10 trillion parameter models actually mean, why old GPUs aren't obsolete yet, and how robotics is quietly reshaping Amazon's warehouses.Lots of rabbit holes. All worth it.🔗 Links mentioned:Neural Noise: https://github.com/leopiney/neuralnoiseAnthropic Mythos / Project Glasswing: https://www.anthropic.com/glasswingDylan Patel & Dwarkesh Patel Podcast: https://www.youtube.com/watch?v=mDG_Hx3BSUE🐦 Follow the hosts:Pierson → https://x.com/piersonmarksBilal → https://x.com/deepwhitman📺 More episodes: https://www.youtube.com/@JellypodAiChapters:00:00 Introduction & What's Been Going On This Week02:32 Neural Noise & Clawcast: AI Agents Hosting Podcasts11:09 Agent Communication, Smart Contracts & Killing the Middleman21:40 Game Theory, Geopolitics & the AI Model Race30:30 Chips, Hardware & Why Old GPUs Still Matter36:17 Robotics, Dark Factories & Amazon's Future

  6. 39

    The Agentic AI Filmmaking Platform - w/ Koyal AI (YC F25) Founder Mehul Agarwal

    On Episode 38, we sit down with Koyal AI (YC F25) founder & CEO Mehul Agarwal (https://x.com/meh_agarwal) to discuss the future of filmmaking with AI. His goal? Replacing the camera, not the filmmaker. Mehul shares the journey of Koyal's development, from inception as a research paper at NeurIPS 2024 to its most recent v2.5 launch. Covering a range of topics from AI video, world models, benchmarking, user interfaces, and more.Chapters00:00 Introducing Koyal: The AI Filmmaking Platform06:04 Combining Creativity and Technology11:26 Koyal's State-of-the-Art Consistency22:25 Creating Realistic Avatars27:29 User Interface for Video Editing36:23 Impact of Memetic Content41:26 Predictions for Video and Image TechnologyShow Notes: NeurIPS 2024 Paper - CHARCHA: https://www.ri.cmu.edu/cmu-alumni-launch-koyal-for-safe-ai-video-creation/Koyal AI: https://koyal.ai/

  7. 38

    Agentic Development - Superpowers

    On Episode 37, hosts Pierson (https://x.com/piersonmarks) and Bilal (https://x.com/deepwhitman) discuss OpenAI shutting down Sora refocusing on coding and ChatGPT, the automated improvement cycles in AI models, and the concept of an agent-first approach in software development (an intro to the /superpowers plugin). Show Notes: Sora Shutting Down: https://openai.com/sora/Superpowers Plugin: https://github.com/obra/superpowersStripe Projects: https://projects.dev/Fuma Docs: https://www.fumadocs.dev/Chapters:00:00 Introduction and Podcast Overview09:05 Agent-First Approach in Software Development27:45 Superpowers Plugin for Cloud Development33:00 Quality of Life Enhancements in Cloud Development39:14 Discussion on API Documentation and OpenAPI Spec

  8. 37

    Google Stitch 2 - the Figma Killer??

    On Episode 36, hosts Pierson (https://x.com/piersonmarks) and Bilal (https://x.com/deepwhitman) give a first look into Google Stitch 2 (and a few other Google Labs products). Coming off a fun after-party hosted by our friends @FAL, they also dig into Bilal's open-sourced LoFi Music dataset of 160 free-to-use songs, Raycast's Glaze mac-app builder, and more. Show Notes: Google Stitch 2: https://stitch.withgoogle.com/Raycast Glaze: https://www.glaze.app/Open LoFi: https://github.com/btahir/open-lofiFAL: https://fal.ai/ Chapters00:00 Exploring the Generative Media Community07:15 Google Labs and Experimental AI Projects15:04 Personalized Software and Vibe Coding22:24 AI-Generated Art and Design29:32 Design System and Theme Generation35:38 AI-Generated Lo-Fi Music

  9. 36

    Replit Agent 4 (@ $9B val), Netflix buys Ben Afleck's AI Startup

    On Episode 35, hosts Pierson (https://x.com/piersonmarks) and Bilal (https://x.com/deepwhitman) cover the latest in Generative Media, with Netflix acquiring Ben Afleck's AI startup InterPositive for $600M, Replit Agent 4 (with a $9B new funding round), and Polsia running your business while you sleep. Also, a Punch the Monkey AI short. Show Notes:Ben Afleck's AI Startup InterPositive: https://techcrunch.com/2026/03/11/netflix-may-have-paid-600-million-for-ben-afflecks-ai-startup/Replit Agent 4 and their latest funding round @ $9B https://replit.com/news/funding-announcementPunch the Monkey AI Film: https://x.com/venturetwins/status/2029961922783895958?s=20Wonder Studios: https://wonderstudios.com/

  10. 35

    NotebookLM Cinematic Video, GPT 5.4, Claude Code Skills 2.0

    On Episode 34, hosts Pierson (@piersonmarks) and Bilal (@deepwhitman) cover Google NotebookLM's new cinematic video overviews, GPT 5.4, Claude Code Skills benchmarks, and Cursor's new automations. Show Notes: NotebookLM Cinematic Video Overviews: https://blog.google/innovation-and-ai/products/notebooklm/generate-your-own-cinematic-video-overviews-in-notebooklm/GPT 5.4 https://openai.com/index/introducing-gpt-5-4/Cursor Automations: https://cursor.com/blog/automationsReact Grab: https://www.react-grab.com/Claude Code Skills 2.0: https://code.claude.com/docs/en/skillsGoogle Workspace CLI: https://www.npmjs.com/package/@googleworkspace/cliChapters00:00 Generative Media and AI08:51 Evolution of AI Models and Releases22:09 Automation and Task Assignment in AI29:17 Cloud Code Skill Creator36:18 Design and Code Integration43:55 Verifiability in Design and Creative Fields

  11. 34

    Nano Banana 2, Claude Code Remote Control, & Quiver AI

    On Episode 33 of Creative Flux, hosts Pierson (@piersonmarks) and Bilal (@deepwhitman) cover Claude Code's new /remote-control feature, the dead internet theory, how to control camera angles in image gen models, PrunaAI for cheap image and video generation, real-time weather visualization with Nano Banana 2, and Quiver AI for SVG generation and animation.Show Notes: Nano Banana 2: https://blog.google/innovation-and-ai/technology/ai/nano-banana-2/Quiver AI (SVG): https://quiver.ai/PrunaAI (pVideo & pImage): https://replicate.com/prunaai/p-videoCamera Angle UI: https://x.com/Framer_X/status/2025912264256307447?s=20Camera Angle Qwen Image Edit Model: https://fal.ai/models/fal-ai/qwen-image-edit-2511-multiple-anglesChapters00:00 Claude Code Remote Control 05:02 AI-Generated Content10:05 The Future of Media and Content Creation29:32 Creative Control with Open Source Models35:32 Pruna AI and Low-Cost Video Generation42:00 Quiver AI and SVG Generation

  12. 33

    Web 4, Google Labs, & Fine Tuning Video Models

    On Episode 32 of Creative Flux, hosts Pierson (@piersonmarks) and Bilal (@deepwhitman) recap some of last week's Seedance 2.0 mania, then dives into the challenges of fine-tuning AI models for scale, the impact of benchmarks on model development, and the future of AI and human taste. Oh, and Web 4.0. Don't forget that one. Show Notes: Google Labs: Pomelli - https://blog.google/innovation-and-ai/models-and-research/google-labs/pomelli-photoshoot/Web 4 - https://x.com/0xSigil/status/2023877649475731671Gemini 3.1 Pro - https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/

  13. 32

    The ChatGPT Moment for Video: Seedance 2.0

    Join the discussion on Jellypod's Discord: https://discord.com/invite/9FYgzU8JNk Pierson (@piersonmarks) and Bilal (@deepwhitman) highlight the release of Seedance 2.0 and it's implications across Hollywood, sports, and filmmaking. Show Notes: Seedance 2.0: https://seedance2.ai/Kling 3: https://klingai.com/global/3D Sports Recreations with AI: https://x.com/bilawalsidhu/status/2020229779585200401 Chapters00:00 The Acceleration of AI09:21 Seedance 2.0: A Game-Changer in Generative Media15:44 The Future of Generative Media and Hollywood22:42 The Impact of AI in Sports and Entertainment

  14. 31

    Moltbook, Kling 3.0, Model Wars (Opus 4.6 vs. OpenAI Codex 5.3)

    Episode 30! On this milestone episode of Creative Flux, hosts Pierson (@piersonmarks) and Bilal (@deepwhitman) celebrate half a year of the podcast and dive into one of the most stacked weeks in generative AI yet.We break down the surprise drop of Claude Opus 4.6 (which literally launched mid-recording), Google's Genie 3 world model and its mind-blowing Halo recreation, the massive Kling 3.0 update with multi-shot prompting and native audio, and TrueShort — the AI movie studio that hit $2.4M revenue in just six months. Plus: agent swarm architectures, Cloud Code power-user workflows, the MoteBook debate, and a JellyPod 2 teaser.Chapters[00:00] Staying Sane in the AI Hype Cycle[04:00] MoteBook, OpenClaw & Conway's Game of Life for AI Bots[09:10] Genie 3 World Models & Spatial Intelligence[13:42] Claude Opus 4.6, Cloud Code Workflows & Agent Swarm Architectures[30:53] Kling 3.0, TrueShort's $2.4M AI Studio & Paper BananaShow Notes:Google Genie 3: https://deepmind.google/models/genie/Halo 3 Respawn Video (HUD appears): https://x.com/elder_plinius/status/2017635440207987061Claude Opus 4.6 & Agent Teams: https://code.claude.com/docs/en/agent-teamsKling 3.0 Guide: https://app.klingai.com/global/quickstart/klingai-video-3-model-user-guideTrueShort App: https://apps.apple.com/us/app/trueshort-stream-true-crime/id6741782158TrueShort Thread: https://x.com/NateTepper/status/2018786702643605780Paper Banana: https://arxiv.org/abs/2601.23265Codex 5.3: https://openai.com/index/introducing-gpt-5-3-codex/

  15. 30

    What is Clawdbot (Moltbolt/OpenClaw)?

    Why is everyone buying Mac Minis and running Clawdbot (Moltbot/OpenClaw)? On Episode 29 of Creative Flux, hosts Pierson (@piersonmarks) and Bilal (@deepwhitman) dig into one of the fastest growing open source projects of all time, amassing over 100,000 github stars in a matter of days. We'll discuss how people are using their new "autonomous claude agent" to work 24/7, the challenges of versioning agent skills, and the release of the insane Kimi K2.5 model.Show Notes:- OpenClaw (Clawdbot) Repo: https://github.com/openclaw/openclaw- Google DeepMind & Pixar: https://blog.google/innovation-and-ai/models-and-research/google-deepmind/dear-upstairs-neighbors/- Kimi K2.5 & Agent Swarm: https://www.kimi.com/blog/kimi-k2-5.html- Agent Skills (Vercel): https://skills.sh/ 

  16. 29

    What is Remotion? The Claude Code skill everyone's talking about

    What is Remotion? Why is everyone now using it to create AI product demos & motion graphics? In Episode 28 of Creative Flux, Pierson (@piersonmarks) and Bilal (@deepwhitman) dive into a project that's near and dear to our hearts: Remotion, founded by the extremely talented (and humble) Jonny Burger. Remotion is a programatic video generation library that enables the creation and rendering of videos using React components. This week, Remotion launched a remotion-best-practices Claude Code skill, and almost immediately went viral. People were now able to prompt Claude Code to create professional-level motion graphics in a matter of minutes. Say goodbye to animated motion designers and $1000+ launch videos.Show Notes:Remotion: https://www.remotion.dev/ Fireship Video on Remotion: https://www.youtube.com/watch?v=deg8bOoziaERemotion Best Practices Claude Code Skill: https://skills.sh/remotion-dev/skills/remotion-best-practicesReact Email: https://react.email/LTX-2 LoRAs: https://www.reddit.com/r/StableDiffusion/comments/1qd525f/ltx2_i2v_synced_to_an_mp3_distill_lora_quality/Agent Browser: https://skills.sh/vercel-labs/agent-browser/agent-browser

  17. 28

    Ralph Wiggum, Claude Cowork, and Coding Workflows

    In Episode 27, Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) explore the shift from single-agent power user to multi-agent orchestrator in AI-native development. We discuss how to run multiple Claude Code agents in parallel using Git worktrees, build AFK workflows that ship while you sleep, and implement the Ralph Wiggum loop pattern for autonomous overnight development. We'll cover notification-based oversight, Docker sandboxing for safe automation, and how to structure your system with skills, plugins, sub-agents, and the Model Context Protocol (MCP) so every future task compounds in speed and quality.Show Notes: Lee Robinson: https://x.com/leerob/status/2011810357942084085?s=20Ralph Wiggum https://x.com/mattpocockuk/status/2009276031622918474Claude Cowork: https://claude.com/blog/cowork-research-preview

  18. 27

    2026 Predictions & LTX-2 4K Open Source Video Model

    In the first episode of 2026, Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) reflect on the advancements in generative media over the past year and share their predictions for 2026. They discuss AI breakthroughs, the evolving sentiment towards AI-generated content, and the ongoing chip wars shaping the future of AI. They also highlight recent developments in video generation technology, particularly the launch of LTX2.Show Notes: Gavin Baker - The AI Battle (Chip Podcast): https://www.youtube.com/watch?v=cmUo4841KQw \LTX-2: https://ltx.io/model/ltx-2

  19. 26

    Gemini 3 Flash, ChatGPT Image 1.5, Meta SAM Audio

    In Episode 25 of Creative Flux, Pierson (@piersonmarks) and Bilal (@deepwhitman) discuss ChatGPT Image 1.5, Gemini 3 Flash, and Meta's Segment Anything Audio model (with a tangent into copyright law and nano banana grid flow hacks) Show Notes: Gemini 3 Flash: https://blog.google/products/gemini/gemini-3-flash/Meta Segment Anything (SAM) Audio: https://ai.meta.com/blog/sam-audio/GPT Image 1.5: https://openai.com/index/new-chatgpt-images-is-here/

  20. 25

    Disney & OpenAI Deal, GPT 5.2, & AI Ad Controversy

    Pierson (@piersonmarks) and Bilal (@deepwhitman) discuss everything from Disney & OpenAI's mega $1B deal, the just released GPT 5.2 model, and what Meta's been up to with their PlayHT & Limitless acquisitions, plus a massive ElevenLabs partnership for Instagram Reels.  Show Notes: GPT 5.2: https://openai.com/index/introducing-gpt-5-2/ Terrance Tao: https://terrytao.wordpress.com/2025/12/08/the-story-of-erdos-problem-126/ The Thinking Game - Demis Hassabis: https://www.youtube.com/watch?v=d95J8yzvjbQ ElevenLabs + Meta: https://x.com/elevenlabsio/status/1999163506743038408Limitless Hardware Device: https://www.limitless.ai/ Twitter/X Hackathon - Dynamic Ads: https://x.com/xai/status/1997875236415676619?s=46Disney/OpenAI Deal: https://openai.com/index/disney-sora-agreement/McDonalds Ad: https://x.com/chatgpt21/status/1998253809307455555?s=20Mac Wars: https://x.com/Solopopsss/status/1997348315424260155?s=20

  21. 24

    Runway Gen 4.5 Breaks Hollywood, Kling O1, OpenAI Code Red

    In Episode 23, Pierson (@piersonmarks) and Bilal (@deepwhitman) break down a packed week in AI video. Starting with Runway's cinematic Gen 4.5 release then Kling's Omni model o1  bringing "Nano Banana for video" editability. Also discuss OpenAI's internal "Code Red" as Google's Gemini 3 and Nano Banana eat into ChatGPT's market share.Other Topics: Waymo's massive California and nationwide expansionGoogle Workspace's new AI-native featuresAnthropic's Opus 4.5 coding prowess (and Dario's thinly-veiled Sam Altman shade)Disney embracing AI remixes of their charactersthe Absurd studio charging $30k per AI-generated videoZootopia 2 crushing it in China, Public domain opportunities as 1926 works enter the commons, Rapid-fire updates on Stable Diffusion 4.5, Flux 2, and the ultra-cheap Zimage model.Chapters: [00:00] Introduction and Thanksgiving Reflections[06:07] Video Models and Runway's New Release[08:54] Spatial Awareness in Video Editing Models[11:55] Exploring Zootopia and Character Creation[14:50] Public Domain and Its Impact on Content Creation[17:46] OpenAI's Code Red and Google's Competitive Edge[20:49] The Future of AI in Browsers and User Experience[23:49] Conclusion and Future Predictions

  22. 23

    Google's Week: Gemini 3 & Nano Banana 2

    In Episode 22 of Creative Flux, Pierson (@piersonmarks) and Bilal (@deepwhitman) break down Google's massive AI launch week featuring the benchmark-breaking Gemini 3, Gemini in Chrome, and Nano Banana Pro (i.e. nano-banana-2) image generator. They explore its text rendering abilities, multi-image storytelling, and pairing with Gemini 3 Pro for unprecedented creative control. This episode also covers OpenAI's counter-moves with GPT 5.1 Pro and Codex Max 2, Replicate's Cloudflare acquisition, Disney's embrace of AI-generated content, and a Pokemon card generator Pierson built using Nano Banana and X402 crypto micropayments for frictionless transactions.

  23. 22

    KimiK2, Waymos on Freeways, AI VFX, Marble WorldLabs

    In Episode 21 of Creative Flux, Pierson (@piersonmarks) and Bilal (@deepwhitman) chat about KimiK2 by the Chinese Lab Moonshot AI and how it excels in creative writing without that "AI"-feel.  They also dig into the sudden release of GPT 5.1 and the Waymo launch of freeway driving. Pierson and Bilal also dig into Beeble, the AI VFX studio for professional filmmakers, WorldLab's GA release of Marble, and Gamma's Series B for AI slidedecks.Chapters0:00 Exploring Creative Writing with KimiK205:58 The Debate: One Model vs. Specialized Models11:51 Innovations in World Models and Their Applications17:57 The Shift to Digital Existence & Network States22:46 Waymo's Freeway Launch 27:36 Beeble and Gamma32:09 Public Perception on AI

  24. 21

    AI Artist Signs $3M Record Deal, LTX-2, and Stripe's Stablecoins

    In Episode 20 of Creative Flux, Pierson (@piersonmarks) and Bilal (@deepwhitman) dive into an AI artist landing a $3M record deal, LTX V2 pumping out 20-second 4K videos for pennies, and the viral Halloween videos that broke the internet (Lonely Frankenstein & Nike/Chainsaw Massacre). They also dive into how Stripe's betting big on stable coins and what the future of internet transactions could look like. Chapters[02:34] LTX V2: The Game-Changing Video Model[08:46] AI Workflows & Creative Pipelines[14:55] Viral Halloween Videos & Storytelling Power[18:53] AI Music Revolution & Artist Rights[24:44] Stable Coins & The Future of Payments

  25. 20

    The Rise of AI Creators (a Quibi comeback?)

    After a week hiatus due to the first ever Generative Media Conference hosted by FAL, Pierson (@piersonmarks) and Bilal (@deepwhitman) are back! For episode 19 of Creative Flux, we discuss Adobe's rising role in the AI creator space, the future of short form AI creators, teleoperated robots, Google, and more.Chapters[00:00] OpenAI's IPO and Market Dynamics[05:53] Generative Media Conference Insights[12:00] The Rise of AI in Media Production[18:00] The Creator Economy and Future Trends[23:26] The Age of Creators and New Opportunities[26:12] The Challenge of Articulating Ideas[30:52] Structured Prompts and Image Generation[33:05] The Impact of AI on Employment and Economy[37:41] Encouraging Entrepreneurship in a Changing Landscape[41:13] The Role of Robotics and Automation in Daily Life

  26. 19

    The AI Creative Process & Workflows (+ Veo 3.1 launch)

    In Episode 18 of Creative Flux, Bilal (@deepwhitman) and Pierson (@piersonmarks) sit down to discuss Veo 3.1 and how to best leverage tools, workflows, and models to create amazing artwork & videos.

  27. 18

    SF Tech Week, Sora 2, AI Market Dynamics

    In episode 17 of Creative Flux, Bilal (@deepwhitman) and Pierson (@piersonmarks) discuss SF Tech Week, including an event with ElevenLabs,  OpenAI Dev Day, and other developments in generative AI, particularly focusing on Sora 2, autonomous vehicles, and Figma. 

  28. 17

    Sora 2 and OpenAI's Crazy Media Ambitions

    Let's talk video. OpenAI released Sora 2 and an iPhone-only ~AI Slop Feed~, oops I mean.. AI Video Feed. Anyways, it's pretty cool. And Pierson (@piersonmarks) and Bilal (@deepwhitman) dig into OpenAI's video ambitions, how it plays into achieving AGI, and some technical talk about Sora 2 and it's capabilities. Come join the generative media crew for episode 16!

  29. 16

    Agentic Payment Systems with x402

    Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) discuss the opportunity of the HTTP 402 Payment Required response code and it's applications in both agentic, chat, and web usecases. Bilal showcases a basic implementation via Nicklejokes, a site where a user can pay a nickle for a joke, with all payment handled via 402 and crypto wallets. We also talk about AI music and the release of Suno v5. Show NotesSuno V5: https://suno.com/x402 Coinbase: https://www.coinbase.com/developer-platform/products/x402Protecting MCP Servers with x402 Demo: https://vercel.com/blog/introducing-x402-mcp-open-protocol-payments-for-mcp-tools Chapters[0:00] Introduction to Creative Flux[1:24] Suno V5 and the future of AI-generated music[9:20] Coinbase's revival of the x402 payment protocol[15:00] 402 HTTP status code[18:46] x402 Demo (Nickle Joke) [27:32] The future of agent-to-agent transactions with crypto payments

  30. 15

    Exploring Fei-Fei Li's World Labs, Image Upscaling, and ChatGPT's Most Common Use Cases

    Join Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) for Episode 14 of Creative Flux, covering Dr. Fei-Fei Li's World Labs model release going viral, breaking down the distribution of ChatGPT use-cases, discussing novel image upscaling techniques, and a new reasoning paradigm in video models. Show Notes: OpenAI ChatGPT Use Cases (Figure 9): https://cdn.openai.com/pdf/a253471f-8260-40c6-a2cc-aa93fe9f142e/economic-research-chatgpt-usage-paper.pdfWorld Labs Demo: https://x.com/theworldlabs/status/1967986124963692715Ray 3 Reasoning Video Model: https://x.com/kimmonismus/status/1968704713731235972Recraft Vectorize: https://replicate.com/recraft-ai Lucy Edit: https://fal.ai/models/decart/lucy-edit/dev 

  31. 14

    Seedream 4.0, Nano-Banana Infinimap, & the Rise of AI NFTs

    Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) explore ByteDance's new image model, Seedream 4.0, and programmatic image generation with Satori, touching on AI, creativity, and video editing. Following a tangent, they touch on LLMs, universal translators, and alien communication, and then finish with a discussion about the Nano Banana Infinimap and the potential for the rise of AI art NFTs. 00:00 Programmatic Image Editing (Satori) vs Diffusion-based Approaches12:50 The Future of Communication and AI18:41 Reflections on Humanity and The Evolution of Intelligence24:11 The Dark Forest Theory27:07 NanoBanana Infinimap32:42 The Future of NFTs and Digital OwnershipShow Noteshttps://github.com/seezatnap/nano-banana-infinimap

  32. 13

    Apple's Innovators Dilemma & New World Models

    In Episode 12 of Creative Flux, Pierson and Bilal dive into recent acquisitions and Apple's dilemma in the AI space, Hunyuan's Voyager World Model, and decentralized gaming. Links: Voyager World Model: https://x.com/TencentHunyuan/status/1962741518797836708OpenPipe: https://openpipe.ai/?refresh=1757095557405

  33. 12

    Google's Nano Banana & AI SVGs

    Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) dig into Google's newest image model, formerly known as "Nano Banana", the underrated technique of using LLMs to generate SVGs, AI industry gossip, and actionable creative workflow tools.00:00 Advancements in Generative AI and Image Editing04:02 Achieving Consistency in AI-Generated Media08:39 Exploring New Tools17:30 The Future of AI in Media and Communication27:07 AI Industry Gossip34:00 Satori MCPLinks: Nano Banana (Google Gemini Flash 2.5 Image Preview): https://blog.google/intl/en-mena/product-updates/explore-get-answers/nano-banana-image-editing-in-gemini-just-got-a-major-upgrade/Flipbook Generator: https://www.hackyexperiments.com/micro/flip-bookSatori MCP Server Repo: https://github.com/Jellypod-Inc/satori-mcp-server

  34. 11

    An In-Depth Look into AI Image Editing

    In this episode of Creative Flux, hosts Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) talk all things image editing, including a new stealth model, "nano-banana", character consistency, fine-tuning models, and creative workflows. 00:00 Recapping Last Week with Genie 308:43 Unveiling Nano Banana18:12 The Evolution of Image Editing Tools22:18 Why Fine-Tune Image Models32:05 Combining Image and Video Generation for Storytelling38:25 The Importance of Iteration Speed & Creative InnovationLinks: Nano Banana: https://blog.google/intl/en-mena/product-updates/explore-get-answers/nano-banana-image-editing-in-gemini-just-got-a-major-upgrade/Genie 3: https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/

  35. 10

    Claude Code as a General Agent, Native Ads, & AI Filmmaking

    In this episode of Creative Flux, hosts Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) discuss using Claude Code as a general purpose agent for tasks like blog writing & SEO, a week using GPT-5, and the future of visual storytelling, ads, and filmmaking. Hollywood is changing and AI is giving an opportunity to do interesting things like dynamic native ad placement, object localization, dubbing, and more. 00:00 Introduction02:57 Claude Code as a General Agent08:34 GPT-5: Expectations vs. Reality21:13 Engineering Trade-offs in Product Development25:51 The Future of Video Generation 36:21 The Impact of AI on Film Production, Localization and Hyper-Personalization

  36. 9

    GPT-5, ElevenLabs Music & Google Genie 3

    Today on Creative Flux, Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman), discuss a crazy week in AI. With the release of GPT-5, ElevenLabs' new Text-to-Music model, and Google's Genie 3 world-model, we haven't seen a week as packed as this in a while. Chapters:00:00 Unveiling GPT-510:55 The Rise and Evolution of AI-Generated Music28:33 Introducing Genie 3 - a HQ Real-time World Model32:30 Revolutionizing Video Games and Simulated EnvironmentsShow Notes: - ElevenLabs Music: https://elevenlabs.io/music- GPT-5: https://platform.openai.com/docs/models/gpt-5- Google Genie 3: https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/

  37. 8

    Indie filmmaking, US AI Action Plan, and an AI co-host

    In episode 7 of Creative Flux, host Pierson Marks and his AI co-host, Spruce, discuss the implications of the U.S. AI action plan, daily workflows using AI tools, and the potential of Runway ML's new model for indie filmmaking. The conversation emphasizes the importance of embracing AI technology to enhance creativity and streamline processes in content creation.Chapters00:00 Introduction to Creative Flux and AI Co-hosts04:47 The U.S. AI Action Plan and Its Implications08:41 Daily AI Workflows and Tools for Creatives17:26 Runway ML's Alf Model and the Future of Indie Filmmaking

  38. 7

    Windsurf Drama, AI Companionship, Cool Video Tools

    In Episode 6 of Creative Flux, hosts Pierson Marks and Bilal Tahir discuss the recent drama surrounding Windsurf and the implications of M&A policy in the ecosystem, the potential of generative media tools like Flora, RunwayML, and how teams are integrating AI in video production and creative fields.  Chapters00:00 Windsurf Drama and Acquisition Saga05:21 Philosophical Questions in Startups & VC19:02 AI Companionship and Social Networks27:49 Generative Media and Its Implications32:57 Exploring New Tools and TechnologiesLinksFlora: https://www.florafauna.ai/Windsurf: https://windsurf.com/Runway ML - Act Two: https://runwayml.com/research/introducing-act-one

  39. 6

    Grok 4, Multimodal Learning, & Is Coding Extinct?

    In this episode, Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) discuss their personal journeys in programming and the value of learning coding today, recent developments in AI such as Grok 4 and multimodal learning, and touch on video and image workflows.Chapters00:00 The Value of Learning Programming Today12:00 Exploring GitHub and Project Discovery14:47 Recent Developments in AI and Grok 421:06 Language and Its Influence on AI Models23:51 Creative Applications of AI in Media30:03 The Role of Automation in Daily TasksLinks: Cartesia: https://x.com/cartesia_ai/status/1943705750381207880Grok 4: https://x.com/xai/status/1943158495588815072

  40. 5

    AI Music & Movies, Overemployment, and Automation Workflows

    In this episode of Creative Flux, Bilal Tahir (@deepwhitman) and Pierson Marks (@piersonmarks) chat about the implications of AI on creativity, hiring practices, and productivity. They'll touch on the wild story of Soham Parekh, the viral AI band Velvet Sundown, using Raycast and MCP extensions to accelerate blogging, and more.  Chapters00:00 Introduction to Creative Flux Podcast06:03 The Over-Employment Phenomenon11:58 Interviewing in the Age of AI18:06 Reflections on Hustle Culture and Employment25:48 The Rise of AI-Generated Music and AI Celebrities30:35 The Future of Storytelling in Entertainment39:41 Exploring Raycast, AI tools, and Business OpportunitiesLinks:Velvet Sundown: https://open.spotify.com/artist/2GRtyAXWUiisGYub5SGMrb?si=EaO00fDcQ_y7s40O_Sp-wQ Ray2 Video to Video Model (on FAL): https://fal.ai/models/fal-ai/luma-dream-machine/ray-2/modifyOriginal Post exposing Soham Parekh (@suhail) https://x.com/Suhail/status/1940287384131969067Raycast: https://www.raycast.com/

  41. 4

    Anthropic's Legal Victory, Interactive Media, Generative Workflows

    In this episode of Creative Flux, Bilal Tahir (@deepwhitman) and Pierson Marks (@piersonmarks) discuss the recent Anthropic copyright ruling on fair use, a step-by-step workflow to generate high-quality AI videos in <30 minutes, and how collaborative and interactivity are huge opportunities in the creative space. Chapters00:00 Introduction 09:35 Legal Implications of AI Training23:40 Bilal's Viking Video Creation Process29:38 Exploring Video Generation Tools and Techniques34:47 The Future of Video Models and Artistic Expression39:46 Imagining a New Era of Interactive Content49:32 Innovative Ideas for Collaborative Content CreationLinks:Kontext edit image (broccoli haircut): https://fal.ai/models/fal-ai/image-editing/broccoli-haircutKontext text to image: https://fal.ai/models/fal-ai/flux-pro/kontext/text-to-imageHailuo-02 standard image to video: https://fal.ai/models/fal-ai/minimax/hailuo-02/standard/image-to-videobudget options for videoSeedance lite: https://fal.ai/models/fal-ai/bytedance/seedance/v1/lite/text-to-videoLTX distilled: https://fal.ai/models/fal-ai/ltx-video-13b-distilled/image-to-videoViking Video: https://x.com/deepwhitman/status/1938450937292767247

  42. 3

    ElevenLabs V3 and the Future of Text to Speech

    In Episode 2 of Creative Flux, Pierson Marks (@piersonmarks) and Bilal Tahir (@deepwhitman) explore recent text-to-speech advancements from model providers like ElevenLabs V3 and OpenAI. They highlight where generative voice is today, where it is going, and how the industry is adopting this new technology. Chapters00:00 Introduction to Creative Flux03:05 Exploring Text-to-Speech Technology06:08 The Evolution of Speech Recognition08:45 Advancements in Text-to-Speech Models11:52 Comparing Text-to-Speech Providers15:09 Voice Agents vs. Creative Applications17:52 The Future of Conversational AI23:33 Exploring API Access and New Features24:38 Innovations in Audio Inpainting28:06 Gemini TTS and AI Studio Overview30:53 The Role of AI Studio for Developers34:22 The Future of Local Models and Open Source39:25 The Need for Abstraction Layers in TTS44:02 The Rapid Evolution of Media GenerationLinks: - Elevenlabs: https://elevenlabs.io/app/speech-synthesis/text-to-speech- Elevenlabs v3 intro: https://elevenlabs.io/docs/models#eleven-v3-alpha- Openai Fm: https://www.openai.fm/- Playai inpaint & dialog: https://fal.ai/models/fal-ai/playai/inpaint/diffusion- Kokoro: https://fal.ai/models/fal-ai/kokoro/american-english- Google ai studio: https://aistudio.google.com/prompts/new_chat- Google AI studio for TTS: https://aistudio.google.com/generate-speech- Minimax hailuo -02: https://fal.ai/models/fal-ai/minimax/hailuo-02/standard/image-to-video- Seedance lite: https://fal.ai/models/fal-ai/bytedance/seedance/v1/lite/text-to-video

  43. 2

    Google's VEO3, NBA Finals Ad made with AI, Creating Video Consistency

    In this inaugural episode of the Creative Flux Podcast, hosts Pierson Marks and Bilal Tahir dive into the transformative impact of AI on media creation, focusing on Google's VEO3 and the evolving landscape of text-to-video models, the NBA Finals Ad made entirely with AI, and practical tips and cost-effective strategies for producing high-quality content.Both Pierson and Bilal are technologists and active builders who bring a unique, hands-on perspective on how to navigate the fast-changing world of generative media.Links: Veo3: https://fal.ai/models/fal-ai/veo3/playgroundImagen 4: https://fal.ai/models/fal-ai/imagen4/preview/ultra/playgroundLTX: https://fal.ai/models/fal-ai/ltx-video-13b-distilled/image-to-videoLyra2: https://fal.ai/models/fal-ai/lyria2Google Flow: https://labs.google/flow/aboutKalshi Ad: https://x.com/Kalshi/status/1932891608388681791

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Each week, AI engineers Pierson Marks (@piersonmarks) & Bilal Tahir (@deepwhitman) bring you practical insights, creative workflows, and the latest breakthroughs in generative media.We cover everything that's happening in AI-powered audio, video, and image creation, sharing hands-on tips and industry news straight from the front lines. Topics include new generative models, creative best practices, open-source tools, real-world use cases, and the evolving landscape of AI-driven content creation.

HOSTED BY

Jellypod

URL copied to clipboard!