PODCAST · business
AI News Today | Julian Goldie Podcast
by Julian Goldie
Latest Podcast
-
402
Agent OS + Obsidian + Free APIs + Agent Teams!
Agent OS Updates + Community Q&A: Hermes, GLM 5.2, Memory, SEO Pipeline, and Model PicksThis episode answers recent community questions about the Agent Operating System (Agent OS), which centralizes and orchestrates multiple AI agents in one place. It covers recent updates including a Hermes lead generation tool, Mixture of Agents testing, an auto-updating memory system using Obsidian, a new GLM Code section to use GLM 5.2 with agent harnesses like Claude Code, NotebookLM short video generation and research import, an expanded SEO content pipeline with OpenSEO, and plans to add Fable 5 as a default CLI when restored. The host advises staying focused with a “Focus Protocol,” recommends Agent OS for managing multiple tools, shares guidance on Docker and GitHub/data concerns, compares models (preferring Opus 4.8, GLM 5.2 over Sonnet 5), suggests SEO stacks for local businesses, and highlights community wins, customization examples, and how to join AI Profit Bomb for training, support, and the full Agent OS.00:00 Agent OS Updates01:51 Focus Protocol03:21 Why Use Agent OS04:49 Docker and GitHub06:47 Model News and Picks07:52 SEO Side Gig Stack08:58 Cheaper Models Setup09:32 Community Wins Workflows11:45 Free Models OwlAlpha12:33 Themes and Customization13:41 Best Memory System14:48 Kanban Orchestration15:44 Ollama vs Hermes16:27 More Memory Advice17:22 Custom Desks Example18:30 Join the Community
-
401
China’s NEW Meituan LongCat 2.0 Tested!
LongCat 2.0 (Open Source) Tested: Benchmarks, Games, and GLM 5.2 ComparisonThe episode covers the official release of LongCat 2.0, an open-source Chinese agentic model revealed as the model behind the AoAlpha free API, with features like Sparse Attention, Zero Compute Experts, and MIPD. The host reviews benchmark claims (including Terminal Bench 2.1 and SWE-Bench Pro comparisons versus GPT-5.5 and Opus 4.8) and shares hands-on tests building game demos such as Dragon Realm, a Skyrim-style open world, and VoxelCraft, noting mixed results and frequent bugs. Access issues are mentioned, including difficulty using the API without a Chinese setup, so the model is tested via the website chat. A key point is that LongCat was trained on China’s Meituan chips without NVIDIA. Overall, GLM 5.2 is judged stronger in side-by-side game benchmarks, and the host promotes the AI Profit Boardroom and Agent OS setup.00:00 LongCat 2.0 Launch00:36 Benchmarks and API Hurdles01:38 Game Demos Dragon Realm02:23 Goldy Bench Verdict02:43 Trained Without NVIDIA03:32 How to Use It03:51 Eval Results vs GPT04:17 GLM 5.2 Showdown06:13 Final Take and Recommendation06:35 Agent OS and Boardroom Plug07:37 Wrap Up
-
400
New NotebookLM Video Update is INSANE!
NotebookLM Just Added 60-Second Vertical AI Video Overviews (Coming Free Soon)The script covers NotebookLM’s new feature for generating 60-second vertical short video overviews, now rolling out to Google AI Ultra and Pro subscribers and expected to reach free users soon. The creator demonstrates examples and explains that each video is generated from a specific NotebookLM notebook’s research, producing AI images, voiceover, and editing in a hands-off workflow, especially when connected via MCP to an agent operating system. They compare the short-video outputs with longer NotebookLM videos (more slideshow-like) and with alternatives like Open Montage (more cinematic) and a separate Video Agent (preferred for educational videos). Despite video quality being below human-made content, they highlight NotebookLM’s strength as a research-and-learning tool and its one-click outputs (audio, videos, slide decks, mind maps, infographics, flashcards, quizzes, tables, reports). The episode ends by promoting the AI Profit Boardroom for setup guides, trainings, and coaching.00:00 NotebookLM Shorts Update00:46 What The Shorts Look Like01:24 Inside Agent OS Integration02:42 Quality Check And Tradeoffs03:00 OpenMontage Comparison04:24 Video Agent Alternative04:54 NotebookLM One Click Content Suite05:56 Shorts Vs Long Form Videos06:45 Learning And Speed Benefits08:09 Which Tool To Choose08:49 Join AI Profit Boardroom09:27 Community Training And Wrap Up
-
399
Claude Sonnet 5 VS GLM 5.2: Who Wins?
Claude Sonnet 5 vs GLM 5.2: One-Shot Coding Showdown (Games, Benchmarks, Cost & Agent OS)The video compares Claude Sonnet 5 and GLM 5.2 side by side using one-shot builds (dungeon crawler, raycaster maze, multiple games, a website UI, and a Web OS), noting Sonnet 5 sometimes looks smoother but often feels basic or lacks gameplay, while GLM 5.2 is frequently more interesting and polished, though it can be buggy in some tests where Sonnet 5 wins. It also reviews benchmarks (CursorBench and GaudiBench/Goldy Bench), stating Sonnet 5 scores higher than GLM 5.2 on CursorBench but Opus 4.8 outperforms Sonnet 5, and “Fable 5” leads overall and is expected to return within 24 hours. The creator highlights pricing differences (Sonnet 5 far more expensive than GLM 5.2), GLM 5.2’s open-source and OAuth/agentic compatibility, demonstrates plugging GLM into Claude Code via Agent OS, and promotes the Agent OS and AI Profit Boarding community with tutorials, coaching calls, and support.00:00 One Shot Showdown00:18 Dungeon Crawler Test01:07 Raycaster Maze Faceoff01:52 CursorBench Rankings02:28 Opus vs Sonnet vs Fable03:05 Pricing and Agent OS Setup04:05 GaudiBench and Context Window04:53 More Builds Mixed Results07:11 Games and Visual Quality07:59 Web UI and Web OS09:08 Final Verdict and Leaderboards10:19 Dont Chase Models10:34 Agent OS Offer and Wrap Up
-
398
Fable 5 is BACK!
Fable 5 Is Coming Back Tomorrow: Export Controls Lifted, Global Access Returns (Plus New Safeguards)Anthropic announced that Claude Fable 5 will be redeployed globally starting July 1 after U.S. export controls imposed June 12 shut down access to Fable 5 and Mythos 5 for everyone due to immediate compliance needs. The controls were lifted June 30, with Fable 5 returning across Claude Platform, Claude AI, Claude Code, and Claude Cowork for Pro, Max, Team, and select Enterprise plans, included for up to 50% of weekly usage limits before shifting to paid usage credits after July 7. Mythos 5 access is being restored only to certain U.S. organizations via Project Glasswing following June 26 approval. The shutdown followed a report that Amazon researchers found a method to bypass Fable 5 safeguards; Anthropic says the bypass is now blocked in over 99% of cases and that U.S. Commerce testing agreed the new safeguards are extraordinarily strong, alongside plans for deeper government collaboration and pre-release evaluations.00:00 Fable 5 Returns00:45 Global Rollout Details01:13 Usage Limits and Credits01:54 Mythos 5 Partial Restore02:32 Why It Was Shut Down03:46 Timeline and Big Picture06:19 Amazon Bypass Explained08:06 New Safeguards Breakdown09:11 Industry and Government Frameworks10:05 Wrap Up and Community Plug
-
397
Hermes Mixture of Agents is ABSURD!
Hermes Mixture of Agents (MOA): Combine Claude + GPT with an Aggregator to Beat Frontier ModelsThe script explains Hermes’ Mixture of Agents system, which lets you combine multiple models (e.g., Claude Opus 4.8 and GPT-5.5) into a panel and choose an “aggregator” model to fuse their outputs, treating the mix as one virtual model. The presenter demos results like generating a Windows-style OS and building games, showing side-by-side comparisons where the mixture outperforms a single model, and notes they tested 42 builds viewable on Goldie Bench. It describes how MOA 2.0 works under the hood (private analysis by models, then an aggregator writes the final answer and runs tools), switching mixes with /MOA, and claims of benchmark gains (8% over Opus 4.8, 11% over GPT-5.5). Downsides include slower runtime, API reliance, and technical setup, which they simplify via their Agent OS dashboard, also featuring Fusion and Sakana-style panels, automations, memory, and access via their paid community with tutorials and coaching.00:00 Mixture of Agents Intro00:33 How the Panel Works01:27 Bench Tests and Demos02:13 Side by Side Game Results03:23 Tradeoffs and Limitations04:02 Making MOA Easy to Use04:45 MOA 2.0 Explained05:28 Fusion and the Bigger Trend06:34 Stop Chasing Models07:10 Three Systems and Why They Win07:37 How to Use It Today08:19 Agent OS and Community Pitch09:50 Wrap Up and Goodbye
-
396
Claude Sonnet 5 is HERE!
Claude Sonnet 5 Review: More Expensive, Worse Than Opus 4.8? (Benchmarks & Agent Tests)The video reviews Anthropic’s newly released Claude Sonnet 5, described as more agentic and capable of planning and tool use, but argues it underperforms Opus 4.8 on benchmarks (including agentic coding) while costing more. The creator shares Goldy Bench examples Sonnet 5 generated (a ray caster maze, a broken galaxy orbit test, a synthwave background, and a crypt game), noting some outputs look good but others fail. Side-by-side comparisons show mixed results versus GLM 5.2, with GLM succeeding on tasks Sonnet 5 fails, and tweets highlight negative reception focused on poor token efficiency and pricing. The recommendation is to keep using Opus 4.8, expect Fable 5 soon, and focus on building flexible agent systems that can swap models in and out.00:00 Sonnet 5 Launch00:30 Benchmarks vs Opus01:39 Goldy Bench Demos02:53 GLM 5.2 Comparisons04:00 Backlash and Pricing05:57 Fugu Ultra Showdown07:20 Why Release This08:00 Focus on Systems09:11 Agent OS Pitch09:48 Final Verdict
-
395
Fable 5 is back in 24 hours!
Export Controls Lifted: Fable 5 & Mythos 5 Returning Tomorrow (Official Anthropic Update)The episode reports that U.S. export controls on Anthropic’s Fable 5 and Mythos 5 have been lifted, citing a tweet and Anthropic’s official June 30, 2026 announcement that access will begin being restored tomorrow (likely within 24–48 hours). It notes Commerce Secretary Howard Lutnick’s statement about working with Anthropic over two weeks to analyze and approve Fable 5, and mentions a related comment from White House Chief of Staff Susie Wiles. The host recaps the timeline: access was cut on June 16 due to flagged security concerns, reports suggested a return by late June, and the White House/Commerce/Anthropic announcements arrived rapidly on July 1. They also comment that there’s no news on GPT 5.6, argue Fable 5 is a major step up from Sonnet 5, and emphasize building resilient “agent operating systems” that can swap models and avoid workflow disruption, promoting the AI Profit Boardroom and its Agent OS, training, and coaching.00:00 Export Controls Lifted01:12 Global Access Returns01:29 Sonnet vs Fable01:55 Why This Matters02:30 Official Statements03:41 How It Unfolded04:12 Previous Shutdown Timeline04:54 GPT Updates Check05:28 Focus on Systems06:39 Diversify Your Workflow07:00 Agent OS Pitch07:37 Join the Community
-
394
-
393
-
392
-
391
Hermes Agent Kanban Swarms are INSANE!
Huge Hermes Agent Update: Build Scalable Multi-Agent TeamsDiscover the massive new update to Hermes agent that fixes database concurrency and unlocks scalable multi-agent Kanban boards. Learn how to build a Content Factory that runs multiple agents in parallel without freezing to automate complex SEO and content workflows. This update effectively removes the cap on multi-agent collaboration for faster results.00:00 - 00:00 - Intro: Hermes Kanban Update00:40 - How to Update Hermes01:03 - Scaling Multi-Agent Parallelism02:03 - Fixing Database Concurrency03:26 - Building a Content Factory05:07 - End-to-End SEO Automation06:24 - Local Models and Token Costs07:11 - Accessing the Agent OS System
-
390
-
389
NEW Grok 4.5 DESTROYS Opus?
Grok 4.5 (V9) Announced: Private Beta, Opus Comparisons, and Why Systems Beat Model HypeGrok 4.5 has been announced as a new update based on a 1.5 trillion V9 foundation model, with Cursor data included for supplemental training, and is currently in private beta at SpaceX and Tesla. Early evaluations suggest performance close to or possibly exceeding Opus 4.8, and Elon Musk frames V9 as a solid workhorse in the same league as Opus. The script discusses timelines and release expectations (including a 42% July estimate and past teaser-to-launch patterns that often land within two weeks), while warning against hype and “Opus killer” claims. It highlights practical advantages of using Grok via OAuth with an existing Twitter subscription versus expensive API usage, showcases builds made with Grok Build, and argues that users should focus on owning robust agent systems (Agent OS, Hermes workflows, benchmarking, memory, and automation) rather than chasing gated or inaccessible frontier models.00:00 Grok 4.5 Announced01:07 Why It Matters01:39 What We Built02:03 Beta Status and Hype03:09 Release Timeline Clues04:21 Models Are Getting Gated05:03 Focus on Systems07:32 Local Benchmarks and Models08:00 Two Week Release Pattern08:47 Agent OS Demo09:37 Join the Community11:24 Final Takeaways
-
388
NEW Claude Agentic OS is INSANE!
I Built a Claude Agent Operating System (Mission Control for AI Agents & Workflows)The video showcases a custom “Claude agent operating system” built as a mission control dashboard that unifies AI agents, CLIs, and one-click workflows for tasks like pulling trending news (Hermes Oracle), generating SEO and social content, images/thumbnails, AI avatar videos, and even music, with daily auto-organization and a personal “memory galaxy” that keeps context updated across workflows. The creator argues this is faster, cheaper (via free/cheap APIs, local models), and more customizable than using Claude directly, and highlights rapid iteration by integrating new models quickly and auto-documenting tests in a website with model comparisons. The system also supports orchestrating multiple AI agents (e.g., executive/CTO roles) via Paperclip, and access is offered through the AI Profit Boardroom community with tutorials, updates, coaching calls, and member success stories.00:00 Agent OS Overview00:52 One Click Workflows01:36 Why Not Use Claude03:12 Build Fast Document Everything04:17 Orchestrating AI Teams06:04 Personalization Memory Galaxy06:36 Community Proof Results08:29 Daily Building Mindset09:53 Join The Profit Boardroom10:36 Tutorials Coaching Wrap Up11:41 Final Goodbye
-
387
Hermes Agent 2.0 is INSANE!
Hermes Mixture of Agents 2.0: Beat Frontier Models with a Panel of AI Agents (Agent OS Demo + Benchmarks)Julian demonstrates a new Hermes Mixture of Agents 2.0 setup that stacks multiple models as a panel of agents inside Hermes Agent, aggregating one prompt across several models into a stronger final output. He says he tested 42 builds with the same prompts and shows a leaderboard where systems like Fusion and Hermes Mixture of Agents rank above single frontier models, with side-by-side comparisons on Goldie Bench versus Claude Opus 4.8. He explains how the Agent OS simplifies running Mixture of Agents without terminal commands, saves outputs in a workspace, and includes tools like chat, talk mode, voice-controlled Hermes Jarvis, and Hermes Oracle. The key message is “don’t chase the model, build the system,” noting you can mix cheaper, free, or local models to outperform top single models, and he points viewers to the AI Profit Boardroom for access, tutorials, and coaching.00:00 Why This System Wins01:03 How Mixture Works02:16 Leaderboard Benchmarks02:50 Opus Comparison Demo04:01 Setup and Agent OS04:39 Built In Agent Tools05:23 System Over Models06:50 More Builds and Costs07:58 Get Agent OS Access08:30 Community Proof and Wrap
-
386
NEW Chinese AI DESTROYS Mythos?
China’s Open GLM 5.2 Matches Anthropic Mythos in Cybersecurity—Why Systems Matter More Than ModelsThe script discusses a Wall Street Journal–reported claim that China’s ZAI model GLM 5.2 matches or beats Anthropic’s highly restricted Claude Mythos on cybersecurity benchmarks like discovering and exploiting software vulnerabilities, despite Mythos being gated to a small vetted group and Fable 5 being taken down and limited by the US government. It highlights that GLM 5.2 is open-weight, free to download, usable locally, and reportedly about a quarter of the cost per token via API, alongside another Chinese entrant, 360 Security, shipping a cyber AI stack. The speaker argues this weakens US labs’ “model moat,” predicts rapid model turnover, and advises focusing on building an agentic operating system with memory, agents, and workflows that can swap in any model. The episode ends by promoting the AI Profit Boardroom and its Agent OS training and community.00:00 China Matches Mythos01:31 Open Weights Shockwave02:44 What WSJ Reported03:49 Copying Backdoor Rumors04:15 China 360 Enters04:53 Moats And Pricing05:52 Build Systems Not Models06:47 Why Mythos Was Locked08:14 Gating Versus Shipping08:43 New Era Of Pullbacks09:37 Outcome First Workflow10:48 Agent OS In Action12:18 Never Feel Behind13:08 Get The Agent OS13:42 Community And Coaching14:26 Testimonials And Wrap Up
-
385
NEW MiniMax + Hermes App Builder Is INSANE!
One-Sentence App Builder: Idea → Plan → Approve → Shipped in Minutes (Hermes Agent + Agent OS)The video demonstrates an app builder system inside an Agent OS using Hermes Agent to turn a single sentence idea into a working single-page app with minimal clicks. The workflow is: enter an idea, agents classify it and draft a plan, you approve or deny, and a coder agent builds, tests, and fixes bugs before shipping the app to a gallery with live previews, starring, deleting, and rebuilding options. A group team chat can generate personalized app ideas by having multiple models (e.g., Claude, Hermes, Gemini) collaborate while reading an Obsidian “memory galaxy” vault that agents continually update and reference. The system also integrates Minimax for a balance of speed and intelligence and a flat-rate coding plan without token concerns.00:00 One Click App Builder00:56 Team Chat Idea Engine02:09 Approve Ship Gallery02:41 Minimax Speed Pricing03:48 Why Multi Model Wins04:29 Obsidian Memory Galaxy05:31 Rebuild Fix Iterate07:24 Step By Step Workflow07:56 Meditation App Demo08:30 Get Agent OS Access09:05 Boardroom Community Tour09:58 Final Call To Action
-
384
Hermes Agent OS is INSANE!
Hermes Agent OS: Mission Control Dashboard for Voice Agents, News, Studio, Outreach & MoE IntelligenceThe video walks through the Hermes Agent Operating System, a UI-based “mission control” dashboard that organizes Hermes Agent and its tools into multiple mini apps to make it more powerful than using Hermes in desktop or terminal form. It demonstrates voice and chat control (including browser actions), building apps with live previews, and running multiple tabs such as Hermes Oracle to pull trending AI news and auto-draft/publish blog and social content to WordPress. The system also includes a Studio for image/video/voice generation via Minimax or Groq, lead generation and outreach campaigns, a mixture-of-experts panel combining models like Claude Opus 4.8 and GPT 5.5, a workspace to save outputs by API, and a “memory galaxy” that syncs context into an Obsidian vault. It emphasizes owning resilient systems over relying on gated models.00:00 Hermes Agent OS Overview00:36 Voice Agent Interface01:43 Browser Control and App Builds02:22 Hermes Oracle News Engine03:19 Studio Media Generation03:51 Profiles and Live Talk04:20 Lead Gen and Outreach05:16 Mixture of Experts Panel06:17 Workspace and Memory Galaxy07:14 System First Philosophy07:55 Automation Boards and Workflows09:13 Model Integrations and OpenClaw10:06 Get the Full System10:57 FAQs and Customization11:46 Testimonials and Wrap Up
-
383
Qwable 5 27B: New FREE + Local + Open Source!
Qwable 27B Coder: Best Local Coding Model Yet? (Benchmarks, Demos & How to Run on Mac)The video demos Qwable 27B Coder, a new open-source local coding model on Hugging Face (updated end of June) built on a Qwen 3.6 27B base, showing projects made locally like a polished animated landing page and working games. It’s run on a Mac Studio (Apple M4 Max, 36GB) using Apple MLX (not available via Ollama) and is integrated into an agent operating system to generate live previews and save builds in a workspace. The presenter says Qwable tops their local leaderboards (Goldie Bench/Cody Bench comparisons) and outperforms recently tested local models like Gemma 4 12B Coder, Quifos 9B, and Onif 1.0 on the same tasks, though it runs noticeably slower than smaller models such as Quifos 9B.00:00 Meet Qwable 27B00:56 Demos Landing Pages01:23 Model Specs Setup02:04 Benchmarks Versus Locals02:40 Running On Mac03:03 Agent OS Live Previews04:24 Speed Tradeoffs05:00 Real Task Comparisons05:49 Local AI Is Accelerating06:05 Agent OS Features Tour06:40 Community Courses Pitch07:28 Wrap Up Thanks
-
382
Agent OS: Voice Agent + Lead Machine + Free AI
Agent OS Q&A: Using Codex, Claude Code, Local LLMs (Ollama), GLM 5.2, Updates, Memory & Team SetupJulian answers recent community questions about his Agent Operating System, a system for linking and automating AI agents with tools like voice control (Hermes Jarvis), Memory Galaxy, Kanban orchestration, and a local agent engine. He explains multiple ways to use a Codex subscription inside Agent OS (Hermes, image generation via Codex OAuth, the Codex tab, and a new Codex plugin in Claude Code), emphasizes focusing on repeatable systems over prompts/models, and shares thoughts on Qwen 3.6 vs Qwen Agent World. He covers connecting Codex/Claude Code to local models via configuration or Ollama, highlights GLM 5.2’s strong performance and open weights, and advises on simplifying Agent OS for teams with hidden UI sections and SOPs. He also suggests Agent OS video automation/Remotion for SaaS promo videos, recommends GLM 5.2 for Ollama-first setups, explains how to update Agent OS via the latest zip/update file, and describes using an Obsidian vault as shared memory for individuals and teams.00:00 Agent OS Overview00:48 Using Codex Inside02:24 Systems Over Prompts04:01 Qwen Agent World04:37 Local LLM Connections05:16 GLM 5.2 Benchmarks07:08 Sharing With Teams09:14 AI Video Creation10:37 Best Ollama Model11:48 Updating Agent OS12:27 Memory Setup Choices13:06 Team Shared Brain14:24 Wrap Up And Invite
-
381
Ornith-1.0 is INSANE (FREE + Local + Open Source)!
Ornif 1.0: Free Self-Learning Local Coding Model (Runs on Ollama + Works with Hermes Agents)The video tests Ornif 1.0, a new free self-learning local open coding model from Deep Reinforce, showing benchmark comparisons where it outperforms models like Gemma 4 31B and even compares well against larger flagship options. The model can be downloaded via Ollama with a simple command and is integrated into the creator’s agent operating system, including Hermes profiles for agentic workflows. Ornif’s key idea is “self-scaffolding”: it writes its own plans/instructions, generates multiple solution rollouts, gets graded, and improves both planning and coding through reinforcement learning. The creator highlights a gallery of apps, games, tools, and visual outputs made with Ornif, plus local and frontier leaderboards, and demonstrates local offline systems like a Hermes engine, agent Kanban orchestration, and local chat, emphasizing local AI benefits like being free, private, fast, and improving.00:00 Meet Ornif 1.000:34 Setup With Ollama00:45 What Ornif Is01:04 Real Build Examples01:51 Benchmark Reality Check02:11 Self Improving Framework03:02 Reinforcement Learning Loop03:52 Self Scaffolding Explained05:31 Local Leaderboards Demos06:21 Agent OS Integrations07:32 Model Comparisons Use Cases08:05 Join The Community09:05 Final Thanks
-
380
Qwythos 9B is INSANE (FREE + Local + Open Source)!
Qwythos 9B on Ollama: A Free, Private Claude-Style Local Model (1M Context?)The video tests Qwythos 9B, a Claude-style creative reasoning model built on a Qwen 3.5 9B base and available on Ollama, showing how to install and run it locally on a Mac for free with no cloud or token costs. The creator wires it into their Agent OS local engine and demonstrates it building several small apps (to-do list, digital clock, Snake game, landing page, calculator), noting it can produce surprisingly solid designs but can also be glitchy or messy. While it advertises a one million token context window, the practical window depends on available RAM and Ollama settings, and it may cut off or load smaller than expected. They compare it with other local models like Qwikboss and Ornif 9B (with Ornif 1.0 seemingly outperforming) and promote the AI Profit Boardroom and Agent OS for running local models and agents.00:00 Meet Qwythos 9B00:26 Install via Ollama00:43 Agent OS Integration01:39 Apps Built Locally02:29 Speed and Comparisons03:26 Why It Works04:20 Million Token Reality04:54 Model Sizes and Quantization05:36 Pros and Cons06:30 Offline Wrap Up07:09 Plug Into Any Agent07:40 AI Profit Boardroom Pitch08:07 Agent OS Feature Tour08:57 Community and Coaching09:24 Final Thanks
-
379
Agent OS Just Changed AI Forever…
Agent Operating System Q&A: Model Restrictions, Local AI, Hermes Jarvis, SEO & Video AutomationThe episode answers community questions about the Agent Operating System and how to get the most from it, highlighting Hermes Oracle for news and SEO automation, Hermes Jarvis as a voice-activated agent, and managing all agents and client instances from one interface using separate profiles. It argues that gated frontier models like GPT 5.6 previews and the removal of Fable 5 matter less than building robust systems that can swap models in and out, including using alternatives like Fusion or Sakana Fugu and open-weight local options such as GLM 5.2. The host discusses a high-volume posting strategy with strict quality control, rising demand for local model setups for privacy and cost, automating video production via a video agent and avatar workflow training, using Paperclip for fact-checking and multi-agent teams, Windows support, SEO article generation and deployment, UI improvements to the Kanban board, and examples of AI-assisted game creation, while promoting the AI Profit Boardroom for daily Q&A, tutorials, coaching calls, and the Agent OS zip file.00:00 Agent OS Overview01:13 Model Access Concerns03:16 Posting Strategy Growth04:41 Local Models for Clients05:36 Managing Client Agents07:10 Automating Video Creation07:57 Fact Checking Workflow08:36 Windows Install Support08:54 Hermes Jarvis Voice Agent10:09 Paperclip Social Team11:57 Systems Over Models12:37 Hermes Desktop vs OS13:28 Using GLM in Claude Code15:21 SEO Blog Automation16:15 AI Game Development Demos18:09 Kanban Board Upgrade19:58 Join the Boardroom21:02 Easy Setup Testimonials22:14 Final Wrap Up
-
378
Hermes Mixture of Agents DESTROYS Fable 5?
Hermes Agent Mixture of Agents (MoA) Presets: Build a Model Panel to Beat Gated Frontier ModelsHermes Agent has released Mixture of Agents (MoA) Presets, allowing multiple AI models to run in parallel and be aggregated into a single stronger answer, similar to systems like Fusion and Sakana Fugu. The script explains this as a workaround for gated or limited-preview frontier models (e.g., GPT 5.6 and Claude Fable 5) by improving performance through a “panel of experts” approach rather than relying on one model. It introduces Hermes Bench, noting claimed reference MoA scores 8% higher than Opus and 11% higher than GPT, and highlights top performance from an Opus 4.8 + GPT-5.5 aggregator. Viewers are shown how to update Hermes, select MoA presets via terminal and dashboard, configure via desktop app or config.yaml, and switch presets with provider commands.00:00 Hermes MoA Presets00:43 Why Models Are Gated02:09 Update and Enable MoA04:06 Commands and Config04:49 Top Hermes Bench Combos05:04 Panel Beats Genius05:47 Fusion and Alternatives06:35 Build Systems Not Models07:34 Agent OS Workflow08:42 Old Way vs New Way09:39 AI Profit Boardroom10:58 Wrap Up and Links
-
377
-
376
NEW GPT 5.6 is INSANE!
GPT-5.6 Sol, Terra & Luna: Limited Preview, Benchmarks, Access (and Why Systems Beat Models)The episode breaks down OpenAI’s GPT-5.6 release, explaining that it’s currently a limited preview available only to about 20 partner organizations via Codex and the API at the request of the US government, with general access hoped for in the coming weeks and possibly restricted by country. It introduces the three new models—Sol (frontier, with Sol Ultra and Sol), Terra (balanced, competitive with GPT-5.5 at roughly half the cost), and Luna (fast, affordable for high-volume work)—and highlights benchmark claims such as Sol Ultra’s 91.9 on Terminal Bench 2.1 while noting other tests where competitors still lead. The speaker argues that chasing new model launches is a mistake and recommends building a model-proof “agent operating system” with a swap layer, multiple models on tap, task routing, and owned memory so models can be swapped in and out with minimal effort, then promotes access to their Agent OS and community.00:00 GPT 5.6 Overview00:23 Preview Access Limits00:42 Meet Sol Terra Luna01:45 Release Timing Uncertainty03:12 Benchmarks Breakdown05:27 Stop Chasing Models07:54 Model Proof System08:16 Swap Layer Routing Memory10:43 Common Objections Answered12:36 Agent OS Offer13:22 Community Tour14:19 Final Call To Actionx
-
375
Codex + Claude Code + Agent OS Is INSANE!
Agent OS Q&A: Using Codex, Claude Code, Local LLMs, GLM 5.2, Team Setups, Updates & AI Video EditingJulian answers recent community questions about his Agent Operating System, a system for linking and orchestrating AI agents with features like voice control via Hermes Jarvis, a Memory Galaxy connected to an Obsidian vault, Kanban orchestration, and a local agent engine. He explains multiple ways to use a Codex subscription inside Agent OS (Codex tab, Hermes with GPT‑4.5 and image generation, OpenCloude integration, and a new Codex plugin inside Claude Code), and emphasizes building repeatable systems rather than relying on prompts or specific models. He discusses Qwen 3.6 vs Qwen Agent World, options for connecting Codex/Claude Code to local models via config changes or Ollama, and why GLM 5.2 performs strongly versus other models while being openly available. He also covers simplifying Agent OS for teams with hidden UI sections and SOPs, faster AI video assembly via Agent OS/Remotion workflows, how to update Agent OS via the latest zip and update file, and creating a shared company “brain” with a shared Obsidian vault.00:00 Agent OS Overview00:48 Using Codex Inside OS02:24 Systems Over Prompts04:00 Qwen Agent World Talk04:37 Local LLM Connections05:16 Why GLM 5.2 Wins07:08 Sharing OS With Team09:14 Fast SaaS Video Creation10:37 Best Ollama Model Setup11:48 Updating Agent OS Builds12:27 Memory Galaxy Workflow13:06 Team Profiles Shared Brain14:24 Wrap Up Join Boardroom
-
374
Mythos 5 + GPT 5.6 are HERE but…
GPT 5.6 (Sol/Terra/Luna) & Claude Mythos/Fable 5 Are Gated—Here’s How to Beat Frontier Models Without AccessThe script covers two announcements: OpenAI’s GPT 5.6 models Sol, Terra, and Luna are released only in a limited US government–requested preview to a small group of trusted partners, with no confirmed public release date, and Anthropic’s Claude Mythos 5 and Fable 5 are also gated to about 100 companies via a deal referenced from CNBC. The speaker argues that instead of waiting for inaccessible frontier models, users can reach similar or better results using multi-model systems: Hermes Agents’ Mixture of Agents, Sakana Fugu, and Fusion, which combine multiple models and use judging/fusion to improve outputs and benchmark performance (including results shown on Goldy Bench). The episode promotes the AI Profit Boardroom community for access to the Agent OS, tutorials, updates, and coaching.00:00 Two Big AI Updates00:11 GPT 5.6 Preview Release01:11 Sol Terra Luna Breakdown01:33 Why Models Are Gated02:10 Claude Mythos Fable News03:03 Best Workaround Systems03:12 Hermes Mixture of Agents05:34 Sakana Fugu Benchmarks06:28 Fusion Beats Single Models09:23 Cost Local Model Options10:13 Stop Chasing Models11:32 Recap and Access Limits12:08 AI Profit Boardroom Tour13:42 Final Thanks
-
373
GPT-5.6 is HERE!
GPT-5.6 May Be Government-Gated: What It Means (and Why Systems Matter More Than Models)The script discusses reports from Axios and The Information claiming the Trump administration asked OpenAI to stagger GPT-5.6’s release, limiting access to government-approved partners during a preview period due to security and cybersecurity concerns, with two federal offices vetting and approving customers one by one and only a hoped-for wider rollout later. It links this to the “Fable 5” situation where a powerful model was taken down after a few days, suggesting a new reality where frontier US models may be delayed or never broadly released. The speaker weighs pros (safety and vetting) and cons (US competitiveness as Chinese open-weight models like GLM rapidly improve and remain widely available), then advises building workflows as model-agnostic systems so models can be swapped without disrupting work, and promotes an agent operating system offered via AI Profit Ballroom.00:00 GPT 5.6 Restricted00:52 Fable 5 Fallout01:19 Government Approval Queue03:31 Who Gets Access04:17 Global Competition Risk06:18 Build Model Agnostic Systems07:39 Automation Examples09:09 Agent OS Pitch10:01 Local Private Options10:42 Wrap Up and Outlook
-
372
NEW Claude Agent OS is INSANE!
My AI Agent Operating System (Mission Control to Automate SEO, Content, Outreach & Builds)The speaker demos an “agent operating system” dashboard that centralizes and orchestrates multiple AI agents to automate building projects, generating and publishing content, and running workflows. The system includes a group chat for CLIs, an idea-to-implementation pipeline, and tools like Hermes Oracle to pull trending X news, draft posts, and publish SEO blog content directly to WordPress. Hermes Jarvis enables voice-controlled building with transcripts and wake-word mode, while an email outreach SaaS module finds, enriches, validates leads, drafts campaigns, and manages inbox/sent and sending limits. The OS also features model panels like Sakana Fugu and Fusion for multi-model “team” outputs, a workspace to store all builds, a “memory galaxy” knowledge graph that auto-logs activity, and automations for Google Search Console keyword research, avatar videos, and media generation, with an option to get the setup via the AI Profit Boardroom community.00:00 Agent OS Overview00:24 Mission Control Dashboard01:30 Hermes Oracle Trends02:45 Personalized SEO Publishing03:39 Hermes Jarvis Voice04:17 Email Outreach SaaS05:33 Fugu Boardroom Models06:48 Local Engine Fusion08:07 Build Process Benchmarks09:02 Memory Galaxy Graph10:42 Automating Daily Workflows13:34 Community Setup Offer14:54 Final Wrap Up
-
371
New Fable 5 LEAKS: Coming Back Soon?
Fable 5 Returning Soon? New Claude Code, Bedrock Catalog & AWS Doc Leaks ExplainedThe episode reviews recent leaks suggesting Anthropic’s Claude Fable 5 may return after being released on June 9 and pulled three days later, with new signals appearing June 24–25. It cites three “checkable trails” moving in the same timeframe: Claude Code v2.1.190 string changes (removing “purchased separately” and adding a weekly-usage message that implies subscription inclusion), Fable 5 reappearing as a live listing in the Amazon Bedrock model catalog, and AWS Bedrock docs/model cards showing the model lifecycle as active with Bedrock IDs for both “Anthropic Claude Fable 5” and “global” versions. The host notes nothing is confirmed, raises the question of US-only availability, and estimates a possible return by early July/within seven days. The video ends by emphasizing building flexible systems (Agent OS) that can swap models quickly and promoting the AI Profit Boardroom where updates, guides, coaching calls, and community support are provided.00:00 Fable 5 Return Rumors00:44 What We Know So Far01:24 Timeline of Events02:38 Claude Code String Clues03:21 Subscription Plan Implications03:56 Amazon Bedrock Listing05:13 AWS Docs Model Cards05:44 Regions and Release Window06:25 Build Systems Not Models07:23 Agent OS and Boardroom Pitch07:57 Wrap Up and Link
-
370
Hermes OS is INSANE! 🤯
Inside My Hermes Agent OS: Oracle, Jarvis Voice Control, Outreach SaaS, Memory Galaxy & One‑Click Content AutomationThe script demos a custom “Agent OS” built around Hermes Agent, showing how it segments multiple agents and workflows into one system. It highlights Hermes Oracle for pulling trending news and generating social posts or SEO-optimized WordPress articles in one click with scheduled daily refreshes; Hermes Jarvis, a voice-activated mode that can run real-time actions like opening websites and providing daily briefings; and a new Outreach Agent that finds leads, enriches/validates emails, manages campaigns, inbox/sent items, dashboards, and suggested outreach sequences using API keys and sending caps. It also covers goal mode with autonomous looping and QC, a connected Memory Galaxy for logged/interlinked knowledge, a Kanban board for multi-profile agent orchestration, NotebookLM asset syncing via MCP, video/SEO engines, Paperclip for team-based agent orgs, and an idea-to-implementation pipeline shared via the AI Profit Boardroom with tutorials, updates, community support, coaching calls, and testimonials.00:00 Agent OS Overview00:18 Hermes Oracle News02:02 Jarvis Voice Control03:06 Outreach Email Engine05:08 Goal Mode Autonomy05:41 Memory Galaxy Brain06:23 Kanban Team Workflow07:06 NotebookLM Asset Sync07:37 Video SEO Loop Tools08:35 MCP Workspace Studio08:59 Paperclip Agent Orgs09:22 Idea To Shipping09:43 Get The Setup10:23 Boardroom Tutorials Support11:17 Build Vs Customize11:41 Testimonials And Wrap
-
369
Hermes Agent: How to Automate Lead Generation!
Hermes Lead Machine: 1-Click Lead Gen + Enriched Outreach Pipeline (Find, Verify, Score, Write, Send)Julian demonstrates the experimental “Hermes Lead Machine,” a Hermes Agent workflow that generates and enriches leads in one click using the Hunter API. Users can either paste an existing list to enrich or describe the exact prospects they want in plain English (e.g., SEO agencies interested in link-building) and the system finds companies and people, captures domains and emails, enriches company details, verifies deliverability to reduce bounces, scores and filters leads (0–100) for fit, and segments them by status (new, enriched, valid, contacted, replied). It then writes personalized openers and outreach emails and can send through a dedicated inbox, all managed in a dashboard that feels like a SaaS tool. He contrasts this with manual, duct-taped outreach and explains setup via Agent OS/AppHub, API key, and email configuration, plus community support and coaching calls.00:00 Hermes Lead Machine Intro00:13 Lead Finder Dashboard Tour00:38 Hunter API Find and Enrich Demo01:30 Segmentation and Workflow Benefits02:35 Find Verify Write Pipeline03:05 Old Outreach vs New Automation04:26 Why Anyone Can Use It05:07 Six Parts Explained07:21 Setup and Common Questions08:12 Agent OS Offer and Community09:25 Wrap Up and Next Steps
-
368
New Hermes Agent Pet Update!
Hermes Agent Sidekick Pets Update: Setup, Commands, and 3,000+ Pixel CompanionsHermes Agent has introduced the Sidekick system, adding an animated pixel pet that reflects an agent’s status at a glance—idle, thinking, running tools, waiting, done, or failed—across the CLI, TUI, desktop app, and the Agent OS dashboard. The pet acts as a personality-driven status indicator to improve peripheral awareness versus spinners and log lines, helping users quickly notice completion or failures. Setup involves updating Hermes, browsing the Pet Dex gallery with `Hermes pets list`, installing a pet (e.g., `Hermes pets install Boba select`), adjusting size with `Hermes pet scale`, swapping anytime, or disabling instantly with `Hermes pets off`; users can choose from nearly 3,000 pets or submit their own, and the sprite cannot affect code or files. The episode also promotes the AI Profit Boardroom for the full Agent OS, additional agents (Oracle, Jarvis, outreach/lead gen), trainings, coaching calls, and support.00:00 Meet the Sidekick Pet00:45 How Pet Poses Work01:41 Why It Matters02:04 Where It Shows Up02:45 Update and Install03:29 Customize or Disable04:01 Gimmick or Useful05:12 Quick Recap05:43 Get the Full Agent OS
-
367
NEW Hermes Email Agent is INSANE!
Hermes Outreach Engine: Build an AI Email Agent for Lead Gen, Verification & Automated CampaignsThe script demonstrates an experimental “Hermes Outreach Engine” built inside a Hermes-based agent operating system to automate email outreach for link building and SEO. It shows an Outreach tab with dashboards for leads, validation, sendability, activity logs, lead generation (finding companies, enriching to real contacts, adding notes, hiding/skipping oversized companies), and campaign management (drafts, sent, bounces, audience selection, previews, draft-mode approvals, pausing/stopping campaigns, and reply tracking). The workflow is described as six parts: define targets, enrich, verify emails via Hunter, filter out large companies, send from a separate agent inbox, and track results. Setup uses Claude Desktop to build the tool and connects Hunter API, a separate inbox, Himalaya for inbox reading/replying, and Google Workspace API, with optional OAuth. The episode ends by promoting access to the agent OS and community training in the AI Profit Boardroom.00:00 Hermes Email Agent Intro00:24 Outreach Dashboard Tour01:19 Lead Generation Workflow02:06 Campaign Builder Overview02:44 APIs and Inbox Setup03:29 Six Step Outreach Engine05:23 Automation and Safety Controls06:38 How It Was Built07:40 Cold Email Objections08:42 Key Takeaways Recap09:17 Agent OS All In One Hub09:42 Join the AI Profit Boardroom10:06 Community Training and Coaching10:51 Final Wrap Up
-
366
Claude: NEW AI Operating System is INSANE!
I Built an AI Agent Operating System (Hermes Oracle + Jarvis + Sakana Fugu)The script showcases an AI “agent operating system” built using Claude Desktop, featuring Hermes Oracle (a web-scanning, breaking-news updater that ranks headlines, links to Twitter posts, and can generate content and automate social media using Groq OAuth) and Hermes Jarvis (a real-time, voice-controlled assistant with conversation history, daily briefings, and a wall mode). It also demos Sakana Fugu Ultra, described as highly capable and topping the creator’s benchmark site, Goldy Bench, including examples of generated interactive content. Additional tools include Google Search Console keyword/page optimization, a one-click video editor that researches topics and creates videos with B-roll and an AI avatar, Memory Galaxy for organizing new context, and a looping agent with a “judge” that iterates until quality targets are met. The creator promotes daily updates, tutorials, files, and community access via the AI Profit Boardroom.00:00 AI Agent OS Reveal00:13 Hermes Oracle News Scanner01:12 Hermes Jarvis Voice Assistant02:37 Sakana Fugu Builds Anything03:32 Benchmarks and Frontier Tech04:57 SEO and Video Automation Tools05:54 Memory Galaxy Knowledge Hub06:45 Infinite Loop Quality System07:42 Agents Replacing Human Work08:59 Get the System and Community10:15 Final Wrap Up
-
365
I Built A Real JARVIS With Claude
Hermes Agent OS Q&A: Local vs VPS, Obsidian Memory Vault, and Automating Content in One PromptThe speaker showcases an “agent operating system” built around Hermes, including a real-time voice-activated Jarvis mode, Hermes Oracle for web/news research and content drafting with source links, and a connected “memory galaxy” second brain powered by an Obsidian vault. They explain they run the system locally (not on a VPS) for security, while relying mostly on cloud models via APIs/CLIs/OAuth (e.g., ChatGPT Realtime API, Groq OAuth, Claude via CLI), noting local models typically require high-end RTX GPUs. They describe hosting files locally and in the cloud while keeping the database (Obsidian) stored locally and auto-updated by agents. The episode answers viewer questions, demonstrates automated SEO/article deployment and social content creation from trending news, emphasizes focusing on systems over specific models, and invites viewers to get the ready-made Agent OS inside the AI Profit Boardroom with tutorials, community help, and coaching calls.00:00 Agent OS Overview01:13 Local vs VPS Setup04:40 Automated Content Engine06:27 Skill Level and Mindset
-
364
Fugu vs Fusion: The Fable 5 Test
Sakana Fugu Ultra vs OpenRouter Fusion: Side-by-Side Tests (Beating Fable 5?)The video compares two new “panel” APIs—Sakana’s Fugu (Ultra and Mini) and OpenRouter Fusion—both claiming benchmark performance above Fable 5, by testing them side by side on visual/code generations like a landing page, raycaster maze, living spiral galaxy, inner solar system, and other simulations. In these examples, Fugu Ultra consistently produces the best-looking, smoothest outputs, Fusion is generally second, and Fugu Mini is faster but more variable and sometimes buggy. The script notes practical tradeoffs: Sakana/Fugu is cheaper with a flat plan but is heavily limited by daily generation/time caps and token limits, while Fusion is easier to use via API but costs per token. Both are one-prompt systems with long waits, and both are integrated into the creator’s Agent operating system and AI Profit Boardroom community.00:00 Sakana vs Fusion Intro00:32 How Panel APIs Work01:35 Landing Page Showdown02:17 Maze and Galaxy Tests03:42 Solar System Quality Gap04:10 Limits Pricing and Workflow06:09 Fusion Open World Example07:07 Fugu Mini Results08:00 Liquid Simulation Comparison08:55 Final Rankings and Tradeoffs09:35 Agent OS and Community Plug11:12 Wrap Up
-
363
New FREE Hermes Computer Use Agents!
Hermes Agent Computer Use Update: Background Desktop Control Now on Windows, Linux & macOSHermes Agent’s “computer use” feature has a new update adding Windows and Linux support and improving macOS, allowing Hermes Jarvis to control a computer by reading the screen and clicking/typing in the background without moving your real cursor or interrupting your work. The script demos opening Obsidian, websites, Google, and writing a note via real-time mode and a more powerful agent mode that can run tasks in the background. It explains the “Hermes Takeover Engine” workflow, emphasizes cross-OS support, and notes you can use different AI “brains” (including local/free options) rather than being locked to one model. Safety is highlighted through approval prompts before potentially destructive actions, plus a stop button. The episode contrasts this with slower, screen-taking tools like Claude Computer Use and promotes the AI Profit Boardroom/Agent OS for setup, training, and support.00:00 Update Overview00:16 Live Demo Basics00:37 Agent Mode Tasks01:18 Voice Interaction Demo01:43 Takeover Engine Explained02:57 Five Step Quiet Control03:43 Models Setup Commands05:45 Agent OS Pitch06:20 Safety Approvals06:52 Why Hermes Beats Others08:40 Key Takeaways09:07 Join And Wrap Up
-
362
Nobody Writes Prompts Anymore (Do THIS Instead)
Stop Prompting, Start Loop Engineering: Build Self-Iterating AI Agent SystemsThe script explains why “loop engineering” is replacing manual prompting: instead of iterating with an AI yourself, you define what “done” looks like, let an agent act, and use an independent judge model to verify results and repeat until the goal is met. It cites industry figures and examples (including Anthropic and NVIDIA’s GTC keynote) to argue that better loops, not better prompts, will power future self-iterating agent systems. The speaker describes a doer/judge loop structure and shows four implementations inside an “agent operating system”: a Fusion loop with configurable rounds and separate builder/judge models, an agent Kanban board with planner/builder/reviewer roles, a Fusion “boardroom” combining multiple model outputs via a judge, and a Sakana/Fugu council approach. It also describes an Obsidian-based memory system and promotes the AI Profit Boardroom community and resources.00:00 Why Loops Beat Prompts00:48 What Loop Engineering Means03:39 Prompting vs Looping04:44 Doer and Judge Framework05:37 Fusion Loop Walkthrough06:26 Agent Kanban Loops07:21 Fusion Boardroom Method08:31 Sakana Fugu Council09:05 Agent OS Bundle Overview10:01 Memory Feedback Loop11:26 Getting Started and Proof12:01 Join the Community12:35 Final Wrap Up
-
361
NEW Agent OS is INSANE! (Runs 3 Businesses FREE!)
Agent OS Q&A: Fixing Kanban Blockers, Orchestrating Agents, Paperclip Paths, and Model SetupsThe episode answers recent questions about building and using an agent operating system (Agent OS) to organize multiple AI agents (e.g., Hermes, Jarvis) in one place, automate pipelines from idea to implementation, and deploy outputs like SEO content. It demonstrates using a Kanban board to triage tasks and complete an SEO keyword research report, then explains ways to resolve blocked tasks (commenting, moving cards, asking Hermes to sync and self-unblock, or using Claude to improve the system). It outlines options for agent orchestration via group chat, pipelines, and Paperclip teams, and recommends Agent OS over Hermes Workspace due to sync issues. The script also covers OpenRouter Fusion for high-stakes one-shot answers, local vs VPS hosting for Hermes, switching from local models to CLI/API, enforcing Paperclip output paths via working directory, simplifying Agent OS for clients, and integrating Obsidian memory and NotebookLM into the workflow.00:00 Agent OS Overview01:24 Kanban Swarm Setup02:06 Fixing Blocked Tasks04:15 Orchestrating Agents Together05:13 Why Workspace Sync Fails06:05 OpenRouter Fusion Explained08:07 Where to Run Hermes08:24 Game Studio Model Fixes09:18 Paperclip Output Paths10:18 Client Ready Agent OS11:46 Custom Upgrades Showcase13:31 Best Model Stack Choices14:15 Obsidian Shared Memory16:09 NotebookLM MCP Workflow
-
360
Hermes Agent OS Runs Itself On Loops
Hermes Agent OS: Build Your Own Autonomous AI System (No Code)Discover how to transform standard AI models into a powerful Agentic Operating System featuring voice control, autonomous loops, and specialized agents. Learn why prompting is dead and how loop engineering allows non-technical users to build complex video, music, and SEO workflows automatically.00:00 - Intro: The Hermes Agent OS00:13 - Real-Time Voice Control & Jarvis01:24 - Studio & Personal AI Butler01:49 - Goal Mode: Why Prompting is Dead04:10 - Specialized Music & Video Agents06:24 - Orchestrating Agent Organizations08:31 - Fully Automated SEO Loops10:16 - How to Build it Without Code
-
359
Fugu: NEW Japanese AI DESTROYS Fable 5?
Fugu Ultra vs Fable 5 & Fusion: New Multi-Agent Panel Model Benchmarks (GoldyBench)The video reviews Sakana.ai’s newly released Fugu Ultra, a multi-agent panel API that runs prompts across multiple models in parallel and fuses results via a judge, aiming for Fable 5-level outputs comparable to Fable and Mythos. The presenter shows one-shot examples (websites, animated galaxy, inner solar system), explains integration into their Agent OS alongside Fusion, and compares speed and reliability: Fugu is faster for latency while Fugu Ultra is much slower but optimized for benchmark performance. Benchmarks shown indicate Sakana outperforming Fable on several tests (e.g., Terminal Bench 2.1, GPQA Diamond, Live Code Bench), and it beat Opus 4.8 on most creations except a voxel game that failed due to 16K token truncation after a long wait. They also note strict API limits that can block usage for hours, regional access issues, and recommend using multiple models in an agentic OS for parallel work and fallbacks, with Fusion remaining top for their outputs.00:00 Fugu Ultra Arrives00:32 One Shot Demos01:19 How Multi Agent Fusion Works02:13 Agent OS Integration03:16 Benchmarks And Strengths03:57 Tradeoffs Limits And Truncation05:18 Leaderboard And Fusion Examples05:59 Speed Vs Smooth Workflow07:22 Pricing Access And Availability09:16 Best Setup And Wrap Up10:21 Community And Final Outro
-
358
-
357
-
356
-
355
-
354
-
353
We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.
No matches for "" in this podcast's transcripts.
No topics indexed yet for this podcast.
Loading reviews...
Loading similar podcasts...