AI前沿:从数学推理到记忆注入 episode artwork

EPISODE · May 2, 2025 · 9 MIN

AI前沿:从数学推理到记忆注入

from AI可可AI生活

本期播客精华汇总1. Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in MathPhi-4-Mini-Reasoning:探索小型数学推理语言模型的极限通过四阶段训练(大规模蒸馏、微调、偏好优化、强化学习),仅38亿参数的Phi-4-Mini-Reasoning在数学推理上超越70亿-80亿参数模型,揭示小模型需“量体裁衣”的训练策略,反直觉地发现朴素高质量数据可能有害。2. ParamΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost直接权重混合的 ParamΔ:零成本训练后的大型语言模型ParamΔ通过简单权重差值加法,将后训练能力零成本迁移到新基座模型,性能达官方版的95%,为...去小宇宙查看完整单集简介在小宇宙查看该单集文稿

NOW PLAYING

AI前沿:从数学推理到记忆注入

0:00 9:29

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

いろはにマネーの「ながら学習」 IrohaniMoney この番組では、インターン生2人が、金融、経済、投資関連の気になる情報を分かりやすくお伝えしていきます。インターン生の会話を「ながら聴き」する感覚で一緒に勉強していきましょう!ご意見箱フォーム:https://forms.gle/TTGaVP2TJksNMKJo7ぜひお便りや感想をお待ちしています!公式X:https://x.com/irohanimoney番組のハッシュタグは「#いろはにながら」です。番組への感想をお待ちしています!いろはにマネー:https://www.bridge-salon.jp/money/姉妹サイト:https://kabu.bridge-salon.jp/姉妹サイト:https://bridge-salon.jp/(株)インベストメントブリッジ運営 AI Erik's Podcast Audio Erik Conn The AI News Podcast where we talk AI. CISO Perspectives (public) N2K Networks This season on CISO Perspectives, host Kim Jones explores some of the challenges of leading through uncertainty. We explore the complexity of the changing nature of regulation and working with the federal government, the evolution of privacy and fraud, and how emerging technologies like AI and quantum computing are changing cyber. When you don’t know what questions to ask, you’re afraid to ask, or don’t know who to ask, CISO Perspectives provides the foundation for learning in this brave new world. AI Generated - EDU Video Podcast Magnus Lian Explore how video tools and AI are transforming education with Magnus Sæternes Lian, Senior Engineer at NTNU and founder of ReadyMedia. This podcast dives into the latest video technologies, real-world use cases, and actionable insights for educators and tech enthusiasts. Created using cutting-edge AI tools like GoogleLM and ElevenLabs, all content is verified for accuracy. Discover practical solutions and stay ahead in the evolving landscape of educational technology!

Frequently Asked Questions

How long is this episode of AI可可AI生活?

This episode is 9 minutes long.

When was this AI可可AI生活 episode published?

This episode was published on May 2, 2025.

What is this episode about?

本期播客精华汇总1. Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in MathPhi-4-Mini-Reasoning:探索小型数学推理语言模型的极限通过四阶段训练(大规模蒸馏、微调、偏好优化、强化学习),仅38亿参数的Phi-4-Mini-Reasoning在数学推理上超越70亿-80亿参数模型,揭示小模型需“量体裁衣”的训练策略,反直觉地发现朴素高质量数据可能有害。2....

Can I download this AI可可AI生活 episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!