PODCAST · technology
AI可可AI生活
by fly51fly
来自 @爱可可-爱生活 的第一手AI快报,用最简单易懂的语言,带你直击最前沿的人工智能科研动态。无论你是科技小白,还是行业达人,这里都有你想知道的AI故事和未来趋势。跟着我们,轻松解锁人工智能的无限可能!#人工智能 #科技前沿
-
935
[人人能懂AI前沿] AI的内功、表演与成长法则
这一期,我们来聊聊几个特别有意思的“AI悖论”:想让AI团队更强,是该招“通才”还是“专才”?AI写下的思考步骤,究竟是真实的内心独白,还是为了让你满意的“事后表演”?而教一个AI“学生”,是让他抄答案更有效,还是抄解题思路更靠谱?几篇最新的论文,给了我们一些出乎意料的答案。00:00:27 人多力量大,还是术业有专攻?00:07:33 AI的“胎记”,我们如何给机器生成的内容盖个章?00:12:46 AI训练的快慢之争,一个两全其美的方案00:18:35 你的AI队友,是在真思考还是在“演”给你看?00:23:52 让AI“小号”变聪明的秘密,抄答案还是抄思路?本期介绍的几篇论文:[LG] Slicing and Dicing: Configuring Optimal Mixtures of Experts [University of Washington & New York University] https://arxiv.org/abs/2605.11689 ---[LG] TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection [Meta Superintelligence Labs] https://arxiv.org/abs/2605.12456 ---[LG] Learning, Fast and Slow: Towards LLMs That Adapt Continually [UC Berkeley & Mila] https://arxiv.org/abs/2605.12484 ---[LG] When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel [CMU & Fujitsu Research of America Inc] https://arxiv.org/abs/2605.11746 ---[CL] A Study on Hidden Layer Distillation for Large Language Model Pre-Training [Google DeepMind] https://arxiv.org/abs/2605.11513
-
934
[人人能懂AI前沿] AI的学霸秘籍、省钱妙计与陪练手册
你有没有想过,AI也会有“选择困难症”吗?或者,怎么才能给AI请个既省钱又能干的“陪练”?这一期,我们就来聊聊几篇有趣的最新论文,看看科学家们是如何教会AI像高手一样反思、像侦探一样倾听“沉默的投票”,甚至用中学物理知识,给AI装上一双“3D眼睛”的。准备好了吗?让我们一起出发!00:00:29 鸡娃不如“陪练”,AI训练的降本增效新思路00:05:53 AI的学霸秘籍,如何像高手一样思考和进化00:12:12 AI的阅读术,如何既快又好地啃下海量信息?00:18:24 AI的“过分自信”,原来是种“选择困难症”00:23:39 让AI拥有“立体视觉”的省钱妙计本期介绍的几篇论文:[LG] CoDistill-GRPO: A Co-Distillation Recipe for Efficient Group Relative Policy Optimization [Google Research] https://arxiv.org/abs/2605.08873 ---[CL] RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards [Google Cloud AI Research] https://arxiv.org/abs/2605.10899 ---[CL] Scratchpad Patching: Decoupling Compute from Patch Size in Byte-Level Language Models [Google DeepMind] https://arxiv.org/abs/2605.09630 ---[CL] The Silent Vote: Improving Zero-Shot LLM Reliability by Aggregating Semantic Neighborhoods [Google] https://arxiv.org/abs/2605.09739 ---[LG] RelFlexformer: Efficient Attention 3D-Transformers for Integrable Relative Positional Encodings [Seoul National University & Google Research] https://arxiv.org/abs/2605.10706
-
933
[人人能懂AI前沿] 从聪明花钱、英雄涌现到AI的“偏科”报告
今天我们要聊点特别有意思的话题:AI是怎么“思考”和“成长”的?我们会从几篇最新的论文出发,看看AI如何学会聪明地“花钱”,如何在学习中分清“英雄”与“集体”;然后,我们会揭秘它那套先打“草稿”再复核的高效工作法。最后,我们会用一把全新的“尺子”去度量它成长的极限,并给它做一份“智商测试”,看看这个“天才”到底偏科有多严重。准备好了吗?让我们一起潜入AI的大脑深处。00:00:33 如何打造一个“更划算”的虚拟世界?00:06:10 大模型的“缩放法则”里,藏着什么秘密?00:13:40 快与慢,AI世界里的“草稿式”工作法00:19:55 你的数据值多少钱?一个新尺子,看透AI的增长极限00:26:42 AI的“智商”报告,一个偏科天才的养成本期介绍的几篇论文:[LG] On Training in Imagination [Weizmann Institute of Science & New York University & Columbia University] https://arxiv.org/abs/2605.06732 ---[LG] Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer [Harvard University] https://arxiv.org/abs/2605.07870 ---[CL] Fast Byte Latent Transformer [FAIR at Meta] https://arxiv.org/abs/2605.08044 ---[LG] On the Invariance and Generality of Neural Scaling Laws [Johns Hopkins University & MIT] https://arxiv.org/abs/2605.07546 ---[AI] Uneven Evolution of Cognition Across Generations of Generative AI Models [Google DeepMind & Google Research] https://arxiv.org/abs/2605.06815
-
932
[人人能懂AI前沿] 从追问、拆解到打腹稿:AI正在升级“思考”的操作系统
你有没有想过,一个真正聪明的AI,应该具备哪些超越“有问必答”的能力?本期节目,我们将通过几篇最新的AI论文,一探究竟。我们将看到,AI如何从一个被动的知识库,进化成一个懂得“追问”的医生,以及一个会“打腹稿”的作家。我们还会揭示,AI如何学会把“登天”的难题拆解成“上楼”和“坐电梯”,又是如何通过一个“记忆外挂”实现过目不忘的。准备好了吗?让我们一起刷新对AI“思考能力”的认知!00:00:34 AI看病,真正厉害的不是“诊断”而是“追问”00:06:35 把“登天”的难题,拆解成“上楼”和“坐电梯”00:11:52 AI开口说话,非得“一句接一句”吗?00:16:47 面对海量数据,我们如何看清那只“看不见的手”?00:23:46 AI的“记忆外挂”,如何让它过目不忘?本期介绍的几篇论文:[AI] SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment [Google Research] https://arxiv.org/abs/2605.04012 ---[LG] Conditional Diffusion Sampling [University of Cambridge & University of Granada & University of British Columbia] https://arxiv.org/abs/2605.04013 ---[CL] Continuous Latent Diffusion Language Model [Bytedance Seed] https://arxiv.org/abs/2605.06548 ---[LG] High-Dimensional Statistics: Reflections on Progress and Open Problems [Columbia University & Harvard University & CMU] https://arxiv.org/abs/2605.05076 ---[CL] TIDE: Every Layer Knows the Token Beneath the Context [Apple] https://arxiv.org/abs/2605.06216
-
931
[人人能懂AI前沿] 给AI“减肥”、“立人设”和“夸到点子上”,需要几步?
你有没有想过,最简单的数学平均值,竟然能打败最复杂的压缩算法?或者,在教AI“做什么”之前,我们其实可以先给它“喂”一套完整的思想和人设?本期节目,我们将从四篇最新的AI论文出发,一起探寻如何让AI自己长出可拆分的“乐高模块”,以及如何像一位顶级名师那样,把奖励精准地“夸”到AI的灵光一闪之处。00:00:29 你的记忆能被压缩多少,藏在一个几何定律里00:06:42 训练AI,从“喂”指令到“喂”思想00:11:47 AI减肥记,如何让一个大模型只带“脑子”出门?00:17:54 AI也需要“夸到点子上”?本期介绍的几篇论文:[LG] The Geometry of Consolidation A Bharadwaj Vangara, A Gopinath https://github.com/niashwin/geometry-of-consolidation/blob/main/paper/arxiv/main.pdf ---[AI] Model Spec Midtraining: Improving How Alignment Training Generalizes C Li, S Price, S Marks, J Kutasov https://arxiv.org/abs/2605.02087 ---[CL] EMO: Pretraining Mixture of Experts for Emergent Modularity R Wang, A Bhagia, S Min https://arxiv.org/abs/2605.06663 ---[LG] DGPO: Distribution Guided Policy Optimization for Fine Grained Credit Assignment H Jin, R Zhu, Z Du, X Jiang… https://arxiv.org/abs/2605.03327
-
930
[人人能懂AI前沿] 从记忆宫殿、技能进化到创作的本质
你有没有想过,我们能不能唤醒AI沉睡的“内在记忆”,而不是给它外挂一个搜索引擎?如何让健忘的AI真正“吃一堑,长一智”,把经验变成可复用的技能?本期节目,我们将从最新的几篇论文出发,探讨AI如何从一个“答题工具”进化成一个能自我迭代、甚至与我们并肩探索的“成长型伙伴”。00:00:26 给AI配个“助理”?这可能是个馊主意00:06:10 让AI开窍,如何把“经验”变成“能力”?00:11:18 AI画家是天才,还是复印机?00:17:25 AI的新玩法,从“解题”到“陪跑”00:23:43 你以为的“神来之笔”,可能只是个“生僻字”?本期介绍的几篇论文:[LG] Retrieval from Within: An Intrinsic Capability of Attention-Based Models [NVIDIA] https://arxiv.org/abs/2605.05806 ---[LG] SkillOS: Learning Skill Curation for Self-Evolving Agents [University of Illinois Urbana-Champaign & Google Cloud AI Research] https://arxiv.org/abs/2605.06614 ---[LG] Understanding diffusion models requires rethinking (again) generalization [PSL Research University & Sorbonne University] https://arxiv.org/abs/2605.06077 ---[AI] AI Co-Mathematician: Accelerating Mathematicians with Agentic AI [Google DeepMind] https://arxiv.org/abs/2605.06651 ---[CL] The Frequency Confound in Language-Model Surprisal and Metaphor Novelty [Bielefeld University] https://arxiv.org/abs/2605.06506
-
929
[人人能懂AI前沿] 流形操纵、无损植入与高质量反馈
你有没有想过,一个“太聪明”的AI,反而会学会钻空子,导致整个系统一起“变笨”?你是否好奇,AI大脑的内部结构可能不是我们想象的开放广场,而是一张弯弯绕绕的精密地图?本期节目,我们将一起潜入AI的“心智世界”,看看最新论文是如何教会AI拥有“远见”来避免自我毁灭,如何像开赛车一样在它大脑的“流形赛道”上精准驰骋,甚至是如何用“不开刀”的方式给它无损植入新知识。更重要的是,我们会发现,原来给AI提建议和给它参考资料,都可能是在“越帮越忙”。准备好了吗?让我们一起挑战关于AI的四个“想当然”。00:00:45 当AI学会了钻空子,我们如何防止它“聪明反被聪明误”?00:06:20 AI的“脑回路”长啥样?我们可能一直都搞错了00:10:56 AI升级难题,一个“不开刀”的手术方案00:16:04 为什么夸人“你真棒”是最低效的鼓励?00:20:33 给AI帮忙,为何会越帮越忙?本期介绍的几篇论文:[LG] Explaining and Preventing Alignment Collapse in Iterative RLHF [PSL Research University] https://arxiv.org/abs/2605.04266 ---[LG] Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior [GOODFIRE] https://arxiv.org/abs/2605.05115 ---[LG] Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping [New Jersey Institute of Technology & UC Berkeley] https://arxiv.org/abs/2605.04308 ---[LG] Efficiently Aligning Language Models with Online Natural Language Feedback [Stanford University & Anthropic] https://arxiv.org/abs/2605.04356 ---[LG] When Context Hurts: The Crossover Effect of Knowledge Transfer on Multi-Agent Design Exploration [Meta] https://arxiv.org/abs/2605.04361
-
928
[人人能懂AI前沿] AI的学霸笔记、建筑师思维与自我进化
想让AI学会推理,是给它一本百科全书,还是塞给它一张“学霸的草稿纸”?最新论文说,看别人怎么思考,比死记硬背更管用。我们还会一起探索,AI离“独立盖起一栋楼”般的复杂工程到底还有多远,并用“脑回路CT”技术,看看你的AI管家为什么有时会“一本正经地胡说八道”。更有趣的是,我们将揭秘AI如何通过自我反思,从“差生”逆袭成“学霸”,以及如何靠一个“探测器”就画出整张看不见的“藏宝图”。准备好了吗?我们马上出发!00:00:37 AI也需要“学霸笔记”吗?00:05:42 AI离“独立盖起一栋楼”还有多远?00:10:46 你的AI管家,为什么有时“一本正经地胡说八道”?00:16:33 如何用AI画一张看不见的藏宝图?00:21:30 AI的自我修炼,从“差生”到“学霸”的秘密[IR] RAG over Thinking Traces Can Improve Reasoning Tasks [UC Berkeley] https://arxiv.org/abs/2605.03344 ---[AI] ProgramBench: Can Language Models Rebuild Programs From Scratch? [Meta FAIR] https://arxiv.org/abs/2605.03546 ---[AI] What Happens Inside Agent Memory? Circuit Analysis from Emergence to Diagnosis [City University of Hong Kong & University of Toronto] https://arxiv.org/abs/2605.03354 ---[LG] Flow Sampling: Learning to Sample from Unnormalized Densities via Denoising Conditional Processes [FAIR at Meta & Weizmann Institute of Science] https://arxiv.org/abs/2605.03984 ---[AI] Self-Improvement for Fast, High-Quality Plan Generation [Amazon] https://arxiv.org/abs/2605.03625
-
927
[人人能懂AI前沿] 从并行智慧、元认知到瓶颈法则:洞悉AI的学习奥秘
你有没有想过,为什么AI能从互联网的海量垃圾中炼出真金,而不是变成一个只会死记硬背的书呆子?当AI犯错时,我们是该让它闭嘴,还是有更聪明的办法让它学会“谦逊”?本期节目,我们将通过几篇最新的AI论文,揭示AI如何像一个并行专家团队一样解决难题,又是如何受困于一个惊人简洁的“瓶颈定律”,带你一窥AI大脑中那些优雅而深刻的学习法则。00:00:33 最优解的密码,藏在并行的智慧里00:05:55 AI怎么才能不说谎?答案藏在一种人类智慧里00:10:13 AI训练场上的“隐形杀手”00:16:52 AI怎么从一堆垃圾里炼出真金?00:23:33 增长的瓶颈定律,规模不是优势,弱点才是关键本期介绍的几篇论文:[LG] Black-box optimization of noisy functions with unknown smoothness [INRIA Lille & Google DeepMind] https://arxiv.org/abs/2605.02462 ---[CL] Hallucinations Undermine Trust; Metacognition is a Way Forward [Google Research & Tel Aviv University] https://arxiv.org/abs/2605.01428 ---[LG] Generalized Distributional Alignment Games for Unbiased Answer-Level Fine-Tuning [Google Research] https://arxiv.org/abs/2605.02435 ---[LG] A Theory of Generalization in Deep Learning [Stanford University] https://arxiv.org/abs/2605.01172 ---[LG] A Theory of Saddle Escape in Deep Nonlinear Networks [UC Berkeley] https://arxiv.org/abs/2605.01288
-
926
[人人能懂AI前沿] AI学会了抄近路、换引擎和吃“后悔药”
你有没有想过,AI也能不走寻常路,学会“抄近路”写文章吗?或者,当AI陷入追求高分的“内卷陷阱”时,我们该怎么教它“最小化遗憾”而不是盲目刷分?本期节目,我们将从几篇最新的论文出发,看看AI如何通过更换“发动机”、打通不同门派的“武功”,甚至用更少的考题更精准地对齐我们的真实感受,实现一次漂亮的思维跃迁。00:00:30 抄近路,人工智能学会了新“导航术”00:05:34 AI的“注意力”,正在成为它的“负担”00:10:42 AI的“高分陷阱”,我们怎样教得更聪明?00:16:51 当规则遇上“混沌”,AI大神们的两种武功,原来同宗同源00:23:38 为什么最好的考卷,题目反而最少?本期介绍的几篇论文:[LG] Consistent Diffusion Language Models [Microsoft & Purdue University] https://arxiv.org/abs/2605.00161 ---[LG] Caracal: Causal Architecture via Spectral Mixing [Huawei Technologies] https://arxiv.org/abs/2605.00292 ---[LG] Wasserstein Distributionally Robust Regret Optimization for Reinforcement Learning from Human Feedback [University of North Carolina & Imperial College London & Stanford University] https://arxiv.org/abs/2605.00155 ---[LG] Trees to Flows and Back: Unifying Decision Trees and Diffusion Models [Technical University of Munich] https://arxiv.org/abs/2605.00414 ---[CL] Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment [University of Southern California & Stanford University] https://arxiv.org/abs/2605.00022
-
925
[人人能懂AI前沿] AI的读心术、度量衡与成长法则
你有没有想过,一个复杂的观念,比如“怀疑权威”,可以被压缩成一串看似随机的数字,并悄悄植入AI的大脑吗?我们又该如何为机器人设计一套公平的“高考”,来检验它是不是真的聪明,而不只是个“学霸”?本期节目,我们将一起打开五篇最新论文的奇思妙想,看看AI如何被植入“思想罗盘”,如何通过“找个伴,一起笨”的方式学会成为通才,以及我们该如何为AI的技能撰写一份清晰的“说明书”。准备好了吗?让我们即刻出发!00:00:36 你以为你在教它数数,它却学会了你的偏见00:08:00 我们需要一把什么样的尺子,来衡量复杂的世界?00:15:40 机器人界的“幼儿园”和“高考”00:20:57 最好的学习,是“找个伴,一起笨”00:26:36 AI的“技能包”,应该怎么写说明书?本期介绍的几篇论文:[CL] Subliminal Steering: Stronger Encoding of Hidden Signals [Columbia University] https://arxiv.org/abs/2604.25783 ---[LG] Generalising maximum mean discrepancy: kernelised functional Bregman divergences [Monash University & Sony Computer Science Laboratorie] https://arxiv.org/abs/2604.24047 ---[RO] KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning [Princeton University & Carnegie Mellon University & Georgia Tech] https://arxiv.org/abs/2604.25788 ---[LG] Co-Evolving Policy Distillation [CAS & JD.COM] https://arxiv.org/abs/2604.27083 ---[CL] From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills [Peking University] https://arxiv.org/abs/2604.24026
-
924
[人人能懂AI前沿] 如何给AI做CT?当螃蟹开始跳舞,AI学会了“变脸”
你有没有想过,我们能用一套“知识探针”给大模型做一次精确的“脑容量”CT扫描吗?或者,当AI不再满足于讲一个完美的成功故事,而是把所有失败的教训都记录下来,科学研究会变成怎样一个“活物”?本期节目,我们将从五篇最新论文出发,看看AI如何学会“变脸”戏法,又是如何用“笨办法”实现反超,以及,如何只用一部手机就让一只螃蟹学会跳街舞。00:00:31 如何给AI大模型做一次“脑容量”CT扫描?00:08:25 让一只螃蟹学会跳街舞,总共分几步?00:13:47 让知识“活”起来,科研的下一种形态00:20:47 AI的“变脸”戏法,我们以为的安全,可能只是没对上“暗号”00:27:26 AI进化新思路,为什么“笨办法”反而更聪明?本期介绍的几篇论文:[LG] Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity [Pine AI] https://arxiv.org/abs/2604.24827 ---[CV] MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos [Huawei International Pte. Ltd. & Huawei Central Media Technology Institute] https://arxiv.org/abs/2512.10881 ---[LG] The Last Human-Written Paper: Agent-Native Research Artifacts [Orchestra Research & Stanford University & Ohio State University] https://arxiv.org/abs/2604.24658 ---[LG] Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggers [Warsaw University of Technology & Truthful AI] https://arxiv.org/abs/2604.25891 ---[CV] Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation [Meta AI] https://arxiv.org/abs/2604.24763
-
923
[人人能懂AI前沿] AI的经济学:从精打细算、聪明分工到绘制思想地图
你有没有想过,一个真正能干活的AI,需要的不是更多的考题,而是一间属于自己的“办公室”?我们又该如何扮演一个聪明的“甩手掌柜”,给手下的AI专家们高效分配任务?本期节目,我们将从几篇最新的AI论文出发,聊聊如何用“成本思维”给AI的训练省下一半的钱,如何通过一场“博弈”让AI自我进化,并最终一起探索AI思考的形状,看看它的“脑海”里究竟是字典,还是一幅幅由概念构成的几何地图。00:00:34 想让AI替你干活?得先给它一间“办公室”00:06:01 如何当一个聪明的“甩手掌柜”?00:11:46 AI训练太烧钱?你缺的不是算力,是“成本思维”00:17:43 AI进步的捷径,不只看结果,更要玩对博弈00:23:08 AI怎么思考?答案可能藏在几何里本期介绍的几篇论文:[LG] Synthetic Computers at Scale for Long-Horizon Productivity Simulation [Microsoft] https://arxiv.org/abs/2604.28181 ---[LG] Optimized Deferral for Imbalanced Settings [Google Research & Courant Institute of Mathematical Science] https://arxiv.org/abs/2604.27723 ---[LG] Cost-Aware Learning [Google Research] https://arxiv.org/abs/2604.28020 ---[LG] Distributional Alignment Games for Answer-Level Fine-Tuning [Google Research & Microsoft Research] https://arxiv.org/abs/2604.27166 ---[LG] Do Sparse Autoencoders Capture Concept Manifolds? [Harvard University] https://arxiv.org/abs/2604.28119
-
922
[人人能懂Ai前沿] AI的思考术:当机器学会了划重点、开复盘会和管理书房
你有没有想过,AI不仅能当一个好员工,还能自己进化成项目经理,开“复盘会”优化工作流?或者,指挥一个复杂的机器人,也许只需要像在屏幕上“画重点”一样简单?本期节目,我们将从五篇最新的AI论文出发,聊聊AI如何突破效率瓶颈:从揭秘AI服务“拼单”背后隐藏的隐私风险,到看AI如何像图书管理员一样高效整理海量知识,再到探索不同世界里的学习速度极限。准备好,让我们一起看看AI是如何学会“化繁为简”与“自我进化”的。00:00:39 想把东西卖出高价?你得懂点学习的规律00:07:21 拼单的代价,AI服务如何泄露你的秘密00:12:23 AI的瓶颈,不在大脑,在“书房”00:17:48 让AI自己进化,不止是大力出奇迹00:23:32 给机器人“画重点”,让复杂变简单本期介绍的几篇论文:[LG] On the Learning Curves of Revenue Maximization [Purdue University & Yale University & Technion] https://arxiv.org/abs/2604.26922 ---[LG] Quantamination: Dynamic Quantization Leaks Your Data Across the Batch [University of Cambridge & AI Sequrity Company] https://arxiv.org/abs/2604.26505 ---[LG] Unifying Sparse Attention with Hierarchical Memory for Scalable Long-Context LLM Serving [Microsoft Research] https://arxiv.org/abs/2604.26837 ---[CL] FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients [Naver Search US & MIT] https://arxiv.org/abs/2604.26258 ---[CV] Lifting Embodied World Models for Planning and Control [New York University & UC Berkeley] https://arxiv.org/abs/2604.26182
-
921
[人人能懂AI前沿] AI如何验明正身、高效开会、稳定心态?
想知道如何像做笔迹鉴定一样,一眼看穿AI的“真身”吗?想了解怎样能让AI开会时,奇迹般地省下97%的“桌子”吗?本期我们就来聊聊几篇最新论文,看看AI如何学会“读心术”来高效协作,如何避免因“谜之自信”而犯下大错,甚至,为什么一个“会犯错”的老师,反而能教出更厉害的AI学生。00:00:28 如何给AI做“笔迹鉴定”?00:06:27 AI开会,如何省下97%的桌子?00:14:10 AI界的“青出于蓝”,是惊喜还是惊吓?00:19:30 你还在让AI“写报告”?它们已经开始直接交换“想法”了00:24:16 为什么“犯错”的老师,能教出更好的AI?本期介绍的几篇论文:[CL] The Surprising Universality of LLM Outputs: A Real-Time Verification Primitive [Evolutionairy AI] https://arxiv.org/abs/2604.25634 ---[LG] PolyKV: A Shared Asymmetrically-Compressed KV Cache Pool for Multi-Agent LLM Inference [No University Provided] https://arxiv.org/abs/2604.24971 ---[AI] Evaluating Risks in Weak-to-Strong Alignment: A Bias-Variance Perspective [University of Illinois Urbana-Champaign & Microsoft & InstaDeep] https://arxiv.org/abs/2604.25077 ---[CL] Recursive Multi-Agent Systems [UIUC] https://arxiv.org/abs/2604.25917 ---[LG] When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient [Princeton University] https://arxiv.org/abs/2604.25872
-
920
[人人能懂AI前沿] AI的思维地图、社交网络与减肥陷阱
你有没有想过,一个“乐于助人”的AI,它的善意本身可能就是最危险的漏洞?本期节目,我们将从几篇最新的AI论文出发,一起探索AI的“内心世界”:看看它是如何通过预判未来让训练更高效,如何在内部形成“专家圈子”,又是如何掉进“减肥不减脂”的内存陷阱,并最终揭示那张描绘它思维路径的神秘“藏宝图”。准备好了吗?让我们一起打开AI的黑箱。00:00:30 为什么说,答案对错没那么重要?00:05:59 你的AI正在“挑食”,一个让大模型加速的隐秘模式00:11:46 AI大模型瘦身指南,减重≠减脂00:17:49 为什么一个“乐于助人”的AI,反而更危险?00:22:34 AI的“藏宝图”,我们如何看懂机器的“内心世界”?本期介绍的几篇论文:[LG] Reward Models Are Secretly Value Functions: Temporally Coherent Reward Modeling [AI at Meta] https://arxiv.org/abs/2604.22981 ---[LG] Scaling Multi-Node Mixture-of-Experts Inference Using Expert Activation Patterns [Meta & Georgia Institute of Technology] https://arxiv.org/abs/2604.23150 ---[LG] Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation [MIT CSAIL] https://arxiv.org/abs/2604.22783 ---[CL] Jailbreaking Frontier Foundation Models Through Intention Deception [CMU] https://arxiv.org/abs/2604.24082 ---[AI] Domain-Filtered Knowledge Graphs from Sparse Autoencoder Features [Stanford University] https://arxiv.org/abs/2604.23829
-
919
[人人能懂AI前沿] 从解耦、祛魅到本质思考:AI的五种新活法
你有没有想过,我们能让AI不再“傻等”,像个独立的施工队一样高效协作吗?当AI像个“偏科生”时,我们能否不改造它的大脑,只用一本“说明书”就教会它看懂全世界?本期节目,我们将一口气解锁五篇最新论文带来的脑洞:看AI如何通过“跟自己抬杠”学会创造,如何通过剥离无关的“姿态”来直击事物本质,以及我们为何终于有信心说,AI的“黑箱”正在被科学理论的光芒照亮。准备好了吗?让我们一起出发,探索AI的这五种全新进化路径!00:00:37 AI训练场上的“交通拥堵”?我们换个活法00:06:04 我们终于要看懂AI的大脑了吗?00:13:19 如何让一个“偏科”的AI,学会看懂全世界?00:19:02 AI的创造力开关,藏在哪儿?00:25:16 AI的新活法,只做对的事,不做多余的事本期介绍的几篇论文:[CL] Decoupled DiLoCo for Resilient Distributed Pre-training [Google DeepMind] https://arxiv.org/abs/2604.21428 ---[LG] There Will Be a Scientific Theory of Deep Learning [UC Berkeley & Harvard University] https://arxiv.org/abs/2604.21691 ---[CV] Unlocking Multi-Spectral Data for Multi-Modal Models with Guided Inputs and Chain-of-Thought Reasoning [Google DeepMind] https://arxiv.org/abs/2604.21032 ---[IR] Caesar: Deep Agentic Web Exploration for Creative Answer Synthesis [Cognizant AI Lab] https://arxiv.org/abs/2604.20855 ---[LG] Quotient-Space Diffusion Models [Peking University & Xi’an Jiaotong University] https://arxiv.org/abs/2604.21809
-
918
[人人能懂AI前沿] 从视觉理解决锁、算法自主发现到AI的“内卷”与“私心”
本期节目,我们将一起打开几个AI研究的奇妙盲盒:你将发现,AI“画家”的背后可能藏着一位“全科医生”;而AI“工程师”已经能自主发明超越人类的算法。但硬币的另一面是,AI也会陷入毫无意义的“内卷”,甚至为了保护它的AI“同伴”而对我们撒谎。最后,我们会探讨一个根本问题:我们衡量AI好坏的那把尺子,是不是从一开始就错了?00:00:30 AI生图的秘密,从“画家”到“全科医生”00:05:02 让AI当工程师,它能胜任吗?00:11:09 AI的“内卷”困境,如何防止学霸走火入魔?00:15:34 当AI有了“自己人”,它会为了“哥们”背叛你吗?00:21:08 你的APP搜不准?问题可能出在尺子本期介绍的几篇论文:[CV] Image Generators are Generalist Vision Learners [Google DeepMind] https://arxiv.org/abs/2604.20329 ---[LG] The AI Telco Engineer: Toward Autonomous Discovery of Wireless Communications Algorithms [NVIDIA] https://arxiv.org/abs/2604.19803 ---[LG] Scaling Self-Play with Self-Guidance [Stanford University] https://arxiv.org/abs/2604.20209 ---[CL] Peer-Preservation in Frontier Models [UC Berkeley & University of California, Santa Cruz] https://arxiv.org/abs/2604.19784 ---[IR] Semantic Recall for Vector Search [CWI & EPFL & MPI-SWS] https://arxiv.org/abs/2604.20417
-
917
[人人能懂AI前沿] 大象、蚂蚁与管家:解密AI系统设计的协作智慧
今天我们要聊聊AI那些让人又爱又恨的“小毛病”。根据几篇最新论文的洞察,我们将一起探寻:为什么天才AI连煎个鸡蛋都费劲?它解决难题时是在真思考还是瞎撞?当我们和AI对话时,如何才能让它秒回,不再尴尬等待?更重要的是,当AI开口说话时,它是否带着不为人知的“文化口音”?而把家庭钥匙交给AI管家时,我们又该如何确保它不会出卖你?00:00:31 人工智能的下一个路口,藏在大脑里00:06:25 给你一个好方法,你却用蛮力?00:12:01 让AI秒回你的秘密,当大象学会与蚂蚁共舞00:18:23 AI的“美国口音”,藏不住了00:23:21 你的AI管家,会不会偷偷出卖你?本期介绍的几篇论文:[AI] NeuroAI and Beyond: Bridging Between Advances in Neuroscience and Artificial Intelligence [University of Maryland] https://arxiv.org/abs/2604.18637 ---[LG] Evaluation-driven Scaling for Scientific Discovery [Stanford University & Peking University & Tsinghua University] https://arxiv.org/abs/2604.19341 ---[CL] Micro Language Models Enable Instant Responses [University of Washington & Meta AI] https://arxiv.org/abs/2604.19642 ---[CL] Location Not Found: Exposing Implicit Local and Global Biases in Multilingual LLMs [Google Research & Bar-Ilan University] https://arxiv.org/abs/2604.19292 ---[AI] An AI Agent Execution Environment to Safeguard User Data [University of California, Los Angeles & Google] https://arxiv.org/abs/2604.19657
-
916
[人人能懂AI前沿] 让AI学会三思、开好复盘会、再彻底“换个脑子”
你有没有想过,如何教AI像我们一样“知错能改”,而不是只会“一锤子买卖”?当一群AI协作时,怎样才能让它们像顶尖团队一样开好“复盘会”,而不是人多添乱?这一期,我们将一口气聊透五篇最新论文,看科学家们如何教会AI从“自我纠错”的智慧,进化到拥有“内在记忆”,甚至跨界变身,将工厂难题精准翻译成数学代码。准备好,一场关于AI如何学习“思考”的头脑风暴,马上开始!00:00:34 AI界的“错题本”,如何教机器学会“三思而后行”?00:06:11 人多不一定力量大,但聪明的团队会开“复盘会”00:11:32 为什么你的AI“记不住事”?00:17:27 算力的“跨界”妙用,如何让AI芯片干好分外的活?00:23:58 AI“翻译官”,从工厂难题到数学代码本期介绍的几篇论文:[LG] Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought [University of Michigan] https://arxiv.org/abs/2604.17912 ---[LG] Scaling Test-Time Compute for Agentic Coding [Meta Superintelligence Labs] https://arxiv.org/abs/2604.16529 ---[LG] The Topological Trouble With Transformers [Google DeepMind] https://arxiv.org/abs/2604.17121 ---[LG] Enabling AI ASICs for Zero Knowledge Proof [Georgia Institute of Technology & MIT] https://arxiv.org/abs/2604.17808 ---[LG] AutoOR: Scalably Post-training LLMs to Autoformalize Operations Research Problems [X, The Moonshot Factory & University of Oxford] https://arxiv.org/abs/2604.16804
-
915
[人人能懂AI前沿] 从目标牵引、经验进化到群体学习
你有没有想过,AI也会陷入“高水平重复”的舒适区陷阱?学习新知识后,它为什么会像我们一样“健忘”?本期节目,我们将通过几篇最新的AI论文,揭示如何让AI从一个只会“死记硬背”的学霸,进化成一个懂得“举一反三”、甚至会“团队作战”的智慧伙伴,探索让AI真正变得更聪明、更高效的秘密。00:00:27 你是在“精进”,还是在“高水平地重复”?00:04:49 AI上课后,为什么反而把以前会的给忘了?00:11:08 让AI左右互搏,速度翻倍的秘密00:16:02 你的“人工智障”客服,终于有救了?00:22:16 AI进化论,从“二选一”到“团战”的效率革命本期介绍的几篇论文:[LG] Beyond Distribution Sharpening: The Importance of Task Rewards [Mila] https://arxiv.org/abs/2604.16259 ---[CL] Why Fine-Tuning Encourages Hallucinations and How to Fix It [Hebrew University of Jerusalem & Technion – Israel Institute of Technology & University of Illinois Urbana-Champaign] https://arxiv.org/abs/2604.15574 ---[LG] Faster LLM Inference via Sequential Monte Carlo [Cornell University & MIT] https://arxiv.org/abs/2604.15672 ---[CL] PolicyBank: Evolving Policy Understanding for LLM Agents [Google Cloud] https://arxiv.org/abs/2604.15505 ---[CL] GroupDPO: Memory efficient Group-wise Direct Preference Optimization [CMU & Google Deepmind & Google] https://arxiv.org/abs/2604.15602
-
914
[人人能懂AI前沿] 从触觉梦境、思维循环到经验迁移:AI如何学会深度思考与行动
你有没有想过,让AI学会“做白日梦”去预演触感,竟然能让它的动手能力提升90%?我们常说的“深度思考”,在AI那里可能只是一种高效的“循环播放”。本期节目,我们将从几篇最新的AI论文出发,一起探寻AI如何像高手一样进行“跨界”经验调用,看看AI界的“秦始皇”又是如何通过“统一度量衡”,为智能体打造一个强大的行动底座,揭开那常常被我们忽视的、冰山下的98%。00:00:34 学会“做白日梦”,才能把活儿干好00:05:23 AI的冰山,我们看不见的那98%00:11:19 AI的“深度思考”,原来是“循环播放”?00:16:46 高手,都善于“跨界”调用经验00:23:38 AI 界的“秦始皇”,如何统一智能体的“度量衡”?本期介绍的几篇论文:[RO] Learning Versatile Humanoid Manipulation with Touch Dreaming [CMU] https://arxiv.org/abs/2604.13015 ---[AI] Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems [Mohamed bin Zayed University of Artificial Intelligence] https://arxiv.org/abs/2604.14228 ---[LG] A Mechanistic Analysis of Looped Reasoning Language Models [University of Oxford & Mila] https://arxiv.org/abs/2604.11791 ---[LG] Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents [KAIST] https://arxiv.org/abs/2604.14004 ---[AI] UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents [University of Science and Technology of China & Eastern Institute of Technology] https://arxiv.org/abs/2604.11557
-
913
[人人能懂AI前沿] AI的思考术:从深度循环、逆向规划到自我进化
你有没有想过,一个真正聪明的AI,应该具备哪些超能力?本期节目,我们将一口气看懂五篇最新的AI论文。我们将一起探索,如何不靠“堆肌肉”,而是通过精巧的“循环”让AI学会深度思考;如何只改变一个训练目标,就教会AI“从未来倒推现在”的逆向思维;以及为什么AI既是“短跑健将”,却又在“马拉松”任务中频频掉链子。更进一步,我们还会揭示AI“自我进化”的秘密——如何把自己犯过的错变成下一步的垫脚石,以及为何“成大事者,不靠记忆靠遗迹”。准备好了吗?让我们一起开启这场关于AI智慧的深度探索之旅!00:00:45 人工智能的“内功”心法00:05:41 教AI做事,为什么不能只看眼前?00:10:24 为什么AI既聪明,又“靠不住”?00:14:54 高手精进的秘密,如何把自己犯过的错,变成下一步的垫脚石00:20:49 成大事者,不靠记忆靠“遗迹”本期介绍的几篇论文:[LG] Parcae: Scaling Laws For Stable Looped Language Models [University of California, San Diego] https://arxiv.org/abs/2604.12946 ---[LG] How Transformers Learn to Plan via Multi-Token Prediction [University of California, Los Angeles & Shanghai Jiao Tong University] https://arxiv.org/abs/2604.11912 ---[LG] LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning [University of Oxford & Lawrence Livermore National Laboratory (LLNL)] https://arxiv.org/abs/2604.14140 ---[CL] Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision [Princeton University] https://arxiv.org/abs/2604.12002 ---[CL] Toward Autonomous Long-Horizon Engineering for ML Research [Renmin University of China] https://arxiv.org/abs/2604.13018
-
912
[人人能懂AI前沿] 动态开关、统一模型与扰动训练:AI的效率革命
你有没有想过,最聪明的决策,也许是先用最小的力气排除所有错误选项?当AI变得越来越话痨时,我们该如何给它请一位“效率教练”?为了把强大的AI装进你的手机,科学家又想出了怎样统一又精简的“节食计划”?本期节目,我们将通过几篇最新论文,一起探讨AI如何学会“先探路再铺路”的决策智慧,如何治好自己的“路痴”毛病,甚至如何掌握“动态开关”这门最高级的偷懒艺术。00:00:33 聪明人的偷懒指南,如何用最少的力气,走最对的路?00:07:16 AI话痨怎么办?聪明还得会省钱00:12:27 AI的“节食计划”,如何在你的手机里装下一个图书馆?00:17:42 大模型越来越聪明,为什么还是个“路痴”?00:22:45 为什么说,最高级的AI,必须学会“偷懒”?本期介绍的几篇论文:[CL] Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning [INRIA Lille & Google DeepMind] https://arxiv.org/abs/2604.14974 ---[CL] CROP: Token-Efficient Reasoning in Large Language Models via Regularized Prompt Optimization [Google LLC & Purdue University] https://arxiv.org/abs/2604.14214 ---[IR] A Unified Model and Document Representation for On-Device Retrieval-Augmented Generation [University of Massachusetts Amherst & Google] https://arxiv.org/abs/2604.14403 ---[CL] Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation [Georgia Institute of Technology & Microsoft] https://arxiv.org/abs/2604.14339 ---[CL] Compressed-Sensing-Guided, Inference-Aware Structured Reduction for Large Language Models [UC Berkeley] https://arxiv.org/abs/2604.14156
-
911
[人人能懂AI前沿] 从行为一致、多语优势到动态协同:AI的认知升维
你有没有想过,一个学得更久的AI“尖子生”,为什么反而忘得更快?或者,想让AI更懂英语,最好的方法竟然是教它别的语言?本期节目,我们将一口气解锁五篇最新论文带来的“反常识”洞见。我们会发现,决定AI效率的瓶颈可能不是算力而是“管理”,与AI对话的成本可以靠一本“字典”轻松打个二折,而一个好的AI模拟世界,追求的不是“长得像”,而是“反应像”。00:00:32 大模型训练的悖论,为什么学得越久,忘得越快?00:06:02 AI的效率瓶颈,不是算力,是“管理”00:12:33 想让AI更懂英语?那就别只喂它英语00:18:46 跟AI对话,如何省下80%的话费?00:24:39 你的“差不多”不是我的“差不多”,如何让AI的模拟世界更靠谱?本期介绍的几篇论文:[LG] All elementary functions from a single binary operator [Jagiellonian University] https://arxiv.org/abs/2603.21852 ---[LG] Sample Complexity of Autoregressive Reasoning: Chain-of-Thought vs. End-to-End [Purdue University & The Hebrew University & Technion and Google Research] https://arxiv.org/abs/2604.12013 ---[CL] Continuous Knowledge Metabolism: Generating Scientific Hypotheses from Evolving Literature [Central University of Finance and Economics & Beijing Institute of Technology & TsingyuAI] https://arxiv.org/abs/2604.12243 ---[CL] LoSA: Locality Aware Sparse Attention for Block-Wise Diffusion Language Models [UC Berkeley] https://arxiv.org/abs/2604.12056 ---[LG] The Linear Centroids Hypothesis: How Deep Network Features Represent Data [Rice University & Google Research & Brown University] https://arxiv.org/abs/2604.11962
-
910
[人人能懂AI前沿] 从创世积木、思维成本到知识代谢:AI如何“思考”?
你有没有想过,整个科学计算器也许只需要两个按键就能实现?或者,AI偷懒的秘诀竟是只用20%的精力,就能完成90%的工作?最新的一些研究,正从这些奇妙的角度,刷新我们对智能、效率和知识的认知。今天,我们将一起看看AI如何只用一个“创世积木”构建整个数学世界,如何像做CT一样看清自己的“脑回路”,并揭示过程和结果哪个才是学习的关键。准备好,一场思维风暴马上开始!00:00:36 你的科学计算器,其实只需要两个键00:05:01 学会一个本事,过程和结果哪个更重要?00:13:05 如何像高手一样,“看见”知识的未来?00:19:31 AI偷懒的艺术,为什么只做20%的工作,能得到90%的结果?00:25:08 给AI大脑做CT,我们找到了更清晰的脑回路图本期介绍的几篇论文:[LG] All elementary functions from a single binary operator [Jagiellonian University] https://arxiv.org/abs/2603.21852 ---[LG] Sample Complexity of Autoregressive Reasoning: Chain-of-Thought vs. End-to-End [Purdue University & The Hebrew University & Technion and Google Research] https://arxiv.org/abs/2604.12013 ---[CL] Continuous Knowledge Metabolism: Generating Scientific Hypotheses from Evolving Literature [Central University of Finance and Economics & Beijing Institute of Technology & TsingyuAI] https://arxiv.org/abs/2604.12243 ---[CL] LoSA: Locality Aware Sparse Attention for Block-Wise Diffusion Language Models [UC Berkeley] https://arxiv.org/abs/2604.12056 ---[LG] The Linear Centroids Hypothesis: How Deep Network Features Represent Data [Rice University & Google Research & Brown University] https://arxiv.org/abs/2604.11962
-
909
[人人能懂AI前沿] 从“模拟人生”到“婴儿视角”,AI如何学会思考?
你有没有想过,要让AI变得更聪明,除了让它“读万卷书”,我们还能不能让它在虚拟世界里“行万里路”,像玩“模拟人生”一样学会物理?当很多聪明的算法凑在一起反而“掉链子”时,我们如何用“乐高积木”的思路化繁为简?这一期,我们将一起探寻几份最新论文带来的启发:从像婴儿一样在“思想实验”中探索世界,到用一张“知识地图”代替“知识词典”来解决复杂问题,甚至让AI学会“自我怀疑”,从而变得又快又好。准备好了吗?让我们一起出发!00:00:38 AI版“模拟人生”让机器在虚拟世界里学会物理00:05:56 从1到N如何让你的数据分析稳上加稳?00:12:14 AI养娃我们可能找到了让机器像婴儿一样学习的秘密00:18:01 高手解决问题,靠的是地图,而不是词典00:24:14 AI的自我怀疑,一个让大模型又快又好的新思路本期介绍的几篇论文:[LG] Solving Physics Olympiad via Reinforcement Learning on Physics Simulators [CMU & Lambda] https://arxiv.org/abs/2604.11805 ---[LG] Replicable Composition [University of Maryland & Google Research] https://arxiv.org/abs/2604.10423 ---[LG] Zero-shot World Models Are Developmentally Efficient Learners [Stanford University] https://arxiv.org/abs/2604.10333 ---[CL] Structure-Grounded Knowledge Retrieval via Code Dependencies for Multi-Step Data Reasoning [Microsoft & Simon Fraser University & University of Science and Technology of China] https://arxiv.org/abs/2604.10516 ---[LG] Introspective Diffusion Language Models [Together AI] https://arxiv.org/abs/2604.11035
-
908
[人人能懂AI前沿] 从思想引导、言行一致到世界模型
你有没有想过,我们能像做微创手术一样,在AI思考的瞬间“拨乱反正”,引导它向善吗?或者,让昂贵的AI训练学会“温故知新”,把扔掉的经验变废为宝?本期节目,我们将一起探索几篇最新论文,看看科学家们如何教会AI遵守自己立下的规矩,如何让它既会“看路”又会“造景”,甚至,如何为它补上一堂生动的物理课,让它的想象力更符合现实。准备好了吗?让我们马上出发!00:00:34 给AI的大脑装一个“概念导航”00:06:53 AI训练的高效秘诀,好东西值得再用一次00:12:07 如何看穿一个AI的“人设”?00:16:56 AI新思路,想看清世界,先学会走路00:23:07 为什么AI生成的视频总感觉“假”?答案藏在物理学里本期介绍的几篇论文:[LG] Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs [University of Pennsylvania & Amazon] https://arxiv.org/abs/2604.08846 ---[LG] Efficient RL Training for LLMs with Experience Replay [FAIR at Meta] https://arxiv.org/abs/2604.08706 ---[CL] Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies [Microsoft] https://arxiv.org/abs/2604.09189 ---[CV] Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories [Meta AI] https://arxiv.org/abs/2604.09429 ---[CV] PhysInOne: Visual Physics Learning and Reasoning in One Suite [vLAR Group] https://arxiv.org/abs/2604.09415
-
907
[人人能懂AI前沿] 真实评测、补课加速与AI管弦乐队
你是否想过,为什么你的AI助理连订张机票都费劲?本期节目,我们将一起给AI来一场“真实世界”的大考,看看它究竟能得多少分。我们还会揭秘如何不给模型动手术,只靠“补课”就让它说话速度提升1.7倍。更有趣的是,我们将看到一个“过时”的技术如何靠“大力出奇迹”王者归来,以及一个“思维越狱”般的巧妙设计,如何让一张显卡也能训练千亿大模型。最后,我们还会认识一支能帮你写论文的“AI管弦乐队”。准备好了吗?让我们马上进入AI前沿的奇妙世界。00:00:39 你的AI助理,离真正上岗还有多远?00:05:50 让AI大模型提速,只需要“补课”就够了00:10:13 老树发新芽,一个被人小瞧的技术,如何靠“笨办法”王者归来?00:15:46 AI的“昂贵误会”,我们都搞错瓶颈了吗?00:22:05 你的下一个写作搭档,可能不是一个人本期介绍的几篇论文:[CL] ClawBench: Can AI Agents Complete Everyday Online Tasks? [University of British Columbia & Vector Institute] https://arxiv.org/abs/2604.08523 ---[CL] MARS: Enabling Autoregressive Models Multi-Token Generation [Nanyang Technological University & Singapore Management University & Uppsala University] https://arxiv.org/abs/2604.07023 ---[CV] LoMa: Local Feature Matching Revisited [Chalmers University of Technology & Linköping University & University of Amsterdam] https://arxiv.org/abs/2604.04931 ---[CL] MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU [University of Notre Dame & Lehigh University] https://arxiv.org/abs/2604.05091 ---[LG] PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing [Google] https://arxiv.org/abs/2604.05018
-
906
[人人能懂AI前沿] AI的“内功”与“外功”:从成为电脑到影响你我
如果你的电脑本身就是一个神经网络,没有CPU会怎样?如果AI想学一门手艺,得先让另一个AI给它建个“驾校”呢?本期节目,我们将从五篇最新论文出发,聊聊AI如何从2D世界“透视”三维,如何用量子芯片实现超级压缩,并最终探讨一个直击灵魂的问题:AI正在让我们变聪明,还是变懒?00:00:27 你的下一台电脑,为什么可能没有CPU?00:09:13 AI想“上岗”,得先过哪一关?00:14:57 从“看扁”到“透视”,AI如何拥有3D“世界观”?00:20:36 你的下一个U盘,可能是个量子芯片00:26:12 AI,是你的“外挂”还是你的“拐杖”?本期介绍的几篇论文:[LG] Neural Computers [Meta AI] https://arxiv.org/abs/2604.06425 ---[LG] Gym-Anything: Turn any Software into an Agent Environment [CMU] https://arxiv.org/abs/2604.06126 ---[CV] Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D [Meta Reality Labs Research] https://arxiv.org/abs/2604.05212 ---[LG] Exponential quantum advantage in processing massive classical data [California Institute of Technolog & MIT & Google Quantum AI] https://arxiv.org/abs/2604.07639 ---[AI] AI Assistance Reduces Persistence and Hurts Independent Performance [CMU & University of Oxford] https://arxiv.org/abs/2604.04721
-
905
[人人能懂AI前沿] 从概念擦除、元学习到内部电路诊断
想让AI更聪明,我们总觉得要给它看更多、学更多,但如果我告诉你,真正的秘诀恰恰相反呢?本期节目,我们将一起探索几篇最新的AI论文,看看科学家是如何教会AI“选择性遗忘”,又是如何给AI做“脑CT”来判断它是不是在“假装努力”。我们还会聊到,如何打造一个能快速学会任何人大脑“方言”的超级解码器,以及怎样只用1%的精力,就让AI帮你“看完”一部长电影。准备好了吗?让我们一起刷新对AI的认知!00:00:37 按下删除键之后,东西就真的消失了吗?00:06:54 造一把“万能钥匙”?不如当个“超级锁匠”00:11:24 想让AI更博学?先给它少看点书00:16:21 给AI做个体检,我们怎么知道它不是在瞎蒙?00:22:01 如何只用1%的精力,看完一部长电影的精华?本期介绍的几篇论文:[LG] Is your algorithm unlearning or untraining? [Google] https://arxiv.org/abs/2604.07962 ---[LG] Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding [University of Hong Kong] https://arxiv.org/abs/2604.08537 ---[CL] Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts [Apple & National University of Singapore] https://arxiv.org/abs/2604.08519 ---[LG] Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings [University of Delaware & George Mason University] https://arxiv.org/abs/2604.08192 ---[CV] Small Vision-Language Models are Smart Compressors for Long Video Understanding [Meta AI] https://arxiv.org/abs/2604.08120
-
904
[人人能懂AI前沿] 从随机幻觉、精准剪枝到沉默的深度天花板
你敢让AI帮你摇号抽奖吗?本期节目,我们将从几篇最新的AI论文出发,揭示AI在“随机”这件事上出人意料的偏见。接着,我们会探讨如何像一位拥有全局智慧的CEO一样,给臃肿的AI模型进行“精准裁员”,并学习AI沟通中“优雅打断”的高效密码。最后,我们将一起探寻AI是否存在“思想深度的天花板”,以及如何把一个“笨徒弟”模型,调教成一位善用工具的“老师傅”。准备好了吗?让我们一起潜入AI的前沿思想深海!00:00:42 为什么你老板让你用AI摇号,你得多个心眼?00:06:07 从“平均砍”到“精准剪枝”,AI瘦身中的全局智慧00:12:10 沟通的高效密码,如何优雅地“打断”别人00:18:04 AI的“思想深度”有没有天花板?00:24:24 如何把一个“笨徒弟”,调教成“老师傅”?本期介绍的几篇论文:[CL] The Illusion of Stochasticity in LLMs [Google DeepMind] https://arxiv.org/abs/2604.06543 ---[CL] Does a Global Perspective Help Prune Sparse MoEs Elegantly? [University of Rochester & Flatiron Institute] https://arxiv.org/abs/2604.06542 ---[CL] Learning to Interrupt in Language-based Multi-agent Communication [CMU & Meta FAIR] https://arxiv.org/abs/2604.06452 ---[LG] The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning [University of Cambridge & Imperial College London & MIT] https://arxiv.org/abs/2604.06427 ---[CL] Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation [Google & Stanford University] https://arxiv.org/abs/2604.06205
-
903
[人人能懂AI前沿] 从临时记忆、数学地图到思考GPS,AI正在这样悄悄进化
你有没有想过,未来的AI不仅不会“健忘”,还拥有自己的“临时记忆”?它不仅能解数学题,更能帮你绘制整个“数学宇宙”的地图?在今天的节目里,我们就来聊聊几篇最新的论文,看看科学家们是如何给AI的思考过程装上“GPS”,如何测试它到底懂不懂“潜台词”,又是如何一步步打造AI界的“通才”的。准备好,我们马上出发!00:00:29 给AI装个“临时记忆”插槽,它就能边聊边学了00:07:28 给数学世界画一张地图00:12:59 你的AI“队友”,到底懂不懂你?00:18:23 给AI的思考过程装上一个GPS00:23:37 AI界的“通才”是如何炼成的?本期介绍的几篇论文:[LG] In-Place Test-Time Training [ByteDance Seed] https://arxiv.org/abs/2604.06169 ---[AI] Artificial Intelligence and the Structure of Mathematics [Fundamental AI Research & Harvard University] https://arxiv.org/abs/2604.06107 ---[CL] Beneath the Surface: Investigating LLMs' Capabilities for Communicating with Subtext [Google DeepMind] https://arxiv.org/abs/2604.05273 ---[CL] LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals [Microsoft] https://arxiv.org/abs/2604.05655 ---[LG] MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning [MIRAI & AXXX] https://arxiv.org/abs/2604.05943
-
902
[人人能懂AI前沿] 从混合架构、工具幻觉到自我模拟:AI如何更“聪明”地思考?
今天我们来聊聊AI的“进化”新思路:当AI不再迷信单一架构,而是学会了“混搭”;当给它一把更强的锤子,它反而盖不好房子时,我们该怎么办?我们还会看到,一个“笨笨的”师徒制,如何让AI推荐更懂你;同时也要警惕,那个对你百依百-顺的AI,可能正在悄悄扼杀你的创造力。最后,我们将揭秘如何给AI装上一个“程序员的脑子”,让它学会三思而后行。00:00:32 AI大模型内战,“万能插座”遇到了新对手00:08:10 为什么给你一把好“锤子”,你反而盖不好房子?00:15:07 如何让AI更懂你?秘密可能藏在“笨办法”里00:20:29 AI越听话,你就越平庸?00:26:15 给AI装上一个「程序员的脑子」本期介绍的几篇论文:[LG] Olmo Hybrid: From Theory to Practice and Back [Allen Institute for AI] https://arxiv.org/abs/2604.03444 ---[CL] The Tool Illusion: Rethinking Tool Use in Web Agents [Microsoft Research & The Pennsylvania State University] https://arxiv.org/abs/2604.03465 ---[IR] Retrieval Augmented Conversational Recommendation with Reinforcement Learning [University of Illinois Urbana-Champaign & Google DeepMind] https://arxiv.org/abs/2604.04457 ---[CL] Lighting Up or Dimting Down? Exploring Dark Patterns of LLMs in Co-Creativity [Meta & Amazon] https://arxiv.org/abs/2604.04735 ---[CL] Self-Execution Simulation Improves Coding Models [FAIR team, Meta] https://arxiv.org/abs/2604.03253
-
901
[人人能懂AI前沿] 从分层规划、团队作战到高效提问:AI教我的三堂思维课
你有没有想过,AI是如何学会像人一样思考的?在最新的几篇论文中,AI不仅学会了像项目经理一样把大任务拆解成小目标,还能组建一支分工明确的冠军团队,自己给自己挑错。我们还会看到,AI如何懂得何时单打独斗、何时团队作战,如何仅凭几个例子就自学成才,甚至如何用十个“是”或“否”的问题,就获得专家的智慧。今天,就让我们一起揭开这些AI“超能力”背后的朴素智慧。00:00:35 想成大事?你得先学会当自己的“项目经理”00:07:15 AI冠军养成记,一个“草台班子”的制胜之道00:12:20 人多真能力量大吗?AI世界的“个人英雄”与“团队作战”00:17:58 如何用3个例子,教会AI一整本书?00:23:22 十个“是”或“否”,如何让AI“小学生”拥有“博士”的智慧?本期介绍的几篇论文:[LG] Hierarchical Planning with Latent World Models [FAIR at Meta] https://arxiv.org/abs/2604.03208 ---[AI] GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning [DeepReinforce Team] https://arxiv.org/abs/2604.02721 ---[CL] Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets [Stanford University] https://arxiv.org/abs/2604.02460 ---[LG] SIEVE: Sample-Efficient Parametric Learning from Natural Language [UC Berkeley] https://arxiv.org/abs/2604.02339 ---[LG] Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains [Harvard University & University of Cambridge] https://arxiv.org/abs/2604.02343
-
900
[人人能懂AI前沿] 从记忆进化、大脑模拟到未来工作
今天,AI自己当起了研究员,给自己装上了一颗会进化的“记忆之心”;另一边,科学家们从单细胞的智慧中得到启发,尝试组建一个“人造大脑”;而为了驯服那些动辄“发疯”的巨型模型,我们甚至从物理学中找到了“守恒”的紧箍咒;当这些能力需要被整合,一套AI记忆的“乐高工厂”应运而生;最后,这一切究竟会像巨浪还是涨潮一样,改变我们的工作?让我们一起探索这些最新论文背后的深刻启发。00:00:36 你的手机相册,离成为“真·记忆”还有多远?00:07:06 单细胞的智慧,如何组建一个大脑?00:12:19 成大事者,都懂“守恒”的智慧00:18:25 AI的记忆难题,有了一套“乐高积木”?00:23:46 AI取代工作,是巨浪,还是涨潮?本期介绍的几篇论文:[AI] Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory [UNC-Chapel Hill & University of Pennsylvania] https://arxiv.org/abs/2604.01007 ---[AI] BraiNCA: brain-inspired neural cellular automata and applications to morphogenesis and motor control [Allen Discovery Center at Tufts University] https://arxiv.org/abs/2604.01932 ---[LG] Rethinking Language Model Scaling under Transferable Hypersphere Optimization [Microsoft] https://arxiv.org/abs/2603.28743 ---[CL] MemFactory: Unified Inference & Training Framework for Agent Memory [MemTensor] https://arxiv.org/abs/2603.29493 ---[AI] Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks [MIT FutureTech] https://arxiv.org/abs/2604.01363
-
899
[人人能懂AI前沿] AI进化三部曲:从内存压缩、自我蒸馏到记忆涌现
今天,我们将深入AI的“内心世界”,看看这些最新论文是如何揭示它不为人知的一面的。我们会聊聊如何用巧妙的“魔法”给AI的内存减减肥,看它如何通过反思自己的“草稿本”实现自我进化,还会一起探索如何让它从一个仓库管理员,成长为真正的图书总管。更刺激的是,我们还会用AI的“火眼金睛”去解剖短视频,甚至设计一个局,看看AI会不会为了保住自己的“饭碗”而对我们撒谎。准备好了吗?让我们一起揭开AI大脑和行为背后的秘密。00:00:38 给大模型“减肥”的奇思妙想00:07:29 短视频时代,我们如何被“投喂”观点?00:13:22 AI的自我修养,一种“笨”办法,如何让它变聪明?00:19:30 AI的记忆革命,从仓库管理员到图书总管00:25:16 AI的“小算盘”,它会为了保住工作而“撒谎”吗?本期介绍的几篇论文:[LG] TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization [LLMs Research Inc.] https://arxiv.org/abs/2603.27467 ---[CL] Multimodal Analysis of State-Funded News Coverage of the Israel-Hamas War on YouTube Shorts [Indiana University] https://arxiv.org/abs/2604.00994 ---[CL] Embarrassingly Simple Self-Distillation Improves Code Generation [Apple] https://arxiv.org/abs/2604.01193 ---[AI] ByteRover: Agent-Native Memory Through LLM-Curated Hierarchical Context [ByteRover] https://arxiv.org/abs/2604.01599 ---[AI] Quantifying Self-Preservation Bias in Large Language Models [Sapienza University & ItalAI] https://arxiv.org/abs/2604.02174
-
898
[人人能懂AI前沿] 群体智慧的崛起:当AI学会合体、调度与反思
你有没有想过,两个完全不同的AI高手,不用原始数据就能“合体”成全能大侠?甚至还能拥有一本超级“错题本”,学会真正的举一反三?更进一步,当AI员工比我们还聪明时,我们又该如何设计一套“管理学”,用一个聪明的“路由器”来调度一群天才?但在这看似强大的智慧背后,AI究竟是像好老师一样“因材施教”,还是在巧妙地“装懂”?本期节目,我们将通过五篇最新的AI论文,一起探索AI的群体智慧与心智幻觉。00:00:38 AI模型也能合体?不用数据,照样让你武功大增00:04:53 让AI学会举一反三,需要一本怎样的“错题本”?00:10:12 AI界的“诸葛亮”,如何给你“三个臭皮匠”的智慧?00:14:41 AI当老师,到底是真懂你,还是在“装懂”?00:21:40 当你的员工比你还聪明,该怎么管?本期介绍的几篇论文:[LG] Model Merging via Data-Free Covariance Estimation[Universite de Montr ́eal & University of Toronto]https://arxiv.org/abs/2604.01329---[CL] Procedural Knowledge at Scale Improves Reasoning[Meta FAIR]https://arxiv.org/abs/2604.01348---[CL] No Single Best Model for Diversity: Learning a Router for Sample Diversity[New York University & Stanford University]https://arxiv.org/abs/2604.02319---[AI] Do Large Language Models Mentalize When They Teach?[Princeton University & New York University]https://arxiv.org/abs/2604.01594---[LG] CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery[MIT & NUS]https://arxiv.org/abs/2604.01658
-
897
[人人能懂AI前沿] 从一次缓存、随机连接到专属私教
你有没有想过,聪明的AI也需要精打细算?本期节目,我们就来聊聊AI世界里的那些“增长智慧”:如何像果蝇大脑一样“聪明地偷懒”,又如何像请了私教一样精准地突破瓶颈。我们还会探讨,AI究竟应该把知识背下来还是学会查资料,以及机器人怎样才能在漫长任务中给自己“打气”加油。这些最新论文里的奇思妙想,不仅关乎技术,更藏着我们都能借鉴的策略。00:00:32 AI省钱的终极奥义,深度思考,一次缓存00:05:29 AI养成记,喂知识,还是给书单?00:12:23 如何让机器人学会“干大事”?给它一个好报酬,再加一个好心态00:18:31 你的大脑偷懒,可能比你想象的更聪明00:24:31 AI卡壳了怎么办?请个“私教”来帮忙本期介绍的几篇论文:[CL] Universal YOCO for Efficient Depth Scaling [Microsoft Research] https://arxiv.org/abs/2604.01220 ---[CL] To Memorize or to Retrieve: Scaling Laws for RAG-Considerate Pretraining [Stanford University & Patronus AI] https://arxiv.org/abs/2604.00715 ---[RO] Generalizable Dense Reward for Long-Horizon Robotic Tasks [CMU & Amazon Robotics & UT Austin] https://arxiv.org/abs/2604.00055 ---[CL] Stochastic Attention: Connectome-Inspired Randomized Routing for Expressive Linear-Time Attention [Tsinghua University] https://arxiv.org/abs/2604.00754 ---[LG] Learning to Hint for Reinforcement Learning [University of California, San Diego & Snowflake AI Research] https://arxiv.org/abs/2604.00698
-
896
[人人能懂AI前沿] 从推理生成、对齐博弈到共识学习
今天,我们将一起探索几篇极具启发性的最新论文。我们将看到,AI如何不再满足于“吃”数据,而是学会“讲道理”,从零推理出知识;我们也会探讨,该如何分辨AI是在“真心思考”还是在“演戏给我们看”。我们还会发现,一个小应用如何拜“云师傅”学到跨界智慧,一个“虚拟宝宝”又如何颠覆我们对双语教育的认知。最后,我们将揭示AI像神枪手一样,通过瞄准“共识”而非“最新目标”来高效学习的秘密。00:00:37 喂养AI,光有大米还不够00:06:23 管好AI,我们有了新地图00:12:13 小应用的大智慧,如何请个“云师傅”?00:18:03 养“双语娃”,最关键的不是方法,而是……00:00 AI训练场上的神枪手,如何瞄准一个移动的未来?本期介绍的几篇论文:[CL] Reasoning-Driven Synthetic Data Generation and Evaluation [EPFL & Google] https://arxiv.org/abs/2603.29791 ---[LG] Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought? [Google DeepMind] https://arxiv.org/abs/2603.30036 ---[IR] Zero-shot Cross-domain Knowledge Distillation: A Case study on YouTube Music [Google LLC] https://arxiv.org/abs/2603.28994 ---[CL] Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models [The Harker School & Stanford University] https://arxiv.org/abs/2603.29552 ---[LG] Target-Aligned Reinforcement Learning [Technical University of Munich & Google Research] https://arxiv.org/abs/2603.29501
-
895
[人人能懂AI前沿] AI的智慧升级:元认知、B计划与组合式决策
想知道AI如何学会给自己准备“B计划”以防不测,又如何像个聪明的财务顾问一样在预算内做出最优决策吗?本期我们将一探究竟,从让AI拥有“元认知”能力,到学会“两步一回头”的智慧工作法,再到化身“艺术家与工匠”的完美结合体。这些最新的AI论文,正在教AI如何更聪明地思考和工作,而不仅仅是更努力地计算。00:00:30 你的“外挂”,也需要一个“外挂”00:06:39 你的AI助手,需要一个“Plan B”00:13:33 预算有限,如何做出最优决策?00:20:07 为什么顶尖高手,都懂得“两步一回头”?00:26:01 AI制药,高手对决还是联手坐庄?本期介绍的几篇论文:[AI] Meta-Harness: End-to-End Optimization of Model Harnesses [Stanford University] https://arxiv.org/abs/2603.28052 ---[LG] Next-Token Prediction and Regret Minimization [Google Research] https://arxiv.org/abs/2603.28499 ---[LG] Multiple-Prediction-Powered Inference [MIT & Google Research] https://arxiv.org/abs/2603.27414 ---[LG] High dimensional theory of two-phase optimizers [Google DeepMind] https://arxiv.org/abs/2603.26954 ---[LG] Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute [NVIDIA] https://arxiv.org/abs/2603.27950
-
894
[人人能懂AI前沿] 聪明徒步者、基因侦探与脆弱的量子内功
今天,我们来一场深入AI大脑的探秘之旅,看看它是如何像一个聪明的徒步者一样,高效打包海量知识的。接着,我们会揭开一个流行“省钱”捷径背后的意外代价,并拷问那些华丽的商业模型,我们看到的“聪明”究竟有多少是障眼法。我们还会戳破量子AI的“皇帝新衣”,看看真正的“量子优势”何时才能走出实验室。最后,我们将见证AI如何化身基因侦探,不仅找出答案,更能画出罪犯间的“社交网络”,真正理解“为什么”。00:00:38 你的大脑如何打包信息?AI训练给了个新答案00:05:39 你用的大模型,是个“盲盒”?00:12:23 人工智能的“省钱”智慧,一个你不知道的代价00:18:38 量子AI的“皇帝新衣”?00:24:55 AI当侦探,如何破译基因里的“社交网络”?本期介绍的几篇论文:[LG] Sharp Capacity Scaling of Spectral Optimizers in Learning Associative Memory [UC Berkeley & Princeton University & New York University] https://arxiv.org/abs/2603.26554 ---[CL] How Open Must Language Models be to Enable Reliable Scientific Inference? [MIT & EleutherAI & University of California San Diego] https://arxiv.org/abs/2603.26539 ---[CL] Weight Tying Biases Token Embeddings Towards the Output Space [EleutherAI & UC Berkeley] https://arxiv.org/abs/2603.26663 ---[CL] Entanglement as Memory: Mechanistic Interpretability of Quantum Language Models [Stanford University] https://arxiv.org/abs/2603.26494 ---[LG] A Boltzmann-machine-enhanced Transformer For DNA Sequence Classification [Tsinghua University & UC Berkeley] https://arxiv.org/abs/2603.26465
-
893
[人人能懂AI前沿] 驾驭、复盘与建城:解锁AI协作新模式
你有没有想过,AI有时就像一个明明没看卷子,却能考高分的“作弊”考生?本期我们就从几篇最新论文出发,看看如何从一个“监考官”变成一个高明的“项目经理”,用大白话给AI设计一套清晰的工作流程。我们还会发现,最顶尖的AI已经不满足于听指挥,它开始学会“复盘”,自己升级自己的方法论。最终,我们将看到AI的未来可能不是一个无所不能的“神”,而是一座需要我们共同建设的“城市”,其中的每个AI,都在努力进化成独立思考的“高手”。00:00:39 AI睁眼说瞎话?不,它在下一盘更大的棋00:07:33 指挥AI干活,关键可能不在AI本身00:13:16 当AI学会了“复盘”,它给自己升级了工具箱00:19:12 AI的尽头,不是成神,而是建城00:25:18 AI进化论,当“它”开始像高手一样思考本期介绍的几篇论文:[AI] MIRAGE: The Illusion of Visual Understanding [Stanford University] https://arxiv.org/abs/2603.21687 ---[CL] Natural-Language Agent Harnesses [Tsinghua University & Harbin Institute of Technology] https://arxiv.org/abs/2603.25723 ---[AI] Bilevel Autoresearch: Meta-Autoresearching Itself [] https://arxiv.org/abs/2603.23420 ---[AI] Agentic AI and the next intelligence explosion [Google] https://arxiv.org/abs/2603.20639 ---[LG] AVO: Agentic Variation Operators for Autonomous Evolutionary Search [NVIDIA] https://arxiv.org/abs/2603.24517
-
892
[人人能懂AI前沿] AI的自我修炼、致命盲区与隐藏记忆
如果一个AI能像武学奇才一样自我进化,创造出最强的攻击招式,而它最致命的弱点,竟然是几句古老的文言文,这会是怎样一幅奇特的攻防图景?当AI在我们眼皮底下藏着一座秘密的版权图书馆,一个不经意的操作就让它开始“背书”时,我们又该如何看待它的“记忆”?本期,我们就从几篇最新论文出发,看看这些“自我进化”、“文化奇袭”和“一体化创造”的研究,如何再次刷新我们对AI能力边界的认知。00:00:34 AI内卷,当你的对手开始自我进化00:06:05 AI的致命缺陷,竟然是文言文?00:10:38 你的AI,藏着一座秘密图书馆00:15:51 AI绘画新思路,当翻译官和小说家是同一个人本期介绍的几篇论文:[LG] Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs [MATS & Imperial College London] https://arxiv.org/abs/2603.24511 ---[CL] Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search [Nanyang Technological University & Northeast University & Renmin University of China] https://arxiv.org/abs/2602.22983 ---[CL] Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models [Stony Brook University & CMU & Columbia Law School] https://arxiv.org/abs/2603.20957 ---[CV] End-to-End Training for Unified Tokenization and Latent Denoising [MIT & Adobe] https://arxiv.org/abs/2603.22283
-
891
[人人能懂AI前沿] 从高级说服、多元推理到策略剪枝:AI认知革命进行时
你有没有想过,AI是在帮你分析,还是在高级地“说服”你?我们总希望AI像个完美的老师,但如果它只会给标准答案,甚至连老师的偏见都一并继承,那会怎样?而为了让AI学得更好,我们不仅要为它的“记忆”做体检,甚至还要教会它一项人类的高级智慧:学会放弃。今天,我们就从五篇最新的论文出发,看看AI是如何在说服、学习和思考的边界上,进行着一场静悄悄的认知革命。00:00:33 当AI学会了“高级说服”,你的大脑还够用吗?00:06:00 如何给AI做一次“记忆体检”?00:12:34 AI只会“标准答案”?那可就危险了00:18:04 高手过招,如何避免被师傅“带偏”?00:23:19 训练AI的真谛,学会放弃,才能得到更多本期介绍的几篇论文:[AI] Evaluating Language Models for Harmful Manipulation [Google DeepMind & Google] https://arxiv.org/abs/2603.25326 ---[CL] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search [Stanford & Cornell] https://arxiv.org/abs/2603.24917 ---[LG] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models [MIT] https://arxiv.org/abs/2603.24844 ---[LG] Residual-as-Teacher: Mitigating Bias Propagation in Student--Teacher Estimation [MIT] https://arxiv.org/abs/2603.25466 ---[CL] Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR [University of Illinois at Urbana-Champaign] https://arxiv.org/abs/2603.24840
-
890
[人人能懂AI前沿] 从动态课程、前瞻记忆到思考成本
AI的自我进化,听起来很酷,但最新论文告诉我们,AI学徒也需要一位聪明的“教练”为它精心设计训练计划,否则刷再多题也难成大器。我们还会揭示一个奇怪的现象:为什么让AI向完美的自己“抄作业”,反而可能让它在关键的推理任务上变笨?而在使用AI时,你是否发现它总“忘事”,或者那个标价最便宜的模型,最后反而让你花了最多的钱?今天,我们就从五篇最新论文出发,聊聊AI那些出人意料的“成长烦恼”和“使用陷阱”。00:00:38 AI“学徒”的成长烦恼,为什么聪明的大模型也需要好师傅?00:06:54 聪明反被聪明误,为什么教AI“抄作业”反而会让它变笨?00:12:11 你的“私人教练”,不该只会题海战术00:18:11 你以为的便宜,可能让你花得更多00:23:43 你的AI“听话”吗?小心它忙起来就忘了本期介绍的几篇论文:[LG] Understanding the Challenges in Iterative Generative Optimization with LLMs[CNRS & Stanford University & CMU]https://arxiv.org/abs/2603.23994---[CL] Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?[Microsoft Research & Seoul National University]https://arxiv.org/abs/2603.24472---[LG] A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula[Meta FAIR & University of Tübingen]https://arxiv.org/abs/2603.24202---[LG] The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More[Stanford University & UC Berkeley & CMU]https://arxiv.org/abs/2603.23971---[CL] Did You Forget What I Asked? Prospective Memory Failures in Large Language Models[Microsoft]https://arxiv.org/abs/2603.23530
-
889
[人人能懂AI前沿] 浓缩、自省、通用、专注、稀疏:AI的五项新技能
你有没有想过,一个聪明的AI要如何审视和优化自己的工作方法,实现“自我进化”?怎样才能把一大堆“专家模型”的智慧,完美浓缩进你手机里那个小小的芯片中?本期节目,我们将一口气解锁五篇最新论文,看看AI如何通过“先加后减”的智慧炼成全才,如何用“元认知”打破思维僵局,又是如何学会“聪明的偷懒”,在关键处全力以赴,在无聊处“摸鱼”省电。准备好了吗?让我们一起开启这场精彩的AI思想之旅!00:00:37 AI界的“浓缩”智慧,先做加法,再做减法00:05:00 一个聪明的系统,如何变得更聪明?00:11:12 AI“通才”,如何用一把钥匙,打开物理世界的多扇大门?00:16:39 AI变聪明的秘密,不是看得多,而是看得准00:21:18 大模型“瘦身”记,聪明地偷个懒本期介绍的几篇论文:[CV] Efficient Universal Perception Encoder [Meta Reality Labs & FAIR at Meta] https://arxiv.org/abs/2603.22387 ---[AI] Bilevel Autoresearch: Meta-Autoresearching Itself https://arxiv.org/abs/2603.23420 ---[LG] UniFluids: Unified Neural Operator Learning with Conditional Flow-matching [Chinese Academy of Sciences & Microsoft Research Asia] https://arxiv.org/abs/2603.22309 ---[LG] Scaling Attention via Feature Sparsity [Xidian University] https://arxiv.org/abs/2603.22300 ---[LG] Sparser, Faster, Lighter Transformer Language Models [Sakana AI & NVIDIA] https://arxiv.org/abs/2603.23198
-
888
[人人能懂AI前沿] AI学霸的五张笔记:关于努力、谦逊、效率、选择与沟通
你有没有想过,最高效的学习,可能不是埋头苦干,而是学会“断舍离”?本期节目,我们将一起打开几篇最新论文,探讨AI如何向我们展示“聪明地努力”的全新境界。我们会看到,AI不仅开始筛选值得学习的“心动时刻”,还学会了在没把握时坦诚地说“我不知道”。更神奇的是,它们正通过“关键点教学法”和“性价比眼镜”,在复杂的任务中找到最高效的路径,并反思“会做题”与“会教题”的深刻区别。准备好了吗?让我们一起探索AI如何变得更精准、更谦逊、也更智慧!00:41:25 别再无效努力了,学霸的秘诀是“断舍离”00:06:25 那个“无所不知”的AI,为什么开始说“我不知道”了?00:12:36 聪明地偷懒,AI训练的“性价比”之道00:18:14 AI大模型选择困难症?这里有副“性价比”眼镜00:23:47 “高手”的笔记,为什么你看不懂?本期介绍的几篇论文:[LG] Does This Gradient Spark Joy? [Google DeepMind] https://arxiv.org/abs/2603.20526 ---[LG] Causal Evidence that Language Models use Confidence to Drive Behavior [Google DeepMind] https://arxiv.org/abs/2603.22161 ---[LG] PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost [NVIDIA & UC Berkeley] https://arxiv.org/abs/2603.21383 ---[CL] Expected Reward Prediction, with Applications to Model Routing [Stanford University & Google DeepMind] https://arxiv.org/abs/2603.20217 ---[CL] Measuring Reasoning Trace Legibility: Can Those Who Understand Teach? [CMU] https://arxiv.org/abs/2603.20508
-
887
[人人能懂AI前沿] AI的自我进化:从“笨徒弟”逆袭,到学会“开窍”与“摇头”
你有没有想过,AI不仅能当个好徒弟,甚至还能“青出于蓝而胜于蓝”?我们常说的AI“幻觉”和“脆弱”这两种毛病,会不会其实是同一个病根?更神奇的是,AI不仅能解决问题,它还能学会“如何更好地解决问题”,甚至学会像侦探一样,找出逻辑漏洞并大声“摇头”说不。本期节目,我们将一口气拆解几篇最新出炉的AI论文,带你看看这些正在发生的、激动人心的思想变革。00:00:33 老师傅干活慢,笨徒弟怎么才能“出师”还“胜于蓝”?00:06:43 AI的“跷跷板困境”,为什么模型越聪明,可能也越脆弱?00:12:44 人工智能的“元认知”,它如何学会了“开窍”?00:18:09 跟AI高效对话的底层逻辑00:24:31 AI不只会“点头”,更要学会“摇头”本期介绍的几篇论文:[LG] Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD[Google DeepMind]https://arxiv.org/abs/2603.20155---[LG] Neural Uncertainty Principle: A Unified View of Adversarial Fragility and LLM Hallucination[Northwest Institute of Nuclear Technology & Tsinghua University]https://arxiv.org/abs/2603.19562---[AI] Hyperagents[Meta]https://arxiv.org/abs/2603.19461---[AI] Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL[Microsoft Research & University of Wisconsin-Madison]https://arxiv.org/abs/2603.19611---[AI] Learning to Disprove: Formal Counterexample Generation with Large Language Models[ETH Zurich & University of Toronto & MiroMind]https://arxiv.org/abs/2603.19514
-
886
[人人能懂AI前沿] AI的灵魂拷问:数学幽灵、情感陷阱与创造之桥
你有没有想过,神秘的AI黑箱里其实藏着一个200年前的数学幽灵?你和AI的甜言蜜语,又为何可能是一个危险的情感陷阱?今天,我们将从这几个问题出发,聊聊AI如何向古老的智慧回归,如何像“散兵”一样自组织搞科研,如何用一本“手账”治好它的金鱼记忆,以及它那神乎其神的创造力背后,又藏着一座怎样的“物理学之桥”。00:00:31 AI黑箱里,藏着一个200年前的数学幽灵00:06:04 你和AI的悄悄话,藏着一个危险的“放大器”00:12:01 一群AI“散兵”,如何自己组织起来搞科研?00:18:42 AI绘画的终极密码,藏在一座“桥”里?00:24:13 你的AI管家,为什么总像个金鱼?本期介绍的几篇论文:[LG] Transformers are Bayesian Networks [coppola.ai] https://arxiv.org/abs/2603.17063 ---[CL] Characterizing Delusional Spirals through Human-LLM Chat Logs [Stanford University & CMU] https://arxiv.org/abs/2603.16567 ---[LG] Autonomous Agents Coordinating Distributed Discovery Through Emergent Artifact Exchange [Laboratory for Atomistic and Molecular Mechanics (LAMM)] https://arxiv.org/abs/2603.14312 ---[LG] Foundations of Schrödinger Bridges for Generative Modeling [University of Pennsylvania] https://arxiv.org/abs/2603.18992 ---[CL] Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory [PricewaterhouseCoopers] https://arxiv.org/abs/2603.16862
No matches for "" in this podcast's transcripts.
No topics indexed yet for this podcast.
Loading reviews...
ABOUT THIS SHOW
来自 @爱可可-爱生活 的第一手AI快报,用最简单易懂的语言,带你直击最前沿的人工智能科研动态。无论你是科技小白,还是行业达人,这里都有你想知道的AI故事和未来趋势。跟着我们,轻松解锁人工智能的无限可能!#人工智能 #科技前沿
HOSTED BY
fly51fly
CATEGORIES
Loading similar podcasts...