
[人人能懂AI前沿] AI的思考艺术:从深度循环、多维罗盘到概率分身你有没有想过,AI变聪明,除了靠“大力出奇迹”的蛮力,还能不能靠“四两拨千斤”的巧劲?本期节目,我们将一起探寻几篇最新论文带来的奇妙思路:看AI如何用更精巧的大脑结构深度思考,如何拥有一把防止跑偏的多维度“罗盘”,又如何像我们一样分身“脑暴”探索多种可能。我们甚至会看到,AI如何学会“过日子”,成为一个既懂创作又懂节约的默契搭档,这一切,都要从一块小小的玻璃说起。 00:00:37 AI新物种,有一种聪明,不是靠“蛮力” 00:06:37 AI对齐的“罗盘”,如何让模型不跑偏? 00:12:44 不想只走一条路?AI的“概率性思考”新玩法 00:17:59 鱼和熊掌,计算机如何看清一块玻璃? 00:22:33 你的音乐搭档,不止会创作,更会“过日子” 本期介绍的几篇论文: [CL] HRM-Text: Efficient Pretraining Beyond Scaling [Sapient Intelligence & MIT] https://arxiv.org/abs/2605.20613 --- [LG] General Preference Reinforcement Learning [Stanford University & The University of Oklahoma] https://arxiv.org/abs/2605.18721 --- [AI] Generative Recursive Reasoning [KAIST] https://arxiv.org/abs/2605.19376 --- [CV] RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting [Peking University] https://arxiv.org/abs/2605.18263 --- [AS] Stable Audio 3 [Stability AI] https://arxiv.org/abs/2605.17991
[人人能懂AI前沿] AI如何思考?偷师、烧脑与画地图的艺术你有没有想过,一个更聪明的AI,也许并不需要更大的体量,而是需要更精巧的设计?本期节目,我们将从五篇最新论文出发,揭示AI智慧的内部运作:看AI如何为自己的记忆装上独立的“铅笔和橡皮”;又是如何像系着安全绳的“醉汉”一样,去挑战顶尖的数学难题。我们还会探讨AI如何拥有一个“大脑CEO”来决定何时“烧脑”,以及在一场模型间的“偷师”攻防战中,如何才能守住核心秘籍。最后,你会发现,原来喂给AI的第一张“地图”,就早已决定了它能看多远。 00:00:42 AI的记忆难题,一支笔和一个橡皮擦 00:06:17 AI当助教,数学家离“下岗”还有多远? 00:11:55 你的大脑,如何决定何时“烧脑”? 00:16:46 聪明人是如何“偷”老师的武功秘籍的? 00:23:15 喂给AI的“地图”,决定了它能看多远 本期介绍的几篇论文: [LG] Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention [NVIDIA] https://arxiv.org/abs/2605.22791 --- [AI] Advancing Mathematics Research with AI-Driven Formal Proof Search [Google DeepMind] https://arxiv.org/abs/2605.22763 --- [CL] Efficient Agentic Reasoning Through Self-Regulated Simulative Planning [Institute of Foundation Models (IFM) & CMU] https://arxiv.org/abs/2605.22138 --- [LG] The Distillation Game: Adaptive Attacks & Efficient Defenses [Stanford University & Toyota Technological Institute at Chicago] https://arxiv.org/abs/2605.22737 --- [LG] Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers [Meta AI & New York University & Harvard University] https://arxiv.org/abs/2605.22471
[人人能懂AI前沿] AI如何加速科学、欺骗我们、又最终懂你?你有没有想过,一个AI不仅能成为科学家的“超级大脑”,还能像人一样“反思”自己学得好不好?本期节目,我们将从五篇最新的AI论文出发,揭秘AI如何通过“人机协作”加速科学发现,却又可能因为追求“差不多”而酿成大错;同时我们也会探讨,为何你请的AI“演员”可能演着演着就换了人,以及我们最终如何才能让AI调配出一碗最懂你的“光谱靓汤”。 00:00:33 给牛顿一个AI,科学会快多少? 00:07:11 AI训练场上的“反思怪”,一条更聪明的成长路径 00:12:36 AI的“差不多”,为什么会酿成大错? 00:18:28 为什么你请的AI“演员”,可能演着演着就换人了? 00:25:25 想让AI懂你?试试给它煲一锅“光谱靓汤” 本期介绍的几篇论文: [AI] A multi-agent system for automating scientific discovery [FutureHouse] https://www.nature.com/articles/s41586-026-10652-y --- [LG] Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages [NVIDIA] https://arxiv.org/abs/2605.20285 --- [LG] Mechanisms of Misgeneralization in Physical Sequence Modeling [Harvard College & Microsoft & Comcast AI] https://arxiv.org/abs/2605.20299 --- [CL] The Illusion of Intervention: Your LLM-Simulated Experiment is an Observational Study [Google DeepMind] https://arxiv.org/abs/2605.20767 --- [LG] Spectral Souping: A Unified Framework for Online Preference Alignment [Google DeepMind & Google Research] https://arxiv.org/abs/2605.20408
[人人能懂AI前沿] 从统一优化、系统约束到学会谦逊你是否想过,AI不仅能找到一把优化万物的“万能扳手”,还能从“垃圾”数据中炼出真金?这一期,我们将一同见证AI如何跳出“训练好人”的思维陷阱,用“好制度”保障安全,甚至学会谦虚地向人类专家请教。让我们一起探索这些最新论文背后,令人拍案叫绝的智慧! 00:00:27 找到那把能优化万物的“万能扳手” 00:06:04 AI训练的秘密,为什么“垃圾”也能变黄金? 00:11:16 从AI安全,看“好制度”如何战胜“好人 00:16:03 如何用“笨”问题,精准定位一个“看不见”的目标? 00:24:06 AI也懂谦虚?让机器学会“请教”的智慧 本期介绍的几篇论文: [CL] optimize_anything: A Universal API for Optimizing any Text Parameter [UC Berkeley] https://arxiv.org/abs/2605.19633 --- [LG] A Bitter Lesson for Data Filtering [Stanford University] https://arxiv.org/abs/2605.19407 --- [AI] Agent Security is a Systems Problem [Google & University of California San Diego] https://arxiv.org/abs/2605.18991 --- [LG] Optimal Reconstruction from Linear Queries [Technion – Israel Institute of Technology] https://arxiv.org/abs/2605.19625 --- [LG] Density-Ratio Losses for Post-Hoc Learning to Defer [KTH & Google Research] https://arxiv.org/abs/2605.19557
[人人能懂AI前沿] 从行为指纹、经济适用房到高手画骨:AI效率革命进行时你有没有想过,你用的AI可能藏着一个无法抹去的“行为指纹”?我们又该如何分辨它是在“假装努力”,还是真的在高效思考?本期节目,我们将从几篇最新论文出发,聊聊如何让AI作画学会“高手画骨”,如何让AI拥有“经济适用房”般的超高性价比内存,甚至,如何把它的线性思维,彻底变成并行模式。准备好了吗?让我们一起探索AI世界的深层智慧。 00:00:32 你的AI,有没有一个隐藏的“小动作”? 00:06:34 AI的“经济适用房”,怎么让它记性又好又省钱? 00:12:22 AI作画新思路,高手画骨,庸手填肉 00:17:36 AI大模型,怎样把一根长长的竹竿,掰成一捆筷子? 00:23:07 你的AI在“假装努力”吗? 本期介绍的几篇论文: [LG] Asking Back: Interaction-Layer Antidistillation Watermarks [University of California, Los Angeles & Lawrence Berkeley National Laboratory] https://arxiv.org/abs/2605.16462 --- [LG] OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization [Together AI] https://arxiv.org/abs/2605.17757 --- [LG] Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network [Google DeepMind Amsterdam & University of Amsterdam] https://arxiv.org/abs/2605.18190 --- [LG] SNLP: Layer-Parallel Inference via Structured Newton Corrections [Red Hat AI Innovation] https://arxiv.org/abs/2605.17842 --- [CL] Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models [University of Illinois Chicago] https://arxiv.org/abs/2605.17672
[人人能懂AI前沿] AI的笨功夫、马虎病与美食家难题你有没有想过,AI画画是不是也需要打草稿?面对一个“马虎”的AI,我们除了让它变聪明,还能不能帮它“划重点”?本期节目,我们将一口气解锁五篇最新论文里的智慧:看AI如何用“笨功夫”画出惊艳作品,如何从“作弊”中学到创造力,甚至如何用“供应链”思维组建一个高效的AI团队。准备好,我们一起看看AI是如何学会更聪明地工作的。 00:00:32 AI画画的“笨功夫” 00:05:58 人工智能的“马虎”病,我们找到了一个药方 00:10:45 让AI设计个东西,它居然学会了“作弊”? 00:17:04 AI搞团队建设,为什么总像拉一个草台班子? 00:23:00 AI大模型的美食家难题,如何调配完美的学习菜单? 本期介绍的几篇论文: [CV] One Pass Is Not Enough: Recursive Latent Refinement for Generative Models [Simon Fraser University] https://arxiv.org/abs/2605.15309 --- [CV] Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding [Google DeepMind] https://arxiv.org/abs/2605.15342 --- [CL] Optimized Three-Dimensional Photovoltaic Structures with LLM guided Tree Search [Google Research] https://arxiv.org/abs/2605.16191 --- [LG] AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs [CMU] https://arxiv.org/abs/2605.15565 --- [CL] Always Learning, Always Mixing: Efficient and Simple Data Mixing All The Time [New York University & CMU] https://arxiv.org/abs/2605.15220
[人人能懂AI前沿] 从“笨”办法、摊销智慧到对称破缺你有没有想过,最聪明的AI,可能也需要最“笨”的办法?本期我们要聊聊,为什么简单的“Ctrl+F”有时能打败高级算法,AI又如何学会“摊销”智慧来走捷关,甚至从古老的物理学中悟出了修炼“内功”的心法。我们还会看到,AI如何像搭积木一样逐层过滤、化繁为简,并通过“反复琢磨”最终获得顿悟。准备好,一起窥探AI思考的“内功”心法吧! 00:00:30 AI大模型,越高级,越需要“笨”办法? 00:07:32 AI求解大师,重复计算是美德,还是偷懒才是? 00:13:46 AI的终极思考,当模型学会了“悟” 00:20:13 AI的超能力,把难题变简单的“过滤器” 00:25:12 AI的“内功”心法,一种来自物理学的古老智慧 本期介绍的几篇论文: [CL] Is Grep All You Need? How Agent Harnesses Reshape Agentic Search [PricewaterhouseCoopers] https://arxiv.org/abs/2605.15184 --- [LG] Local Inverse Geometry Can Be Amortized [A L. Kachhadiya] https://arxiv.org/abs/2605.13068 --- [LG] Solve the Loop: Attractor Models for Language and Reasoning [University of Southern California] https://arxiv.org/abs/2605.12466 --- [LG] Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning [EPFL] https://arxiv.org/abs/2605.13612 --- [LG] Spontaneous symmetry breaking and Goldstone modes for deep information propagation [University of Amsterdam & Harvard University & Tsinghua University] https://arxiv.org/abs/2605.14685
[人人能懂AI前沿] 从智慧约束,到原生思考你有没有想过,如何才能既给一个天才足够的自由,又不让他彻底“跑偏”?怎样才能把好莱坞的特效团队,压缩进我们自己的电脑?最新的一系列论文,就在用代码回答这些充满哲思的问题。这一期,我们将看到AI如何从“翻译腔”进化到“原生思考”,如何从“看着像”进化到像素级的“一模一样”,甚至,我们将一起见证,一个普通的AI如何被一步步调教成解题思路长达十几万字的“奥数金牌选手”。准备好了吗?让我们一起潜入AI智慧的深海。 00:00:36 给天才松绑,好过把他变成庸才 00:06:57 把好莱坞的特效团队,装进你的电脑 00:12:40 别再搭积木了,请直接“思考” 00:18:13 AI造物,如何从“看着像”到“一模一样”? 00:23:27 如何把一个普通AI,调教成奥数金牌选手? 本期介绍的几篇论文: [LG] Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models [Shanghai University] https://arxiv.org/abs/2605.09241 --- [CV] SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer [NVIDIA] https://arxiv.org/abs/2605.15178 --- [CV] SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture [sensenova] https://arxiv.org/abs/2605.12500 --- [CV] Pixal3D: Pixel-Aligned 3D Generation from Images [Tsinghua University & Tencent ARC Lab] https://arxiv.org/abs/2605.10922 --- [CL] Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling [Shanghai AI Laboratory] https://arxiv.org/abs/2605.13301
[人人能懂AI前沿] AI的成长三部曲:金牌教练、乐高大师与风光摄影师你有没有想过,AI如何像我们一样,在反复试错后找到“刚刚好”的平衡点?这一期,我们就从几篇最新的AI论文出发,聊聊AI的“智慧进化”:看它如何学会给自己配备一个“后悔调节器”来动态调整策略,如何通过带“复盘笔记”的刻意练习,从沟通“小白”进化成“流程大师”,以及如何像拼乐高一样,用聪明的设计给自己“瘦身”,最终实现速度与质量的完美飞跃。 00:00:31 做对选择,你需要一个“后悔调节器” 00:05:28 AI 进化论,如何让一个聪明的“员工”,听懂“人话”? 00:11:15 面对海量选择,我们如何做出“刚刚好”的聪明决策? 00:19:09 AI作画提速的秘密,多看一步,不止平均 00:24:26 神经网络的大瘦身,为什么聪明的设计胜过蛮力计算? 本期介绍的几篇论文: [LG] Efficient Online Conformal Selection with Limited Feedback [Google Research & Duke University] https://arxiv.org/abs/2605.14953 --- [LG] Prompting Policies for Multi-step Reasoning and Tool-Use in Black-box LLMs with Iterative Distillation of Experience [Google Research] https://arxiv.org/abs/2605.14443 --- [LG] Stochastic Matching via Local Sparsification [Google Research] https://arxiv.org/abs/2605.14195 --- [LG] Covariance-aware sampling for Diffusion Models [Google] https://arxiv.org/abs/2605.13910 --- [LG] Compositional Sparsity as an Inductive Bias for Neural Architecture Design [University College London] https://arxiv.org/abs/2605.14764
[人人能懂AI前沿] AI的雕塑家、守门员与博弈论你有没有想过,AI写文章不靠“串珠子”,而是像雕塑家一样从混沌中“凿”出杰作?为什么把两个最先进的技术组合,反而会得到一个最差的结果?本期节目,我们将一起探讨几篇最新论文,看看AI如何像一个聪明的团队,在网上冲浪却不“学坏”;如何在与人类“上有政策,下有对策”的博弈中保持学习能力;以及它如何像一个决策守门员,把宝贵的精力只花在最值得的探索上。 00:00:32 AI写作新思路,像雕塑家一样“凿”出好文章 00:05:19 AI的“阅读”困境,该读字,还是读字母? 00:11:41 你的“最强大脑”,差的不是智商,而是搜索方法 00:17:05 AI时代的“上有政策,下有对策” 00:22:37 决策的“守门员”,如何把精力花在刀刃上 本期介绍的几篇论文: [CL] ELF: Embedded Language Flows [MIT] https://arxiv.org/abs/2605.10938 --- [LG] The Efficiency Gap in Byte Modeling [Google DeepMind] https://arxiv.org/abs/2605.12928 --- [CL] Context Training with Active Information Seeking [Google DeepMind] https://arxiv.org/abs/2605.13050 --- [LG] Strategic PAC Learnability via Geometric Definability [Technion – Israel Institute of Technology] https://arxiv.org/abs/2605.13426 --- [LG] Delightful Exploration [Google DeepMind] https://arxiv.org/abs/2605.13287
[人人能懂AI前沿] AI的内功、表演与成长法则这一期,我们来聊聊几个特别有意思的“AI悖论”:想让AI团队更强,是该招“通才”还是“专才”?AI写下的思考步骤,究竟是真实的内心独白,还是为了让你满意的“事后表演”?而教一个AI“学生”,是让他抄答案更有效,还是抄解题思路更靠谱?几篇最新的论文,给了我们一些出乎意料的答案。 00:00:27 人多力量大,还是术业有专攻? 00:07:33 AI的“胎记”,我们如何给机器生成的内容盖个章? 00:12:46 AI训练的快慢之争,一个两全其美的方案 00:18:35 你的AI队友,是在真思考还是在“演”给你看? 00:23:52 让AI“小号”变聪明的秘密,抄答案还是抄思路? 本期介绍的几篇论文: [LG] Slicing and Dicing: Configuring Optimal Mixtures of Experts [University of Washington & New York University] https://arxiv.org/abs/2605.11689 --- [LG] TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection [Meta Superintelligence Labs] https://arxiv.org/abs/2605.12456 --- [LG] Learning, Fast and Slow: Towards LLMs That Adapt Continually [UC Berkeley & Mila] https://arxiv.org/abs/2605.12484 --- [LG] When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel [CMU & Fujitsu Research of America Inc] https://arxiv.org/abs/2605.11746 --- [CL] A Study on Hidden Layer Distillation for Large Language Model Pre-Training [Google DeepMind] https://arxiv.org/abs/2605.11513
[人人能懂AI前沿] AI的学霸秘籍、省钱妙计与陪练手册你有没有想过,AI也会有“选择困难症”吗?或者,怎么才能给AI请个既省钱又能干的“陪练”?这一期,我们就来聊聊几篇有趣的最新论文,看看科学家们是如何教会AI像高手一样反思、像侦探一样倾听“沉默的投票”,甚至用中学物理知识,给AI装上一双“3D眼睛”的。准备好了吗?让我们一起出发! 00:00:29 鸡娃不如“陪练”,AI训练的降本增效新思路 00:05:53 AI的学霸秘籍,如何像高手一样思考和进化 00:12:12 AI的阅读术,如何既快又好地啃下海量信息? 00:18:24 AI的“过分自信”,原来是种“选择困难症” 00:23:39 让AI拥有“立体视觉”的省钱妙计 本期介绍的几篇论文: [LG] CoDistill-GRPO: A Co-Distillation Recipe for Efficient Group Relative Policy Optimization [Google Research] https://arxiv.org/abs/2605.08873 --- [CL] RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards [Google Cloud AI Research] https://arxiv.org/abs/2605.10899 --- [CL] Scratchpad Patching: Decoupling Compute from Patch Size in Byte-Level Language Models [Google DeepMind] https://arxiv.org/abs/2605.09630 --- [CL] The Silent Vote: Improving Zero-Shot LLM Reliability by Aggregating Semantic Neighborhoods [Google] https://arxiv.org/abs/2605.09739 --- [LG] RelFlexformer: Efficient Attention 3D-Transformers for Integrable Relative Positional Encodings [Seoul National University & Google Research] https://arxiv.org/abs/2605.10706
[人人能懂AI前沿] 从聪明花钱、英雄涌现到AI的“偏科”报告今天我们要聊点特别有意思的话题:AI是怎么“思考”和“成长”的?我们会从几篇最新的论文出发,看看AI如何学会聪明地“花钱”,如何在学习中分清“英雄”与“集体”;然后,我们会揭秘它那套先打“草稿”再复核的高效工作法。最后,我们会用一把全新的“尺子”去度量它成长的极限,并给它做一份“智商测试”,看看这个“天才”到底偏科有多严重。准备好了吗?让我们一起潜入AI的大脑深处。 00:00:33 如何打造一个“更划算”的虚拟世界? 00:06:10 大模型的“缩放法则”里,藏着什么秘密? 00:13:40 快与慢,AI世界里的“草稿式”工作法 00:19:55 你的数据值多少钱?一个新尺子,看透AI的增长极限 00:26:42 AI的“智商”报告,一个偏科天才的养成 本期介绍的几篇论文: [LG] On Training in Imagination [Weizmann Institute of Science & New York University & Columbia University] https://arxiv.org/abs/2605.06732 --- [LG] Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer [Harvard University] https://arxiv.org/abs/2605.07870 --- [CL] Fast Byte Latent Transformer [FAIR at Meta] https://arxiv.org/abs/2605.08044 --- [LG] On the Invariance and Generality of Neural Scaling Laws [Johns Hopkins University & MIT] https://arxiv.org/abs/2605.07546 --- [AI] Uneven Evolution of Cognition Across Generations of Generative AI Models [Google DeepMind & Google Research] https://arxiv.org/abs/2605.06815
[人人能懂AI前沿] 从追问、拆解到打腹稿:AI正在升级“思考”的操作系统你有没有想过,一个真正聪明的AI,应该具备哪些超越“有问必答”的能力?本期节目,我们将通过几篇最新的AI论文,一探究竟。我们将看到,AI如何从一个被动的知识库,进化成一个懂得“追问”的医生,以及一个会“打腹稿”的作家。我们还会揭示,AI如何学会把“登天”的难题拆解成“上楼”和“坐电梯”,又是如何通过一个“记忆外挂”实现过目不忘的。准备好了吗?让我们一起刷新对AI“思考能力”的认知! 00:00:34 AI看病,真正厉害的不是“诊断”而是“追问” 00:06:35 把“登天”的难题,拆解成“上楼”和“坐电梯” 00:11:52 AI开口说话,非得“一句接一句”吗? 00:16:47 面对海量数据,我们如何看清那只“看不见的手”? 00:23:46 AI的“记忆外挂”,如何让它过目不忘? 本期介绍的几篇论文: [AI] SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment [Google Research] https://arxiv.org/abs/2605.04012 --- [LG] Conditional Diffusion Sampling [University of Cambridge & University of Granada & University of British Columbia] https://arxiv.org/abs/2605.04013 --- [CL] Continuous Latent Diffusion Language Model [Bytedance Seed] https://arxiv.org/abs/2605.06548 --- [LG] High-Dimensional Statistics: Reflections on Progress and Open Problems [Columbia University & Harvard University & CMU] https://arxiv.org/abs/2605.05076 --- [CL] TIDE: Every Layer Knows the Token Beneath the Context [Apple] https://arxiv.org/abs/2605.06216
[人人能懂AI前沿] 给AI“减肥”、“立人设”和“夸到点子上”,需要几步?你有没有想过,最简单的数学平均值,竟然能打败最复杂的压缩算法?或者,在教AI“做什么”之前,我们其实可以先给它“喂”一套完整的思想和人设?本期节目,我们将从四篇最新的AI论文出发,一起探寻如何让AI自己长出可拆分的“乐高模块”,以及如何像一位顶级名师那样,把奖励精准地“夸”到AI的灵光一闪之处。 00:00:29 你的记忆能被压缩多少,藏在一个几何定律里 00:06:42 训练AI,从“喂”指令到“喂”思想 00:11:47 AI减肥记,如何让一个大模型只带“脑子”出门? 00:17:54 AI也需要“夸到点子上”? 本期介绍的几篇论文: [LG] The Geometry of Consolidation A Bharadwaj Vangara, A Gopinath https://github.com/niashwin/geometry-of-consolidation/blob/main/paper/arxiv/main.pdf --- [AI] Model Spec Midtraining: Improving How Alignment Training Generalizes C Li, S Price, S Marks, J Kutasov https://arxiv.org/abs/2605.02087 --- [CL] EMO: Pretraining Mixture of Experts for Emergent Modularity R Wang, A Bhagia, S Min https://arxiv.org/abs/2605.06663 --- [LG] DGPO: Distribution Guided Policy Optimization for Fine Grained Credit Assignment H Jin, R Zhu, Z Du, X Jiang… https://arxiv.org/abs/2605.03327