2026.02.12 | 稀疏MoE比肩GPT-5;GENIUS测流体智能

2026.02.12 | 稀疏MoE比肩GPT-5;GENIUS测流体智能

12分钟 ·
播放数108
·
评论数0

【赞助商】

通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事

传送门 🔗www.xiaoyuzhoufm.com

【目录】

本期的 15 篇论文如下:

00:28 ⚡ Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters(Step 3.5 Flash:拥有110亿活跃参数的前沿级智能模型)

01:06 🧠 GENIUS: Generative Fluid Intelligence Evaluation Suite(GENIUS:生成式流体智能评估套件)

01:46 🤖 PhyCritic: Multimodal Critic Models for Physical AI(PhyCritic:面向物理人工智能的多模态评判模型)

02:18 ⚙ ASA: Training-Free Representation Engineering for Tool-Calling Agents(ASA:面向工具调用智能体的免训练表征工程)

02:59 🧠 When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning(何时记忆与何时停止:用于长上下文推理的门控循环记忆)

03:38 🧮 Towards Autonomous Mathematics Research(迈向自主数学研究)

04:15 🎬 TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions(TimeChat-Captioner:基于时间感知与结构化音视频描述的多场景视频脚本生成)

05:12 🧠 G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design(G-LNS:基于大语言模型的生成式大邻域搜索自动启发式设计)

06:02 ⚙ FeatureBench: Benchmarking Agentic Coding for Complex Feature Development(FeatureBench:面向复杂功能开发的智能体编码基准测试)

06:44 🧑 DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning(DataChef:通过强化学习为LLM适应烹饪最优数据配方)

07:28 🚀 ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression(ROCKET:基于校准引导的背包增强截断的快速优化,用于高效模型压缩)

08:27 📈 Online Causal Kalman Filtering for Stable and Effective Policy Optimization(在线因果卡尔曼滤波用于稳定有效的策略优化)

09:24 🧠 Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models(将元经验内化至记忆以指导大语言模型的强化学习)

10:06 🗣 Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models(Ex-Omni:赋能全模态大语言模型生成3D面部动画)

10:47 🔄 Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning(在长链思维监督微调中,数据重复优于数据扩展)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递