【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
00:28 ⚡ Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters(Step 3.5 Flash:拥有110亿活跃参数的前沿级智能模型)
01:06 🧠 GENIUS: Generative Fluid Intelligence Evaluation Suite(GENIUS:生成式流体智能评估套件)
01:46 🤖 PhyCritic: Multimodal Critic Models for Physical AI(PhyCritic:面向物理人工智能的多模态评判模型)
02:18 ⚙ ASA: Training-Free Representation Engineering for Tool-Calling Agents(ASA:面向工具调用智能体的免训练表征工程)
02:59 🧠 When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning(何时记忆与何时停止:用于长上下文推理的门控循环记忆)
03:38 🧮 Towards Autonomous Mathematics Research(迈向自主数学研究)
04:15 🎬 TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions(TimeChat-Captioner:基于时间感知与结构化音视频描述的多场景视频脚本生成)
05:12 🧠 G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design(G-LNS:基于大语言模型的生成式大邻域搜索自动启发式设计)
06:02 ⚙ FeatureBench: Benchmarking Agentic Coding for Complex Feature Development(FeatureBench:面向复杂功能开发的智能体编码基准测试)
06:44 🧑 DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning(DataChef:通过强化学习为LLM适应烹饪最优数据配方)
07:28 🚀 ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression(ROCKET:基于校准引导的背包增强截断的快速优化,用于高效模型压缩)
08:27 📈 Online Causal Kalman Filtering for Stable and Effective Policy Optimization(在线因果卡尔曼滤波用于稳定有效的策略优化)
09:24 🧠 Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models(将元经验内化至记忆以指导大语言模型的强化学习)
10:06 🗣 Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models(Ex-Omni:赋能全模态大语言模型生成3D面部动画)
10:47 🔄 Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning(在长链思维监督微调中,数据重复优于数据扩展)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
