【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
00:31 🔍 From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models(从盲点到增益:诊断驱动的迭代训练用于大型多模态模型)
01:16 🌍 The Trinity of Consistency as a Defining Principle for General World Models(一致性三位一体:作为通用世界模型定义原则)
01:49 🧭 MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios(MobilityBench:一个用于评估现实世界移动场景中路径规划智能体的基准)
02:52 🧠 OmniGAIA: Towards Native Omni-Modal AI Agents(OmniGAIA:迈向原生全模态人工智能体)
03:44 🔍 Imagination Helps Visual Reasoning, But Not Yet in Latent Space(想象力助力视觉推理,但尚未在潜在空间中实现)
04:26 🧠 Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization(基于混合在线与离线策略优化的探索性记忆增强大语言模型智能体)
05:26 🛡 AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning(AgentDropoutV2:通过测试时修正或拒绝剪枝优化多智能体系统中的信息流)
06:18 🔍 Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization(多搜索,少思考:重新思考长视野智能搜索的效率与泛化性)
06:54 🩺 MediX-R1: Open Ended Medical Reinforcement Learning(MediX-R1:开放式医学强化学习框架)
07:42 ⚡ Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling(基于条件引导调度的混合数据-流水线并行加速扩散模型)
08:43 🤖 EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents(EmbodMocap:面向具身智能体的野外4D人-场景重建)
09:41 🎮 AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games(AI游戏商店:通过人类游戏对机器通用智能进行可扩展、开放式评估)
10:26 🚶 Causal Motion Diffusion Models for Autoregressive Motion Generation(因果运动扩散模型用于自回归运动生成)
11:09 ⚡ veScale-FSDP: Flexible and High-Performance FSDP at Scale(veScale-FSDP:大规模灵活且高性能的FSDP)
11:51 🚗 Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving(面向可泛化端到端自动驾驶的风险感知世界模型预测控制)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
