2026.02.27 | 诊断补课反超72B；三一致性考趴世界模型 - HuggingFace 每日AI论文速递

【赞助商】

通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事

【目录】

本期的 15 篇论文如下：

00:31 🔍 From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models（从盲点到增益：诊断驱动的迭代训练用于大型多模态模型）

01:16 🌍 The Trinity of Consistency as a Defining Principle for General World Models（一致性三位一体：作为通用世界模型定义原则）

01:49 🧭 MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios（MobilityBench：一个用于评估现实世界移动场景中路径规划智能体的基准）

02:52 🧠 OmniGAIA: Towards Native Omni-Modal AI Agents（OmniGAIA：迈向原生全模态人工智能体）

03:44 🔍 Imagination Helps Visual Reasoning, But Not Yet in Latent Space（想象力助力视觉推理，但尚未在潜在空间中实现）

04:26 🧠 Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization（基于混合在线与离线策略优化的探索性记忆增强大语言模型智能体）

05:26 🛡 AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning（AgentDropoutV2：通过测试时修正或拒绝剪枝优化多智能体系统中的信息流）

06:18 🔍 Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization（多搜索，少思考：重新思考长视野智能搜索的效率与泛化性）

06:54 🩺 MediX-R1: Open Ended Medical Reinforcement Learning（MediX-R1：开放式医学强化学习框架）

07:42 ⚡ Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling（基于条件引导调度的混合数据-流水线并行加速扩散模型）

08:43 🤖 EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents（EmbodMocap：面向具身智能体的野外4D人-场景重建）

09:41 🎮 AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games（AI游戏商店：通过人类游戏对机器通用智能进行可扩展、开放式评估）

10:26 🚶 Causal Motion Diffusion Models for Autoregressive Motion Generation（因果运动扩散模型用于自回归运动生成）

11:09 ⚡ veScale-FSDP: Flexible and High-Performance FSDP at Scale（veScale-FSDP：大规模灵活且高性能的FSDP）

11:51 🚗 Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving（面向可泛化端到端自动驾驶的风险感知世界模型预测控制）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递