2026.04.03 | DataFlex让数据像乐高；潜在空间成AI新地图 - HuggingFace 每日AI论文速递

【赞助商】

通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事

【目录】

本期的 15 篇论文如下：

00:41 🔄 DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models（DataFlex：面向大语言模型数据中心化动态训练的统一框架）

01:48 🧠 The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook（潜在空间：基础、演进、机制、能力与展望）

02:45 🧠 SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization（SKILL0：用于技能内化的上下文智能体强化学习）

03:22 🎮 Generative World Renderer（生成式世界渲染器）

04:09 👁 EgoSim: Egocentric World Simulator for Embodied Interaction Generation（EgoSim：面向具身交互生成的第一人称世界模拟器）

05:24 🧠 LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model（LatentUM：通过潜在空间统一模型释放交错跨模态推理的潜力）

06:06 🧠 Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory（Omni-SimpleMem：基于自主研究引导的终身多模态智能体记忆发现）

06:47 🚗 UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving（UniDriveVLA：统一自动驾驶中的理解、感知与动作规划）

07:35 🎯 Steerable Visual Representations（可操控的视觉表示）

08:12 🎬 VOID: Video Object and Interaction Deletion（VOID：视频对象与交互删除）

09:06 🤖 Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time（探究自主编码代理在真实项目中的贡献：活动模式与代码随时间的变化）

09:47 🚀 ASI-Evolve: AI Accelerates AI（ASI-Evolve：人工智能加速人工智能发展）

10:50 🎭 Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models（Tex3D：通过对抗性3D纹理将物体作为视觉-语言-动作模型的攻击面）

11:36 🤖 GPA: Learning GUI Process Automation from Demonstrations（GPA：通过演示学习图形用户界面流程自动化）

12:24 🔍 VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification（VideoZeroBench：通过时空证据验证探究视频多模态大语言模型的极限）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递