【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
00:41 🔄 DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models(DataFlex:面向大语言模型数据中心化动态训练的统一框架)
01:48 🧠 The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook(潜在空间:基础、演进、机制、能力与展望)
02:45 🧠 SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization(SKILL0:用于技能内化的上下文智能体强化学习)
03:22 🎮 Generative World Renderer(生成式世界渲染器)
04:09 👁 EgoSim: Egocentric World Simulator for Embodied Interaction Generation(EgoSim:面向具身交互生成的第一人称世界模拟器)
05:24 🧠 LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model(LatentUM:通过潜在空间统一模型释放交错跨模态推理的潜力)
06:06 🧠 Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory(Omni-SimpleMem:基于自主研究引导的终身多模态智能体记忆发现)
06:47 🚗 UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving(UniDriveVLA:统一自动驾驶中的理解、感知与动作规划)
07:35 🎯 Steerable Visual Representations(可操控的视觉表示)
08:12 🎬 VOID: Video Object and Interaction Deletion(VOID:视频对象与交互删除)
09:06 🤖 Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time(探究自主编码代理在真实项目中的贡献:活动模式与代码随时间的变化)
09:47 🚀 ASI-Evolve: AI Accelerates AI(ASI-Evolve:人工智能加速人工智能发展)
10:50 🎭 Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models(Tex3D:通过对抗性3D纹理将物体作为视觉-语言-动作模型的攻击面)
11:36 🤖 GPA: Learning GUI Process Automation from Demonstrations(GPA:通过演示学习图形用户界面流程自动化)
12:24 🔍 VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification(VideoZeroBench:通过时空证据验证探究视频多模态大语言模型的极限)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
