本期的 15 篇论文如下:
00:22 📊 DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle(DAComp:跨全数据智能生命周期的数据智能体基准测试)
01:07 🤖 Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length(实时虚拟化身:基于无限时长的流式实时音频驱动化身生成)
01:42 🤖 Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction(Nex-N1:基于统一生态系统大规模环境构建训练的智能体模型)
02:24 🤖 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning(ARM-Thinker:通过智能体工具使用与视觉推理增强多模态生成奖励模型)
02:54 🎬 Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation(奖励强制:通过奖励分布匹配蒸馏实现高效流式视频生成)
03:42 🚀 Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion(语义先行:通过异步潜在扩散协调语义与纹理建模)
04:25 🔧 PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing(PaperDebugger:一种基于插件的多智能体系统,用于编辑器内的学术写作、审阅与编辑)
04:56 🌍 DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling(DynamicVerse:一个物理感知的多模态4D世界建模框架)
05:47 🌀 4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer(4DLangVGGT:基于Transformer的4D语言-视觉几何接地统一框架)
06:15 🔍 UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers(UltraImage:重新思考图像扩散变换器中的分辨率外推)
07:02 🎨 DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation(DraCo:以草稿作为思维链实现文本到图像预览与稀有概念生成)
07:51 ❄ Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting(Splannequin:基于双重检测溅射的单目人体模型挑战视频冻结)
08:34 🤖 SIMA 2: A Generalist Embodied Agent for Virtual Worlds(SIMA 2:面向虚拟世界的通用具身智能体)
09:05 🧮 Model-Based and Sample-Efficient AI-Assisted Math Discovery in Sphere Packing(基于模型与样本高效的AI辅助数学发现:球体填充问题研究)
09:47 🧭 SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization(SeeNav-Agent:通过视觉提示与步级策略优化增强视觉语言导航)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
