2025.11.13 | 原神数据炼成7B通用AI;零训练轨迹秒变视频遥控器

2025.11.13 | 原神数据炼成7B通用AI;零训练轨迹秒变视频遥控器

6分钟 ·
播放数139
·
评论数0

本期的 9 篇论文如下:

00:19 🌍 Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds(Lumine:在3D开放世界中打造通才智能体的开源方案)

00:54 🎬 Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising(Time-to-Move:无需训练的双时钟去噪运动控制视频生成)

01:31 ⚡ TiDAR: Think in Diffusion, Talk in Autoregression(TiDAR:扩散式思考,自回归式表达)

02:15 🔄 LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls(LoopTool:闭合数据-训练循环,铸就鲁棒LLM工具调用)

02:51 🤖 WMPO: World Model-based Policy Optimization for Vision-Language-Action Models(基于世界模型的视觉-语言-动作策略优化)

03:33 🖥 WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation(WebVIA:可交互可验证的网页端视觉-语言智能体UI代码生成框架)

04:19 🎯 Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance(迈向对抗式Sinkhorn注意力引导的可靠扩散采样新前沿)

04:55 🤖 Agentic Refactoring: An Empirical Study of AI Coding Agents(智能体重构:AI编程智能体的大规模实证研究)

05:31 🛡 Stemming Hallucination in Language Models Using a Licensing Oracle(利用许可证预言机遏制语言模型幻觉)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递