【目录】
本期的 15 篇论文如下:
[] 🧠 COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation(COLLEAGUE.SKILL:通过专家知识蒸馏实现自动化AI技能生成)
[] 🧠 Representation Forcing for Bottleneck-Free Unified Multimodal Models(表示强制:无瓶颈统一多模态模型)
[] 🎙 SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue(SwanVoice:面向独白与对话的表现力丰富长文本零样本语音合成)
[] 🔍 LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards(长迹强化学习:利用评分奖励从搜索代理轨迹中学习长上下文推理)
[] 🎧 Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer(面向流式同步空间音频生成的自回归扩散Transformer)
[] 🖼 GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration(GGT-100K:面向通用真实世界图像恢复的生成式真实标签)
[] 🎤 Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios(多样化场景下长篇语音生成的综合基准测试)
[] 🛋 Function2Scene: 3D Indoor Scene Layout from Functional Specifications(从功能规格到场景:基于功能说明的3D室内布局生成)
[] 🎥 SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer(SANA-Streaming:基于混合扩散Transformer的实时流式视频编辑)
[] 🧠 Task-Focused Memorization for Multimodal Agents(面向多模态智能体的任务聚焦记忆机制)
[] 🤖 Exploring Autonomous Agentic Data Engineering for Model Specialization(探索面向模型专业化的自主代理数据工程)
[] 🎓 Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation(并非所有分歧都是可学习的:在线策略蒸馏中的令牌可教性)
[] 🧩 dMoE: dLLMs with Learnable Block Experts(dMoE:具有可学习块级专家机制的扩散大语言模型)
[] 🛠 Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents(恢复策略诱导的错误:面向鲁棒GUI智能体的基准测试与轨迹合成)
[] 🛡 From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors(从提示注入到持久控制:防御智能体框架中的特洛伊后门)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 www.xiaoyuzhoufm.com
