2026.06.01 | 知识蒸馏炼技能;表示强制破瓶颈

2026.06.01 | 知识蒸馏炼技能;表示强制破瓶颈

15分钟 ·
播放数91
·
评论数0

【目录】
本期的 15 篇论文如下:

[00:30] 🧠 COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation(COLLEAGUE.SKILL:通过专家知识蒸馏实现自动化AI技能生成)
[01:17] 🧠 Representation Forcing for Bottleneck-Free Unified Multimodal Models(表示强制:无瓶颈统一多模态模型)
[02:07] 🎙 SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue(SwanVoice:面向独白与对话的表现力丰富长文本零样本语音合成)
[02:58] 🔍 LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards(长迹强化学习:利用评分奖励从搜索代理轨迹中学习长上下文推理)
[03:59] 🎧 Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer(面向流式同步空间音频生成的自回归扩散Transformer)
[04:48] 🖼 GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration(GGT-100K:面向通用真实世界图像恢复的生成式真实标签)
[05:39] 🎤 Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios(多样化场景下长篇语音生成的综合基准测试)
[06:46] 🛋 Function2Scene: 3D Indoor Scene Layout from Functional Specifications(从功能规格到场景:基于功能说明的3D室内布局生成)
[07:36] 🎥 SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer(SANA-Streaming:基于混合扩散Transformer的实时流式视频编辑)
[08:29] 🧠 Task-Focused Memorization for Multimodal Agents(面向多模态智能体的任务聚焦记忆机制)
[09:30] 🤖 Exploring Autonomous Agentic Data Engineering for Model Specialization(探索面向模型专业化的自主代理数据工程)
[10:15] 🎓 Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation(并非所有分歧都是可学习的:在线策略蒸馏中的令牌可教性)
[11:10] 🧩 dMoE: dLLMs with Learnable Block Experts(dMoE:具有可学习块级专家机制的扩散大语言模型)
[12:12] 🛠 Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents(恢复策略诱导的错误:面向鲁棒GUI智能体的基准测试与轨迹合成)
[13:07] 🛡 From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors(从提示注入到持久控制:防御智能体框架中的特洛伊后门)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 www.xiaoyuzhoufm.com