2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。

2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。

6分钟 ·
播放数95
·
评论数0

本期的 11 篇论文如下:

00:22 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models(超越固定长度:扩散大语言模型的可变长度去噪)

00:44 🎨 PixNerd: Pixel Neural Field Diffusion(PixNerd:像素神经场扩散)

01:11 💡 SWE-Exp: Experience-Driven Software Issue Resolution(SWE-Exp:经验驱动的软件问题解决)

01:38 🔍 Multimodal Referring Segmentation: A Survey(多模态指代表达分割:一项综述)

01:59 🧠 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding(3D-R1:增强3D VLM的推理能力以实现统一场景理解)

02:40 🤖 SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution(SWE-Debate:用于软件问题解决的竞争性多智能体辩论)

03:05 ⚖ Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges(从多个评委中学习高效的多轮对话评估器)

03:33 🤯 Investigating Hallucination in Conversations for Low Resource Languages(研究低资源语言对话中的幻觉现象)

04:00 🧭 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation(IGL-Nav:用于图像目标导航的增量式三维高斯定位)

04:30 🎧 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation(SpA2V: 利用空间听觉线索进行音频驱动的空间感知视频生成)

04:55 🎮 Multi-Agent Game Generation and Evaluation via Audio-Visual Recordings(多智能体游戏生成与评估基于视听记录)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递