2025.06.12 | 自信微调提升模型表现；视频生成模型高效优化。 - HuggingFace 每日AI论文速递

本期的 13 篇论文如下：

00:23 🧠 Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models（自信即全部：基于语言模型的小样本强化学习微调）

01:07 🎬 Seedance 1.0: Exploring the Boundaries of Video Generation Models（Seedance 1.0：探索视频生成模型的边界）

01:50 🥽 PlayerOne: Egocentric World Simulator（PlayerOne：以自我为中心的真实世界模拟器）

02:30 🎬 Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation（用于实时交互视频生成的自回归对抗后训练）

03:15 🤖 ComfyUI-R1: Exploring Reasoning Models for Workflow Generation（ComfyUI-R1：探索用于工作流生成的推理模型）

03:48 🧠 SeerAttention-R: Sparse Attention Adaptation for Long Reasoning（SeerAttention-R：用于长程推理的稀疏注意力自适应）

04:25 🧪 SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner（SWE-Flow：以测试驱动的方式合成软件工程数据）

05:10 🎶 Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation（自回归 vs. 流匹配：文本到音乐生成建模范式的比较研究）

05:52 🎭 InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions（InterActHuman：基于布局对齐音频条件的多概念人物动画）

06:34 🤖 SAFE: Multitask Failure Detection for Vision-Language-Action Models（SAFE：视觉-语言-动作模型的多任务失败检测）

07:14 🧠 Reparameterized LLM Training via Orthogonal Equivalence Transformation（基于正交等价变换的重参数化LLM训练）

07:56 👁 MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis（MIRAGE：用于全面视网膜OCT图像分析的多模态基础模型与基准）

08:39 🌱 Branched Schrödinger Bridge Matching（分支薛定谔桥匹配）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递