本期的 13 篇论文如下:
00:23 🧠 Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models(自信即全部:基于语言模型的小样本强化学习微调)
01:07 🎬 Seedance 1.0: Exploring the Boundaries of Video Generation Models(Seedance 1.0:探索视频生成模型的边界)
01:50 🥽 PlayerOne: Egocentric World Simulator(PlayerOne:以自我为中心的真实世界模拟器)
02:30 🎬 Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation(用于实时交互视频生成的自回归对抗后训练)
03:15 🤖 ComfyUI-R1: Exploring Reasoning Models for Workflow Generation(ComfyUI-R1:探索用于工作流生成的推理模型)
03:48 🧠 SeerAttention-R: Sparse Attention Adaptation for Long Reasoning(SeerAttention-R:用于长程推理的稀疏注意力自适应)
04:25 🧪 SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner(SWE-Flow:以测试驱动的方式合成软件工程数据)
05:10 🎶 Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation(自回归 vs. 流匹配:文本到音乐生成建模范式的比较研究)
05:52 🎭 InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions(InterActHuman:基于布局对齐音频条件的多概念人物动画)
06:34 🤖 SAFE: Multitask Failure Detection for Vision-Language-Action Models(SAFE:视觉-语言-动作模型的多任务失败检测)
07:14 🧠 Reparameterized LLM Training via Orthogonal Equivalence Transformation(基于正交等价变换的重参数化LLM训练)
07:56 👁 MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis(MIRAGE:用于全面视网膜OCT图像分析的多模态基础模型与基准)
08:39 🌱 Branched Schrödinger Bridge Matching(分支薛定谔桥匹配)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
