2025.10.07 | 论文秒变演讲；Video-LMM后训练突破 - HuggingFace 每日AI论文速递

本期的 15 篇论文如下：

00:21 🎬 Paper2Video: Automatic Video Generation from Scientific Papers（论文自动生成学术演讲视频）

00:55 🎬 Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models（Video-LMM后训练：深入剖析大型多模态模型的视频推理）

01:38 🎬 VChain: Chain-of-Visual-Thought for Reasoning in Video Generation（VChain：面向视频生成推理的视觉思维链）

02:14 👻 Imperceptible Jailbreaking against Large Language Models（针对大语言模型的隐形越狱攻击）

02:56 🌳 MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information（MITS：基于点互信息的树搜索增强大模型推理）

03:30 🧬 Hybrid Architectures for Language Models: Systematic Analysis and Design Insights（语言模型混合架构：系统剖析与设计洞见）

04:07 📊 Factuality Matters: When Image Generation and Editing Meet Structured Visuals（事实至关重要：当图像生成与编辑遇上结构化视觉）

04:59 🔄 Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models（反应式Transformer：事件驱动的实时有状态对话模型）

05:55 ⚖ Judging with Confidence: Calibrating Autoraters to Preference Distributions（置信评判：将自动评分器校准到偏好分布）

06:44 🎯 Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training（Reinforce-Ada：面向Reinforce风格LLM训练的自适应采样框架）

07:27 📏 Optimal Scaling Needs Optimal Norm（最优扩放需要最优范数）

07:51 🔬 Code4MeV2: a Research-oriented Code-completion Platform（Code4MeV2：面向研究的代码补全平台）

08:31 🪞 Self-Reflective Generation at Test Time（测试时自反思生成）

09:15 🔄 SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs（SwiReasoning：在显式与潜空间之间切换思维，实现帕累托更优的推理大模型）

10:00 👀 Watch and Learn: Learning to Use Computers from Online Videos（观看与学习：从在线视频中学习使用计算机）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递