2025.07.04 | WebSailor提升LLM推理能力；LangScene-X优化3D场景重建。 - HuggingFace 每日AI论文速递

本期的 15 篇论文如下：

00:22 🧭 WebSailor: Navigating Super-human Reasoning for Web Agent（WebSailor：为Web Agent导航超人推理）

00:59 🖼 LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion（LangScene-X：通过TriMap视频扩散重建可泛化的3D语言嵌入场景）

01:44 🧬 IntFold: A Controllable Foundation Model for General and Specialized Biomolecular Structure Prediction（IntFold：用于通用和专用生物分子结构预测的可控基础模型）

02:35 👂 Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback（倾听内心的声音：通过中间特征反馈对齐ControlNet训练）

03:17 🤝 Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy（Skywork-Reward-V2：通过人机协同扩展偏好数据标注）

04:00 🖼 Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers（基于图像的多模态推理：基础、方法与未来前沿）

04:38 🧠 Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving（布尔巴基：用于定理证明的自生成和目标条件MDP）

05:12 🧠 Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search（解耦规划与执行：一种用于深度搜索的分层推理框架）

05:47 💡 Fast and Simplex: 2-Simplicial Attention in Triton（快速且简明：Triton中的2-单形注意力机制）

06:33 🧐 Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers（大型语言模型能否识别科学研究中的关键局限性？人工智能研究论文的系统性评估）

07:16 🧩 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models（选择与合并：面向具有大型语言模型的可适应和可扩展的命名实体识别）

08:12 🤖 Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs（自校正基准：揭示并解决大型语言模型中的自校正盲点）

08:51 💡 Energy-Based Transformers are Scalable Learners and Thinkers（基于能量的Transformer是可扩展的学习者和思考者）

09:33 ⚙ AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM Post-Training（AsyncFlow：用于高效大语言模型后训练的异步流式强化学习框架）

10:16 🚀 ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention（ZeCO：线性注意力机制的零通信开销序列并行）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递