【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
[] 🧠 EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments(EvoArena:在动态环境中追踪记忆演化以实现鲁棒的LLM智能体)
[] 🧠 SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning(SpatialClaw:重新思考面向智能体空间推理的动作接口)
[] 🔍 FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents(FORT-Searcher:为训练深度搜索代理合成抗捷径搜索任务)
[] 🛠 Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?(鲁棒-U1:多模态大语言模型能否自我恢复受损视觉内容以实现鲁棒理解?)
[] 🔄 InterleaveThinker: Reinforcing Agentic Interleaved Generation(交错思考者:强化智能交错生成)
[] 🧮 MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling(MaxProof:利用生成-验证强化学习与群体级测试时扩展实现数学证明的规模化)
[] 🧠 MiniMax Sparse Attention(MiniMax稀疏注意力机制)
[] 🖥 WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces(WeaveBench:面向混合界面计算机使用代理的长期、真实世界基准)
[] 🔬 LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories(LabVLA:在科学实验室中落地视觉-语言-动作模型)
[] 🦾 HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers(HYDRA-X:具备整体视觉分词器的原生统一多模态模型)
[] 🧩 N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization(N-GRPO:基于嵌入级邻居混合的增强策略优化)
[] 🔬 EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery(EurekAgent:智能体环境工程是自主科学发现所需的一切)
[] 🏃 VideoMDM: Towards 3D Human Motion Generation From 2D Supervision(VideoMDM:从二维监督迈向三维人体运动生成)
[] 🔍 Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback(位置、类型、原因与重要性:面向文本到图像反馈的结构化缺陷定位)
[] 🔀 Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning(揭秘隐藏状态循环:基于在线强化学习的可切换潜在推理)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
