【目录】
本期的 15 篇论文如下:
[] 🧠 PhysBrain 1.0 Technical Report(PhysBrain 1.0 技术报告)
[] 🔍 CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence(CiteVQA:为可信文档智能建立证据归因基准)
[] 🤖 MMSkills: Towards Multimodal Skills for General Visual Agents(MMSkills:面向通用视觉智能体的多模态技能)
[] 👗 FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization(FashionChameleon:面向实时且交互式的人体-服装视频定制)
[] 🦾 DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo(DexJoCo:面向任务型灵巧操作的MuJoCo基准测试与工具包)
[] 🔮 Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation(学会预见:揭示在线策略蒸馏的解锁效率)
[] 🖼 InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation(InsightTok:改进自回归图像生成中离散标记化的文本和人脸保真度)
[] 🧠 Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding(通过协作式逐步多教师解码蒸馏长链思维推理)
[] ⚡ Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization(Flash-GRPO:基于单步策略优化的高效视频扩散对齐方法)
[] 🧭 Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR(超越舒适区的助推:用于RLVR的高效策略引导探索)
[] 🎮 ReactiveGWM: Steering NPC in Reactive Game World Models(反应式游戏世界模型:在反应性游戏世界中操控非玩家角色)
[] ⚖ Hölder Policy Optimisation(赫尔德策略优化)
[] 🧠 Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution(Solvita:通过智能体进化增强大型语言模型在竞赛编程中的能力)
[] 🌐 CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage(CM-EVS:用于完整场景覆盖的稀疏全景RGB-D-姿态数据)
[] 🎯 PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control(PAGER:弥合点精确几何GUI控制中的语义-执行鸿沟)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
