2025.12.12 | RL捏3D新纪录；AI奥赛摘银牌 - HuggingFace 每日AI论文速递

本期的 15 篇论文如下：

00:25 🤖 Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation（我们准备好将强化学习应用于文本到3D生成领域了吗？一项渐进式研究）

01:01 🧠 Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving（用于奥赛级数学问题求解的长程推理智能体）

01:36 🚀 T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground（T-pro 2.0：一个高效的俄语混合推理模型与实验平台）

02:18 🔍 OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification（OPV：基于结果的流程验证器，用于高效的长链思维验证）

03:04 🏆 Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning（通过复杂度提升强化学习实现奥林匹克级别的几何大语言模型智能体）

04:06 🎬 MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos（MoCapAnything：基于单目视频的任意骨架统一三维运动捕捉）

04:46 🔬 From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models（从宏观到微观：基于视觉语言模型的分子微观空间智能基准测试）

05:22 🧠 Thinking with Images via Self-Calling Agent（通过自调用智能体进行图像思维推理）

06:08 🧩 VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction（VQRAE：用于多模态理解、生成与重建的表征量化自编码器）

06:48 🤖 Evaluating Gemini Robotics Policies in a Veo World Simulator（在Veo世界模拟器中评估Gemini机器人策略）

07:30 🚀 Stronger Normalization-Free Transformers（更强大的无归一化Transformer）

08:05 📊 The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality（FACTS 排行榜：大型语言模型事实准确性综合基准）

08:36 🎬 Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task（工具增强的时空推理：简化视频问答任务）

09:14 🌀 MoRel: Long-Range Flicker-Free 4D Motion Modeling via Anchor Relay-based Bidirectional Blending with Hierarchical Densification（MoRel：基于锚点中继双向混合与分层致密化的长程无闪烁4D运动建模）

09:50 🤖 Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale（孔子代码智能体：工业级开源AI软件工程师）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递