本期的 15 篇论文如下:
00:24 💡 Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models(超越“Aha!”时刻:迈向大型推理模型中系统性元能力对齐)
01:02 🤖 System Prompt Optimization with Meta-Learning(基于元学习的系统提示优化)
01:47 🤖 EnerVerse-AC: Envisioning Embodied Environments with Action Condition(EnerVerse-AC:通过动作条件设想具身环境)
02:29 🧠 The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think(CoT百科全书:分析、预测和控制推理模型如何思考)
03:17 🤖 EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models(EWMBench:具身世界模型中场景、运动和语义质量的评估)
03:57 🖼 End-to-End Vision Tokenizer Tuning(端到端视觉标记器调优)
04:34 📈 WorldPM: Scaling Human Preference Modeling(世界偏好建模:扩展人类偏好模型)
05:13 🤖 MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering(MLE-Dojo:用于增强机器学习工程中LLM代理的交互式环境)
06:01 🧩 Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning(通过启发式适配和超Token学习实现语言模型中的Tokenizer灵活性)
06:43 🎨 Style Customization of Text-to-Vector Generation with Image Diffusion Priors(基于图像扩散先验的文本到矢量生成风格定制)
07:25 🧠 J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning(J1:通过强化学习激励LLM作为裁判时的思考)
08:07 👉 PointArena: Probing Multimodal Grounding Through Language-Guided Pointing(PointArena:通过语言引导的指向探测多模态理解)
08:47 🖼 Depth Anything with Any Prior(任意先验的深度感知)
09:29 🖼 OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning(OpenThinkIMG: 通过视觉工具强化学习,学习用图像思考)
10:14 🚀 Parallel Scaling Law for Language Models(语言模型的并行扩展法则)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递