本期的 11 篇论文如下:
00:25 🧠 rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking(rStar-Math:小型语言模型通过自我进化的深度思考掌握数学推理)
01:06 🧠 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics(URSA:理解与验证多模态数学中的思维链推理)
01:45 🧠 Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though(迈向LLMs中的系统2推理:学习如何通过元思维链进行思考)
02:25 🔬 Agent Laboratory: Using LLM Agents as Research Assistants(智能体实验室:利用LLM智能体作为研究助手)
03:02 🔬 LLM4SR: A Survey on Large Language Models for Scientific Research(LLM4SR:大语言模型在科学研究中的应用综述)
03:44 🔍 GeAR: Generation Augmented Retrieval(生成增强检索)
04:22 🤖 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection(InfiGUIAgent:具备原生推理与反思能力的多模态通用GUI代理)
05:02 🐦 Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation(Chirpy3D:基于连续部件潜变量的创造性3D鸟类生成)
05:41 🖼 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images(SPAR3D:基于单图像的稳定点感知三维物体重建)
06:17 🧠 DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization(DPO核:一种语义感知、核增强且富含散度的直接偏好优化范式)
06:55 🌳 EpiCoder: Encompassing Diversity and Complexity in Code Generation(EpiCoder:在代码生成中涵盖多样性与复杂性)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
