【目录】
本期的 10 篇论文如下:
[] TOP1(🔥626) | 🏆 GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning(GrandCode:通过智能体强化学习在竞技编程中达到宗师级水平)
[] TOP2(🔥501) | 📈 Adam's Law: Textual Frequency Law on Large Language Models(亚当定律:大语言模型上的文本频率定律)
[] TOP3(🔥364) | 🔄 DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models(DataFlex:面向大语言模型数据中心化动态训练的统一框架)
[] TOP4(🔥350) | 🧠 FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization(FIPO:通过未来KL影响策略优化引导深度推理)
[] TOP5(🔥341) | 🚁 CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence(CARLA-Air:在CARLA世界中飞行无人机——面向空地具身智能的统一基础设施)
[] TOP6(🔥323) | 🧠 Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability(重新审视推理监督微调中的泛化问题:关于优化、数据与模型能力的条件性分析)
[] TOP7(🔥289) | 🧬 SkillClaw: Let Skills Evolve Collectively with Agentic Evolver(SkillClaw:让技能在智能体演化器中集体进化)
[] TOP8(🔥261) | 🤖 ClawBench: Can AI Agents Complete Everyday Online Tasks?(ClawBench:AI智能体能否完成日常在线任务?)
[] TOP9(🔥252) | 🔄 Recursive Multi-Agent Systems(递归多智能体系统)
[] TOP10(🔥249) | 👗 Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items(Tstars-Tryon 1.0:面向多样化时尚商品的鲁棒且逼真的虚拟试穿系统)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

