本期的 11 篇论文如下:
00:20 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推理与编程(ARC)基础模型)
00:47 👕 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off(Voost:一种统一且可扩展的双向虚拟试穿与试脱扩散Transformer)
01:11 🎯 InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization(InfiGUI-G1:通过自适应探索策略优化推进 GUI 元素定位能力)
01:34 🧠 Memp: Exploring Agent Procedural Memory(Memp:探索智能体程序性记忆)
02:03 ✂ Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal(剪枝非关键信息:基于首令牌惊奇度的高效代码推理)
02:29 🪄 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing(GENIE:用于神经辐射场交互式编辑的高斯编码)
02:50 📚 Adapting Vision-Language Models Without Labels: A Comprehensive Survey(无标签视觉-语言模型适应:一项全面综述)
03:15 🌍 MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs(MELLA:弥合低资源语言多模态大语言模型的语言能力与文化扎根性)
03:37 🧱 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh(MeshLLM:赋能大型语言模型逐步理解和生成3D网格)
04:02 🎯 UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding(UI-AGILE:以有效强化学习和精准推断时定位提升图形用户界面智能体)
04:30 ✨ LightSwitch: Multi-view Relighting with Material-guided Diffusion(光开关:基于材料引导扩散的多视角重照明)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
