2025.08.11 | GLM-4.5统一智能体推理编程；Voost高保真虚拟试穿试脱 - HuggingFace 每日AI论文速递

本期的 11 篇论文如下：

00:20 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models（GLM-4.5：智能体、推理与编程（ARC）基础模型）

00:47 👕 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off（Voost：一种统一且可扩展的双向虚拟试穿与试脱扩散Transformer）

01:11 🎯 InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization（InfiGUI-G1：通过自适应探索策略优化推进 GUI 元素定位能力）

01:34 🧠 Memp: Exploring Agent Procedural Memory（Memp：探索智能体程序性记忆）

02:03 ✂ Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal（剪枝非关键信息：基于首令牌惊奇度的高效代码推理）

02:29 🪄 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing（GENIE：用于神经辐射场交互式编辑的高斯编码）

02:50 📚 Adapting Vision-Language Models Without Labels: A Comprehensive Survey（无标签视觉-语言模型适应：一项全面综述）

03:15 🌍 MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs（MELLA：弥合低资源语言多模态大语言模型的语言能力与文化扎根性）

03:37 🧱 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh（MeshLLM：赋能大型语言模型逐步理解和生成3D网格）

04:02 🎯 UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding（UI-AGILE：以有效强化学习和精准推断时定位提升图形用户界面智能体）

04:30 ✨ LightSwitch: Multi-view Relighting with Material-guided Diffusion（光开关：基于材料引导扩散的多视角重照明）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递