2026.02.20 | 砍95%注意力画质反升;边压缩边生成FID 1.4

2026.02.20 | 砍95%注意力画质反升;边压缩边生成FID 1.4

13分钟 ·
播放数112
·
评论数0

【赞助商】

通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事

传送门 🔗www.xiaoyuzhoufm.com

【目录】

本期的 15 篇论文如下:

00:31 ⚡ SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning(SpargeAttention2:通过混合Top-k+Top-p掩码与蒸馏微调实现可训练的稀疏注意力)

01:27 🧠 Unified Latents (UL): How to train your latents(统一隐变量(UL):如何训练你的隐变量)

02:05 🤖 Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents(Mobile-Agent-v3.5:多平台基础图形用户界面智能体)

02:58 🚗 "What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing(“你在做什么?”:多步骤处理过程中来自具身化LLM车载助手的中间反馈效果研究)

03:45 ⚠ Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5(前沿人工智能风险管理框架实践:风险分析技术报告 v1.5)

04:40 ⚡ DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers(DDiT:面向高效扩散变换器的动态补丁调度)

05:39 🧠 Arcee Trinity Large Technical Report(Arcee Trinity 大型技术报告)

06:23 🖥 Computer-Using World Model(计算机使用世界模型)

07:20 🔬 ArXiv-to-Model: A Practical Study of Scientific LM Training(ArXiv到模型:科学语言模型训练的实践研究)

07:59 🧬 Discovering Multiagent Learning Algorithms with Large Language Models(利用大语言模型发现多智能体学习算法)

08:42 🖐 TactAlign: Human-to-Robot Policy Transfer via Tactile Alignment(TactAlign:通过触觉对齐实现人机策略迁移)

09:24 🤖 FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment(FRAPPE:通过多未来表示对齐将世界建模注入通用策略)

10:08 🧠 World Models for Policy Refinement in StarCraft II(用于《星际争霸II》策略优化的世界模型)

10:46 ⚡ 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy(2Mamba2Furious:线性复杂度,媲美准确度)

11:19 🤿 StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation(StereoAdapter-2:全局结构一致的水下立体深度估计)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递