2026.05.29 | AgentDoG 1.5实现毫秒级安全防护;Qwen-VLA统一跨任务动作建模。

2026.05.29 | AgentDoG 1.5实现毫秒级安全防护;Qwen-VLA统一跨任务动作建模。

14分钟 ·
播放数83
·
评论数0

【目录】
本期的 14 篇论文如下:
[00:25] 🛡 AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security(AgentDoG 1.5:一种轻量级且可扩展的AI代理安全与安保对齐框架)
[01:06] 🤖 Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments(Qwen-VLA:统一跨任务、环境和机器人本体的视觉-语言-动作建模)
[02:02] 🌐 OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources(OmniRetrieval:跨异构知识源的统一检索)
[02:52] 🎨 CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation(集合LoRA:通过多教师同策略蒸馏将50种效果收集到一个LoRA中)
[03:47] 🎬 minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models(minWM:一个用于实时交互式视频世界模型的全栈开源框架)
[04:39] 🎥 YoCausal: How Far is Video Generation from World Model? A Causality Perspective(YoCausal:视频生成距离世界模型还有多远?一个因果视角)
[05:42] 🎨 GenClaw: Code-Driven Agentic Image Generation(GenClaw:代码驱动的智能体图像生成)
[06:40] ⚡ EarlyTom: Early Token Compression Completes Fast Video Understanding(EarlyTom:早期令牌压缩实现快速视频理解)
[07:37] 🎯 UniSteer: Text-Guided Flow Matching in Activation Space for Versatile LLM Steering(UniSteer:文本引导的激活空间流匹配实现多功能大语言模型操控)
[08:25] 🧠 How LoRA Remembers? A Parametric Memory Law for LLM Finetuning(LoRA如何记忆?大语言模型微调中的参数化记忆定律)
[09:20] 🔗 LoMo: Local Modality Substitution for Deeper Vision-Language Fusion(本地模态替换:实现更深入的视觉-语言融合)
[10:24] 🔍 LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training(LaRA:基于逐层表示分析的RL后训练数据污染检测方法)
[11:16] 🧠 Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning(Skill0.5:面向智能体强化学习中分布外泛化的技能内化与利用联合框架)
[12:17] 🔍 Xetrieval: Mechanistically Explaining Dense Retrieval(Xetrieval:机制性解释稠密检索)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递