2026.01.28 | AgentDoG筑护栏诊断风险根源;AdaReasoner排工具小模型逆袭

2026.01.28 | AgentDoG筑护栏诊断风险根源;AdaReasoner排工具小模型逆袭

12分钟 ·
播放数106
·
评论数0

【赞助商】

通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事

传送门 🔗www.xiaoyuzhoufm.com

【目录】

本期的 14 篇论文如下:

00:30 🛡 AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security(AgentDoG:面向AI智能体安全与安全的诊断性护栏框架)

01:21 🧩 AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning(AdaReasoner:面向迭代式视觉推理的动态工具编排)

02:11 🤖 A Pragmatic VLA Foundation Model(一个实用的VLA基础模型)

02:56 🧠 Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models(视觉生成通过多模态世界模型解锁类人推理)

03:39 🌍 World Craft: Agentic Framework to Create Visualizable Worlds via Text(World Craft:通过文本创建可视化世界的智能体框架)

04:26 🧠 AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking(AVMeme 考试:针对大语言模型情境与文化知识与思维能力的多模态多语言多文化基准测试)

05:08 🌲 FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning(FABLE:基于森林的自适应双路径LLM增强检索用于多文档推理)

05:55 🛡 TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment(TriPlay-RL:面向大语言模型安全对齐的三角色自博弈强化学习)

06:44 🎯 Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection(选择性导向:通过判别性层选择实现规范保持的控制)

07:17 ⚡ Revisiting Parameter Server in LLM Post-Training(重新审视大语言模型后训练中的参数服务器范式)

08:00 🧠 Post-LayerNorm Is Back: Stable, ExpressivE, and Deep(后层归一化回归:稳定、高表达且深度的Transformer架构)

08:38 🧬 GPCR-Filter: a deep learning framework for efficient and precise GPCR modulator discovery(GPCR-Filter:用于高效精准GPCR调节剂发现的深度学习框架)

09:39 ⚠ HalluCitation Matters: Revealing the Impact of Hallucinated References with 300 Hallucinated Papers in ACL Conferences(幻觉引用问题:基于ACL会议中300篇幻觉论文揭示其影响)

10:38 📊 Benchmarks Saturate When The Model Gets Smarter Than The Judge(当模型比评估者更聪明时,基准测试趋于饱和)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递