【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
00:30 🚀 TAPS: Task Aware Proposal Distributions for Speculative Sampling(TAPS:面向推测采样的任务感知提议分布)
01:11 🔬 Towards a Medical AI Scientist(迈向医学AI科学家)
02:03 🔍 Gen-Searcher: Reinforcing Agentic Search for Image Generation(Gen-Searcher:强化图像生成的代理搜索)
02:43 ⚠ Emergent Social Intelligence Risks in Generative Multi-Agent Systems(生成式多智能体系统中的涌现社会智能风险)
03:22 ⚙ EpochX: Building the Infrastructure for an Emergent Agent Civilization(EpochX:构建涌现性智能体文明的基础设施)
04:01 📊 GEditBench v2: A Human-Aligned Benchmark for General Image Editing(GEditBench v2:一个面向人类对齐的通用图像编辑基准)
05:00 🧠 On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models(论令牌的困境:用于大型视觉语言模型持续学习的、具有漂移感知令牌分配能力的动态混合专家模型)
05:56 🔬 PRBench: End-to-end Paper Reproduction in Physics Research(PRBench:物理学研究中的端到端论文复现基准)
06:37 🧠 Make Geometry Matter for Spatial Reasoning(让几何信息在空间推理中发挥作用)
07:28 🖼 ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks(ImagenWorld:基于可解释人类评估对开放世界任务进行图像生成模型的压力测试)
08:18 🎨 On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers(基于上下文空间即时排斥的扩散变换器多样性增强研究)
09:11 🧠 MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences(MuSEAgent:一种具备状态化经验的多模态推理智能体)
09:55 ⚡ Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization(Kernel-Smith:进化式内核优化的统一方案)
10:55 🎯 ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning(ResAdapt:面向高效多模态推理的自适应分辨率)
12:07 🔍 Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design(Marco DeepResearch:通过以验证为中心的设计解锁高效深度研究智能体)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
