【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 www.xiaoyuzhoufm.com
【目录】
本期的 15 篇论文如下:
[] 🎬 LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing(LiveEdit:迈向基于实时扩散的流式视频编辑)
[] 🧠 Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent(扩展智能体视野而非参数规模:以35B智能体达到万亿参数级性能)
[] 🤔 Agentic Abstention: Do Agents Know When to Stop Instead of Act?(智能体式弃权:智能体知道何时该停止而非行动吗?)
[] 💻 TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents(TUA-Bench:面向通用终端操作代理的基准测试)
[] 🗿 Trimming the Long-Tail of Visual World Modeling Evaluation(修剪视觉世界模型评估中的长尾分布)
[] 🧠 Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning(视频MME-逻辑:一个用于视频时序逻辑推理的受控诊断基准)
[] 📊 Beyond IID: How General Are Tabular Foundation Models, Really?(超越独立同分布:表格基础模型的泛化能力究竟如何?)
[] 🏭 AsyncOPD: How Stale Can On-Policy Distillation Be?(异步OPD:策略蒸馏可以容忍多旧的数据?)
[] 🧠 ReFreeKV: Towards Threshold-Free KV Cache Compression(ReFreeKV:迈向无阈值KV缓存压缩)
[] 📱 Monte Carlo Energy Aggregation for Mobile 3D Gaussian Splatting(面向移动端三维高斯泼溅的蒙特卡洛能量聚合方法)
[] 🔧 TACO: Tool-Augmented Credit Optimization for Agentic Tool Use(工具增强信用优化:面向智能体工具使用的GRPO变体)
[] 🎥 Bridging VideoQA and Video-Guided Agentic Tasks via Generalized Keyframe Extraction(通过广义关键帧提取桥接视频问答与视频引导的智能体任务)
[] 🔍 Interleaved Speech Language Models Latently Work In Text(交错式语音语言模型在文本中潜在地工作)
[] 🤖 OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks(OSWorld2.0:面向长时间跨度的真实世界任务的计算机使用智能体基准测试)
[] 🌍 DreamForge-World 0.1 Preview: A Low-Compute Real-Time Controllable World Model(DreamForge-World 0.1 预览版:一种低计算量、实时可控的世界模型)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
