本期的 9 篇论文如下:
00:17 🚀 Diffusion Language Models are Super Data Learners(扩散语言模型是超级数据学习者)
01:06 🎬 UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions(统一音视频生成的不对称跨模态交互方法)
01:42 🧩 LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation(LEGO-Eval:面向具身3D环境合成工具增强细粒度评测)
02:25 📊 Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning(Orion-MSP:面向表格上下文学习的多尺度稀疏注意力机制)
03:15 📊 TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models(TabTune:面向表格基础模型推理与微调的一站式统一库)
03:46 🦾 Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects(Kinematify:开放词汇的高自由度关节物体合成)
04:30 🧠 MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity(MME-CC:一项面向多模态认知能力的挑战性评测基准)
05:06 📈 LiveTradeBench: Seeking Real-World Alpha with Large Language Models(LiveTradeBench:用大模型在真实市场里挖掘超额收益)
05:55 🔍 Let Multimodal Embedders Learn When to Augment Query via Adaptive Query Augmentation(多模态嵌入器自适应决定何时增强查询的所罗门方法)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
