2024.12.31 每日AI论文 | 解释性指令提升视觉任务泛化，多模态模型优化医学影像泛化。 - HuggingFace 每日AI论文速递

本期的 10 篇论文如下：

00:25 🔍 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization（解释性指令：迈向统一视觉任务理解与零样本泛化）

01:13 🧠 On the Compositional Generalization of Multimodal LLMs for Medical Imaging（多模态大语言模型在医学影像中的组合泛化研究）

02:02 ⚙ Efficiently Serving LLM Reasoning Programs with Certaindex（高效服务LLM推理程序的Certaindex系统）

02:44 🎨 Edicho: Consistent Image Editing in the Wild（Edicho：在野外图像中的一致性编辑）

03:22 🎵 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization（TangoFlux：基于流匹配和CLAP排序偏好优化的超快速且忠实文本到音频生成）

04:04 🎥 Bringing Objects to Life: 4D generation from 3D objects（赋予物体生命：从3D物体生成4D内容）

04:47 🧠 Facilitating large language model Russian adaptation with Learned Embedding Propagation（通过学习嵌入传播促进大语言模型的俄语适应）

05:25 🤖 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation（HumanEval Pro与MBPP Pro：评估大语言模型在自调用代码生成上的表现）

06:12 🤖 Training Software Engineering Agents and Verifiers with SWE-Gym（使用SWE-Gym训练软件工程代理与验证器）

06:52 🧠 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System（OneKE：基于Docker化模式引导的LLM代理知识提取系统）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递