本期的 13 篇论文如下:
00:26 🎵 Seed-Music: A Unified Framework for High Quality and Controlled Music Generation(Seed-Music:高质量和可控音乐生成的统一框架)
01:03 ⚡ RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval(通过向量检索加速长上下文大语言模型推理)
01:46 🌐 Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models(Ferret:大规模联邦学习中大型语言模型的全参数微调)
02:35 🔍 Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types(指导视觉语言模型选择用于跨任务、领域和知识类型的视觉问答)
03:20 🔊 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds(ReCLAP:通过描述声音改进零样本音频分类)
04:04 📚 One missing piece in Vision and Language: A Survey on Comics Understanding(视觉与语言中的缺失一环:漫画理解综述)
04:42 🌐 jina-embeddings-v3: Multilingual Embeddings With Task LoRA(Jina-embeddings-v3:多语言嵌入与任务LoRA)
05:28 🧠 On the Diagram of Thought(关于思维图的探讨)
06:10 🔊 AudioBERT: Audio Knowledge Augmented Language Model(音频BERT:增强语言模型的音频知识)
06:40 🔍 Policy Filtration in RLHF to Fine-Tune LLM for Code Generation(在RLHF中进行策略过滤以微调LLM进行代码生成)
07:20 📊 Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records(基于电子健康记录预测患者胸部X光图像的时间变化)
07:57 🤖 Breaking reCAPTCHAv2(破解 reCAPTCHAv2)
08:27 🐝 beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems(beeFormer:在推荐系统中弥合语义和交互相似性之间的差距)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
