本期的 10 篇论文如下:
00:25 🖼 MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models(多图像增强直接偏好优化:大型视觉语言模型)
01:09 🌍 WorldSimBench: Towards Video Generation Models as World Simulators(世界模拟器:迈向视频生成模型作为世界模拟器)
01:47 🌊 Scaling Diffusion Language Models via Adaptation from Autoregressive Models(通过自回归模型适应扩展扩散语言模型)
02:20 📱 Lightweight Neural App Control(轻量级神经应用控制)
03:01 🏠 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding(ARKit标签制造者:室内3D场景理解的新尺度)
03:47 🖼 Scalable Ranked Preference Optimization for Text-to-Image Generation(可扩展的文本到图像生成中的排序偏好优化)
04:23 🌆 DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes(动态城市:动态场景的大规模LiDAR生成)
05:05 🩺 MedINST: Meta Dataset of Biomedical Instructions(医学指令元数据集:MedINST)
05:52 🌍 M-RewardBench: Evaluating Reward Models in Multilingual Settings(多语言环境下的奖励模型评估:M-RewardBench)
06:27 📊 TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts(TP-Eval:通过定制提示挖掘多模态大语言模型的评估潜力)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递