【目录】
本期的 15 篇论文如下:
00:23 👗 Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items(Tstars-Tryon 1.0:面向多样化时尚商品的鲁棒且逼真的虚拟试穿系统)
01:05 🎬 CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation(CoInteract:通过空间结构化协同生成实现物理一致的人-物交互视频合成)
01:58 🤖 AgentSPEX: An Agent SPecification and EXecution Language(AgentSPEX:一种智能体规范与执行语言)
02:51 📐 AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model(AnyRecon:基于视频扩散模型的任意视角三维重建)
03:33 🚀 TEMPO: Scaling Test-time Training for Large Reasoning Models(TEMPO:扩展大型推理模型的测试时训练规模)
04:26 🎮 PlayCoder: Making LLM-Generated GUI Code Playable(PlayCoder:让LLM生成的GUI代码可玩)
05:08 🕶 ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning(ShadowPEFT:用于参数高效微调的影子网络)
05:58 🤖 Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language(Chat2Workflow:基于自然语言生成可执行视觉工作流的基准)
06:44 ⚖ AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation(AJ-Bench:面向环境感知评估的Agent-as-a-Judge基准测试)
07:31 🔄 Dual-View Training for Instruction-Following Information Retrieval(面向指令跟随信息检索的双视图训练)
08:41 🔍 Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers(代码转换信息检索:基准测试、分析与当前检索系统的局限)
09:20 🔗 Understanding and Enforcing Weight Disentanglement in Task Arithmetic(理解与强制任务算术中的权重解耦)
10:00 ⚡ Speculative Decoding for Autoregressive Video Generation(用于自回归视频生成的推测解码)
11:01 🧠 Target-Oriented Pretraining Data Selection via Neuron-Activated Graph(基于神经元激活图的目标导向预训练数据选择)
11:41 🧩 UniMesh: Unifying 3D Mesh Understanding and Generation(UniMesh:统一三维网格理解与生成)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
