你有没有想过,我们如何给AI这头“吞金兽”来一次彻底的瘦身和压缩?如何为它设计一张分工明确的“组织架构图”,让“调度员”和“专家”各司其职?我们又该如何给它装上一个既能规避灾难性风险,又能动态调整预算的“安全大脑”?当AI自己当上“裁判”时,我们如何确保它不是在抛硬币?本期节目,我们将通过几篇最新的研究,一起探索如何让AI变得更高效、更聪明,也更可靠。
驯服AI的新兵法,“共享”与“压缩”
给AI画一张“组织架构图”,谁是调度员,谁是专家?
如何让AI既能干,又不出事?
AI当裁判,是明察秋毫,还是抛硬币?
给AI上好“紧箍咒”,它才能学得又快又稳
本期介绍的几篇论文:
[LG] Gefen: Optimized Stochastic Optimizer
[Reichman University & Tel Aviv University]
---
[LG] A theoretical model for task routing in mixture-of-expert transformers
[University of Sydney & Zhejiang University]
---
[LG] Utility-Constrained Policy Optimization
[York University & Google DeepMind]
---
[CL] The Coin Flip Judge? Reliability and Bias in LLM-as-a-Judge Evaluation
[A Yagubyan]
---
[LG] Diffusion Policy Optimization without Drifting Apart
[UC Berkeley]
![[人人能懂AI前沿] AI的瘦身术、组织图与紧箍咒](https://image.xyzcdn.net/FqWpK8fpivLboaqBbRHUe_BCOvxu.png@small)