你有没有想过,AI也能不走寻常路,学会“抄近路”写文章吗?或者,当AI陷入追求高分的“内卷陷阱”时,我们该怎么教它“最小化遗憾”而不是盲目刷分?本期节目,我们将从几篇最新的论文出发,看看AI如何通过更换“发动机”、打通不同门派的“武功”,甚至用更少的考题更精准地对齐我们的真实感受,实现一次漂亮的思维跃迁。
抄近路,人工智能学会了新“导航术”
AI的“注意力”,正在成为它的“负担”
AI的“高分陷阱”,我们怎样教得更聪明?
当规则遇上“混沌”,AI大神们的两种武功,原来同宗同源
为什么最好的考卷,题目反而最少?
本期介绍的几篇论文:
[LG] Consistent Diffusion Language Models
[Microsoft & Purdue University]
---
[LG] Caracal: Causal Architecture via Spectral Mixing
[Huawei Technologies]
---
[LG] Wasserstein Distributionally Robust Regret Optimization for Reinforcement Learning from Human Feedback
[University of North Carolina & Imperial College London & Stanford University]
---
[LG] Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
[Technical University of Munich]
---
[CL] Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment
[University of Southern California & Stanford University]
![[人人能懂AI前沿] AI学会了抄近路、换引擎和吃“后悔药”](https://image.xyzcdn.net/FqWpK8fpivLboaqBbRHUe_BCOvxu.png@small)