[人人能懂] AI的“开窍”秘诀:一行代码如何胜过千军万马?

[人人能懂] AI的“开窍”秘诀:一行代码如何胜过千军万马?

25分钟 ·
播放数135
·
评论数0

00:41:15 AI防忽悠指南:如何让聪明的机器不说胡话? 

00:05:37 想变强?别再刷旧题了,你得学会自己“造”难题 

00:10:06 AI进阶的秘密:一行代码如何让“学霸”真正开窍? 

00:14:46 AI的新玩法:从“搬运工”到“侦探” 

00:19:10 AI也会“想太多”?聊聊如何给模型一颗“定心丸” 

本期介绍的五篇论文:

[CL] Learning to Reason for Factuality  

[FAIR at Meta]  

arxiv.org 

---

[CL] MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy  

[Tsinghua University]  

arxiv.org 

---

[LG] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification  

[Southeast University & University of California, Los Angeles]  

arxiv.org 

---

[LG] GRAIL: Learning to Interact with Large Knowledge Graphs for Retrieval Augmented Reasoning  

[Tsinghua University]  

arxiv.org 

---

[CL] Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression  

[Peking University & The Hong Kong University of Science and Technology]  

arxiv.org