[CL] DiffuCoder:Understanding and Improving Masked Diffusion Models for Code Generation
[Apple]
---
[LG] Language Modeling by Language Models
[Allen Institute for AI]
---
[CL] Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
[Harvard University]
---
[LG] Mastering Multiple-Expert Routing: Realizable H-Consistency and Strong Guarantees for Learning to Defer
[Courant Institute of Mathematical Sciences & Google Research]
---
[LG] Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
[FAIR at Meta]

