SHARE

COMMENT

VOICE_COMMENT

COMMENT_PIN_OPERATION

MUTE_COMMENT_AUTHOR

DELETE

Stella英文听着很舒服。鸟叫咋回事 - 是在户外录的吗？

谢谢☺️鸟叫是因为没有时间剪辑的太细，就用鸟叫声遮掩一下😂

Series “Evaluate LLM-powered Products” EP2!
In this episode, I share what “accuracy” really means when it comes to LLMs and AI-powered products. We explore why traditional metrics like BLEU and ROUGE often fall short, how LLM-as-a-judge methods work, and why multi-turn conversations are especially tricky to evaluate. I also share practical tips, rubrics, and personal lessons learned from my own experiments.
Subscribe "Data Science x AI" newsletter to get updates!
https://datasciencexai.substack.com/

COMMENT_PAGE

CLAP

PICK

VOTE

AI_SUMMARIZE

stellaxamy@gmail.com

原《数据女孩的中年危机》播客改名升级！
各大播客平台同步更新。
每期邀请一位朋友，讲述中文世界故事、华人故事。和我们一起倾听自定义人生。
stellaxamy@gmail.com

AI_SUMMARIZE_EPISODE

StellaxAmy·自定义

Data Science x AI EP2 -Evaluate Accuracy

65c127650847349e0cfbd955/ltBNEZD_IKgeRBt0KQrlkp-iUiMF.mp4a