AI Radio FM - Technology Channel: GShard and Giant Models

AI Radio FM - Technology Channel: GShard and Giant Models

9分钟 ·
播放数0
·
评论数0

A deep dive into GShard, a module for scaling giant neural networks, focusing on its application to multilingual machine translation and its impact on training efficiency and model quality.