人工智能论文人工智能论文Using DeepSpeed and Megatron to Train Megatron-Turing NLG.pdfVIP

  • 0
  • 0
  • 约14.92万字
  • 约 44页
  • 2026-04-28 发布于浙江
  • 举报

人工智能论文人工智能论文Using DeepSpeed and Megatron to Train Megatron-Turing NLG.pdf

UsingDeepSpeedandMegatrontoTrainMegatron-TuringNLG

530B,ALarge-ScaleGenerativeLanguageModel

,,

ShadenSmith,MostofaPatwary,BrandonNorick,PatrickLeGresley,Samyam

*

Rajbhandari,JaredCasper,ZhunLiu,ShrimaiPrabhumoye,GeorgeZerveas,Vijay

文档评论(0)

1亿VIP精品文档

相关文档