- 1
- 0
- 约16.1万字
- 约 52页
- 2026-05-22 发布于浙江
- 举报
DeepSeek-V2:AStrong,Economical,andEfficient
Mixture-of-ExpertsLanguageModel
DeepSeek-AI
research@
Abstract
4
2
0WepresentDeepSeek-V2,astrongMixture-of-Experts(MoE)languagemodelcharacterizedby
2
economicaltrainingandefficientinference.Itcomprises236Btotalparameters,ofwhich21B
n
原创力文档

文档评论(0)