深度并非越优:通过置信层解码降低对齐损耗 Deeper is Not Always Better Mitigating the Alignment Tax via Confident Layer Decoding.pptx
May,2026.
DeeperisNotAlwaysBetter:MitigatingtheAlignmentTaxvia
ConfidentLayerDecoding
XuanmingZhang*1,SiningZhoubian*2,YuxuanChen1,TianyiTang1,AnYang1,Sean
Du3,ChujieZheng1,FeiHuang1,DayihengLiu1,GaoHuang2,JingrenZhou1
1QwenTeam,AlibabaInc.2TsinghuaUniversity
原创力文档

文档评论(0)