- 1、本文档共34页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
ConstitutionalAI:HarmlessnessfromAIFeedback
YuntaoBai,SauravKadavath,SandipanKundu,AmandaAskell,JacksonKernion,
AndyJones,AnnaChen,AnnaGoldie,AzaliaMirhoseini,CameronMcKinnon,
CarolChen,CatherineOlsson,ChristopherOlah,DannyHernandez,DawnDrain,
2
2DeepGanguli,DustinLi,EliTran-Johnson,EthanPerez,JamieKerr,JaredMueller,
0
2JeffreyLadish,JoshuaLandau,KamalNdousse,KamileLukosuite,LianeLovitt,
cMichaelSellitto,NelsonElhage,NicholasSchiefer,NoemiMercado,NovaDasSarma,
e
DRobertLasenby,RobinLarson,SamRinger,ScottJohnston,ShaunaKravec,
5SheerElShowk,StanislavFort,TameraLanham,TimothyTelleen-Lawton,TomConerly,
1
TomHenighan,TristanHume,SamuelR.Bowman,ZacHatfield-Dodds,BenMann,
]
LDarioAmodei,NicholasJoseph,SamMcCandlish,TomBrown,JaredKaplan
C
.
s
cAnthropic
[
1
v
3Abstract
7
0
8AsAIsystemsbecomemorecapable,wewouldliketoenlisttheirhelptosupervise
0otherAIs.WeexperimentwithmethodsfortrainingaharmlessAIassistantthroughself-
.improvement,withoutanyhumanlabelsidentifyingharmfuloutputs.Theonlyhuman
2oversightisprovidedthroughalistofrulesorprinciples,andsowerefertothemethodas
1‘ConstitutionalAI’.Theprocessinvolvesbothasupervisedlearnin
您可能关注的文档
- 《知识图谱与大模型融合实践研究报告》.pdf
- 6G内生AI架构及AI大模型.pdf
- 2023中国大模型市场商业化进展研究报告.pdf
- AIGC人才趋势洞察报告-猎聘.pdf
- PyTorch模型训练调优&GPU并行加速宝典.pdf
- 大模型综述 97页 英文版.pdf
- 大语言模型在推荐系统的实践应用.pdf
- 多态大模型平台的应用研发与思考.pdf
- 推荐系统&大模型.pdf
- 26-YOUR VIT BUT FASTER大模型资料高清版.pdf
- 19-Scaling Language Models Methods, Analysis大模型资料高清版.pdf
- 14-CRAMMING TRAINING A LANGUAGE MODEL ON A大模型资料高清版.pdf
- 13-Efficient Transformers A Survey大模型资料高清版.pdf
- 17-A Suite for Analyzing Large Language Models大模型资料高清版.pdf
- 15-LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS大模型资料高清版.pdf
- 10-Fast and Memory-Efficient Exact Attention大模型资料高清版.pdf
- 09-A Survey on Efficient Training of Transformers大模型资料高清版.pdf
- 18-Training Compute-Optimal Large Language Models大模型资料高清版.pdf
- 05-Transformer with Dual Residual大模型资料高清版.pdf
- 22-Fine-Tuning Language Models from Human Preferences大模型资料高清版.pdf
文档评论(0)