微软Salesforce大模型容易在多轮对话中迷失方向.pdfVIP

  • 0
  • 0
  • 约20.34万字
  • 约 36页
  • 2026-03-02 发布于北京
  • 举报

微软Salesforce大模型容易在多轮对话中迷失方向.pdf

LLMSGETLOSTINMULTI-TURNCONVERSATION

PhilippeLaban∗♢HiroakiHayashi∗♣YingboZhou♣JenniferNeville♢

♢MicrosoftResearch♣SalesforceResearch

{plaban,jenneville}@

{hiroakihayashi,yingbo.zhou}@

5ABSTRACT

2

0LargeLanguageModels(LLMs)areconversationalinterfaces.Assuch,LLMshavethepotentialto

2assisttheirusersnotonlywhentheycanfullyspecifythetaskathand,butalsotohelpthemdefine,

yexplore,andrefinewhattheyneedthroughmulti-turnconversationalexchange.Althoughanalysisof

aLLMconversationlogshasconfirmedthatunderspecificationoccursfrequentlyinuserinstructions,

LLMevaluationhaspredominantlyfocusedonthesingle-turn,fully-specifiedinstructionsetting.In

Mthiswork,weperformlarge-scalesimulationexperimentstocompareLLMperformanceinsingle-

9andmulti-turnsettings.Ourexperimentsconfirmthatallthetopopen-andclosed-weightLLMs

wetestexhibitsignificantlylowerperformanceinmulti-turnconversationsthansingle-turn,with

]anaveragedropof39%acrosssixgenerationtasks.Analysisof200,000+simulatedconversations

Ldecomposestheperformancedegradationintotwocomponents:aminorlossinaptitudeanda

Csignificantincreaseinunreliability.WefindthatLLMsoft

文档评论(0)

1亿VIP精品文档

相关文档