人工智能论文英文版-Let’s Put Ourselves in Sally’s Shoes:Shoes-of-Others Prefixing Improves Theory of Mind in Large Language Models.pdfVIP
- 0
- 0
- 约6.7万字
- 约 14页
- 2025-06-13 发布于湖南
- 举报
Let’sPutOurselvesinSally’sShoes:Shoes-of-OthersPrefixingImproves
TheoryofMindinLargeLanguageModels
KazutoshiShinodaNobukatsuHojoKyosukeNishidaYoshihiroYamazaki
KeitaSuzukiHiroakiSugiyamaKunikoSaito
NTTCorporation,Japan
kazutoshi.shinoda@
Abstract
RecentstudieshaveshownthatTheoryof
5Mind(ToM)inlargelanguagemodels(LLMs)
2
0hasnotreachedhuman-levelperformanceyet.
2Sincefine-tuningLLMsonToMdatasetsoften
degradestheirgeneralization,severalinference-
n
utimemethodshavebeenproposedtoenhance
JToMinLLMs.However,existinginference-
6timemethodsforToMarespecializedforin-
ferringbeliefsfromcontextsinvolvingchanges
]intheworldstate.Inthisstudy,wepresent
Lanewinference-timemethodforToM,Shoes-
C.of-Others(SoO)prefixing,whichmakesfewer
sassumptionsaboutcontextsandisapplicableto
c
[broaderscenarios.SoOprefixingsimplyspeci-
fiesthebeginningofLLMoutputswith“Let’s
1putourselvesinA’sshoes.”,whereAdenotes
vthetargetcharacter’sname.WeevaluateSoOFigure1:Shoes-of-Othersprefixingspecifiesthebe-
0
7prefixingontwobenchmarksthatassessToMginningofoutputsandthenLLMsgeneratethecontin-
9inconversationalandnarrativecontextswithoutuation.TheaboveexamplefromToMATO(Shinoda
5chan
您可能关注的文档
- 人工智能论文英文版-Eigenspectrum Analysis of Neural Networks without Aspect Ratio.pdf
- 人工智能论文英文版-Cartridges:Lightweight and general-purpose long context.pdf
- 人工智能论文英文版-Distillation Robustifies Unlearning.pdf
- 人工智能论文英文版-PersonaAgent:When Large Language Model Agents Meet Personalization at Test Time.pdf
- 人工智能论文英文版-Reflect-then-Plan:Offline Model-Based Planning through a Doubly Bayesian Lens.pdf
- 人工智能论文英文版-DesignBench:A Comprehensive Benchmark for MLLM-based Front-end Code Generation.pdf
- 人工智能论文英文版-Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models.pdf
- 人工智能论文英文版-“We need to avail ourselves of [GenAI] to enhance knowledge distribution”: Empowering Older Adults through GenAI Literacy.pdf
- 人工智能论文英文版-GenIR: Generative Visual Feedback for Mental Image Retrieval.pdf
- 人工智能论文英文版-Integer Linear Programming Preprocessing for Maximum Satisfiability.pdf
原创力文档

文档评论(0)