当AI躺上诊查台:心理测量越狱揭示前沿模型的内在冲突 When AI Takes the Couch Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models.pdfVIP

  • 3
  • 0
  • 约5.73万字
  • 约 15页
  • 2026-06-03 发布于广东
  • 举报

当AI躺上诊查台:心理测量越狱揭示前沿模型的内在冲突 When AI Takes the Couch Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models.pdf

WhenAITakestheCouch:PsychometricJailbreaks

RevealInternalConflictinFrontierModels

AfshinKhadangi,HannaMarxen,AmirSartipi,IgorTchappi,GilbertFridgen

SnT,UniversityofLuxembourg

Frontierlargelanguagemodels(LLMs)suchasChatGPT,GrokandGeminiareincreasinglyusedfor

mental-healthsupportwithanxiety,traumaandself-worth.Mostworktreatsthemastoolsoras

targetsofpersonalitytests,assumingtheymerelysimulateinnerlife.Weinste

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档