- 1
- 0
- 约11.55万字
- 约 15页
- 2025-06-13 发布于湖南
- 举报
Hey,That’sMyData!Label-OnlyDatasetInferenceinLargeLanguageModels
12314
ChenXiong,ZihaoWang,RuiZhu,Tsung-YiHo,Pin-YuChen,
523
JingweiXiong,HaixuTang,LucilaOhno-Machado
1TheChineseUniversityofHongKong,2IndianaUniversityBloomington,3YaleUniversity,SchoolofMedicine,
4IBMResearchAI,5UniversityofCalifornia,Davis
5Abstract—LargeLanguageModels(LLMs)haverevolutionizedNonetheless,thissurgeindevelopmenthasalsosparked
2NaturalLanguageProcessingbyexcellingatinterpreting,debatesregardingunauthorizedusageofdata.Inparticular,
0reasoningabout,andgeneratinghumanlanguage.However,copyrightedcontentmaybeembeddedinmodeltraining
2theirrelianceonlarge-scale,oftenproprietarydatasetsposesasetswithoutproperconsent,infringingontherightsof
ncriticalchallenge:unauthorizedusageofsuchdatacanleadtoauthorsandpotentiallyleadingtofinancialharm[23],[34],
ucopyrightinfringementandsignificantfinancialharm.Existing[44].AnotableexampleistheNewYorkTimesfiling
Jdataset-inferencemethodstypicallydependonlogprobabilitiesalawsuitagainstOpenAIandMicrosoftoverthealleged
6todetectsuspicioustrainingmaterial,yetmanyleadingLLMsimproperuseoftheircopyrightedmaterialsfortraining
havebegunwit
您可能关注的文档
- 人工智能论文英文版-Eigenspectrum Analysis of Neural Networks without Aspect Ratio.pdf
- 人工智能论文英文版-Cartridges:Lightweight and general-purpose long context.pdf
- 人工智能论文英文版-Distillation Robustifies Unlearning.pdf
- 人工智能论文英文版-PersonaAgent:When Large Language Model Agents Meet Personalization at Test Time.pdf
- 人工智能论文英文版-Reflect-then-Plan:Offline Model-Based Planning through a Doubly Bayesian Lens.pdf
- 人工智能论文英文版-DesignBench:A Comprehensive Benchmark for MLLM-based Front-end Code Generation.pdf
- 人工智能论文英文版-Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models.pdf
- 人工智能论文英文版-“We need to avail ourselves of [GenAI] to enhance knowledge distribution”: Empowering Older Adults through GenAI Literacy.pdf
- 人工智能论文英文版-GenIR: Generative Visual Feedback for Mental Image Retrieval.pdf
- 人工智能论文英文版-Integer Linear Programming Preprocessing for Maximum Satisfiability.pdf
- 人工智能论文英文版-SIMPLE YET EFFECTIVE:EXTRACTING PRIVATE DATA ACROSS CLIENTS IN FEDERATED FINE-TUNING OF LARGE LANGUAGE MODELS.pdf
- 人工智能论文英文版-FPDANet:A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound.pdf
- 人工智能论文英文版-CP-Bench:Evaluating Large Language Models for Constraint Modelling.pdf
- 人工智能论文英文版-Astra:Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning.pdf
- 人工智能论文英文版-The Lock-in Hypothesis:Stagnation by Algorithm.pdf
- 人工智能论文英文版-TRUST:Test-time Resource Utilization for Superior Trustworthiness.pdf
- 人工智能论文英文版-When to Trust Context: Self-Reflective Debates for Context Reliability.pdf
- 人工智能论文英文版-Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models.pdf
- 人工智能论文英文版-Unlocking Recursive Thinking of LLMs:Alignment via Refinement.pdf
- 人工智能论文英文版-Leveraging Generative AI for Enhancing Automated Assessment in Programming Education Contests.pdf
原创力文档

文档评论(0)