SQuAD数据集上端到端神网络模型问题回答.pdfVIP

下载本文档

0
0
约4.15万字
约 30页
2026-01-20 发布于北京
举报

SQuAD数据集上端到端神网络模型问题回答.pdf

SQuAD数据集上的问题回答

刁子桓

计算机科学系

斯坦福大学

diaozh@stanford.edu

董俊杰，

电气工程系

斯坦福大学

{junjied,jg755}@stanford.edu

机器理解（MC），即根据给定的上下文回答问题，在近年来获得了显著的关注。

在本文中，我们提出了一种用于SQuAD数据集的端到端神经网络模型。在隐藏

测试集上，我们的单一模型实现了F1分数为79.9和EM分数为70.7，而7个模型的

集成则达到了F1分数81.9和EM分数73.8。

1引言

机器理解近年来因其在现实生活中的广泛应用以及对自然语言处理领域的理论价

值而受到广泛关注。一个特别推动该领域巨大进步的数据集是斯坦福问答数据集

（SQuAD）[1],，该数据集包含由众包工作者在百科文章上超过

100,000个问题。

在SQuAD机器理解任务中，给定一个上下文和一个问题，机器需要阅读并综合

上下文信息，然后根据上下文中的信息用自然语言回答问题。上下文是一

系列词标记=[,...，]而问题是=[,...，]，其中是上

下文的长度，是问题的长度。SQuAD数据集将限制为上下文的连续

子片段；

QuestionAnsweringonSQuADDataset

ZihuanDiao

DepartmentofComputerScience

StanfordUniversity

diaozh@stanford.edu

JunjieDong,JiaxingGeng

DepartmentofElectricalEngineering

StanfordUniversity

{junjied,jg755}@stanford.edu

Machinecomprehension(MC),answeringaquestionbasedonagivencontext,has

gainedsignificantpopularityinrecentyears.Inthispaper,wepresentanend-to-end

neuralnetworkmodelfortheSQuADdataset.Onthehiddentestset,oursinglemodel

achievedF1score79.9andEM70.7,andanensembleof7modelsachievedF1score

81.9andEM73.8.

1Introduction

Machinecomprehensionhasgainedsignificantpopularityoverrecentyearsduetoits

wideapplicationsinreallifeaswellastheoreticalvaluesforthefieldofnaturallanguage

processing.Oneparticulardatasetthatledtohugeadvancementinthefieldisthe

StanfordQuestionAnsweringDataset(SQuAD)[1],whichconsistsofmorethan100,000

questionsposedbycrowdworkersonWikipediaartic.

IntheSQuADmachinecomprehensiontask,givenacontextandaquestion,the

machineneedstoreadandsynthesizethecontext,andthenanswerthequestionin

naturallanguagebasedoninformationpresentedinthecontex

您可能关注的文档

文档评论（0）

1亿VIP精品文档

更多 >

SQuAD数据集上端到端神网络模型问题回答.pdfVIP