- 1、本文档共8页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Generic Sentence Fusion is an Ill-Defined Summarization Task
Generic Sentence Fusion is an Ill-Defined Summarization Task
Hal Daume? III and Daniel Marcu
Information Sciences Institute
University of Southern California
4676 Admiralty Way, Suite 1001
Marina del Rey, CA 90292
{hdaume,marcu}@
Abstract
We report on a series of human evaluations of
the task of sentence fusion. In this task, a hu-
man is given two sentences and asked to produce
a single coherent sentence that contains only the
important information from the original two.
Thus, this is a highly constrained summariza-
tion task. Our investigations show that even
at this restricted level, there is no measurable
agreement between humans regarding what in-
formation should be considered important. We
further investigate the ability of separate eval-
uators to assess summaries, and find similarly
disturbing lack of agreement.
1 Introduction and Motivation
The practices of automatic summarization
vary widely across many dimensions, including
source length, summary length, style, source,
topic, language, and structure. Most typical are
summaries of a single news document down to
a headline or short summary, or of a collection
of news documents down to a headline or short
summary (Hahn and Harman, 2002). A few re-
searchers have focused on other aspects of sum-
marization, including single sentence (Knight
and Marcu, 2002), paragraph or short document
(Daume? III and Marcu, 2002), query-focused
(Berger and Mittal, 2000), or speech (Hori et
al., 2003).
The techniques relevant to, and the challenges
faced in each of these tasks can be quite dif-
ferent. Nevertheless, they all rely on one crit-
ical assumption: there exists a notion of (rel-
ative) importance between pieces of informa-
tion in a document (or utterance), regardless
of whether we can detect this or not. Indeed,
recent research has looked at this question in de-
tail, and can be rather cleanly divided into two
partitions. The first partition aims to develop
manual evaluation criteria for determining the
quality o
您可能关注的文档
- EasyASURO 中文精要操作步骤.pdf
- E-SMARTER-COMMERCE概况.ppt [Compatibility Mode].pdf
- easyUI属性汇总.pdf
- EasyUI组件使用.pdf
- EBSCOhost2.0a.ppt
- EB_Propsim_F8.pdf
- ebookchain白皮书.pdf
- Ecological Optimization For A Generalized Irreversible Carnot Engine With An Universal Heat.pdf
- Econometrics_Slide02.pdf
- Eclipse+ADT+NDK调试C源码的方法.pdf
- GeneralDynamics_PLM_ERP_15March2012.pdf
- genetic analysis and lactose hydrogen.pdf
- Genetic programming for knowledge discovery in chest pain diagnosis.pdf
- Genetic variation within a dominant shrub species determines plant.pdf
- Genome Relationships The Grass Model in Current Research.pdf
- Genomics of Macadamia, a.pdf
- Genomic analysis of increased host immune and cell death.pdf
- Fully dynamic transitive closure in plane dags with one source and one sink.pdf
- Genotoxic and Carcinogenic Impurities in Drug Substances and Products Recommended Approaches.pdf
- Geometric height inequality on varieties with ample cotangent bundles.pdf
最近下载
- 2024年广东省初中学业水平考试模拟地理试卷(一)课件.pptx VIP
- 关于烹饪的策划书3.pptx
- 广州市人民南历史文化街区保护利用规划(文本+图纸).pdf VIP
- WALL·E《机器人总动员(2008)》完整中英文对照剧本.pdf VIP
- LDT 99.13-2008 建设工程劳动定额市政工程-维修养护工程.docx
- 实验报告之spss频数分析.docx VIP
- 新教科版科学小学科学五年级下册全册教案(表格式,可打印).docx
- 2022年新改版教科版五年级上册科学全册教案教学设计(新整理版).doc
- 某小区高楼变频恒压供水系统设计.docx
- 教育智能化AI技术在教学中的应用与影响培训课件.pptx
文档评论(0)