- 1、本文档共5页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
A Systematic Cross-Comparison of Sequence Classifiers
A Systematic Cross-Comparison of Sequence Classifiers
Binyamin Rozenfeld, Ronen Feldman, Moshe Fresko
Bar-Ilan University, Computer Science Department, Israel
grurgrur@, feldman@cs.biu.ac.il, freskom1@cs.biu.ac.il,
Abstract
In the CoNLL 2003 NER shared task, more
than two thirds of the submitted systems used
a feature-rich representation of the task. Most
of them used the maximum entropy principle
to combine the features together. Others used
large margin linear classifiers, such as SVM
and RRM. In this paper, we compare several
common classifiers under exactly the same
conditions, demonstrating that the ranking of
systems in the shared task is due to feature se-
lection and other causes and not due to inher-
ent qualities of the algorithms, which should
be ranked otherwise. We demonstrate that
whole-sequence models generally outperform
local models, and that large margin classifiers
generally outperform maximum entropy-based
classifiers.
1 Introduction
Recently, feature-rich classifiers became state-of-the-art
in sequence labeling tasks, such as NP chunking, PoS
tagging, and Named Entity Recognition. Such classifiers
are able to use any property of tokens and their contexts,
if the property can be represented in the form of real-
valued (usually binary) feature functions. Since almost
all local properties can be represented in such a way,
this ability is very powerful.
Maximum-entropy-based models are currently the
most prevalent type of feature-rich classifiers in se-
quence labeling tasks. Such models define a probability
distribution over the set of labelings of a sentence, given
the sentence. In this, such classifiers differ from the
generative probabilistic classifiers, such as HMM-based
Nymble (Bikel et al., 1999) and SCFG-based TEG
(Rosenfeld et al., 2004), which model the joint probabil-
ity of sequences and their labelings, and which can use
only a very limited range of context features.
An alternative feat
您可能关注的文档
- 3.3高分子的“理想溶液”--θ状态.pdf
- 3.4Flory稀溶液理论.pdf
- 3.5 高分子溶液的相平衡和相分离.pdf
- 3.BETAFENCE 机场安全围界1.pdf
- 31Open-Economy Macroeconomics_Basic Concepts.pdf
- 3200nt timer 中文说明书.pdf
- 30天冲刺雅思大作文高分计划之DAY 18 保护老建筑-智课教育出国考试.pdf
- 365句ENGLISH.pdf
- 33-lesson 35.ppt
- 3300 ELSD ChemStationB.03DriverV04108 Instructions.pdf
- 人教版英语5年级下册全册教学课件.pptx
- 部编人教版2年级上册语文全册教学课件含单元及专项复习.pptx
- 人教版8年级上册英语全册教学课件(2021年8月修订).pptx
- 教科版(2017版)6年级上册科学全册课件+课时练.pptx
- 人教版PEP版6年级英语下册全册教学课件(2022年12月修订).pptx
- 部编人教版2年级下册语文全册课件(2021年春修订).pptx
- 人教版数学6年级下册全册教学课件(2023年教材).pptx
- 湘少版5年级下册英语全册教学课件(2021年春修订).pptx
- 人教PEP4年级下册英语全册教学课件 [2}.pptx
- 人教版6年级上册英语全册教学课件.pptx
文档评论(0)