HypertextInterfacesforChinese.docVIP

  • 2
  • 0
  • 约4.86万字
  • 约 20页
  • 2017-05-04 发布于天津
  • 举报
HypertextInterfacesforChinese.doc

Chinese Word Segmentation Using Minimal Linguistic Knowledge Aitao Chen School of Information Management and Systems University of California at Berkeley Berkeley, CA 94720-4600, USA aitao@ Abstract This paper presents a primarily data-driven Chinese word segmentation system and its performances on the closed track using two corpora at the first International Chinese word segmentation bakeoff. The system consists of a new words recognizer, a baseline segmentation algorithm based on a unigram language model, and procedures for combining single characters and checking segmentation consistencies.

文档评论(0)

1亿VIP精品文档

相关文档