【精品】Natural Language Processing .pptVIP

  • 47
  • 0
  • 约4.45万字
  • 约 70页
  • 2015-08-01 发布于河南
  • 举报
【精品】Natural Language Processing .ppt

Collins’s Parser # of Distinct CFG Rules in Penn Treebank: 14,000 in 50,000 sentences Michael Collins (now at MIT) 1998 UPenn PhD Thesis Generative model of tree probabilities: P(Tree) Parses WSJ with ~90% constituent precision/recall Best performance for single parser Not a full who-did-what-to-whom problem, though Dependencies 50%-95% accurate depending on type) Similar to GPSG + Categirla Grammar (aka HPSG) model Subcat frames: adjuncts / complements distinguished Generalized Coordination Unbounded Dependencies via slash percolation Punctuation model Distance metric codes word order (cano

文档评论(0)

1亿VIP精品文档

相关文档