同济大学计算机前沿技术概论 第3章_信息检索和语义Web.pptVIP

  • 10
  • 0
  • 约1.04万字
  • 约 52页
  • 2016-12-09 发布于江西
  • 举报

同济大学计算机前沿技术概论 第3章_信息检索和语义Web.ppt

Identifying mentions of entities (e.g., person names, locations, companies) in text MUC (1997): Person, Location, Organization, Date/Time/Currency ACE (2005): more than 100 more specific types Hand-coded vs. Machine Learning approaches Best approach depends on entity type and domain: Closed class (e.g., geographical locations, disease names, gene protein names): hand coded + dictionaries Syntactic (e.g., phone numbers, zip codes): regular expressions Semantic (e.g., person and company names): mixture of context, syntactic features, dictionaries, heuristics, etc. “Almost solved” for common/

文档评论(0)

1亿VIP精品文档

相关文档