文本挖掘工具与应用-北京大学计算机科学技术研究所.PDFVIP

  • 14
  • 0
  • 约1.39万字
  • 约 60页
  • 2018-01-26 发布于天津
  • 举报

文本挖掘工具与应用-北京大学计算机科学技术研究所.PDF

文本挖掘工具与应用-北京大学计算机科学技术研究所

文本挖掘技术(2013春) 第十五章: 文本挖掘工具与应用 杨建武 北京大学计算机科学技术研究所 Email:yangjw@ 1 Gartner view of Unstructured Data Management 2 Text Mining by Task  Information retrieval  Text categorization  Document clustering  Information filtering / topic detection  Text summarization  Question and answer  Taxonomy/concept/relationship mining  Visualization and user interface 3 Text Mining by Industry  Biotechnology  Consumer products  CRM, Consulting, Marketing  Education  Government  Healthcare  Insurance  Other Industry 4 传统商业方面的应用 5 Discovering Unexpected Information From A Competitor  Assume your boss ask you to find out what new information your competitor provides  E.g., to learn from the competitor  E.g., to design counter measures (对策)  Text mining techniques that maybe useful  novelty detection, text classification, information extraction  Major problems:  How to model what you already know? » Incorporating user’s existing knowledge  What unexpected information about competitors to find?  Algorithms  System architecture 6 Find Unexpected Information About Competitors  What is unexpected information? Is relevant to the user Is unknown to the user, or contradicts the user’s existing beliefs or expectations  • Examples Unexpe

文档评论(0)

1亿VIP精品文档

相关文档