- 10
- 0
- 约1.12万字
- 约 6页
- 2017-09-16 发布于内蒙古
- 举报
基于文本分类中特征提取的领域词语聚类
刘华
[摘要]本文以领域特征明显的词和短语作为聚类对象,在分类系统的大规模语料库中,利用文
本分类的特征提取方法进行词语的领域聚类,从而获得大规模的领域知识,用于文本分类和主
题分析。
[关键词]特征提取 领域词语 聚类
Clustering Field Words by Character Extraction
in Text Classification
Abstract: Towards building a large-scale domanial repository for text categorization and topic
analysis, this paper presents an algorithm that clusters field Words in classed large-scale corpus by
character extraction in text categorization.
Keywords: Character Extraction, domanial word
原创力文档

文档评论(0)