面向产品评论挖掘的特征粒度树研究-计算机软件与理论专业毕业论文.docxVIP

  • 1
  • 0
  • 约3.21万字
  • 约 43页
  • 2019-05-08 发布于上海
  • 举报

面向产品评论挖掘的特征粒度树研究-计算机软件与理论专业毕业论文.docx

摘 要 摘 要 产品评论挖掘就是从用户发表的评论中挖掘出产品特征、用户观点,并判断观点极 性,为生产、营销商家和潜在的用户提供参考。通过对提取出的产品特征进行分析,发 现用户对产品特征粒度的关注是不同的,本文针对这一问题进行了研究,主要工作如下: 利用基于索引的标签路径的方法找到数据区路径,抽取产品说明书和原始评论。定 义标注细则,完成原始评论的人工标注,为后续研究准备基础数据。 给出了基于特征粒度树获得产品特征粒度关系的方法。由于单个说明文档中的特征 分类效果不好,本文利用改进的相似度公式判断来自不同说明文档特征记录的相似性, 相似度公式的改进使得特征记录相似性判断的准确性有了较大提高;基于相似特征记录 将特征组进行重组,根据新的特征组集合建立特征粒度树;由于特征记录来源于同一型 号的产品,其特征覆盖不完全,本文抽取了多种类型产品的说明文档,用于完善粒度树, 增加特征粒度树的广泛适用性;根据相似度计算和《同义词词林》判断从产品评论中抽 取的特征与特征粒度树中结点的相似性,将产品特征在特征粒度树中进行定位,从而获 得产品特征之间的粒度关系。 实验结果表明本文改进的相似度公式提高了相似判断的准确性,也验证了基于特征 粒度树获得产品特征粒度关系方法的有效性和本文建立的特征粒度树的实用性。 关键词 评论挖掘 特征粒度 特征粒度树 特征抽取 相似度计算 I Abstract Abstract With the explosive growth of the network information, how to find useful information from it comes to a hot research focus. Mining product reviews is to extract the product features, users’ attitudes and judge the emotional polarity, in order to offer reference information for potential users and merchants. However, after analyzing the extracted product features, we find that the granularities of product features which users concern are different. So the paper studies this problem, and the main work as follows: Using the method of label path basin on index, this study finds the path of data area, and extracts the product manual as well as original product reviews. Then define the label rules and mark reviews artificially, preparing the adequate data for follow-up. This paper proposes a method about how to get the granularity distribution of features based on feature-granularity tree. Firstly, because the category of the feature-groups from single specification file is indistinct, we judge the similarity of feature-records from from different specification files by using an improved formula of similarity calculation which improves the precision of judging the similarity of feature-records. Secondly, restructure the feature-groups based on similar feature-records. After these, a feature-granularity tree is built according to the new feature-groups. Secondly, restructure the fe

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档