个性化服务中用户兴趣模型的研究与设计-计算机应用技术专业论文.docx

下载文档 降价啦

2
0
约6.81万字
约 76页
2018-12-05 发布于上海
举报
版权申诉
保障服务

个性化服务中用户兴趣模型的研究与设计-计算机应用技术专业论文.docx

1、本文档共76页，可阅读全部内容。
2、原创力文档（book118）网站文档一经付费（服务费），不意味着购买了该文档的版权，仅供个人/单位学习、研究之用，不得用于商业用途，未经授权，严禁复制、发行、汇编、翻译或者网络传播等，侵权必究。
3、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。如您付费，意味着您自己接受本站规则且自行承担风险，本站不退款、不进行额外附加服务；查看《如何避免下载的几个坑》。如果您已付费下载过本站文档，您可以点击这里二次下载。
4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等，请点击“版权申诉”（推荐），也可以打举报电话：400-050-0827(电话支持时间：9:00-18:30)。

个性化服务中用户兴趣模型的研究与设计-计算机应用技术专业论文

摘摘要 I万方数据 I 万方数据摘要随着网络信息的高速增长，为了解决信息过载和信息迷航所带来的种种问题，个性化服务已经成为信息领域研究的热点之一。个性化服务针对不同的用户采取不同的服务策略，提供不同的服务内容，用户兴趣建模是其关键技术之一。用户兴趣模型能否准确地反映用户的兴趣决定了系统提供个性化服务的质量。本文针对用户兴趣建模进行了以下几方面的研究：首先进行数据的采集。系统隐式地收集用户浏览页面和浏览行为作为用户兴趣建模的主要数据来源，在对页面进行预处理，抽取页面内容后，采用本文提出的适用于中文文本聚类的单文档特征提取方法——基于综合指标的特征提取方案来提取页面的特征向量。其次，本文讨论了用户兴趣聚类的特殊性，指出了经典聚类方法应用于用户兴趣聚类时的不足，在基于图论的 NEOREN 算法基础上进行实验改进，提出了基于相似度阈值的聚类算法，实验证明，该算法能够显著提高聚类质量，有效区分孤立点，适用于用户兴趣聚类。接着，本文采用细兴趣粒度与向量空间模型相结合的表示方法，并在此基础上进行扩展，给出了用户兴趣模型的形式化表示。在用户兴趣聚类分析的基础上创建用户兴趣模型；结合活跃度、关注度、遗忘因子对模型进行更新，生成长、短期兴趣；并给出了该模型应用于个性化服务时的推荐算法。最后进行全面的模拟实验，通过实验分析表明，本文提出的用户兴趣模型能够比较全面的描述用户兴趣，准确地跟踪用户兴趣变化，具有良好的效率。关键词：个性化服务，用户兴趣模型，特征提取，文本聚类，向量空间模型 AB ABSTRACT II万方数据 II 万方数据 Abstract With the explosive growth of information available on the Internet, personalized service has become to be a focus research in the domain of information service to deal with the problem of information overloading and information amazing. Personalized service is to give different service-strategy and different service-content to different user. How to construct user profile is one of its core technologies. Therefore, the quality of personalized information service provided by the system is determined by the fact whether or not the user profiles reflect user interests exactly. This paper plans to make a study from the following aspects about user interest modeling: First, the system collects the user’s browsing content and behavior as the main initial data in implicit way. The content, obtained by cleaning the web page, is been expressed by Vector Space Model. And a new term selection and weighting approach to Chinese text clustering is presented in this paper. Second, the particularity of the user interests clustering is discussesed, and the disadvantage of traditional clustering methods is pointed out. With the improvement of the NEOREN clustering method based on the graph theory, the clustering method based on the similarity is proposed. The experiment results that this method