- 1、本文档共6页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
144 2013 ,49 (9 ) Computer Engineering and Applications 计算机工程与应用
一种面向专业搜索引擎的查询推荐算法
王桂华,秦湘清,陈 黎,王亚强,于中华
WANG Guihua, QIN Xiangqing, CHEN Li, WANG Yaqiang, YU Zhonghua
四川大学 计算机学院,成都 610065
College of Computer Science, Sichuan University, Chengdu 610065, China
WANG Guihua, QIN Xiangqing, CHEN Li, et al. Query recommendation algorithm for professional search engines.
Computer Engineering and Applications, 2013, 49 (9 ):144-149.
Abstract :In the light of the differences between professional and universal search engines, a novel Query Recommendation
algorithm based on word Co-occurrence and HITS algorithm (QR-CH )is proposed for professional search engines. To improve
the relevance between the recommended words and the initial query, QR-CH utilizes the HITS algorithm to order the candidates
which are filtered by word co-occurrence, and then chooses the candidates with high relevance as recommended words. At the
same time, the algorithm reduces the redundancy effectively. Whether the recommended word is redundant depends on query
results, which are also ordered by the HITS algorithm. QR-CH stores the recommended words in a domain knowledge tree
which is used for query recommendation. The results of the experiment show that QR-CH is superior to the existent similar algo-
rithms in both the relevance and the redundancy.
Key words :query recommendation; word co-occurrence; Hypertext Induced Topic Search (HITS )algorithm; professional
search engine
摘 要:根据专业搜索引擎的特点,提出了一种新颖的基于词语共现与HITS 算法的查询推荐算法QR-CH (Query Recom-
mendation algorithm based on word Co-occurrence and HITS algorithm )。该算法一方面利用HITS 算法对基于词语共现
筛选出的关联词按语义关联性进行排序,选取排序靠前的关联词作为推荐词,提高了推荐词与原查询词的相关性;另一方
面使用HITS 算法排序关联文档,从查询结果文档集的角度来判断推荐是否冗余,降低了推荐词的冗余性。该算法将推荐
相关的信息存储到知识树中,利用知识树实现查询推荐。实验结果表明QR-CH 算法在推荐词的相关性和冗余词的判断方
面均优于文献中已有的类似算法。
关键词:查询推荐;词语共现;超链诱导主题搜索(HITS )算法;专业搜索引擎
文献标志码:A 中图分类号:TP391.1 doi :10.377
文档评论(0)