- 2
- 0
- 约5.01万字
- 约 61页
- 2019-01-09 发布于上海
- 举报
基于agent的专题搜索引擎爬虫的研究-计算机应用技术专业论文
江苏大学硕士学位论文ABSTRACT
江苏大学硕士学位论文
ABSTRACT
With the extensive application ofⅥWnⅣtechnology,the traditional search engines ar e facing enormous challenges for owning lower in a recall rate,retrieval accuracy and not timely update always.They can not express good demand for users and show the users search results containing a substantial amount of information with unrelated to topic.At the same time,with an increasing number of different areas of customer, professional search engines able tO providing much more efficient retrieval are needed
in tiers’specific industry.
Taking advantage the unique method of specific site acquisition,topic-specific acquisition and digging web structure,Topic-Specific Search Engine can promote the efficiency of the entire recall rate,retrieval accuracy,and higher guarantee in its timeliness and professionalism,also provide better personalized services.In that way,it
啪be highly effective in identifying the specific areas of information and providing a
unique retrieval services.So the network crawler design is the core of search engine.This paper is main about the crawler research of Agent-based Topic-Specific Search Engine and related key technologies,the major work of this paper are:
1.Based on the analysis of search engine technology,agent adaptive technology and machine learning,a Crawler Frame of Agent-based Topic-Specific Search is
presented.
2.A Chinese Segmentation algorithm with cx/mbining word library and statistics is presented.Taking advantage of improved VSM(vector space model) and the use of improved Methods of web mining and content mining.a automatic text classification
algorithm is designed based on support vector machine.
3.The search strategy is presented based on Q—learning algorithm,the algorithm is
combining with web page evaluation technology and web link structure technology, with the help of adaptive agent,the algorithm can reduce the greed of the search to some extent and have a more effective manlier to choose web
您可能关注的文档
- 基于cfd的超声空化对抛光介质运动影响的研究-机械工程专业论文.docx
- 基于can总线的钻井多参数测量系统开发-检测技术与自动化装置专业论文.docx
- 基于3s的防洪决策方法研究-水文学及水资源专业论文.docx
- 基于c型弹簧管的光纤布拉格光栅压力传感技术研究-检测技术与自动化装置专业论文.docx
- 基于fpga的ofdm调制解调器研究与实现-计算机应用技术专业论文.docx
- 基于corba技术的电信网管系统的设计与实现-软件工程专业论文.docx
- 基于arm的无线传感器网络多帧图像采集与处理-物理电子学专业论文.docx
- 基于cae的u形波纹管疲劳寿命研究-机械电子工程专业论文.docx
- 基于aermod的氮氧化物日变化特征的模拟试验研究-环境工程专业论文.docx
- 基于bs的办公管理系统的设计与实现-软件工程专业论文.docx
- 数据流通利用设施发展研究白皮书_37页_1004kb.pptx
- 食品饮料行业深度报告_原奶价格周期向上_板块配置价值愈显_59页_2mb.pptx
- 风电2026年行业策略_国内需求稳升_出海加速_国内外盈利共振_34页_2mb.pptx
- 数读IPO系列_2025年沪深新股总结_36页_1mb.pptx
- 2026年投资展望系列之十二_股债之锚_2026通胀的温度_34页_1mb.pptx
- 家电行业资金面系列专题一_从业绩博弈到稀缺性溢价——家电板块估值重构与白电龙头新机遇_47页_3mb.pptx
- 医药行业2026年度医疗器械策略报告出海篇_破局内卷_向全球价值链中高端迈进_53页_2mb.pptx
- 出海概念股票池及主题指数_扬帆出海孕育的四个投资机遇_17页_1mb.pptx
- 交运行业2026年投资策略_航空盈利修复可期_航运绿色转型提速_45页_3mb.pptx
- 资本周期系列_从业绩变脸到价值修复_22页_697kb.pptx
原创力文档

文档评论(0)