- 18
- 0
- 约1.55万字
- 约 52页
- 2017-02-15 发布于北京
- 举报
Web Crawling.ppt
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * Concurrent crawlers Can use multi-processing or multi-threading Each process or thread works like a sequential crawler, except they share data structures: frontier and repository Shared data structures must be synchronized (locked for concurrent writes) Speedup of factor of 5-10 are easy this way * Outline Motivation and taxonomy of crawlers Basic crawlers and implementation issues Universal crawlers Crawler ethics and conflicts * Universal crawlers Support universal search engines Large-scale Huge cost (ne
您可能关注的文档
- Prepared by Dennis Tirpak, Senior Fellow, World Resources .ppt
- Quality of Coverage of a Wireless Sensor Network Using .ppt
- RecombomamtDNA&Medicine.ppt
- Reporting Requirements and Indicators for Supply Chain .ppt
- SEFC 1B Unit 14 Festivals.ppt
- Semantic Web Technologies and Data Management.ppt
- Semantic Web.ppt
- Session 2 Conceptualize Information Systems from various .ppt
- Simple Data Objects.ppt
- Sixth AnnualIn-House Counsel Conference.ppt
最近下载
- 2026年高考化学二轮复习(全国)专题16 大题突破——化学实验综合(专题专练)(解析版).pdf VIP
- 2025年高考物理真题分类汇编专题19 力学计算(全国)(原卷版).docx
- 七年级生物下册必背核心知识点(人教版2025新教材)_可搜索.pdf VIP
- AI+新型智慧工业园区建设方案(52页 PPT).pptx
- 国开(宁夏)50125-地下建筑结构-形考作业四.pdf VIP
- 石家庄市2026年高三(二模)地理试卷(含答案).pdf
- 小学生必背古诗75首(可打印) .pdf VIP
- 精品解析:北京市中国人民大学附属中学2025-2026学年七年级下学期期中语文试题(解析版).docx VIP
- 2025年浙江省事业单位教师招聘考试生物学科专业知识试卷详解.docx VIP
- 2026年省立护士招聘考试题库.docx VIP
原创力文档

文档评论(0)