Google工程师在VLDB2008所做演讲的ppt.pptVIP

  • 2
  • 0
  • 约1.99万字
  • 约 37页
  • 2016-12-10 发布于北京
  • 举报
Google工程师在VLDB2008所做演讲的ppt

Googles Deep-Web Crawl (VLDB 2008) Google’s Deep-Web Crawl Jayant Madhavan Google Inc. David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy What is the Deep Web? Content hidden behind HTML forms Why is it important? Large source of structured data Forms present a search interface over backend databases Significant gap in search engine coverage Potentially more content that currently searchable web [Bergman+, Madhavan+, He+] More than 10 million distinct HTML forms Likely to increase and more data comes online Challenge: make the Deep Web accessible to web search What is

文档评论(0)

1亿VIP精品文档

相关文档