Web搜索引擎及算法.pptVIP

  • 6
  • 0
  • 约1.69万字
  • 约 60页
  • 2017-03-03 发布于江苏
  • 举报
* The benefit for customers in using a smaller index is savings. It costs them more to query against the biggest collection of documents * PageRank Explained PageRank relies on the uniquely democratic nature of the web by using its vast link structure as an indicator of an individual pages value. In essence, Google interprets a link from page A to page B as a vote, by page A, for page B. But, Google looks at more than the sheer volume of votes, or links a page receives; it also analyzes the page that casts the vote. Votes cast by pages that are themselves important weigh more heavily and help to make other pages important. Important, high-quality sites receive a higher PageRank, which Google remembers each time it conducts a search. Of course, important pages mean nothing to you if they dont match your query. So, Google combines PageRank with sophisticated text-matching techniques to find pages that are both important and relevant to your search. Google goes far beyond the number of times a term appears on a page and examines all aspects of the pages content (and the content of the pages linking to it) to determine if its a good match for your query. * /~page/papers/pagerank/ppframe.htm * pages from your site appear in about 4 to 6 weeks. Google has fairly large index, so it should gather a significant number of your pages. * Authorities -- should have relevant content Hubs: should point to similar content * ... help to organize information on the Web, however informally and inadvertently. Authorities ( blue ) are sites that other Web pages happen to link to frequently on a particular topic. For the subject of human rights, for instance, the home page of Amnesty International might be one such location. Hubs ( red ) are sites that tend to cite many of those authorities, perhaps in a resource list or in a My Favorite Links section on a personal home page. * Jon M. Kleinbergs Authoritative Sources in a Hyperlinked Environment in Proceeding

文档评论(0)

1亿VIP精品文档

相关文档