- 7
- 0
- 约3.1万字
- 约 12页
- 2017-03-13 发布于湖北
- 举报
FocusedSearchontheWebusing
Focused Search on the Web using WeQueL
Amar-Djalil MEZAOUR Laboratoire de Recherche en Informatique (LRI), France
Email: mezaour@lri.fr
Abstract
Keyword-based web query languages suffer from a lack of precision when searching for a precise kind of documents. Indeed, some documents cannot be simply characterized by a list of keywords. For example, searching for on-line pictures dealing with formula one using only simple keywords with general-purpose search-engines gives imprecise answers. This imprecision is due to the method that considers that a relevant document to a query is one that contains a lot of query keywords occurrences. This method is totally unef?cient for poor textual-content documents like pictures, video streams . On the other hand, keyword based languages are often not powerful enough for expressing sophisticated document search like on-line java or c++ tutorials.
We propose ” WeQueL ” a multi-criteria query langage for a better characterization of documents. The aim is to increase the precision of document retrieval on the web. In our experiments, we show the gain in accuracy for web document searching using our language.
1 Introduction
Nowadays, the web represents an important heterogeneous data source (news, articles, pictures, streams ). The information is stored in entities called documents. These documents are identi?ed in a unique way by their urls and are linked together by hyperlinks. Searching for an information in the web consists of ?nding the urls of documents containing this information. General-purpose search engines have been developed to offer simple and powerful tools for users in order to search web documents. A general-purpose search engine, like Google [7], can be divided into three general components : a web crawler, a local web-index repository and a user query language. The web crawler is a program that visits the most possible web documents in the web in order to download them to the local web-index repository. In this local
原创力文档

文档评论(0)