Abstract |
: |
This paper presents the Metacrawler, a fielded Web service that represents the next level up in the information "food chain." The Metacrawler provides a single, central interface for Web document searching. Upon receiving a query, the Metacrawler posts the query to multiple search services in parallel, collates the returned references, and loads those references to verify their existence and to ensure that they contain relevant information. The Metacrawler is sufficiently lightweight to reside on a user's machine, which facilitates customization, privacy, sophisticated filtering of references, and more. Standard Web search services, though useful, are far from ideal. There are over a dozen of different search services currently in existence, each with a unique interface and a database covering a different portion of the Web. As a result, users are forced to repeatedly try and retry their queries across different services. Furthermore, the services return many responses that are irrelevant, outdated, or unavailable, forcing the user to manually sift through the responses searching for useful information.
|