e-ISSN : 0975-3397
Print ISSN : 2229-5631
Home | About Us | Contact Us

ARTICLES IN PRESS

Articles in Press

ISSUES

Current Issue
Archives

CALL FOR PAPERS

CFP 2021

TOPICS

IJCSE Topics

EDITORIAL BOARD

Editors

Indexed in

oa
 

ABSTRACT

Title : HIDDEN WEB EXTRACTOR DYNAMIC WAY TO UNCOVER THE DEEP WEB
Authors : DR. ANURADHA, BABITA AHUJA
Keywords : WWW; Hidden Web; Surface Web; Query Interface; Crawler; Semantic Web; XML;RDF.
Issue Date : June 2012.
Abstract :
In this era of digital tsunami of information on the web, everyone is completely dependent on the WWW for information retrieval. This has posed a challenging problem in extracting relevant data. Traditional web crawlers focus only on the surface web while the deep web keeps expanding behind the scene. The web databases are hidden behind the query interfaces. In this paper, we propose a Hidden Web Extractor (HWE) that can automatically discover and download data from the Hidden Web databases. Since the only “entry point” to a Hidden Web site is a query interface, the main challenge that a Hidden Web Extractor has to face is how to automatically generate meaningful queries for the unlimited number of website pages.
Page(s) : 1137-1145
ISSN : 0975–3397
Source : Vol. 4, Issue.06

All Rights Reserved © 2009-2024 Engg Journals Publications
Page copy protected against web site content infringement by CopyscapeCreative Commons License