z-logo
open-access-imgOpen Access
Focused crawling from the basic approach to context aware notification architecture
Author(s) -
Venugopal Boppana,
P Sandhya
Publication year - 2019
Publication title -
indonesian journal of electrical engineering and computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.241
H-Index - 17
eISSN - 2502-4760
pISSN - 2502-4752
DOI - 10.11591/ijeecs.v13.i2.pp492-498
Subject(s) - web crawler , crawling , focused crawler , world wide web , computer science , context (archaeology) , architecture , the internet , information retrieval , web server , static web page , medicine , art , paleontology , biology , visual arts , anatomy
The large and wide range of information has become a tough time for crawlers and search engines to extract related information. This paper discusses about focused crawlers also called as topic specific crawler and variations of focused crawlers leading to distributed architecture, i.e., context aware notification architecture. To get the relevant pages from a huge amount of information available in the internet we use the focused crawler. This can bring out the relevant pages for the given topic with less number of searches in a short time. Here the input to the focused crawler is a topic specified using exemplary documents, but not using the keywords. Focused crawlers avoid the searching of all the web documents instead it searches over the links that are relevant to the crawler boundary. The Focused crawling mechanism helps us to save CPU time to large extent to keep the crawl up-to-date.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here