A Novel Framework for Context Based Distributed Focused Crawler (CBDFC)
Author(s) -
Pooja Gupta,
Ashok Kumar Sharma,
J. P. Gupta,
Komal Kumar Bhatia
Publication year - 2010
Publication title -
international journal of computer and communication technology
Language(s) - English
Resource type - Journals
eISSN - 2231-0371
pISSN - 0975-7449
DOI - 10.47893/ijcct.2010.1003
Subject(s) - web crawler , crawling , computer science , focused crawler , context (archaeology) , world wide web , information retrieval , index (typography) , web page , web navigation , static web page , medicine , paleontology , biology , anatomy
Focused crawling aims to search only the relevant subset of the WWW for a specific topic of user interest; leading to the necessity to decide about the relevancy of a document to the topic of interest; especially when the user is not perfect in specifying the exact context of the topic. This paper provides a novel framework of a context based distributed focused crawler that maintains an index of web documents pertaining to the context of keywords resulting in storage of more related documents.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom