Review of web crawlers
Author(s) -
S.R. Sreeja,
Sangita Chaudhari
Publication year - 2014
Publication title -
international journal of knowledge and web intelligence
Language(s) - English
Resource type - Journals
eISSN - 1755-8263
pISSN - 1755-8255
DOI - 10.1504/ijkwi.2014.065035
Subject(s) - computer science , world wide web , web crawler , information retrieval
The web is a repository of large amount of data. Information available in the web is organised in the form of pages. Due to the presence of unlimited amount of information, searching and finding out appropriate information from the web is a task which needs expertise. Web crawlers are programmes that assist search engines by automating the task of visiting web pages and downloading their contents. They also help in ranking the downloaded web pages. Thus, the search engines can produce a list of web pages ordered by their relevance and can display this list as a result of the search. Crawling also helps to validate web pages, analyse them, notify about page-updation, visualise web pages and sometimes for collecting e-mail addresses for spam purposes. They can be of different types, each one using different strategies and techniques to crawl web pages. This paper presents a review of various types of web crawlers.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom