Open Access
Hybrid Algorithm for Improving Efficiency of Keywords based Search Engine
Author(s) -
K. F. Bharati
Publication year - 2012
Publication title -
international journal of computer science and informatics
Language(s) - English
Resource type - Journals
ISSN - 2231-5292
DOI - 10.47893/ijcsi.2012.1051
Subject(s) - computer science , crawling , hits algorithm , data mining , web page , web search engine , web crawler , information retrieval , search engine , web mining , set (abstract data type) , task (project management) , process (computing) , algorithm , world wide web , web search query , engineering , medicine , operating system , systems engineering , anatomy , programming language
Web mining is the application of data mining techniques to discover interesting patterns from the Web. Web usage mining is the process of extracting useful information from server logs i.e users history. While discovering interesting patterns in multi agents the efficiency is decreased. In our paper presents hybrid algorithm is designed for the improvement of the efficiency of keywords based search engine. The model divides mining task into several parallel agents which coordinately work together, and the mining efficiency is improved greatly. The hybrid algorithm is Evolved from HITS, algorithm. Hybrid algorithm removes Link Farm pages in the expansion of root set, makes anchor text similarity calculation when crawling link page, and chooses pages by a brief conceptual analysis of page content. With the overcoming of the shortcomings of only text analysis or link analysis, Hybrid enhances the search engine in understanding the user interest and crawling more Web pages to meet the needs of the users.