Machine Learning Techniques for Automated Web Page Classification Using URL Features

Explosive growth of the Internet makes it difficult for search engines to give relevant results to the users within a stipulated time. Search engines store the Web pages in classified directories and for this process even though some search engines depend on human expertise; most of the search engin...

Full description

Saved in:
Bibliographic Details
Published inComputational Intelligence and Multimedia Applications; Proceedings: 2007: Tamil Nadu, India. Vol. 4 Vol. 2; pp. 116 - 120
Main Authors Devi, M.I., Rajaram, D.R., Selvakuberan, K.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2007
Subjects
Online AccessGet full text
ISBN0769530508
9780769530505
DOI10.1109/ICCIMA.2007.342

Cover

More Information
Summary:Explosive growth of the Internet makes it difficult for search engines to give relevant results to the users within a stipulated time. Search engines store the Web pages in classified directories and for this process even though some search engines depend on human expertise; most of the search engines use automated methods for classification of web pages. In this paper we use machine-learning techniques for the automated classification of Web pages. We consider only URL features for classification as the URL name is unique, meaningful and helps identification of their subject categories most of the times. Experimental results show that machine learning techniques for automated classification of Web pages with URL features proves to be the best and more useful method for search engines.
ISBN:0769530508
9780769530505
DOI:10.1109/ICCIMA.2007.342