An Algorithm for Multi-Domain Website Classification

The web is the largest world-wide communication system of computers. The web has local, academic, commercial and government sites. As the types of websites increases in numbers, the cost and accuracy of manual classification became cumbersome and cannot satisfy the increasing internet service demand...

Full description

Saved in:

Bibliographic Details
Published in	International journal of web-based learning and teaching technologies Vol. 15; no. 4; pp. 57 - 65
Main Authors	Ullah, Mohammad Aman, Tahrin, Anika, Marjan, Sumaiya
Format	Journal Article
Language	English
Published	Hershey IGI Global 01.10.2020
Subjects	Accuracy Algorithms Artificial Intelligence Automatic classification Automation Classification Communications systems Computer mediated communication Computers Copyrights Cybercrime Data collection Decision trees Dictionaries Education Experiments Influence of Technology Internet Internet/Web search services Machine learning Mathematics Open access publishing Open Source Technology Periodicals Search engines Service enhancement Social Media Social networks Social Problems Software Support vector machines Web Sites Websites World Problems Algeria
Online Access	Get full text
ISSN	1548-1093 1548-1107 1548-1107
DOI	10.4018/IJWLTT.2020100104

Cover

More Information
Summary:	The web is the largest world-wide communication system of computers. The web has local, academic, commercial and government sites. As the types of websites increases in numbers, the cost and accuracy of manual classification became cumbersome and cannot satisfy the increasing internet service demands, thereby automated classification became important for better and more accurate search engine results. Therefore, this research has proposed an algorithm for classifying different websites automatically by using randomly collected textual data from the webpages. This research also contributed ten dictionaries covering different domains and used as training data in the classification process. Finally, the classification was carried out using the proposed and Naïve Bayes algorithms and found the proposed algorithm outperformed on the scale of accuracy by 1.25%. This research suggests that the proposed algorithm could be applied to any number of domains if the related dictionaries are available.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1548-1093 1548-1107 1548-1107
DOI:	10.4018/IJWLTT.2020100104