Significance of machine learning algorithms in professional blogger's classification

•Blogger's data is categorized into professional and seasonal bloggers using machine learning algorithms.•Decision tree algorithms, lazy learning methods and ensembling techniques are applied to standard datasets to analyze and compare the data classification algorithms on nominal data.•Random...

Full description

Saved in:
Bibliographic Details
Published inComputers & electrical engineering Vol. 65; pp. 461 - 473
Main Authors Asim, Yousra, Shahid, Ahmad Raza, Malik, Ahmad Kamran, Raza, Basit
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier Ltd 01.01.2018
Elsevier BV
Subjects
Online AccessGet full text
ISSN0045-7906
1879-0755
DOI10.1016/j.compeleceng.2017.08.001

Cover

More Information
Summary:•Blogger's data is categorized into professional and seasonal bloggers using machine learning algorithms.•Decision tree algorithms, lazy learning methods and ensembling techniques are applied to standard datasets to analyze and compare the data classification algorithms on nominal data.•Random Forest and Nearest-neighbor classifiers achieved high accuracy for data classification.•Importance of decision tree rules for factor identification behind the professionalism of bloggers is elaborated. Outreach of internet has opened new horizons for the people who want quick and widespread dissemination of their ideas, and the tool to do so is blogging. Bloggers can broadly be classified into two groups: professional and non-professional bloggers. As for professional bloggers, there are many factors that influence individuals to opt this profession. This study, with the help of an online dataset, attempts to identify such factors. Data analysis was made by using decision tree algorithms, lazy learning algorithms and ensembling methods. Nearest-neighbour classifier (IB1) and RandomForest have results with 85% accuracy and 84.8% precision for classification. The proof of concept is provided for result validation. The causes behind the varying performance of algorithms are elaborated. The factors that influence a blogger to behave professionally are identified based on the classifier with the best results. [Display omitted]
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0045-7906
1879-0755
DOI:10.1016/j.compeleceng.2017.08.001