A Review on Comparison of Machine Learning Algorithms for Text Classification

The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classi...

Full description

Saved in:
Bibliographic Details
Published in2022 5th International Conference on Contemporary Computing and Informatics (IC3I) pp. 1818 - 1823
Main Authors Dhingra, Mallika, Dhabliya, Dharmesh, Dubey, M. K., Gupta, Ankur, Reddy, Dhoma Harshavardhan
Format Conference Proceeding
LanguageEnglish
Published IEEE 14.12.2022
Subjects
Online AccessGet full text
DOI10.1109/IC3I56241.2022.10072502

Cover

Abstract The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classification that assigns documents to predetermined categories. Machine learning approaches can categories texts more accurately. The goal of this work is to introduce text classification, give a description of the text classification technique, a general review of the classifiers, and a comparison of a few of the current classifiers. It is based on performance, time complexity, and other factors. On the basis of speed, accuracy, benefits, and drawbacks of existing classification methods such as Decision Tree, Naive Bayes, Support Vector Machine, and k-Nearest Neighbours are compared.
AbstractList The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classification that assigns documents to predetermined categories. Machine learning approaches can categories texts more accurately. The goal of this work is to introduce text classification, give a description of the text classification technique, a general review of the classifiers, and a comparison of a few of the current classifiers. It is based on performance, time complexity, and other factors. On the basis of speed, accuracy, benefits, and drawbacks of existing classification methods such as Decision Tree, Naive Bayes, Support Vector Machine, and k-Nearest Neighbours are compared.
Author Dhabliya, Dharmesh
Dubey, M. K.
Dhingra, Mallika
Reddy, Dhoma Harshavardhan
Gupta, Ankur
Author_xml – sequence: 1
  givenname: Mallika
  surname: Dhingra
  fullname: Dhingra, Mallika
  email: mallikadhingra13@gmail.com
  organization: Manipal University Jaipur,Department of Mathematics and Statistics,Jaipur,Rajasthan,India
– sequence: 2
  givenname: Dharmesh
  surname: Dhabliya
  fullname: Dhabliya, Dharmesh
  email: dharmesh.dhabliya@viit.ac.in
  organization: Vishwakarma Institute of Information Technology,Department of Information Technology,Pune,Maharashtra,India
– sequence: 3
  givenname: M. K.
  surname: Dubey
  fullname: Dubey, M. K.
  email: maheshdubey6@gmail.com
  organization: Manipal University Jaipur,Department of Mathematics and Statistics,Jaipur,Rajasthan,India
– sequence: 4
  givenname: Ankur
  surname: Gupta
  fullname: Gupta, Ankur
  email: ankurdujana@gmail.com
  organization: Vaish College of Engineering,Department of Computer Science and Engineering,Rohtak,Haryana,India
– sequence: 5
  givenname: Dhoma Harshavardhan
  surname: Reddy
  fullname: Reddy, Dhoma Harshavardhan
  email: dhreddy2001@gmail.com
  organization: Trainee Security Analyst JunoClinic, Davman Technology Services Private Limited,Mumbai,Maharashtra,India
BookMark eNo1j8tKxDAYRiPoQsd5A8G8QGuuTbIsxUuhgyDjekgyf2YCbTKkxcvbO6CuvgMHDnw36DLlBAjdU1JTSsxD3_FeNkzQmhHGakqIYpKwC7Q2ymguCTeaNeoabVr8Bh8RPnFOuMvTyZY4nzEHvLH-GBPgAWxJMR1wOx5yictxmnHIBW_ha8HdaOc5hujtEnO6RVfBjjOs_3aF3p8et91LNbw-9107VJFSs1RcOyVC0IZxqZVwwKQzeyqDd4paohUVgVqmwBNmnQEtGrpXPggXgj87vkJ3v90IALtTiZMt37v_l_wHYApMlw
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/IC3I56241.2022.10072502
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350398267
EndPage 1823
ExternalDocumentID 10072502
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i119t-38b74ff89235874be25b9d15fcb71a08714f1a27ec02ab9e8461d7cf4bffc14f3
IEDL.DBID RIE
IngestDate Thu Jan 18 11:14:58 EST 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-38b74ff89235874be25b9d15fcb71a08714f1a27ec02ab9e8461d7cf4bffc14f3
PageCount 6
ParticipantIDs ieee_primary_10072502
PublicationCentury 2000
PublicationDate 2022-Dec.-14
PublicationDateYYYYMMDD 2022-12-14
PublicationDate_xml – month: 12
  year: 2022
  text: 2022-Dec.-14
  day: 14
PublicationDecade 2020
PublicationTitle 2022 5th International Conference on Contemporary Computing and Informatics (IC3I)
PublicationTitleAbbrev IC3I
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.9853216
Snippet The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts...
SourceID ieee
SourceType Publisher
StartPage 1818
SubjectTerms Decision Tree
k-Nearest Neighbour
Machine learning
Machine learning algorithms
Naive Bayes
Support Vector Machine
Support vector machine classification
Text categorization
Text classification
Text mining
Web pages
Writing
Title A Review on Comparison of Machine Learning Algorithms for Text Classification
URI https://ieeexplore.ieee.org/document/10072502
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA3akycVK36Tg9fdbnazyeZYiqUVWjy00FvJx6QWdVfq9uKvN8luKwqCt5AEEjIwb5K8N4PQPTOKEJnoSDAXvlENNpIqTyNJpWGMQ8FDuqbJlI3m9HGRL1qxetDCAEAgn0Hsm-Ev31R665_Kev5H30G287iHnItGrNVytkgieuNBNnZwTv21L03j3ewfdVMCbAyP0XS3YMMWeYm3tYr1569cjP_e0Qnqfiv08NMee07RAZRnaNLHzVM_rko82BcYxJXFk8CZBNymU13h_uuq2qzr57cP7MJWPHM-GocCmZ46FKzVRfPhw2wwitpyCdGaEFFHWaE4tbYQXv3KqYI0V8KQ3GrFnTnczYhaIlMOOkmlEuAiD2K4tlRZq91Ydo46ZVXCBcLKgBUaBDMO4XOggltFWE5sIgqWKXmJuv4slu9NRozl7hiu_ui_RkfeJJ4GQugN6tSbLdw6MK_VXTDiFyKAoFY
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA4yD3pSceJvc_DarmnTpDmO4dh0HR422G006cscajtmd_GvN0m7iYLgLSSBhDzI917yfe8hdM9ySUgWKE8w475RBdrLZBx6Gc1yxjgk3KVrSsdsMKWPs3jWiNWdFgYAHPkMfNt0f_l5qTb2qaxjf_QNZJsbdz82YQWv5VoNa4sEojPsRUMD6NQGfmHob-f_qJzigKN_hMbbJWu-yKu_qaSvPn9lY_z3no5R-1ujh5936HOC9qA4RWkX14_9uCxwb1diEJcap441CbhJqLrA3bdFuV5WL-8f2DiueGJuaexKZFrykLNXG037D5PewGsKJnhLQkTlRYnkVOtEWP0rpxLCWIqcxFpJbgxiYiOqSRZyUEGYSQHG9yA5V5pKrZUZi85QqygLOEdY5qCFAsFyg_ExUMG1JCwmOhAJi2R2gdr2LOarOifGfHsMl3_036GDwSQdzUfD8dMVOrTmsaQQQq9Rq1pv4MZAeyVvnUG_AL_Po6c
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+5th+International+Conference+on+Contemporary+Computing+and+Informatics+%28IC3I%29&rft.atitle=A+Review+on+Comparison+of+Machine+Learning+Algorithms+for+Text+Classification&rft.au=Dhingra%2C+Mallika&rft.au=Dhabliya%2C+Dharmesh&rft.au=Dubey%2C+M.+K.&rft.au=Gupta%2C+Ankur&rft.date=2022-12-14&rft.pub=IEEE&rft.spage=1818&rft.epage=1823&rft_id=info:doi/10.1109%2FIC3I56241.2022.10072502&rft.externalDocID=10072502