A Review on Comparison of Machine Learning Algorithms for Text Classification
The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classi...
Saved in:
Published in | 2022 5th International Conference on Contemporary Computing and Informatics (IC3I) pp. 1818 - 1823 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
14.12.2022
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/IC3I56241.2022.10072502 |
Cover
Abstract | The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classification that assigns documents to predetermined categories. Machine learning approaches can categories texts more accurately. The goal of this work is to introduce text classification, give a description of the text classification technique, a general review of the classifiers, and a comparison of a few of the current classifiers. It is based on performance, time complexity, and other factors. On the basis of speed, accuracy, benefits, and drawbacks of existing classification methods such as Decision Tree, Naive Bayes, Support Vector Machine, and k-Nearest Neighbours are compared. |
---|---|
AbstractList | The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts continue to be the most readily available source of knowledge, despite the fact that knowledge may be accessed in many other places. Text classification that assigns documents to predetermined categories. Machine learning approaches can categories texts more accurately. The goal of this work is to introduce text classification, give a description of the text classification technique, a general review of the classifiers, and a comparison of a few of the current classifiers. It is based on performance, time complexity, and other factors. On the basis of speed, accuracy, benefits, and drawbacks of existing classification methods such as Decision Tree, Naive Bayes, Support Vector Machine, and k-Nearest Neighbours are compared. |
Author | Dhabliya, Dharmesh Dubey, M. K. Dhingra, Mallika Reddy, Dhoma Harshavardhan Gupta, Ankur |
Author_xml | – sequence: 1 givenname: Mallika surname: Dhingra fullname: Dhingra, Mallika email: mallikadhingra13@gmail.com organization: Manipal University Jaipur,Department of Mathematics and Statistics,Jaipur,Rajasthan,India – sequence: 2 givenname: Dharmesh surname: Dhabliya fullname: Dhabliya, Dharmesh email: dharmesh.dhabliya@viit.ac.in organization: Vishwakarma Institute of Information Technology,Department of Information Technology,Pune,Maharashtra,India – sequence: 3 givenname: M. K. surname: Dubey fullname: Dubey, M. K. email: maheshdubey6@gmail.com organization: Manipal University Jaipur,Department of Mathematics and Statistics,Jaipur,Rajasthan,India – sequence: 4 givenname: Ankur surname: Gupta fullname: Gupta, Ankur email: ankurdujana@gmail.com organization: Vaish College of Engineering,Department of Computer Science and Engineering,Rohtak,Haryana,India – sequence: 5 givenname: Dhoma Harshavardhan surname: Reddy fullname: Reddy, Dhoma Harshavardhan email: dhreddy2001@gmail.com organization: Trainee Security Analyst JunoClinic, Davman Technology Services Private Limited,Mumbai,Maharashtra,India |
BookMark | eNo1j8tKxDAYRiPoQsd5A8G8QGuuTbIsxUuhgyDjekgyf2YCbTKkxcvbO6CuvgMHDnw36DLlBAjdU1JTSsxD3_FeNkzQmhHGakqIYpKwC7Q2ymguCTeaNeoabVr8Bh8RPnFOuMvTyZY4nzEHvLH-GBPgAWxJMR1wOx5yictxmnHIBW_ha8HdaOc5hujtEnO6RVfBjjOs_3aF3p8et91LNbw-9107VJFSs1RcOyVC0IZxqZVwwKQzeyqDd4paohUVgVqmwBNmnQEtGrpXPggXgj87vkJ3v90IALtTiZMt37v_l_wHYApMlw |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/IC3I56241.2022.10072502 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore Digital Library IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350398267 |
EndPage | 1823 |
ExternalDocumentID | 10072502 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i119t-38b74ff89235874be25b9d15fcb71a08714f1a27ec02ab9e8461d7cf4bffc14f3 |
IEDL.DBID | RIE |
IngestDate | Thu Jan 18 11:14:58 EST 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i119t-38b74ff89235874be25b9d15fcb71a08714f1a27ec02ab9e8461d7cf4bffc14f3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_10072502 |
PublicationCentury | 2000 |
PublicationDate | 2022-Dec.-14 |
PublicationDateYYYYMMDD | 2022-12-14 |
PublicationDate_xml | – month: 12 year: 2022 text: 2022-Dec.-14 day: 14 |
PublicationDecade | 2020 |
PublicationTitle | 2022 5th International Conference on Contemporary Computing and Informatics (IC3I) |
PublicationTitleAbbrev | IC3I |
PublicationYear | 2022 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.9853216 |
Snippet | The majority of the data is preserved as text (about 75%), hence It is believed that text mining has a significant commercial potential. Unstructured texts... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1818 |
SubjectTerms | Decision Tree k-Nearest Neighbour Machine learning Machine learning algorithms Naive Bayes Support Vector Machine Support vector machine classification Text categorization Text classification Text mining Web pages Writing |
Title | A Review on Comparison of Machine Learning Algorithms for Text Classification |
URI | https://ieeexplore.ieee.org/document/10072502 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA3akycVK36Tg9fdbnazyeZYiqUVWjy00FvJx6QWdVfq9uKvN8luKwqCt5AEEjIwb5K8N4PQPTOKEJnoSDAXvlENNpIqTyNJpWGMQ8FDuqbJlI3m9HGRL1qxetDCAEAgn0Hsm-Ev31R665_Kev5H30G287iHnItGrNVytkgieuNBNnZwTv21L03j3ewfdVMCbAyP0XS3YMMWeYm3tYr1569cjP_e0Qnqfiv08NMee07RAZRnaNLHzVM_rko82BcYxJXFk8CZBNymU13h_uuq2qzr57cP7MJWPHM-GocCmZ46FKzVRfPhw2wwitpyCdGaEFFHWaE4tbYQXv3KqYI0V8KQ3GrFnTnczYhaIlMOOkmlEuAiD2K4tlRZq91Ydo46ZVXCBcLKgBUaBDMO4XOggltFWE5sIgqWKXmJuv4slu9NRozl7hiu_ui_RkfeJJ4GQugN6tSbLdw6MK_VXTDiFyKAoFY |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA4yD3pSceJvc_DarmnTpDmO4dh0HR422G006cscajtmd_GvN0m7iYLgLSSBhDzI917yfe8hdM9ySUgWKE8w475RBdrLZBx6Gc1yxjgk3KVrSsdsMKWPs3jWiNWdFgYAHPkMfNt0f_l5qTb2qaxjf_QNZJsbdz82YQWv5VoNa4sEojPsRUMD6NQGfmHob-f_qJzigKN_hMbbJWu-yKu_qaSvPn9lY_z3no5R-1ujh5936HOC9qA4RWkX14_9uCxwb1diEJcap441CbhJqLrA3bdFuV5WL-8f2DiueGJuaexKZFrykLNXG037D5PewGsKJnhLQkTlRYnkVOtEWP0rpxLCWIqcxFpJbgxiYiOqSRZyUEGYSQHG9yA5V5pKrZUZi85QqygLOEdY5qCFAsFyg_ExUMG1JCwmOhAJi2R2gdr2LOarOifGfHsMl3_036GDwSQdzUfD8dMVOrTmsaQQQq9Rq1pv4MZAeyVvnUG_AL_Po6c |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+5th+International+Conference+on+Contemporary+Computing+and+Informatics+%28IC3I%29&rft.atitle=A+Review+on+Comparison+of+Machine+Learning+Algorithms+for+Text+Classification&rft.au=Dhingra%2C+Mallika&rft.au=Dhabliya%2C+Dharmesh&rft.au=Dubey%2C+M.+K.&rft.au=Gupta%2C+Ankur&rft.date=2022-12-14&rft.pub=IEEE&rft.spage=1818&rft.epage=1823&rft_id=info:doi/10.1109%2FIC3I56241.2022.10072502&rft.externalDocID=10072502 |