A Study of BERT-Based Classification Performance of Text-Based Health Counseling Data

The entry into a hyper-connected society increases the generalization of communication using SNS. Therefore, research to analyze big data accumulated in SNS and extract meaningful information is being conducted in various fields. In particular, with the recent development of Deep Learning, the perfo...

Full description

Saved in:

Bibliographic Details
Published in	Computer modeling in engineering & sciences Vol. 135; no. 1; pp. 795 - 808
Main Authors	Woo Sung, Yeol, Seung Park, Dae, Ghil Kim, Cheong
Format	Journal Article
Language	English
Published	Henderson Tech Science Press 2023
Subjects	Algorithms Big Data Classification Counseling Deep learning Machine learning Natural language processing
Online Access	Get full text
ISSN	1526-1506 1526-1492 1526-1506
DOI	10.32604/cmes.2022.022465

Cover

More Information
Summary:	The entry into a hyper-connected society increases the generalization of communication using SNS. Therefore, research to analyze big data accumulated in SNS and extract meaningful information is being conducted in various fields. In particular, with the recent development of Deep Learning, the performance is rapidly improving by applying it to the field of Natural Language Processing, which is a language understanding technology to obtain accurate contextual information. In this paper, when a chatbot system is applied to the healthcare domain for counseling about diseases, the performance of NLP integrated with machine learning for the accurate classification of medical subjects from text-based health counseling data becomes important. Among the various algorithms, the performance of Bidirectional Encoder Representations from Transformers was compared with other algorithms of CNN, RNN, LSTM, and GRU. For this purpose, the health counseling data of Naver Q&A service were crawled as a dataset. KoBERT was used to classify medical subjects according to symptoms and the accuracy of classification results was measured. The simulation results show that KoBERT model performed high performance by more than 5% and close to 18% as large as the smallest.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1526-1506 1526-1492 1526-1506
DOI:	10.32604/cmes.2022.022465