Systematic Literature Review in the Development of Datasets and Fact Verification Models for Indonesian Language

This study explores the major challenges in fact verification for the Indonesian language due to gaps in managing dynamic information, uncertainties from linguistic variations, and the proliferation of online narratives. Most text classification research for fact verification relies on English-langu...

Full description

Saved in:

Bibliographic Details
Published in	2024 7th International Conference of Computer and Informatics Engineering (IC2IE) pp. 1 - 9
Main Authors	Agustina, Nova, Kusrini, Utami, Ema, Hidayat, Tonny
Format	Conference Proceeding
Language	English
Published	IEEE 12.09.2024
Subjects	Accuracy Adaptation models Context modeling dataset fact verification indonesian-language Measurement Natural language processing Object recognition Organizations Reliability engineering Text categorization Uncertainty
Online Access	Get full text
DOI	10.1109/IC2IE63342.2024.10748079

Cover

More Information
Summary:	This study explores the major challenges in fact verification for the Indonesian language due to gaps in managing dynamic information, uncertainties from linguistic variations, and the proliferation of online narratives. Most text classification research for fact verification relies on English-language datasets, which may not be suitable for the Indonesian context. Through a Systematic Literature Review (SLR) approach, this research analyzes the development of datasets and models for Indonesian-language fact verification tasks. Utilizing the PICOC framework, this research conducted a systematic literature review to explore the development of datasets and models for Indonesian-language fact verification tasks. The main objectives of this research are to identify existing research shortcomings, evaluate the performance of current fact verification models, and propose recommendations for future research. These findings highlight the lack of research specifically addressing fact verification in Indonesia, as well as the lack of datasets and models specifically designed for the fact verification context. Existing models often perform poorly in terms of precision, recall, and F1 score, with average scores below 70%, indicating difficulty in identifying relevant evidence and consistently verifying facts. The key findings indicate a lack of research specifically addressing fact verification in Indonesia, as well as limited datasets and models specifically designed for fact verification. Additionally, most fact-verification tasks do not cover specific topics, except those related to politics. Common performance metrics used in this analysis include accuracy, precision, recall, F1-Score, and Dataset Score.
DOI:	10.1109/IC2IE63342.2024.10748079