Named Entity Recognition in Indonesian History Textbook Using BERT Model
History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the pres...
Saved in:
Published in | Cogito smart journal Vol. 11; no. 1; pp. 140 - 151 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
30.06.2025
|
Online Access | Get full text |
ISSN | 2541-2221 2477-8079 |
DOI | 10.31154/cogito.v11i1.880.140-151 |
Cover
Abstract | History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the presentation is still unstructured and difficult to understand. This research aims to develop a model of extracting historical entities from textbooks using the Named Entity Recognition (NER) approach based on the BERT (Bidirectional Encoder Representations from Transformers). The text data is derived from the history chapter of the 8th Social Science published by the Ministry of Education. The research stages include data extraction, preprocessing, IOB labeling, identifying entities by the BERT algorithm, and performance evaluation. The preprocessing results successfully reduced irrelevant words and improved analysis efficiency. The BERT model showed high performance with a precision value of 88.68%, a recall of 74.60%, and an F1-score of 81.03%. In addition, there were fluctuations in training time between epochs that were influenced by entity variation and sentence complexity. Overall, this research shows, the model application can extract historical entities automatically and accurately, thus potentially enriching historical understanding for students and society through the utilization of Natural Language Processing technology |
---|---|
AbstractList | History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the presentation is still unstructured and difficult to understand. This research aims to develop a model of extracting historical entities from textbooks using the Named Entity Recognition (NER) approach based on the BERT (Bidirectional Encoder Representations from Transformers). The text data is derived from the history chapter of the 8th Social Science published by the Ministry of Education. The research stages include data extraction, preprocessing, IOB labeling, identifying entities by the BERT algorithm, and performance evaluation. The preprocessing results successfully reduced irrelevant words and improved analysis efficiency. The BERT model showed high performance with a precision value of 88.68%, a recall of 74.60%, and an F1-score of 81.03%. In addition, there were fluctuations in training time between epochs that were influenced by entity variation and sentence complexity. Overall, this research shows, the model application can extract historical entities automatically and accurately, thus potentially enriching historical understanding for students and society through the utilization of Natural Language Processing technology |
Author | Firdaus, Muliawan Muslim, Ichwanul Habibi, Rizki |
Author_xml | – sequence: 1 givenname: Ichwanul surname: Muslim fullname: Muslim, Ichwanul – sequence: 2 givenname: Muliawan surname: Firdaus fullname: Firdaus, Muliawan – sequence: 3 givenname: Rizki surname: Habibi fullname: Habibi, Rizki |
BookMark | eNot0N9KwzAUBvAgE5xz7xAfoDUnTZr2Ukd1g6kwuuuQ5s8Ibok0RdzbGzavzndx-D743aNZiMEi9AikrAA4e9Lx4KdY_gB4KJuGlMBIARxu0JwyIYqGiHaWM2dQUErhDi1T8gNhTFQVZWSO1h_qZA3uwuSnM97Z3Bj85GPAPuBNMHkweRXw2qcpjmfc299piPEL75MPB_zS7Xr8Ho09PqBbp47JLv_vAu1fu361Lrafb5vV87bQQACKgZvauFYL3hrNK2cV1EJTbkDbljPCapofBXBdE-5q7RrVGmBOcU4NJ6paoPbaq8eY0mid_B79SY1nCUReVORVRV5UZFaRWUVmleoPXmxaGA |
ContentType | Journal Article |
DBID | AAYXX CITATION |
DOI | 10.31154/cogito.v11i1.880.140-151 |
DatabaseName | CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
EISSN | 2477-8079 |
EndPage | 151 |
ExternalDocumentID | 10_31154_cogito_v11i1_880_140_151 |
GroupedDBID | AAYXX ADBBV ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION GROUPED_DOAJ |
ID | FETCH-LOGICAL-c1011-b5d6df9c759dc53fea167c25d1ce9540462c10715c605f6cf8a9d14fa552d50a3 |
ISSN | 2541-2221 |
IngestDate | Thu Sep 11 00:19:36 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | http://creativecommons.org/licenses/by/4.0 |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c1011-b5d6df9c759dc53fea167c25d1ce9540462c10715c605f6cf8a9d14fa552d50a3 |
OpenAccessLink | https://doi.org/10.31154/cogito.v11i1.880.140-151 |
PageCount | 12 |
ParticipantIDs | crossref_primary_10_31154_cogito_v11i1_880_140_151 |
PublicationCentury | 2000 |
PublicationDate | 2025-06-30 |
PublicationDateYYYYMMDD | 2025-06-30 |
PublicationDate_xml | – month: 06 year: 2025 text: 2025-06-30 day: 30 |
PublicationDecade | 2020 |
PublicationTitle | Cogito smart journal |
PublicationYear | 2025 |
SSID | ssib044733240 ssj0002039597 |
Score | 2.2993672 |
Snippet | History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger... |
SourceID | crossref |
SourceType | Index Database |
StartPage | 140 |
Title | Named Entity Recognition in Indonesian History Textbook Using BERT Model |
Volume | 11 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2477-8079 dateEnd: 99991231 omitProxy: true ssIdentifier: ssib044733240 issn: 2541-2221 databaseCode: M~E dateStart: 20150101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1JSwMxFA6iIF5EUXEngrcyaiZb56hSqUJ7kBa8DZlkBgdrEW0VPfjbfUlmc8PlMgyhTTPzfbx8L30LQvsikUnEknaQUsoDuCGBYpQF4ImFFDCX7cwmCvf6ojtkF1f8qj5Vctklk-RAv3yZV_IfVGEMcLVZsn9AtpoUBuAe8IUrIAzXX2HcV7CXtTo209ZK6SIUyAcvnts2HalLkfSlQJ5bAzDEVlS3fJzASedy4HqhjZoK9RRsIejRh1v4vVZzFQ4VUKWOQOf6-kmN66DCs_zeqKk_uZ6OcvVUk66rbIafT-J_ucmbxwwhL2PiSmsEjiQJQEx4IqR-jEm7xfluMJU5JZ9o420j8XWZim2W-DqzHy24Lf7D4L1r97AHj4TkBGAH027_gy5q076rmv1hN6tiDMG7cZPFfqrYTRXDVLENZyM27X4ulELYthe9105phBiT1BYprA7qwiMacdekp3oF82ivXOrhdwtt6JyGYBksocXC08DHnjbLaCYdr6Cuowz2lMENyuB8jGvK4IIyuKQMdpTBljLYUWYVDc86g9NuUDTTCDSxx-AJN8JkkZY8MprTLFVESB1yQ3QagWxnIoQPSsI1OLiZ0FlbRYawTHEeGn6k6BqaHcMi1hGmtG0oCGtwRg3LpFaC8YQyKk2iItgkNlBYPnp852umxD-isfmfL22hhZqr22h2cj9Nd0AkTpJdB-obIedicQ |
linkProvider | ISSN International Centre |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Named+Entity+Recognition+in+Indonesian+History+Textbook+Using+BERT+Model&rft.jtitle=Cogito+smart+journal&rft.au=Muslim%2C+Ichwanul&rft.au=Firdaus%2C+Muliawan&rft.au=Habibi%2C+Rizki&rft.date=2025-06-30&rft.issn=2541-2221&rft.eissn=2477-8079&rft.volume=11&rft.issue=1&rft.spage=140&rft.epage=151&rft_id=info:doi/10.31154%2Fcogito.v11i1.880.140-151&rft.externalDBID=n%2Fa&rft.externalDocID=10_31154_cogito_v11i1_880_140_151 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2541-2221&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2541-2221&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2541-2221&client=summon |