Named Entity Recognition in Indonesian History Textbook Using BERT Model

History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the pres...

Full description

Saved in:
Bibliographic Details
Published inCogito smart journal Vol. 11; no. 1; pp. 140 - 151
Main Authors Muslim, Ichwanul, Firdaus, Muliawan, Habibi, Rizki
Format Journal Article
LanguageEnglish
Published 30.06.2025
Online AccessGet full text
ISSN2541-2221
2477-8079
DOI10.31154/cogito.v11i1.880.140-151

Cover

Abstract History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the presentation is still unstructured and difficult to understand. This research aims to develop a model of extracting historical entities from textbooks using the Named Entity Recognition (NER) approach based on the BERT (Bidirectional Encoder Representations from Transformers). The text data is derived from the history chapter of the 8th Social Science published by the Ministry of Education. The research stages include data extraction, preprocessing, IOB labeling, identifying entities by the BERT algorithm, and performance evaluation. The preprocessing results successfully reduced irrelevant words and improved analysis efficiency. The BERT model showed high performance with a precision value of 88.68%, a recall of 74.60%, and an F1-score of 81.03%. In addition, there were fluctuations in training time between epochs that were influenced by entity variation and sentence complexity. Overall, this research shows, the model application can extract historical entities automatically and accurately, thus potentially enriching historical understanding for students and society through the utilization of Natural Language Processing technology
AbstractList History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger generation about their nation's history. Whereas history textbooks are available in digital form and contain much information, the presentation is still unstructured and difficult to understand. This research aims to develop a model of extracting historical entities from textbooks using the Named Entity Recognition (NER) approach based on the BERT (Bidirectional Encoder Representations from Transformers). The text data is derived from the history chapter of the 8th Social Science published by the Ministry of Education. The research stages include data extraction, preprocessing, IOB labeling, identifying entities by the BERT algorithm, and performance evaluation. The preprocessing results successfully reduced irrelevant words and improved analysis efficiency. The BERT model showed high performance with a precision value of 88.68%, a recall of 74.60%, and an F1-score of 81.03%. In addition, there were fluctuations in training time between epochs that were influenced by entity variation and sentence complexity. Overall, this research shows, the model application can extract historical entities automatically and accurately, thus potentially enriching historical understanding for students and society through the utilization of Natural Language Processing technology
Author Firdaus, Muliawan
Muslim, Ichwanul
Habibi, Rizki
Author_xml – sequence: 1
  givenname: Ichwanul
  surname: Muslim
  fullname: Muslim, Ichwanul
– sequence: 2
  givenname: Muliawan
  surname: Firdaus
  fullname: Firdaus, Muliawan
– sequence: 3
  givenname: Rizki
  surname: Habibi
  fullname: Habibi, Rizki
BookMark eNot0N9KwzAUBvAgE5xz7xAfoDUnTZr2Ukd1g6kwuuuQ5s8Ibok0RdzbGzavzndx-D743aNZiMEi9AikrAA4e9Lx4KdY_gB4KJuGlMBIARxu0JwyIYqGiHaWM2dQUErhDi1T8gNhTFQVZWSO1h_qZA3uwuSnM97Z3Bj85GPAPuBNMHkweRXw2qcpjmfc299piPEL75MPB_zS7Xr8Ho09PqBbp47JLv_vAu1fu361Lrafb5vV87bQQACKgZvauFYL3hrNK2cV1EJTbkDbljPCapofBXBdE-5q7RrVGmBOcU4NJ6paoPbaq8eY0mid_B79SY1nCUReVORVRV5UZFaRWUVmleoPXmxaGA
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.31154/cogito.v11i1.880.140-151
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
EISSN 2477-8079
EndPage 151
ExternalDocumentID 10_31154_cogito_v11i1_880_140_151
GroupedDBID AAYXX
ADBBV
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
GROUPED_DOAJ
ID FETCH-LOGICAL-c1011-b5d6df9c759dc53fea167c25d1ce9540462c10715c605f6cf8a9d14fa552d50a3
ISSN 2541-2221
IngestDate Thu Sep 11 00:19:36 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License http://creativecommons.org/licenses/by/4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c1011-b5d6df9c759dc53fea167c25d1ce9540462c10715c605f6cf8a9d14fa552d50a3
OpenAccessLink https://doi.org/10.31154/cogito.v11i1.880.140-151
PageCount 12
ParticipantIDs crossref_primary_10_31154_cogito_v11i1_880_140_151
PublicationCentury 2000
PublicationDate 2025-06-30
PublicationDateYYYYMMDD 2025-06-30
PublicationDate_xml – month: 06
  year: 2025
  text: 2025-06-30
  day: 30
PublicationDecade 2020
PublicationTitle Cogito smart journal
PublicationYear 2025
SSID ssib044733240
ssj0002039597
Score 2.2993672
Snippet History is not recognized as an explicit subject in some primary or secondary education institutions anymore. Certainly, this can cause concern for the younger...
SourceID crossref
SourceType Index Database
StartPage 140
Title Named Entity Recognition in Indonesian History Textbook Using BERT Model
Volume 11
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2477-8079
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib044733240
  issn: 2541-2221
  databaseCode: M~E
  dateStart: 20150101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1JSwMxFA6iIF5EUXEngrcyaiZb56hSqUJ7kBa8DZlkBgdrEW0VPfjbfUlmc8PlMgyhTTPzfbx8L30LQvsikUnEknaQUsoDuCGBYpQF4ImFFDCX7cwmCvf6ojtkF1f8qj5Vctklk-RAv3yZV_IfVGEMcLVZsn9AtpoUBuAe8IUrIAzXX2HcV7CXtTo209ZK6SIUyAcvnts2HalLkfSlQJ5bAzDEVlS3fJzASedy4HqhjZoK9RRsIejRh1v4vVZzFQ4VUKWOQOf6-kmN66DCs_zeqKk_uZ6OcvVUk66rbIafT-J_ucmbxwwhL2PiSmsEjiQJQEx4IqR-jEm7xfluMJU5JZ9o420j8XWZim2W-DqzHy24Lf7D4L1r97AHj4TkBGAH027_gy5q076rmv1hN6tiDMG7cZPFfqrYTRXDVLENZyM27X4ulELYthe9105phBiT1BYprA7qwiMacdekp3oF82ivXOrhdwtt6JyGYBksocXC08DHnjbLaCYdr6Cuowz2lMENyuB8jGvK4IIyuKQMdpTBljLYUWYVDc86g9NuUDTTCDSxx-AJN8JkkZY8MprTLFVESB1yQ3QagWxnIoQPSsI1OLiZ0FlbRYawTHEeGn6k6BqaHcMi1hGmtG0oCGtwRg3LpFaC8YQyKk2iItgkNlBYPnp852umxD-isfmfL22hhZqr22h2cj9Nd0AkTpJdB-obIedicQ
linkProvider ISSN International Centre
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Named+Entity+Recognition+in+Indonesian+History+Textbook+Using+BERT+Model&rft.jtitle=Cogito+smart+journal&rft.au=Muslim%2C+Ichwanul&rft.au=Firdaus%2C+Muliawan&rft.au=Habibi%2C+Rizki&rft.date=2025-06-30&rft.issn=2541-2221&rft.eissn=2477-8079&rft.volume=11&rft.issue=1&rft.spage=140&rft.epage=151&rft_id=info:doi/10.31154%2Fcogito.v11i1.880.140-151&rft.externalDBID=n%2Fa&rft.externalDocID=10_31154_cogito_v11i1_880_140_151
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2541-2221&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2541-2221&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2541-2221&client=summon