Performance of ChatGPT‐3.5 and ChatGPT‐4o in the Japanese National Dental Examination

Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. Methods: ChatGPT's...

Full description

Saved in:

Bibliographic Details
Published in	Journal of dental education Vol. 89; no. 4; pp. 459 - 466
Main Authors	Uehara, Osamu, Morikawa, Tetsuro, Harada, Fumiya, Sugiyama, Nodoka, Matsuki, Yuko, Hiraki, Daichi, Sakurai, Hinako, Kado, Takashi, Yoshida, Koki, Murata, Yukie, Matsuoka, Hirofumi, Nagasawa, Toshiyuki, Furuichi, Yasushi, Abiko, Yoshihiro, Miura, Hiroko
Format	Journal Article
Language	English
Published	United States 01.04.2025
Subjects	artificial intelligence Chat Generative Pre‐trained Transformer (ChatGPT) Clinical Competence dental education Education, Dental - methods Educational Measurement - methods Generative Artificial Intelligence Humans Japan Japanese National Dental Examination natural language processing Japan dental education Chat Generative Pre‐trained Transformer (ChatGPT) Japanese National Dental Examination natural language processing artificial intelligence
Online Access	Get full text
ISSN	0022-0337 1930-7837 1930-7837
DOI	10.1002/jdd.13766

Cover

Abstract	Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. Methods: ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111−117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance. Results: A significant improvement was noted in the percentage of correct answers from ChatGPT‐4o (84.63%) compared with those from ChatGPT‐3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT‐4o consistently outperformed ChatGPT‐3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT‐4o provided more stable and higher correct answer rates, especially for complex subjects. Conclusions: This study found that advanced natural language processing models, such as ChatGPT‐4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation.
AbstractList	Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. Methods: ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111−117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance. Results: A significant improvement was noted in the percentage of correct answers from ChatGPT‐4o (84.63%) compared with those from ChatGPT‐3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT‐4o consistently outperformed ChatGPT‐3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT‐4o provided more stable and higher correct answer rates, especially for complex subjects. Conclusions: This study found that advanced natural language processing models, such as ChatGPT‐4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.OBJECTIVESIn this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.METHODSChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.A significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects.RESULTSA significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects.This study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation.CONCLUSIONSThis study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance. A significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects. This study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation.
Author	Matsuoka, Hirofumi Uehara, Osamu Nagasawa, Toshiyuki Furuichi, Yasushi Sugiyama, Nodoka Hiraki, Daichi Abiko, Yoshihiro Kado, Takashi Murata, Yukie Yoshida, Koki Morikawa, Tetsuro Harada, Fumiya Matsuki, Yuko Miura, Hiroko Sakurai, Hinako
Author_xml	– sequence: 1 givenname: Osamu orcidid: 0000-0001-5602-4448 surname: Uehara fullname: Uehara, Osamu organization: Health Sciences University of Hokkaido – sequence: 2 givenname: Tetsuro surname: Morikawa fullname: Morikawa, Tetsuro organization: Health Sciences University of Hokkaido – sequence: 3 givenname: Fumiya surname: Harada fullname: Harada, Fumiya organization: Health Sciences University of Hokkaido – sequence: 4 givenname: Nodoka surname: Sugiyama fullname: Sugiyama, Nodoka organization: Health Sciences University of Hokkaido – sequence: 5 givenname: Yuko surname: Matsuki fullname: Matsuki, Yuko organization: Health Sciences University of Hokkaido – sequence: 6 givenname: Daichi surname: Hiraki fullname: Hiraki, Daichi organization: Health Sciences University of Hokkaido – sequence: 7 givenname: Hinako surname: Sakurai fullname: Sakurai, Hinako organization: Health Sciences University of Hokkaido – sequence: 8 givenname: Takashi surname: Kado fullname: Kado, Takashi organization: Health Sciences University of Hokkaido – sequence: 9 givenname: Koki surname: Yoshida fullname: Yoshida, Koki organization: Health Sciences University of Hokkaido – sequence: 10 givenname: Yukie surname: Murata fullname: Murata, Yukie organization: Health Sciences University of Hokkaido – sequence: 11 givenname: Hirofumi surname: Matsuoka fullname: Matsuoka, Hirofumi organization: Health Sciences University of Hokkaido – sequence: 12 givenname: Toshiyuki surname: Nagasawa fullname: Nagasawa, Toshiyuki organization: Health Sciences University of Hokkaido – sequence: 13 givenname: Yasushi surname: Furuichi fullname: Furuichi, Yasushi organization: Health Sciences University of Hokkaido – sequence: 14 givenname: Yoshihiro surname: Abiko fullname: Abiko, Yoshihiro email: yoshi-ab@hoku-iryo-u.ac.jp organization: Health Sciences University of Hokkaido – sequence: 15 givenname: Hiroko surname: Miura fullname: Miura, Hiroko organization: Health Sciences University of Hokkaido
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/39538434$$D View this record in MEDLINE/PubMed
BookMark	eNp1kLtOwzAUhi0EohcYeAGUEYa0vsbJiNpSqCroUAYmy3FsNVXilDgVdOMReEaeBNOLhBBMRz7-_l_21wHHtrIagAsEewhC3F9mWQ8RHkVHoI0SAkMeE34M2v4Oh5AQ3gId55b-mFCKT0GLJIzElNA2eJ7p2lR1Ka3SQWWCwUI249n88_2D9FggbfZjQ6sgt0Gz0MFErqTVTgcPsskrK4tgqG3jx-hNlrndLs_AiZGF0-f72QVPt6P54C6cPo7vBzfTUJGER2HCmYFpIiWCRHMYY0wRS3HCUEYMjOI4URmEjBmjkKZpTCWNqFGMqwwTxTLSBVe73lVdvay1a0SZO6WLwr-wWjtBEI5j7C1Aj17u0XVa6kys6ryU9UYcdHigvwNUXTlXayNU3mx_09QyLwSC4lu48MLFVrhPXP9KHEr_Yvftr3mhN_-DYjIc7hJfLgaOMg
CitedBy_id	crossref_primary_10_1016_j_jds_2025_02_022 crossref_primary_10_2147_JPR_S509845
Cites_doi	10.1007/s10439-023-03296-w 10.1016/j.jds.2024.06.015 10.1016/j.jds.2024.02.019 10.3352/jeehp.2024.21.4 10.1016/j.jds.2023.12.007 10.7759/cureus.50369 10.2196/48002 10.1111/j.0305-182X.2007.01820.x 10.36740/WLek202311101
ContentType	Journal Article
Copyright	2024 American Dental Education Association.
Copyright_xml	– notice: 2024 American Dental Education Association.
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8
DOI	10.1002/jdd.13766
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic
DatabaseTitleList	CrossRef MEDLINE - Academic MEDLINE
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Dentistry
EISSN	1930-7837
EndPage	466
ExternalDocumentID	39538434 10_1002_jdd_13766 JDD13766
Genre	article Journal Article
GeographicLocations	Japan
GeographicLocations_xml	– name: Japan
GroupedDBID	--- 0R~ 18M 1CY 1OC 2WC 33P 34H 53G 5GY 5RE 5VS AAHHS AAHQN AAHSB AAIPD AAMNL AANLZ AAOGT AAWTL AAYCA ABCUV ABJNI ABQWH ACCFJ ACCZN ACGFO ACGOF ACPOU ACXQS ADBBV ADBTR ADKYN ADZMN AEEZP AEIGN AENEX AEQDE AEUYR AFFNX AFFPM AFWVQ AGHNM AGYGG AHBTC AITYG AIWBW AJBDE ALMA_UNASSIGNED_HOLDINGS ALUQN ALVPJ AMYDB BAWUL BFHJK BTFSW C45 DCZOG E3Z EBS EJD F5P GK1 GX1 H13 HGLYW HW5 KQ8 LATKE LEEKS MEWTI OK1 OVD P2P RHI ROL SUPJJ TDE TEORI TR2 UCV UMD USG W8F WH7 WOQ WXSBR XZL YRY ZGI ZVN AAYXX AEYWJ CITATION AAMMB AEFGJ AGXDD AIDQK AIDYY CGR CUY CVF ECM EIF NPM 7X8 LH4
ID	FETCH-LOGICAL-c3976-975f0b9aa103e70822415b2951d3f06889cd0055ffc1e4b84a464fc57cd23c5d3
ISSN	0022-0337 1930-7837
IngestDate	Thu Sep 04 16:14:43 EDT 2025 Mon Jul 21 05:28:40 EDT 2025 Tue Jul 01 05:04:01 EDT 2025 Thu Apr 24 22:58:37 EDT 2025 Thu Apr 17 09:30:29 EDT 2025
IsPeerReviewed	true
IsScholarly	true
Issue	4
Keywords	dental education Chat Generative Pre‐trained Transformer (ChatGPT) Japanese National Dental Examination natural language processing artificial intelligence
Language	English
License	2024 American Dental Education Association.
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c3976-975f0b9aa103e70822415b2951d3f06889cd0055ffc1e4b84a464fc57cd23c5d3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ORCID	0000-0001-5602-4448
PMID	39538434
PQID	3128820330
PQPubID	23479
PageCount	8
ParticipantIDs	proquest_miscellaneous_3128820330 pubmed_primary_39538434 crossref_citationtrail_10_1002_jdd_13766 crossref_primary_10_1002_jdd_13766 wiley_primary_10_1002_jdd_13766_JDD13766
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	April 2025
PublicationDateYYYYMMDD	2025-04-01
PublicationDate_xml	– month: 04 year: 2025 text: April 2025
PublicationDecade	2020
PublicationPlace	United States
PublicationPlace_xml	– name: United States
PublicationTitle	Journal of dental education
PublicationTitleAlternate	J Dent Educ
PublicationYear	2025
References	2023; 76 2024; 52 2008; 35 2024; 21 2024 2023; 15 2023; 9 2024; 19 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 e_1_2_8_8_1 e_1_2_8_10_1 e_1_2_8_11_1 e_1_2_8_12_1
References_xml	– volume: 21 year: 2024 article-title: ChatGPT (GPT‐4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study publication-title: J Educ Eval Health Prof – volume: 9 year: 2023 article-title: Performance of GPT‐3.5 and GPT‐4 on the Japanese medical licensing examination: comparison Study publication-title: JMIR Med Educ – volume: 35 start-page: 446 issue: 6 year: 2008 end-page: 453 article-title: Dental occlusion: a critical reflection on past, present and future concepts publication-title: J Oral Rehabil – volume: 15 year: 2023 article-title: The performance of GPT‐3.5, GPT‐4, and bard on the japanese national dentist examination: a comparison study publication-title: Cureus – volume: 19 start-page: 1595 issue: 3 year: 2024 end-page: 1600 article-title: Evaluating GPT‐4V's performance in the Japanese national dental examination: a challenge explored publication-title: J Dent Sci – year: 2024 article-title: Evaluating the image recognition capabilities of GPT‐4V and Gemini Pro in the Japanese national dental examination publication-title: J Dent Sci – volume: 52 start-page: 130 issue: 2 year: 2024 end-page: 133 article-title: The role and potential contributions of the artificial intelligence language model ChatGPT publication-title: Ann Biomed Eng – volume: 76 start-page: 2345 issue: 11 year: 2023 end-page: 2350 article-title: From text to diagnose: chatGPT's efficacy in medical decision‐making publication-title: Wiad Lek – volume: 19 start-page: 2262 issue: 4 year: 2024 end-page: 2267 article-title: Evaluating the efficacy of leading large language models in the Japanese national dental hygienist examination: a comparative analysis of ChatGPT, Bard, and Bing Chat publication-title: J Dent Sci – ident: e_1_2_8_3_1 doi: 10.1007/s10439-023-03296-w – ident: e_1_2_8_9_1 doi: 10.1016/j.jds.2024.06.015 – ident: e_1_2_8_10_1 doi: 10.1016/j.jds.2024.02.019 – ident: e_1_2_8_4_1 doi: 10.3352/jeehp.2024.21.4 – ident: e_1_2_8_8_1 doi: 10.1016/j.jds.2023.12.007 – ident: e_1_2_8_6_1 – ident: e_1_2_8_7_1 doi: 10.7759/cureus.50369 – ident: e_1_2_8_11_1 – ident: e_1_2_8_5_1 doi: 10.2196/48002 – ident: e_1_2_8_12_1 doi: 10.1111/j.0305-182X.2007.01820.x – ident: e_1_2_8_2_1 doi: 10.36740/WLek202311101
SSID	ssj0029442
Score	2.4303641
Snippet	Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which... In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses...
SourceID	proquest pubmed crossref wiley
SourceType	Aggregation Database Index Database Enrichment Source Publisher
StartPage	459
SubjectTerms	artificial intelligence Chat Generative Pre‐trained Transformer (ChatGPT) Clinical Competence dental education Education, Dental - methods Educational Measurement - methods Generative Artificial Intelligence Humans Japan Japanese National Dental Examination natural language processing
Title	Performance of ChatGPT‐3.5 and ChatGPT‐4o in the Japanese National Dental Examination
URI	https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fjdd.13766 https://www.ncbi.nlm.nih.gov/pubmed/39538434 https://www.proquest.com/docview/3128820330
Volume	89
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1930-7837 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0029442 issn: 0022-0337 databaseCode: KQ8 dateStart: 20010401 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bb9MwFLbKeIAXxH0bFxnEAxJKSWKniR8R3ZiKGEO00niKfAsqo83UJuLyxE_giR_IL-HYTtyMdtLgJW2dI7f19-X4JD7-DkJPlAilETILUpHRgKowC7IiEwGPIHaWImE6NXuH3xwODiZ0dJwc93q_OllLdSX68vvGfSX_gyq0Aa5ml-w_IOs7hQZ4D_jCERCG44UwPupk_du0Cl69Ohr7_AXST-zawN_ttGyzG0cwVZoSlM8O22eCQ7c9cu8rNzkyHrX18FU5Q90miLTITbSRgDbIvV3yWe0BLRfTE_7FnhjralkvypXzW3BlT-zXs-k3P0-8rz_Cpxl3y0uqPOHdRxRx0slsWW0ZCIlTd-lr52kZMamMTVvjil01oYZytONXaSMb7qZo6gq1rHl_pyb7Sal-BH5zg8K2N0rONbNT-2g4tKcuocsxvJjKGK_f-dWpmFHqVejN32oVq8L4ue_1bJyzdvNy9l7IBjPj6-haAyN-4Sh1A_X0_Ca6YrC3xf9uoQ8dauGywA2Ffv_4CaTCQKpOCy3xdI6BTrilE27phB2dcIdOt9Fkf2_88iBoqnAE0sSqAUuTIhSM8ygkOjUFAiDmEzFE5ooUpmQRk8oouRWFjDSFK57TAS1kkkoVE5kocgdtzcu53kZYcJhNIUYdaKP6L3jGmeQRo6lkTBAd7qCn7ZjlspGoN5VSPudOXDvOYXhzO7w76LE3PXW6LJuMHrUDn4PXNEthMAplvcwJhGUQ-xIC33nXIeK7IQyCAEoo_BoL0fn95y1Pdi9ueg9dXV0i99FWtaj1A4hrK_HQkuwPmT-cuA
linkProvider	Colorado Alliance of Research Libraries
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Performance+of+ChatGPT%E2%80%903.5+and+ChatGPT%E2%80%904o+in+the+Japanese+National+Dental+Examination&rft.jtitle=Journal+of+dental+education&rft.au=Uehara%2C+Osamu&rft.au=Morikawa%2C+Tetsuro&rft.au=Harada%2C+Fumiya&rft.au=Sugiyama%2C+Nodoka&rft.date=2025-04-01&rft.issn=0022-0337&rft.eissn=1930-7837&rft.volume=89&rft.issue=4&rft.spage=459&rft.epage=466&rft_id=info:doi/10.1002%2Fjdd.13766&rft.externalDBID=10.1002%252Fjdd.13766&rft.externalDocID=JDD13766
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0022-0337&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0022-0337&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0022-0337&client=summon