Performance of ChatGPT‐3.5 and ChatGPT‐4o in the Japanese National Dental Examination
Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. Methods: ChatGPT's...
Saved in:
Published in | Journal of dental education Vol. 89; no. 4; pp. 459 - 466 |
---|---|
Main Authors | , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
01.04.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 0022-0337 1930-7837 1930-7837 |
DOI | 10.1002/jdd.13766 |
Cover
Abstract | Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.
Methods: ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111−117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.
Results: A significant improvement was noted in the percentage of correct answers from ChatGPT‐4o (84.63%) compared with those from ChatGPT‐3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT‐4o consistently outperformed ChatGPT‐3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT‐4o provided more stable and higher correct answer rates, especially for complex subjects.
Conclusions: This study found that advanced natural language processing models, such as ChatGPT‐4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. |
---|---|
AbstractList | Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.
Methods: ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111−117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.
Results: A significant improvement was noted in the percentage of correct answers from ChatGPT‐4o (84.63%) compared with those from ChatGPT‐3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT‐4o consistently outperformed ChatGPT‐3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT‐4o provided more stable and higher correct answer rates, especially for complex subjects.
Conclusions: This study found that advanced natural language processing models, such as ChatGPT‐4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.OBJECTIVESIn this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education.ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.METHODSChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance.A significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects.RESULTSA significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects.This study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation.CONCLUSIONSThis study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses clinical reasoning skills and dental knowledge, to determine their potential usefulness in dental education. ChatGPT's performance was assessed using 1399 (55% of the exam) of 2520 questions from the Japanese National Dental Examinations (111-117). The 1121 excluded questions (45% of the exam) contained figures or tables that ChatGPT could not recognize. The questions were categorized into 18 different subjects based on dental specialty. Statistical analysis was performed using SPSS software, with McNemar's test applied to assess differences in performance. A significant improvement was noted in the percentage of correct answers from ChatGPT-4o (84.63%) compared with those from ChatGPT-3.5 (45.46%), demonstrating enhanced reliability and subject knowledge. ChatGPT-4o consistently outperformed ChatGPT-3.5 across all dental subjects, with significant improvements in subjects such as oral surgery, pathology, pharmacology, and microbiology. Heatmap analysis revealed that ChatGPT-4o provided more stable and higher correct answer rates, especially for complex subjects. This study found that advanced natural language processing models, such as ChatGPT-4o, potentially have sufficiently advanced clinical reasoning skills and dental knowledge to function as a supplementary tool in dental education and exam preparation. |
Author | Matsuoka, Hirofumi Uehara, Osamu Nagasawa, Toshiyuki Furuichi, Yasushi Sugiyama, Nodoka Hiraki, Daichi Abiko, Yoshihiro Kado, Takashi Murata, Yukie Yoshida, Koki Morikawa, Tetsuro Harada, Fumiya Matsuki, Yuko Miura, Hiroko Sakurai, Hinako |
Author_xml | – sequence: 1 givenname: Osamu orcidid: 0000-0001-5602-4448 surname: Uehara fullname: Uehara, Osamu organization: Health Sciences University of Hokkaido – sequence: 2 givenname: Tetsuro surname: Morikawa fullname: Morikawa, Tetsuro organization: Health Sciences University of Hokkaido – sequence: 3 givenname: Fumiya surname: Harada fullname: Harada, Fumiya organization: Health Sciences University of Hokkaido – sequence: 4 givenname: Nodoka surname: Sugiyama fullname: Sugiyama, Nodoka organization: Health Sciences University of Hokkaido – sequence: 5 givenname: Yuko surname: Matsuki fullname: Matsuki, Yuko organization: Health Sciences University of Hokkaido – sequence: 6 givenname: Daichi surname: Hiraki fullname: Hiraki, Daichi organization: Health Sciences University of Hokkaido – sequence: 7 givenname: Hinako surname: Sakurai fullname: Sakurai, Hinako organization: Health Sciences University of Hokkaido – sequence: 8 givenname: Takashi surname: Kado fullname: Kado, Takashi organization: Health Sciences University of Hokkaido – sequence: 9 givenname: Koki surname: Yoshida fullname: Yoshida, Koki organization: Health Sciences University of Hokkaido – sequence: 10 givenname: Yukie surname: Murata fullname: Murata, Yukie organization: Health Sciences University of Hokkaido – sequence: 11 givenname: Hirofumi surname: Matsuoka fullname: Matsuoka, Hirofumi organization: Health Sciences University of Hokkaido – sequence: 12 givenname: Toshiyuki surname: Nagasawa fullname: Nagasawa, Toshiyuki organization: Health Sciences University of Hokkaido – sequence: 13 givenname: Yasushi surname: Furuichi fullname: Furuichi, Yasushi organization: Health Sciences University of Hokkaido – sequence: 14 givenname: Yoshihiro surname: Abiko fullname: Abiko, Yoshihiro email: yoshi-ab@hoku-iryo-u.ac.jp organization: Health Sciences University of Hokkaido – sequence: 15 givenname: Hiroko surname: Miura fullname: Miura, Hiroko organization: Health Sciences University of Hokkaido |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/39538434$$D View this record in MEDLINE/PubMed |
BookMark | eNp1kLtOwzAUhi0EohcYeAGUEYa0vsbJiNpSqCroUAYmy3FsNVXilDgVdOMReEaeBNOLhBBMRz7-_l_21wHHtrIagAsEewhC3F9mWQ8RHkVHoI0SAkMeE34M2v4Oh5AQ3gId55b-mFCKT0GLJIzElNA2eJ7p2lR1Ka3SQWWCwUI249n88_2D9FggbfZjQ6sgt0Gz0MFErqTVTgcPsskrK4tgqG3jx-hNlrndLs_AiZGF0-f72QVPt6P54C6cPo7vBzfTUJGER2HCmYFpIiWCRHMYY0wRS3HCUEYMjOI4URmEjBmjkKZpTCWNqFGMqwwTxTLSBVe73lVdvay1a0SZO6WLwr-wWjtBEI5j7C1Aj17u0XVa6kys6ryU9UYcdHigvwNUXTlXayNU3mx_09QyLwSC4lu48MLFVrhPXP9KHEr_Yvftr3mhN_-DYjIc7hJfLgaOMg |
CitedBy_id | crossref_primary_10_1016_j_jds_2025_02_022 crossref_primary_10_2147_JPR_S509845 |
Cites_doi | 10.1007/s10439-023-03296-w 10.1016/j.jds.2024.06.015 10.1016/j.jds.2024.02.019 10.3352/jeehp.2024.21.4 10.1016/j.jds.2023.12.007 10.7759/cureus.50369 10.2196/48002 10.1111/j.0305-182X.2007.01820.x 10.36740/WLek202311101 |
ContentType | Journal Article |
Copyright | 2024 American Dental Education Association. |
Copyright_xml | – notice: 2024 American Dental Education Association. |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 |
DOI | 10.1002/jdd.13766 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | CrossRef MEDLINE - Academic MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Dentistry |
EISSN | 1930-7837 |
EndPage | 466 |
ExternalDocumentID | 39538434 10_1002_jdd_13766 JDD13766 |
Genre | article Journal Article |
GeographicLocations | Japan |
GeographicLocations_xml | – name: Japan |
GroupedDBID | --- 0R~ 18M 1CY 1OC 2WC 33P 34H 53G 5GY 5RE 5VS AAHHS AAHQN AAHSB AAIPD AAMNL AANLZ AAOGT AAWTL AAYCA ABCUV ABJNI ABQWH ACCFJ ACCZN ACGFO ACGOF ACPOU ACXQS ADBBV ADBTR ADKYN ADZMN AEEZP AEIGN AENEX AEQDE AEUYR AFFNX AFFPM AFWVQ AGHNM AGYGG AHBTC AITYG AIWBW AJBDE ALMA_UNASSIGNED_HOLDINGS ALUQN ALVPJ AMYDB BAWUL BFHJK BTFSW C45 DCZOG E3Z EBS EJD F5P GK1 GX1 H13 HGLYW HW5 KQ8 LATKE LEEKS MEWTI OK1 OVD P2P RHI ROL SUPJJ TDE TEORI TR2 UCV UMD USG W8F WH7 WOQ WXSBR XZL YRY ZGI ZVN AAYXX AEYWJ CITATION AAMMB AEFGJ AGXDD AIDQK AIDYY CGR CUY CVF ECM EIF NPM 7X8 LH4 |
ID | FETCH-LOGICAL-c3976-975f0b9aa103e70822415b2951d3f06889cd0055ffc1e4b84a464fc57cd23c5d3 |
ISSN | 0022-0337 1930-7837 |
IngestDate | Thu Sep 04 16:14:43 EDT 2025 Mon Jul 21 05:28:40 EDT 2025 Tue Jul 01 05:04:01 EDT 2025 Thu Apr 24 22:58:37 EDT 2025 Thu Apr 17 09:30:29 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Keywords | dental education Chat Generative Pre‐trained Transformer (ChatGPT) Japanese National Dental Examination natural language processing artificial intelligence |
Language | English |
License | 2024 American Dental Education Association. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c3976-975f0b9aa103e70822415b2951d3f06889cd0055ffc1e4b84a464fc57cd23c5d3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0001-5602-4448 |
PMID | 39538434 |
PQID | 3128820330 |
PQPubID | 23479 |
PageCount | 8 |
ParticipantIDs | proquest_miscellaneous_3128820330 pubmed_primary_39538434 crossref_citationtrail_10_1002_jdd_13766 crossref_primary_10_1002_jdd_13766 wiley_primary_10_1002_jdd_13766_JDD13766 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | April 2025 |
PublicationDateYYYYMMDD | 2025-04-01 |
PublicationDate_xml | – month: 04 year: 2025 text: April 2025 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | Journal of dental education |
PublicationTitleAlternate | J Dent Educ |
PublicationYear | 2025 |
References | 2023; 76 2024; 52 2008; 35 2024; 21 2024 2023; 15 2023; 9 2024; 19 e_1_2_8_3_1 e_1_2_8_2_1 e_1_2_8_5_1 e_1_2_8_4_1 e_1_2_8_7_1 e_1_2_8_6_1 e_1_2_8_9_1 e_1_2_8_8_1 e_1_2_8_10_1 e_1_2_8_11_1 e_1_2_8_12_1 |
References_xml | – volume: 21 year: 2024 article-title: ChatGPT (GPT‐4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study publication-title: J Educ Eval Health Prof – volume: 9 year: 2023 article-title: Performance of GPT‐3.5 and GPT‐4 on the Japanese medical licensing examination: comparison Study publication-title: JMIR Med Educ – volume: 35 start-page: 446 issue: 6 year: 2008 end-page: 453 article-title: Dental occlusion: a critical reflection on past, present and future concepts publication-title: J Oral Rehabil – volume: 15 year: 2023 article-title: The performance of GPT‐3.5, GPT‐4, and bard on the japanese national dentist examination: a comparison study publication-title: Cureus – volume: 19 start-page: 1595 issue: 3 year: 2024 end-page: 1600 article-title: Evaluating GPT‐4V's performance in the Japanese national dental examination: a challenge explored publication-title: J Dent Sci – year: 2024 article-title: Evaluating the image recognition capabilities of GPT‐4V and Gemini Pro in the Japanese national dental examination publication-title: J Dent Sci – volume: 52 start-page: 130 issue: 2 year: 2024 end-page: 133 article-title: The role and potential contributions of the artificial intelligence language model ChatGPT publication-title: Ann Biomed Eng – volume: 76 start-page: 2345 issue: 11 year: 2023 end-page: 2350 article-title: From text to diagnose: chatGPT's efficacy in medical decision‐making publication-title: Wiad Lek – volume: 19 start-page: 2262 issue: 4 year: 2024 end-page: 2267 article-title: Evaluating the efficacy of leading large language models in the Japanese national dental hygienist examination: a comparative analysis of ChatGPT, Bard, and Bing Chat publication-title: J Dent Sci – ident: e_1_2_8_3_1 doi: 10.1007/s10439-023-03296-w – ident: e_1_2_8_9_1 doi: 10.1016/j.jds.2024.06.015 – ident: e_1_2_8_10_1 doi: 10.1016/j.jds.2024.02.019 – ident: e_1_2_8_4_1 doi: 10.3352/jeehp.2024.21.4 – ident: e_1_2_8_8_1 doi: 10.1016/j.jds.2023.12.007 – ident: e_1_2_8_6_1 – ident: e_1_2_8_7_1 doi: 10.7759/cureus.50369 – ident: e_1_2_8_11_1 – ident: e_1_2_8_5_1 doi: 10.2196/48002 – ident: e_1_2_8_12_1 doi: 10.1111/j.0305-182X.2007.01820.x – ident: e_1_2_8_2_1 doi: 10.36740/WLek202311101 |
SSID | ssj0029442 |
Score | 2.4303641 |
Snippet | Objectives: In this study, we compared the performance of ChatGPT‐3.5 to that of ChatGPT‐4o in the context of the Japanese National Dental Examination, which... In this study, we compared the performance of ChatGPT-3.5 to that of ChatGPT-4o in the context of the Japanese National Dental Examination, which assesses... |
SourceID | proquest pubmed crossref wiley |
SourceType | Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 459 |
SubjectTerms | artificial intelligence Chat Generative Pre‐trained Transformer (ChatGPT) Clinical Competence dental education Education, Dental - methods Educational Measurement - methods Generative Artificial Intelligence Humans Japan Japanese National Dental Examination natural language processing |
Title | Performance of ChatGPT‐3.5 and ChatGPT‐4o in the Japanese National Dental Examination |
URI | https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fjdd.13766 https://www.ncbi.nlm.nih.gov/pubmed/39538434 https://www.proquest.com/docview/3128820330 |
Volume | 89 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1930-7837 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0029442 issn: 0022-0337 databaseCode: KQ8 dateStart: 20010401 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bb9MwFLbKeIAXxH0bFxnEAxJKSWKniR8R3ZiKGEO00niKfAsqo83UJuLyxE_giR_IL-HYTtyMdtLgJW2dI7f19-X4JD7-DkJPlAilETILUpHRgKowC7IiEwGPIHaWImE6NXuH3xwODiZ0dJwc93q_OllLdSX68vvGfSX_gyq0Aa5ml-w_IOs7hQZ4D_jCERCG44UwPupk_du0Cl69Ohr7_AXST-zawN_ttGyzG0cwVZoSlM8O22eCQ7c9cu8rNzkyHrX18FU5Q90miLTITbSRgDbIvV3yWe0BLRfTE_7FnhjralkvypXzW3BlT-zXs-k3P0-8rz_Cpxl3y0uqPOHdRxRx0slsWW0ZCIlTd-lr52kZMamMTVvjil01oYZytONXaSMb7qZo6gq1rHl_pyb7Sal-BH5zg8K2N0rONbNT-2g4tKcuocsxvJjKGK_f-dWpmFHqVejN32oVq8L4ue_1bJyzdvNy9l7IBjPj6-haAyN-4Sh1A_X0_Ca6YrC3xf9uoQ8dauGywA2Ffv_4CaTCQKpOCy3xdI6BTrilE27phB2dcIdOt9Fkf2_88iBoqnAE0sSqAUuTIhSM8ygkOjUFAiDmEzFE5ooUpmQRk8oouRWFjDSFK57TAS1kkkoVE5kocgdtzcu53kZYcJhNIUYdaKP6L3jGmeQRo6lkTBAd7qCn7ZjlspGoN5VSPudOXDvOYXhzO7w76LE3PXW6LJuMHrUDn4PXNEthMAplvcwJhGUQ-xIC33nXIeK7IQyCAEoo_BoL0fn95y1Pdi9ueg9dXV0i99FWtaj1A4hrK_HQkuwPmT-cuA |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Performance+of+ChatGPT%E2%80%903.5+and+ChatGPT%E2%80%904o+in+the+Japanese+National+Dental+Examination&rft.jtitle=Journal+of+dental+education&rft.au=Uehara%2C+Osamu&rft.au=Morikawa%2C+Tetsuro&rft.au=Harada%2C+Fumiya&rft.au=Sugiyama%2C+Nodoka&rft.date=2025-04-01&rft.issn=0022-0337&rft.eissn=1930-7837&rft.volume=89&rft.issue=4&rft.spage=459&rft.epage=466&rft_id=info:doi/10.1002%2Fjdd.13766&rft.externalDBID=10.1002%252Fjdd.13766&rft.externalDocID=JDD13766 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0022-0337&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0022-0337&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0022-0337&client=summon |