Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat

Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat Publisher

M Dashti MAHMOOD ; S Ghasemi SHOHREH ; N Ghadimi NILOOFAR ; D Hefzi DELBAND ; A Karimian AZIZEH ; N Zare NIUSHA ; A Fahimipour AMIR ; Z Khurshid ZOHAIB ; M Mohammadalizadeh Chafjiri MARYAM ; S Ghaedsharaf SAHAR

Source: Imaging Science in Dentistry Published:2024

Abstract

Purpose: Recent advancements in artificial intelligence (AI), particularly tools such as ChatGPT developed by OpenAI, a U.S.-based AI research organization, have transformed the healthcare and education sectors. This study investigated the effectiveness of ChatGPT in answering dentistry exam questions, demonstrating its potential to enhance professional practice and patient care. Materials and Methods: This study assessed the performance of ChatGPT 3.5 and 4 on U.S. dental exams - specifically, the Integrated National Board Dental Examination (INBDE), Dental Admission Test (DAT), and Advanced Dental Admission Test (ADAT) - excluding image-based questions. Using customized prompts, ChatGPT’s answers were evaluated against official answer sheets. Results: ChatGPT 3.5 and 4 were tested with 253 questions from the INBDE, ADAT, and DAT exams. For the INBDE, both versions achieved 80% accuracy in knowledge-based questions and 66–69% in case history questions. In ADAT, they scored 66–83% in knowledge-based and 76% in case history questions. ChatGPT 4 excelled on the DAT, with 94% accuracy in knowledge-based questions, 57% in mathematical analysis items, and 100% in comprehension questions, surpassing ChatGPT 3.5’s rates of 83%, 31%, and 82%, respectively. The difference was significant for knowledge-based questions (P= 0.009). Both versions showed similar patterns in incorrect responses. Conclusion: Both ChatGPT 3.5 and 4 effectively handled knowledge-based, case history, and comprehension questions, with ChatGPT 4 being more reliable and surpassing the performance of 3.5. ChatGPT 4’s perfect score in comprehension questions underscores its trainability in specific subjects. However, both versions exhibited weaker performance in mathematical analysis, suggesting this as an area for improvement. © 2024 Elsevier B.V., All rights reserved.

Related Docs

View other Related Docs

1. Assessing Chatgpt-4’S Performance on the Us Prosthodontic Exam: Impact of Fine-Tuning and Contextual Prompting Vs. Base Knowledge, a Cross-Sectional Study, BMC Medical Education (2025)

2. Attitudes, Knowledge, and Perceptions of Dentists and Dental Students Toward Artificial Intelligence: A Systematic Review, Journal of Taibah University Medical Sciences (2024)

3. Developing a New Intelligent System for the Diagnosis of Oral Medicine With Case-Based Reasoning Approach, Oral Diseases (2019)

Experts (# of related papers)

View all Related Experts

Sara Pourshahidi (1)

Mohammad Hossein Nekoofar (1)

Style	Citing Format
MLA	M Dashti MAHMOOD, et al.. "Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat." Imaging Science in Dentistry, vol. 54, no. 3, 2024, pp. 271-275.
APA	M Dashti MAHMOOD, S Ghasemi SHOHREH, N Ghadimi NILOOFAR, D Hefzi DELBAND, A Karimian AZIZEH, N Zare NIUSHA, A Fahimipour AMIR, Z Khurshid ZOHAIB, M Mohammadalizadeh Chafjiri MARYAM, S Ghaedsharaf SAHAR (2024). Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat. Imaging Science in Dentistry, 54(3), 271-275.
Chicago	M Dashti MAHMOOD, S Ghasemi SHOHREH, N Ghadimi NILOOFAR, D Hefzi DELBAND, A Karimian AZIZEH, N Zare NIUSHA, A Fahimipour AMIR, Z Khurshid ZOHAIB, M Mohammadalizadeh Chafjiri MARYAM, S Ghaedsharaf SAHAR. "Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat." Imaging Science in Dentistry 54, no. 3 (2024): 271-275.
Harvard	M Dashti MAHMOOD et al. (2024) 'Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat', Imaging Science in Dentistry, 54(3), pp. 271-275.
Vancouver	M Dashti MAHMOOD, S Ghasemi SHOHREH, N Ghadimi NILOOFAR, D Hefzi DELBAND, A Karimian AZIZEH, N Zare NIUSHA, et al.. Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat. Imaging Science in Dentistry. 2024;54(3):271-275.
BibTex	@article{ author = {M Dashti MAHMOOD and S Ghasemi SHOHREH and N Ghadimi NILOOFAR and D Hefzi DELBAND and A Karimian AZIZEH and N Zare NIUSHA and A Fahimipour AMIR and Z Khurshid ZOHAIB and M Mohammadalizadeh Chafjiri MARYAM and S Ghaedsharaf SAHAR}, title = {Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat}, journal = {Imaging Science in Dentistry}, volume = {54}, number = {3}, pages = {271-275}, year = {2024} }
RIS	TY - JOUR AU - M Dashti MAHMOOD AU - S Ghasemi SHOHREH AU - N Ghadimi NILOOFAR AU - D Hefzi DELBAND AU - A Karimian AZIZEH AU - N Zare NIUSHA AU - A Fahimipour AMIR AU - Z Khurshid ZOHAIB AU - M Mohammadalizadeh Chafjiri MARYAM AU - S Ghaedsharaf SAHAR TI - Performance of Chatgpt 3.5 and 4 on U.S. Dental Examinations: The Inbde, Adat, and Dat JO - Imaging Science in Dentistry VL - 54 IS - 3 SP - 271 EP - 275 PY - 2024 ER -

Science Communicator Platform

Authors

Abstract