Tehran University of Medical Sciences

Science Communicator Platform

Share By
Accuracy of Large Language Models in Answering Dental Examination Questions: A Systematic Review and Meta-Analysis Publisher



Dashti M ; Khosraviani F ; Meyari A ; Amirzadeiranaq MH ; Chaurasia A ; Hefzi D ; Ghadimi N ; Tichy A ; Khurshid Z ; Schwendicke F
Authors

Source: International Dental Journal Published:2026


Abstract

Introduction Large language models (LLMs), including OpenAI’s GPT family accessed via interfaces such as ChatGPT and Microsoft Copilot, as well as non-GPT systems such as Google Gemini, are increasingly applied in healthcare and dental education. However, the accuracy of these systems in specialized tasks such as answering dental examination questions remains unclear. Methods This systematic review and meta-analysis evaluated LLM performance in answering dental questions. Databases searched were PubMed, Embase, Scopus, and Web of Science. Data on question type and number, LLM versions, and accuracy rates were extracted. Pooled accuracy was estimated using a random-effects model; heterogeneity and publication bias were assessed. Results A total of 39 studies were included, with ChatGPT-4 being the most frequently evaluated model. The pooled accuracy for LLMs was 63.7% (95% CI: 60.3%-67.1%), with high heterogeneity (I ² = 91.5%). Subgroup analysis revealed ChatGPT-4 and Copilot (a GPT-based interface) achieved the highest pooled accuracies (∼73% and ∼75%, respectively). Direct comparisons confirmed ChatGPT-4 significantly outperformed earlier versions and some competitor models. Sensitivity analyses supported the robustness of findings. Conclusion LLMs demonstrate moderate accuracy in answering dental examination questions and are currently insufficient for autonomous clinical decision-making. When their limitations are explicitly recognized, however, these systems may serve as valuable adjuncts in dental education and examination preparation. Methodological strategies such as structured prompting and retrieval-augmented approaches warrant further investigation but were not the primary focus of the present analysis. © 2026 The Authors.
Other Related Docs
12. Cryotherapy and Post-Treatment Endodontic Pain, Journal of Craniomaxillofacial Research (2024)