Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology

Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology Publisher Pubmed

Mohammadrahimi H^{1, 2} ; Khoury ZH³ ; Alamdari MI⁴ ; Rokhshad R² ; Motie P⁵ ; Parsa A⁶ ; Tavares T⁷ ; Sciubba JJ⁸ ; Price JB^{1, 6} ; Sultan AS^{1, 6, 9}

Source: Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology Published:2024

Abstract

Objectives: In this study, we assessed 6 different artificial intelligence (AI) chatbots (Bing, GPT-3.5, GPT-4, Google Bard, Claude, Sage) responses to controversial and difficult questions in oral pathology, oral medicine, and oral radiology. Study Design: The chatbots’ answers were evaluated by board-certified specialists using a modified version of the global quality score on a 5-point Likert scale. The quality and validity of chatbot citations were evaluated. Results: Claude had the highest mean score of 4.341 ± 0.582 for oral pathology and medicine. Bing had the lowest scores of 3.447 ± 0.566. In oral radiology, GPT-4 had the highest mean score of 3.621 ± 1.009 and Bing the lowest score of 2.379 ± 0.978. GPT-4 achieved the highest mean score of 4.066 ± 0.825 for performance across all disciplines. 82 out of 349 (23.50%) of generated citations from chatbots were fake. Conclusions: The most superior chatbot in providing high-quality information for controversial topics in various dental disciplines was GPT-4. Although the majority of chatbots performed well, it is suggested that developers of AI medical chatbots incorporate scientific citation authenticators to validate the outputted citations given the relatively high number of fabricated citations. © 2024 Elsevier Inc.

Related Docs

View other Related Docs

1. Efficacy and Empathy of Ai Chatbots in Answering Frequently Asked Questions on Oral Oncology, Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology (2025)

2. Efficacy and Empathy of Ai Chatbots in Answering Frequently Asked Questions on Oral Oncology, Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology (2025)

3. Examining the Diagnostic Accuracy of Artificial Intelligence for Detecting Dental Caries Across a Range of Imaging Modalities: An Umbrella Review With Meta-Analysis, PLOS ONE (2025)

Experts (# of related papers)

Maryam Yazdi (1)

Parisa Soltani (1)

Style	Citing Format
MLA	Mohammadrahimi H, et al.. "Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology." Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology, vol. 137, no. 5, 2024, pp. 508-514.
APA	Mohammadrahimi H, Khoury ZH, Alamdari MI, Rokhshad R, Motie P, Parsa A, Tavares T, Sciubba JJ, Price JB, Sultan AS (2024). Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology. Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology, 137(5), 508-514.
Chicago	Mohammadrahimi H, Khoury ZH, Alamdari MI, Rokhshad R, Motie P, Parsa A, Tavares T, Sciubba JJ, Price JB, Sultan AS. "Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology." Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology 137, no. 5 (2024): 508-514.
Harvard	Mohammadrahimi H et al. (2024) 'Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology', Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology, 137(5), pp. 508-514.
Vancouver	Mohammadrahimi H, Khoury ZH, Alamdari MI, Rokhshad R, Motie P, Parsa A, et al.. Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology. Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology. 2024;137(5):508-514.
BibTex	@article{ author = {Mohammadrahimi H and Khoury ZH and Alamdari MI and Rokhshad R and Motie P and Parsa A and Tavares T and Sciubba JJ and Price JB and Sultan AS}, title = {Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology}, journal = {Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology}, volume = {137}, number = {5}, pages = {508-514}, year = {2024} }
RIS	TY - JOUR AU - Mohammadrahimi H AU - Khoury ZH AU - Alamdari MI AU - Rokhshad R AU - Motie P AU - Parsa A AU - Tavares T AU - Sciubba JJ AU - Price JB AU - Sultan AS TI - Performance of Ai Chatbots on Controversial Topics in Oral Medicine, Pathology, and Radiology JO - Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology VL - 137 IS - 5 SP - 508 EP - 514 PY - 2024 ER -

Science Communicator Platform

Authors

Abstract