Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):

Share By

Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification Publisher Pubmed

Am Vahdani Amir M ; Mm Shariatnia M MOEIN ; P Rajpurkar PRANAV ; A Pareek AYOOSH

Source: Knee Surgery, Sports Traumatology, Arthroscopy Published:2025

Abstract

Introduction: Deep learning (DL) models have achieved remarkable performance in musculoskeletal (MSK) medical imaging research, yet their clinical integration remains hindered by their black-box nature and the absence of reliable confidence measures. Uncertainty quantification (UQ) seeks to bridge this gap by providing each DL prediction with a calibrated estimate of uncertainty, thereby fostering clinician trust and safer deployment. Methods: We conducted a targeted narrative review, performing expert-driven searches in PubMed, Scopus, and arXiv and mining references from relevant publications in MSK imaging utilizing UQ, and a thematic synthesis was used to derive a cohesive taxonomy of UQ methodologies. Results: UQ approaches encompass multi-pass methods (e.g., test-time augmentation, Monte Carlo dropout, and model ensembling) that infer uncertainty from variability across repeated inferences; single-pass methods (e.g., conformal prediction, and evidential deep learning) that augment each individual prediction with uncertainty metrics; and other techniques that leverage auxiliary information, such as inter-rater variability, hidden-layer activations, or generative reconstruction errors, to estimate confidence. Applications in MSK imaging, include highlighting uncertain areas in cartilage segmentation and identifying uncertain predictions in joint implant design detections; downstream applications include enhanced clinical utility and more efficient data annotation pipelines. Conclusion: Embedding UQ into DL workflows is essential for translating high-performance models into clinical practice. Future research should prioritize robust out-of-distribution handling, computational efficiency, and standardized evaluation metrics to accelerate the adoption of trustworthy AI in MSK medicine. Level of Evidence: Not applicable. © 2025 Elsevier B.V., All rights reserved.

Related Docs

View other Related Docs

1. Cultivating Diagnostic Clarity: The Importance of Reporting Artificial Intelligence Confidence Levels in Radiologic Diagnoses, Clinical Imaging (2025)

2. Artificial Intelligence for Tumor [18F]Fdg-Pet Imaging: Advancement and Future Trends—Part I, Seminars in Nuclear Medicine (2025)

3. Revolutionizing Personalized Medicine Using Artificial Intelligence: A Meta-Analysis of Predictive Diagnostics and Their Impacts on Drug Development, Clinical and Experimental Medicine (2025)

Experts (# of related papers)

View all Related Experts

Kolahi Shahriar (2)

Amir Reza Radmard (1)

Other Related Docs

4. Implementation of Artificial Intelligence in Detection, Classification, and Prognostication of Osteosarcoma Utilizing Different Assessment Techniques: A Systematic Review, Intelligence-Based Medicine (2025)

5. Methodological Insights Into Chatgpt’S Screening Performance in Systematic Reviews, BMC Medical Research Methodology (2024)

6. Diagnostic Accuracy of Deep Learning Models in Predicting Glioma Molecular Markers: A Systematic Review and Meta-Analysis, Diagnostics (2025)

7. Explainable Artificial Intelligence for Pneumonia Classification: Clinical Insights Into Deformable Prototypical Part Network in Pediatric Chest X-Ray Images, Journal of Medical Imaging and Radiation Sciences (2025)

8. Ultrasound-Based Machine Learning Models for Predicting Response to Neoadjuvant Chemotherapy in Breast Cancer: A Meta-Analysis, Clinical Imaging (2025)

9. Concurrent Learning Approach for Estimation of Pelvic Tilt From Anterior–Posterior Radiograph, Bioengineering (2024)

10. Lung Cancer Management: Revolutionizing Patient Outcomes Through Machine Learning and Artificial Intelligence, Cancer Reports (2025)

11. Predicting the Need for Cardiovascular Surgery: A Comparative Study of Machine Learning Models, Journal of Electronics# Electromedical Engineering# and Medical Informatics (2024)

12. Oncology in the Modern Era: Artificial Intelligence Is Reshaping Cancer Diagnosis, Prognosis and Treatment, Iranian Journal of Blood and Cancer (2023)

13. Exploring a Decade of Deep Learning in Dentistry: A Comprehensive Mapping Review, Clinical Oral Investigations (2025)

14. Diagnostic Performance of Deep Learning Models Versus Radiologists in Covid-19 Pneumonia: A Systematic Review and Meta-Analysis, Clinical Imaging (2024)

15. Differential Privacy Preserved Federated Learning for Prognostic Modeling in Covid-19 Patients Using Large Multi-Institutional Chest Ct Dataset, Medical Physics (2024)

16. Computer-Aided Detection (Cade) and Segmentation Methods for Breast Cancer Using Magnetic Resonance Imaging (Mri), Journal of Magnetic Resonance Imaging (2025)

17. Deep Conformal Supervision: Leveraging Intermediate Features for Robust Uncertainty Quantification, Journal of Imaging Informatics in Medicine (2025)

18. Diagnostic Performance of Neural Network Algorithms in Skull Fracture Detection on Ct Scans: A Systematic Review and Meta-Analysis, Emergency Radiology (2025)

19. Diagnostic Performance of Artificial Intelligence in Multiple Sclerosis: A Systematic Review and Meta-Analysis, Neurological Sciences (2023)

20. Prediction of In-Hospital Adverse Clinical Outcomes in Patients With Pulmonary Thromboembolism, Machine Learning Based Models, Frontiers in Cardiovascular Medicine (2023)

Style	Citing Format
MLA	Am Vahdani Amir M, et al.. "Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification." Knee Surgery, Sports Traumatology, Arthroscopy, vol. 33, no. 9, 2025, pp. 3418-3437.
APA	Am Vahdani Amir M, Mm Shariatnia M MOEIN, P Rajpurkar PRANAV, A Pareek AYOOSH (2025). Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification. Knee Surgery, Sports Traumatology, Arthroscopy, 33(9), 3418-3437.
Chicago	Am Vahdani Amir M, Mm Shariatnia M MOEIN, P Rajpurkar PRANAV, A Pareek AYOOSH. "Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification." Knee Surgery, Sports Traumatology, Arthroscopy 33, no. 9 (2025): 3418-3437.
Harvard	Am Vahdani Amir M et al. (2025) 'Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification', Knee Surgery, Sports Traumatology, Arthroscopy, 33(9), pp. 3418-3437.
Vancouver	Am Vahdani Amir M, Mm Shariatnia M MOEIN, P Rajpurkar PRANAV, A Pareek AYOOSH. Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification. Knee Surgery, Sports Traumatology, Arthroscopy. 2025;33(9):3418-3437.
BibTex	@article{ author = {Am Vahdani Amir M and Mm Shariatnia M MOEIN and P Rajpurkar PRANAV and A Pareek AYOOSH}, title = {Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification}, journal = {Knee Surgery, Sports Traumatology, Arthroscopy}, volume = {33}, number = {9}, pages = {3418-3437}, year = {2025} }
RIS	TY - JOUR AU - Am Vahdani Amir M AU - Mm Shariatnia M MOEIN AU - P Rajpurkar PRANAV AU - A Pareek AYOOSH TI - Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification JO - Knee Surgery, Sports Traumatology, Arthroscopy VL - 33 IS - 9 SP - 3418 EP - 3437 PY - 2025 ER -

Science Communicator Platform

Authors

Abstract