Tehran University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! By
Towards Trustworthy Artificial Intelligence in Musculoskeletal Medicine: A Narrative Review on Uncertainty Quantification Publisher Pubmed



Am Vahdani Amir M ; Mm Shariatnia M MOEIN ; P Rajpurkar PRANAV ; A Pareek AYOOSH
Authors

Source: Knee Surgery, Sports Traumatology, Arthroscopy Published:2025


Abstract

Introduction: Deep learning (DL) models have achieved remarkable performance in musculoskeletal (MSK) medical imaging research, yet their clinical integration remains hindered by their black-box nature and the absence of reliable confidence measures. Uncertainty quantification (UQ) seeks to bridge this gap by providing each DL prediction with a calibrated estimate of uncertainty, thereby fostering clinician trust and safer deployment. Methods: We conducted a targeted narrative review, performing expert-driven searches in PubMed, Scopus, and arXiv and mining references from relevant publications in MSK imaging utilizing UQ, and a thematic synthesis was used to derive a cohesive taxonomy of UQ methodologies. Results: UQ approaches encompass multi-pass methods (e.g., test-time augmentation, Monte Carlo dropout, and model ensembling) that infer uncertainty from variability across repeated inferences; single-pass methods (e.g., conformal prediction, and evidential deep learning) that augment each individual prediction with uncertainty metrics; and other techniques that leverage auxiliary information, such as inter-rater variability, hidden-layer activations, or generative reconstruction errors, to estimate confidence. Applications in MSK imaging, include highlighting uncertain areas in cartilage segmentation and identifying uncertain predictions in joint implant design detections; downstream applications include enhanced clinical utility and more efficient data annotation pipelines. Conclusion: Embedding UQ into DL workflows is essential for translating high-performance models into clinical practice. Future research should prioritize robust out-of-distribution handling, computational efficiency, and standardized evaluation metrics to accelerate the adoption of trustworthy AI in MSK medicine. Level of Evidence: Not applicable. © 2025 Elsevier B.V., All rights reserved.
Other Related Docs
11. Predicting the Need for Cardiovascular Surgery: A Comparative Study of Machine Learning Models, Journal of Electronics# Electromedical Engineering# and Medical Informatics (2024)