Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models

Science Communicator Platform

Share By

Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models Publisher

Fazilati F ; Rajabi MZ ; Alihosseini N ; Farsani ME ; Sandid SH ; Zamani S ; Alirezaei Farahani M ; Biriaei F ; Sadeghipour F ; Mirshamsi MT ; Fahami M ; Marateb HR

Source: Diagnostics Published:2026

Abstract

Machine learning (ML), including deep learning, kernel-based classifiers, and ensemble methods, is increasingly used to support clinical diagnosis in medical imaging, biosignal interpretation, and electronic health record (EHR)-based decision support. Despite rapid progress, many diagnostic AI studies still rely on limited retrospective evaluation and single summary measures (e.g., accuracy or AUC), creating a gap between reported model performance and evidence required for safe clinical adoption. This review proposes TRIAGE, a clinically grounded evaluation framework designed to organize diagnostic AI testing as an evidence pipeline aligned with real clinical use cases (screening, triage, second reading, and confirmatory testing). We summarize core discrimination metrics derived from the confusion matrix (sensitivity, specificity, predictive values, likelihood ratios, diagnostic odds ratio, and F-scores) and highlight the importance of prevalence and spectrum effects for interpreting predictive value and clinical workload. We further review evaluation strategies for multi-class and multi-label diagnostic tasks using appropriate aggregation methods (micro, macro, and weighted averaging) and set-based measures such as Hamming loss, exact match ratio, and Jaccard/IoU. Because diagnostic deployment is threshold-dependent, we integrate representation curves (ROC, precision–recall, lift, and cumulative gain) with calibration assessment and clinical utility analysis, including calibration slope, Brier score, and decision-curve analysis. We also address robustness and fairness evaluation, leakage-resistant validation designs (patient-grouped splits, stratified and temporal validation, and external validation), computational constraints relevant to deployment (latency, throughput, and energy use), and statistically sound model comparison with multiplicity control. A structured TRIAGE checklist table summarizing the evaluation parameters described in this review is provided in the main text to support reproducible and clinically interpretable reporting. © 2026 by the authors.

Related Docs

View other Related Docs

1. Artificial Intelligence and Acute Appendicitis: A Systematic Review of Diagnostic and Prognostic Models, World Journal of Emergency Surgery (2023)

2. Artificial Intelligence for Tumor [18F]Fdg-Pet Imaging: Advancement and Future Trends—Part I, Seminars in Nuclear Medicine (2025)

3. Revolutionizing Personalized Medicine Using Artificial Intelligence: A Meta-Analysis of Predictive Diagnostics and Their Impacts on Drug Development, Clinical and Experimental Medicine (2025)

Experts (# of related papers)

View all Related Experts

Hanieh Radkhah (1)

Ozra Tabatabaei-Malazy (1)

Other Related Docs

4. Lung Cancer Management: Revolutionizing Patient Outcomes Through Machine Learning and Artificial Intelligence, Cancer Reports (2025)

5. Artificial Intelligence–Based Radiomic Model in Craniopharyngiomas: A Systematic Review and Meta-Analysis on Diagnosis, Segmentation, and Classification, World Neurosurgery (2025)

6. Applications of Deep Learning in Intracranial Aneurysm Imaging: A Scoping Review of Detection, Risk Prediction, and Emerging Prognostic Models, Current Journal of Neurology (2025)

7. Evaluating the Diagnostic Accuracy of Artificial Intelligence in Spondylolisthesis Detection: A Systematic Review and Meta-Analysis, Academic Radiology (2026)

8. Diagnostic Performance of Artificial Intelligence in Detection of Hepatocellular Carcinoma: A Meta-Analysis, Journal of Imaging Informatics in Medicine (2024)

9. The Diagnostic Accuracy of Artificial Intelligence-Assisted Ct Imaging in Covid-19 Disease: A Systematic Review and Meta-Analysis, Informatics in Medicine Unlocked (2021)

10. Can Machine Learning Be a Reliable Tool for Predicting Hematoma Progression Following Traumatic Brain Injury? a Systematic Review and Meta-Analysis, Neuroradiology (2025)

11. Diagnostic Accuracy of Ct-Based Radiomics and Deep Learning for Predicting Lymph Node Metastasis in Esophageal Cancer, Clinical Imaging (2024)

12. Machine Learning and Deep Learning Algorithms in Stroke Medicine: A Systematic Review of Hemorrhagic Transformation Prediction Models, Journal of Neurology (2025)

13. Evaluation of Machine Learning Methods for Prediction of Heart Failure Mortality and Readmission: Meta-Analysis, BMC Cardiovascular Disorders (2025)

14. Prediction of Recurrence After Surgery for Pituitary Adenoma Using Machine Learning- Based Models: Systematic Review and Meta-Analysis, BMC Endocrine Disorders (2025)

15. Myo-Guide: A Machine Learning-Based Web Application for Neuromuscular Disease Diagnosis With Mri, Journal of Cachexia, Sarcopenia and Muscle (2025)

16. Risk Scores for Prediction of Paroxysmal Atrial Fibrillation After Acute Ischemic Stroke or Transient Ischemic Attack: A Systematic Review and Meta-Analysis, International Journal of Cardiology: Cardiovascular Risk and Prevention (2024)

17. Advancing Diabetic Retinopathy Screening: A Systematic Review of Artificial Intelligence and Optical Coherence Tomography Angiography Innovations, Diagnostics (2025)

18. Machine Learning and Microbiome Analysis for Early Detection of Pancreatic Cancer, Gastroenterology and Hepatology from Bed to Bench (2025)

19. Can Artificial Intelligence Accurately Predict the Risk of Hematoma Expansion in Intracerebral Hemorrhage? a Systematic Review and Meta-Analysis of 7,665 Patients, Neurosurgical Review (2026)

20. Can We Rely on Machine Learning Algorithms As a Trustworthy Predictor for Recurrence in High-Grade Glioma? a Systematic Review and Meta-Analysis, Clinical Neurology and Neurosurgery (2025)

Style	Citing Format
MLA	Fazilati F, et al.. "Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models." Diagnostics, vol. 16, no. 5, 2026, pp. -.
APA	Fazilati F, Rajabi MZ, Alihosseini N, Farsani ME, Sandid SH, Zamani S, Alirezaei Farahani M, Biriaei F, Sadeghipour F, Mirshamsi MT, Fahami M, Marateb HR (2026). Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models. Diagnostics, 16(5), -.
Chicago	Fazilati F, Rajabi MZ, Alihosseini N, Farsani ME, Sandid SH, Zamani S, Alirezaei Farahani M, et al.. "Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models." Diagnostics 16, no. 5 (2026): -.
Harvard	Fazilati F et al. (2026) 'Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models', Diagnostics, 16(5), pp. -.
Vancouver	Fazilati F, Rajabi MZ, Alihosseini N, Farsani ME, Sandid SH, Zamani S, et al.. Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models. Diagnostics. 2026;16(5):-.
BibTex	@article{ author = {Fazilati F and Rajabi MZ and Alihosseini N and Farsani ME and Sandid SH and Zamani S and Alirezaei Farahani M and Biriaei F and Sadeghipour F and Mirshamsi MT and Fahami M and Marateb HR}, title = {Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models}, journal = {Diagnostics}, volume = {16}, number = {5}, pages = {-}, year = {2026} }
RIS	TY - JOUR AU - Fazilati F AU - Rajabi MZ AU - Alihosseini N AU - Farsani ME AU - Sandid SH AU - Zamani S AU - Alirezaei Farahani M AU - Biriaei F AU - Sadeghipour F AU - Mirshamsi MT AU - Fahami M AU - Marateb HR TI - Triage: Trustworthy Reporting and Assessment for Clinical Gain and Effectiveness of Ai Models JO - Diagnostics VL - 16 IS - 5 SP - EP - PY - 2026 ER -

Science Communicator Platform

Authors

Abstract