Tehran University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
Machine Learning-Based Clinical Decision Support System for Automatic Diagnosis of Covid-19 Based on Clinical Data



Afrash MR1 ; Erfannia L2, 3 ; Amraei M4 ; Mehrabi N5 ; Jelvay S6 ; Nopour R7 ; Shanbehzadeh M8
Authors
Show Affiliations
Authors Affiliations
  1. 1. School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  2. 2. Department of Health Information Technology, Faculty of Paramedical, Zaheda University of Medical Sciences, Zahedan, Iran
  3. 3. Clinical Education Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
  4. 4. Department of Health Information Technology, School of Allied Medical Sciences, Lorestan University of Medical Sciences, Khorramabad, Iran
  5. 5. Department of Health Information Technology, Aja University of Medical Sciences, Tehran, Iran
  6. 6. Instructor of Health Information Technology, Abadan University of Medical Sciences, Abadan, Iran
  7. 7. Department of Health Information Technology and Management, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
  8. 8. Department of Health Information Technology, School of Paramedical, Ilam University of Medical Sciences, Ilam, Iran

Source: Journal of Biostatistics and Epidemiology Published:2022

Abstract

Introduction: Needless to say that correct and real-time detection and effective prognosis of the COVID-19 are necessary to deliver the best possible care for patients and, accordingly, diminish the pressure on the healthcare industries. Hence our paper aims to present an intelligent algorithm for selecting the best features from the dataset and developing Machine Learning(ML) based models to predict the COVID-19 and finally opted for the best-performing algorithm. Methods: In this developmental study, the clinical data of 1703 COVID-19 and non-COVID-19 patients Using a single-center registry from February 9, 2020, to December 20, 2020, were used. The Minimum Redundancy Maximum Relevance (mRMR) feature selection algorithm identified the most relevant variables. Then, chosen features feed into the several data mining methods, including K-Nearest Neighbors, AdaBoost Classifier, Decision Tree, HistGradient Boosting Classifier, and Support Vector Machine. A 10-fold cross-validation method and six performance evaluation metrics were used to evaluate and compare these implemented algorithms, and finally, the best model was implemented. Results: Out of the 34 included features, 11 variables were selected as the essential features. The results of using ML algorithms indicated that the best performance belongs to the AdaBoost classifier with mean accuracy = 92.9%, mean specificity = 89.3%, mean sensitivity = 94.2%, mean F-measure = 91.6 %, mean KAPA = 94.3% and mean ROC = 92.1 %. Conclusion: The empirical results reveal that the Adaboost model yielded higher performance than other classification models and developed our Clinical Decision Support Systems (CDSS) interface to discriminate positive COVID-19 from negative cases. © 2022 Tehran University of Medical Sciences. Published by Tehran University of Medical Sciences. This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International license (https://creativecommons.org/licenses/by-nc/4.0/). Noncommercial uses of the work are permitted, provided the original work is properly cited.