Tehran University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
Investigating the Performance of Machine Learning Algorithms in Predicting the Survival of Covid-19 Patients: A Cross Section Study of Iran Publisher



Yazdani A1, 2, 3 ; Bigdeli SK4 ; Zahmatkeshan M5, 6
Authors
Show Affiliations
Authors Affiliations
  1. 1. Department of Health Information Management, School of Health Management and Information Sciences, Shiraz University of Medical Sciences, Shiraz, Iran
  2. 2. Clinical Education Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
  3. 3. Health Human Resources Research Center, School of Health Management and Information Sciences, Shiraz University of Medical Sciences, Shiraz, Iran
  4. 4. Health Information Management Department, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
  5. 5. Noncommunicable Diseases Research Center, Fasa University of Medical Sciences, Fasa, Iran
  6. 6. School of Allied Medical Sciences, Fasa University of Medical Sciences, Fasa, Iran

Source: Health Science Reports Published:2023


Abstract

Background and Aims: Like early diagnosis, predicting the survival of patients with Coronavirus Disease 2019 (COVID-19) is of great importance. Survival prediction models help doctors be more cautious to treat the patients who are at high risk of dying because of medical conditions. This study aims to predict the survival of hospitalized patients with COVID-19 by comparing the accuracy of machine learning (ML) models. Methods: It is a cross-sectional study which was performed in 2022 in Fasa city in Iran country. The research data set was extracted from the period February 18, 2020 to February 10, 2021, and contains 2442 hospitalized patients' records with 84 features. A comparison was made between the efficiency of five ML algorithms to predict survival, includes Naive Bayes (NB), K-nearest neighbors (KNN), random forest (RF), decision tree (DT), and multilayer perceptron (MLP). Modeling steps were done with Python language in the Anaconda Navigator 3 environment. Results: Our findings show that NB algorithm had better performance than others with accuracy, precision, recall, F-score, and area under receiver operating characteristic curve of 97%, 96%, 96%, 96%, and 97%, respectively. Based on the analysis of factors affecting survival, heart disease, pulmonary diseases and blood related disease were the most important disease related to death. Conclusion: The development of software systems based on NB will be effective to predict the survival of COVID-19 patients. © 2023 The Authors. Health Science Reports published by Wiley Periodicals LLC.