Isfahan University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
Predicting High Sensitivity C-Reactive Protein Levels and Their Associations in a Large Population Using Decision Tree and Linear Regression Publisher Pubmed



Ghiasi Hafezi S1, 2 ; Sahranavard T3 ; Kooshki A3 ; Hosseini M4 ; Mansoori A1, 6, 7 ; Fakhrian EA3 ; Rezaeifard H3 ; Ghamsary M5 ; Esmaily H1, 6 ; Ghayourmobarhan M7
Authors
Show Affiliations
Authors Affiliations
  1. 1. Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad, Iran
  2. 2. Department of Applied Mathematics, School of Mathematical Sciences, Ferdowsi University of Mashhad, Mashhad, Iran
  3. 3. Student Research Committee, Faculty of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran
  4. 4. Department of Biostatistics, College of Health, Isfahan University of Medical Sciences, Isfahan, Iran
  5. 5. School of Public Health, Loma Linda University, Loma Linda, CA, United States
  6. 6. Social Determinants of Health Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
  7. 7. Metabolic Syndrome Research Center, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran

Source: Scientific Reports Published:2024


Abstract

High-sensitivity C-reactive protein (hs-CRP) is a biomarker of inflammation predicting the incidence of different health pathologies. In this study, we aimed to evaluate the association between hematological and demographic factors with hs-CRP levels using decision tree (DT) and linear regression (LR) modeling. This study was conducted on a population of 9704 males and females aged 35 to 65 years recruited from the Mashhad Stroke and Heart Atherosclerotic Disorder (MASHAD) cohort study. We utilized a data mining approach to construct a predictive model of hs-CRP measurements, employing the DT methodology. DT model was used to predict hs-CRP level using biochemical factors and clinical features. A total of 9,704 individuals were included in the analysis, with 57% of them being female. Except for fasting blood glucose (FBG), hypertension (HTN), and Type 2 diabetes mellites (T2DM), all variables showed significant differences between the two groups. The results of the LR models showed that variables such as anxiety score, depression score, Systolic Blood Pressure, Cardiovascular disease, and HTN were significant in predicting hs-CRP levels. In the DT models, depression score, FBG, cholesterol, and anxiety score were identified as the most important factors in predicting hs-CRP levels. DT model was able to predict hs-CRP level with an accuracy of 72.1% in training and 71.4% in testing of both genders. The proposed DT model appears to be able to predict the hs-CRP levels based on anxiety score, depression scores, fasting blood glucose, systolic blood pressure, and history of cardiovascular diseases. © The Author(s) 2024.
Experts (# of related papers)
Other Related Docs
9. A Hybrid Computer-Aided Diagnosis System for Central Obesity Screening in a Large Sample of Iranian Children and Adolescents, 2023 31st International Conference on Electrical Engineering, ICEE 2023 (2023)
38. Low-Density Lipoprotein Cholesterol and Metabolic Syndrome in an Iranian High-Risk Population, Diabetes and Metabolic Syndrome: Clinical Research and Reviews (2015)
39. Risk Factors for Coronary Artery Disease in Isfahan, Iran, European Journal of Public Health (1999)
45. Application of Data Mining Techniques in Predicting Coronary Heart Disease: A Systematic Review, International Journal of Environmental Health Engineering (2021)