Tehran University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
Analysis of the Most Influential Factors Affecting Outcomes of Lung Transplant Recipients: Amultivariate Prediction Model Based on Unos Data Publisher Pubmed



Gholamzadeh M1 ; Safdari R1 ; Asadi Gharabaghi M2 ; Abtahi H3
Authors
Show Affiliations
Authors Affiliations
  1. 1. Health Information Management and Medical Informatics Department, School of Allied Medical Sciences, Tehran University of Medical Sciences, Tehran, Iran
  2. 2. Department of Pulmonary Medicine, Faculty of Medicine, Tehran University of Medical Sciences, Tehran, Iran
  3. 3. Pulmonary and Critical Care Medicine Department, Thoracic Research Center, Imam Khomeini Hospital Complex, Tehran University of Medical Sciences, Tehran, Iran

Source: BMJ Open Published:2025


Abstract

Objectives In lung transplantation (LTx), apriority is assigned to each candidate on the waiting list. Our primary objective was to identify the key factors that influence the allocation of priorities in LTx using machine learning (ML) techniques to enhance the process of prioritising patients. Design Developing aprediction model. Setting and participants Our datawere retrieved from the United Network for Organ Sharing (UNOS) open-source database of transplant patients between 2005 and 2023. Interventions After the preprocessing process, afeature engineering technique was employed to select the most relevant features. Then, six ML models with optimised hyperparameters including multiple linear regression, random forest regressor (RF), support vector machine regressor, XGBoost regressor, amultilayer perceptron model and adeep learning model were developed based on the UNOS dataset. Primary and secondary outcome measures The performance of each model was evaluated using R-squared (R 2) and other error rate metrics. Next, the Shapley Additive Explanations (SHAP) technique was used to identify the most important features in the prediction. Results The raw dataset contains 196 270 records with 545 features in all organs. After preprocessing, 32 966 records with 15 features remain. Among various models, the RF model achieved ahigh R 2 score. Additionally, the RF model exhibited the lowest error values, indicating its superior precision compared with other regression models. The SHAP technique in conjunction with the RF model revealed the 11 most important features for priority allocation. Subsequently, we developed aweb-based decision support tool using Python and the Streamlit framework based on the best-fine-tuned model. Conclusion The deployment of the ML model has the potential to act as an automated tool to aid physicians in assessing the priority of lung transplants and identifying significant factors that play arole in patient survival. © Author(s) (or their employer(s)) 2025. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ Group.