Jung In Park, PhD, RN; Doyub Kim, PhD; Jung-Ah Lee PhD, RN; Kai Zheng PhD; Alpesh Amin MD, MBA; First published: April 22, 2021; DOI: https://doi.org/10.1111/jnu.12637



The aim of the study was to develop and validate machine learning models to predict the personalized risk for 30-day readmission with venous thromboembolism (VTE).


This study was a retrospective, observational study.


We extracted and preprocessed the structured electronic health records (EHRs) from a single academic hospital. Then we developed and evaluated three prediction models using logistic regression, the balanced random forest model, and the multilayer perceptron.


The study sample included 158,804 total admissions; VTE-positive cases accounted for 2,080 admissions from among 1,695 patients (1.31%). Based on the evaluation results, the balanced random forest model outperformed the other two risk prediction models.


This study delivered a high-performing, validated risk prediction tool using machine learning and EHRs to identify patients at high risk for VTE after discharge.

Clinical Relevance

The risk prediction model developed in this study can potentially guide treatment decisions for discharged patients for better patient outcomes.