White, James ALei, Lucy Y2022-11-152021-09-24Lei, L. Y. (2021). Phenotype-based prediction of incident cardiovascular hospitalization and inpatient care costs in patients referred for cardiovascular magnetic resonance imaging: Applications of traditional statistical modelling and machine learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.http://hdl.handle.net/1880/115443Background: Cardiovascular disease has an estimated lifetime prevalence of 48% in adults and imposes the highest economic burden on health care systems among noncommunicable diseases. These costs are largely related to chronic disease management, clinical procedures, and hospitalization, particularly for major adverse cardiovascular events (MACE). Importantly, health expenditures incurred by cardiovascular care are expected to increase substantially as the global population ages and life expectancies continue to rise. To improve health system efficiency and resource allocation in preparation for future cardiovascular care needs, it is necessary to improve baseline patient characterization and offer more accurate personalized risk predictions to optimally plan for opportunities to improve cardiovascular health while controlling costs.Aims: The aim of this thesis was to develop and validate models for the prediction of MACE and one-year cumulative inpatient care costs in a large cohort of patients referred for cardiovascular magnetic resonance imaging.Methods: Patients were recruited from the Cardiovascular Imaging Registry of Calgary, a prospective clinical outcomes registry that provides automated linkages of data abstracted from electronic health records, cardiovascular magnetic resonance imaging reports, and patient-reported health questionnaires. These data were used for predictive modelling using both traditional statistical methodologies and machine learning approaches.Results: Random survival forest and Cox proportional hazards models were developed for time-to-event prediction of hospitalization for MACE. Both models achieved time dependent AUCs of 0.83 in holdout validation. Patients with predicted risk in the upper tertile experienced 29- and 21-fold (p < 0.001) increased risk of MACE, respectively. A two-part hurdle model was developed for cost regression to predict one-year cumulative inpatient expenditures following cardiovascular magnetic resonance imaging. When binning the cost predictions into zero-, low-, and high-cost brackets, the model achieved 0.73 precision, 0.76 recall, and 0.74 F1. The best performing machine learning classification model combined predictions from random forest and artificial neural network algorithms to achieve 0.76 precision, 0.82 recall, and 0.79 F1.Conclusions: The results of this thesis demonstrate the prognostic capacity of multi-domain health data and its utility in the development of patient-specific risk models for adverse cardiovascular events and cumulative inpatient care costs. Additionally, while machine learning modelling methodologies offer advantages in handling large health care data sets, the interpretability of traditional statistical models remains valuable for delineating relationships between health-related variables and outcomes.enUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.risk predictionmachine learningHealth Sciences--Medicine and SurgeryPhenotype-based prediction of incident cardiovascular hospitalization and inpatient care costs in patients referred for cardiovascular magnetic resonance imaging: Applications of traditional statistical modelling and machine learningmaster thesis