Creating a Frailty Case Definition for Primary Care EMR Using Machine Learning

dc.contributor.advisorWilliamson, Tyler
dc.contributor.advisorLee, Joon
dc.contributor.authorAponte-Hao, Zhi Yun (Sylvia)
dc.contributor.committeememberMcBrien, Kerry
dc.contributor.committeememberRonksley, Paul
dc.date2021-06
dc.date.accessioned2021-05-10T20:21:10Z
dc.date.available2021-05-10T20:21:10Z
dc.date.issued2021-05-04
dc.description.abstractBackground: Frailty is a geriatric syndrome characterized by increased vulnerability and increased risk of adverse events. The Clinical Frailty Scale (CFS) is a judgement-based scale used to identify frailty in senior populations (over the age of 65). Primary care electronic medical records (EMRs) contain routinely collected medical data and can be used for frailty screening. There is currently no method to detect frailty automatically using primary care electronic medical records that aligns with the CFS definition. Purpose: To create a machine learning based algorithm for the identification of frailty in routinely collected primary care electronic medical records. Methods: Primary care physicians within the Canadian Primary Care Sentinel Surveillance Network retrospectively identified frailty in 5466 senior patients from their own practice using the CFS, and the corresponding patient EMR data were extracted and processed as features. The patient data were split 30-70, with 30% being the hold-out set used for final testing and 70% for the training set. A collection of machine learning algorithms was created using the training dataset, including regularized logistic regression models, support vector machines, random forests, k-nearest neighbours, classification and regression trees, feedforward neural networks, Naïve Bayes, and XGBoost. A balanced training dataset was also created by oversampling. Sensitivity analyses were also performed using two alternative dichotomization cut-offs of frailty. Final model performance was assessed using the hold-out dataset, and reported using ROC, accuracy, F1-score, sensitivity, specificity, positive and negative predictive values. Results: 18.4% of patients were classified as frail based on a CFS score of 5 and above. Of the 8 models developed, an XGBoost model had the best classification performance, with sensitivity of 78.14% and specificity of 74.41%. Neither the balanced training dataset, nor the sensitivity analyses using two alternative cut-offs resulted in improved performance. Conclusion: Supervised machine learning was able to distinguish between frail and non-frail patients with good performance. Future work may wish to develop a protocol for standardized assignment of the CFS, use all available unstructured and structured data, and supplement with additional geriatric-specific data.en_US
dc.identifier.citationAponte-Hao, Z. Y. (2021). Creating a Frailty Case Definition for Primary Care EMR Using Machine Learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/38846
dc.identifier.urihttp://hdl.handle.net/1880/113392
dc.language.isoengen_US
dc.publisher.facultyCumming School of Medicineen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subjectSupervised Machine Learningen_US
dc.subjectFrailtyen_US
dc.subjectElectronic Medical Recordsen_US
dc.subjectEpidemiologyen_US
dc.subjectCase Definitionen_US
dc.subject.classificationBiostatisticsen_US
dc.subject.classificationEpidemiologyen_US
dc.subject.classificationPublic Healthen_US
dc.titleCreating a Frailty Case Definition for Primary Care EMR Using Machine Learningen_US
dc.typemaster thesisen_US
thesis.degree.disciplineMedicine – Community Health Sciencesen_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrueen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2021_aponte-hao_zhiyun.pdf
Size:
1.85 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: