Causal Inference With Non-probability Sample and Misclassified Covariate

dc.contributor.advisorShen, Hua
dc.contributor.authorSevinc, Emir
dc.contributor.committeememberLu, Xuewen
dc.contributor.committeememberDeardon, Robert
dc.contributor.committeememberShen, Hua
dc.contributor.committeememberBadescu, Alexandru
dc.date2022-11
dc.date.accessioned2022-09-27T16:21:15Z
dc.date.available2022-09-27T16:21:15Z
dc.date.issued2022-09
dc.description.abstractCausal inference refers to the study of analyzing data that is explicitly defined on a question of causality. The problems motivating many, if not most studies in social and biological sciences, tend to be causative and not associative. A well defined and systematically representative sample tends to be the base in such studies. However, sometimes a sample may result from a non-probability process. This often provides a unique challenge in estimating the probability of an individual being in the sample, and generalizing the causality conclusions made off of the non-probability samples to the target population. Additionally, due to issues such as difficulty of precise measurements and human error, certain variables may be classified incorrectly. In this thesis, we address both challenges by implementing causal inferential methods in a case where we have a main non-probability sample with response available, and a probability sample with auxiliary information only. We deal with the presence of incorrectly classified confounder in the non-probability sample only, or both samples. We examine the consequences of naively ignoring misclassification, and develop a latent-variable based method via an Expectation-Maximization algorithm to correct for the misclassified confounder. We incorporate this method with a double-robust mean estimator requiring only the correct specification of either the regression model or the non-probability sample selection model to estimate the average treatment effect. We demonstrate the effectiveness of our methodology via simulation studies, and implement it on smoking data from the Centre of Disease Control and Prevention (CDC).en_US
dc.identifier.citationSevinc, E. (2022). Causal inference with non-probability sample and misclassified covariate (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.urihttp://hdl.handle.net/1880/115301
dc.identifier.urihttps://dx.doi.org/10.11575/PRISM/40307
dc.language.isoengen_US
dc.publisher.facultyScienceen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subject.classificationStatisticsen_US
dc.titleCausal Inference With Non-probability Sample and Misclassified Covariateen_US
dc.typemaster thesisen_US
thesis.degree.disciplineMathematics & Statisticsen_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrueen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2022_sevinc_emir.pdf
Size:
900.01 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: