Causal Inference With Non-probability Sample and Misclassified Covariate

Date
2022-09
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Causal inference refers to the study of analyzing data that is explicitly defined on a question of causality. The problems motivating many, if not most studies in social and biological sciences, tend to be causative and not associative. A well defined and systematically representative sample tends to be the base in such studies. However, sometimes a sample may result from a non-probability process. This often provides a unique challenge in estimating the probability of an individual being in the sample, and generalizing the causality conclusions made off of the non-probability samples to the target population. Additionally, due to issues such as difficulty of precise measurements and human error, certain variables may be classified incorrectly. In this thesis, we address both challenges by implementing causal inferential methods in a case where we have a main non-probability sample with response available, and a probability sample with auxiliary information only. We deal with the presence of incorrectly classified confounder in the non-probability sample only, or both samples. We examine the consequences of naively ignoring misclassification, and develop a latent-variable based method via an Expectation-Maximization algorithm to correct for the misclassified confounder. We incorporate this method with a double-robust mean estimator requiring only the correct specification of either the regression model or the non-probability sample selection model to estimate the average treatment effect. We demonstrate the effectiveness of our methodology via simulation studies, and implement it on smoking data from the Centre of Disease Control and Prevention (CDC).
Description
Keywords
Citation
Sevinc, E. (2022). Causal inference with non-probability sample and misclassified covariate (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.