On Some New Variable Selection Methods for Multivariate Survival Data

dc.contributor.advisorLu, Xuewen
dc.contributor.authorMahmoudi, Fatemeh
dc.contributor.committeememberWu, Jingjing
dc.contributor.committeememberDeardon, Rob
dc.contributor.committeememberWang, Liqun
dc.contributor.committeememberAmbagaspitiya, Rohana
dc.contributor.committeememberLu, Xuewen
dc.date2023-11
dc.date.accessioned2023-08-22T20:42:06Z
dc.date.available2023-08-22T20:42:06Z
dc.date.issued2023-08
dc.description.abstractThis dissertation proposes variable selection methods for reducing dimensionality in complex lifetime data for survival analysis. With the advent of big data, survival analysis often involves a large number of covariates, necessitating their identification. High-dimensional data, especially with increasing sample size, presents challenges in terms of variable selection. The dissertation focuses on simultaneous estimation and variable selection methods under various censored data types and survival models, examining their theoretical properties and performance in finite samples. The analysis of complex lifetime data encounters challenges stemming from different sources, including various types of censoring, diverse models, and multiple outcomes. Traditional survival analysis primarily deals with univariate survival data, focusing on a single event of interest. However, real-world applications frequently involve multiple event types with distinct underlying causes and risk factors. This research investigates three types of multiple events data: competing risks, semi-competing risks, and multivariate failure time data. For competing risks data, Chapter 2 considers interval-censored models. A penalized variable selection method is proposed, utilizing the LASSO, Adaptive LASSO, and broken adaptive ridge regression. The proposed method effectively selects important variables based on results of simulation studies. It is also successfully applied to a real-life HIV study dataset. In the context of semi-competing risks data, Chapter 3 explores an illness-death model with shared frailty. Parametric and semiparametric models are employed to examine the effects of covariates and conduct variable selection. The proposed method demonstrates good performance through simulation studies and analysis of colon cancer data. For multivariate failure time data, Chapter 4 introduces the sparse group broken adaptive ridge (SGBAR) penalty. This penalty facilitates variable selection at both the individual and group levels and is applied to interval-censored data. Extensive simulation studies confirm the good performance of the method, and the method is further validated using real-life data from the Aerobic Center Longitudinal Study (ACLS). In summary, this dissertation proposes new variable selection methods for complex lifetime data. It addresses challenges associated with competing risks, semi-competing risks, and multivariate failure time data. The proposed methods are supported by theoretical analysis, simulation studies, and real-life applications.
dc.identifier.citationMahmoudi, F. (2023). On some new variable selection methods for multivariate survival data (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/116877
dc.identifier.urihttps://dx.doi.org/10.11575/PRISM/41719
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectPenalized Variable Selection
dc.subjectCompeting Risks Data
dc.subjectSemi-competing Risks Data
dc.subjectMultivariate Failure Time Data
dc.subjectLifetime Data Analysis
dc.subject.classificationBiostatistics
dc.titleOn Some New Variable Selection Methods for Multivariate Survival Data
dc.typedoctoral thesis
thesis.degree.disciplineMathematics & Statistics
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameDoctor of Philosophy (PhD)
ucalgary.thesis.accesssetbystudentI require a thesis withhold – I need to delay the release of my thesis due to a patent application, and other reasons outlined in the link above. I have/will need to submit a thesis withhold application.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2023_mahmoudi_fatemeh.pdf
Size:
4.07 MB
Format:
Adobe Portable Document Format
Description:
This is the main file of my Ph.D. thesis.
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.64 KB
Format:
Item-specific license agreed upon to submission
Description: