Novel stabilized models to characterize gene-gene interactions by utilizing transcriptome data

dc.contributor.advisorLong, Quan
dc.contributor.advisorZhang, Qingrun
dc.contributor.authorKossinna, Thalagala Kossinnage Pathum Subhashana
dc.contributor.committeememberArnold, Paul Daniel
dc.contributor.committeememberDe Leon, Alexander
dc.date2022-11
dc.date.accessioned2022-10-03T19:16:44Z
dc.date.available2022-10-03T19:16:44Z
dc.date.issued2022-09-28
dc.description.abstractMachine learning models employed in genetics often grapple with issues related to the "curse of dimensionality". Furthermore, due to the inherent noisy nature of most -omics data, most methods suffer from the problem of "stability": i.e., even slight perturbations of the original data may result in wholly different outcomes. This becomes particularly true when dealing with interactions as the number of potential interactions are usually astronomical. In this thesis, we present two novel methods: 1) Stabilized COre gene and Pathway Election (SCOPE) and 2) Interaction Bridged Association Study (IBAS) that uses two differing approaches in analyzing biological interactions. SCOPE employs a stabilized form of the LASSO that is better able to handle highly correlated expression data and a co-expression network analysis that identifies "core" genes that may be of interest as well as the underlying biological pathways or mechanisms by which they interact. Stabilizing these results across six cancers of The Cancer Genome Atlas uncovered hallmark cancer pathways as well as a novel potential therapeutic target of kidney cancer, CD63. IBAS utilizes a "data-bridge" composed of dimensionality reduced pathway level interactions of the transcriptome to identify genes associated with a phenotype of interest using the Sequence Kernel Association Test (SKAT), in a disentangled form of the Transcriptome Wide Association Study. Application to the Wellcome Trust Case Control Consortium reveals novel gene candidates with literature reviews highlighting their potential for further study. In conclusion, we have developed two novel methodologies in analyzing complex interaction patterns in -omics data using stabilized machine learning methods, paving the way to further understand the biological interactions underlying complex disease.en_US
dc.identifier.citationKossinna, T. K. P. S. (2022). Novel stabilized models to characterize gene-gene interactions by utilizing transcriptome data (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.urihttp://hdl.handle.net/1880/115339
dc.identifier.urihttps://dx.doi.org/10.11575/PRISM/40339
dc.language.isoengen_US
dc.publisher.facultyCumming School of Medicineen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subjectMachine Learningen_US
dc.subjectPan-cancer Analysisen_US
dc.subjectTCGAen_US
dc.subjectCD63en_US
dc.subjectSCOPEen_US
dc.subjectIBASen_US
dc.subjectStabilized Methodsen_US
dc.subjectBiological Interactionsen_US
dc.subjectdata-bridgeen_US
dc.subject.classificationBioinformaticsen_US
dc.subject.classificationStatisticsen_US
dc.titleNovel stabilized models to characterize gene-gene interactions by utilizing transcriptome dataen_US
dc.typemaster thesisen_US
thesis.degree.disciplineMedicine – Biochemistry and Molecular Biologyen_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrueen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2022_kossinna_thalagalakossinnagepathum.pdf
Size:
15.28 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: