Rapid Large-Scale Inference of Genome-Wide Mutational Heterogeniety

atmire.migration.oldid5017
dc.contributor.advisorde Koning, A.P. Jason
dc.contributor.authorMathankeri, Aaron
dc.contributor.committeememberChan, Jennifer
dc.contributor.committeememberDeardon, Robert
dc.date.accessioned2016-10-17T16:19:35Z
dc.date.available2016-10-17T16:19:35Z
dc.date.issued2016
dc.date.submitted2016en
dc.description.abstractTumours arise by mutation and natural selection among cellular lineages. Understanding and modelling mutation is thus a central aspect of cancer research. Genes that confer a selective advantage to their cell-line when mutated are known as drivers and are usually identified by statistical enrichment of mutations. Current approaches to detect drivers make several simplifying assumptions, sacrificing biological realism for computational speed when modelling mutation. The main novel, technical contribution of this thesis is the presentation of a principled mathematical framework for mutational analysis in genomic data that we term ``Mut-HMM''. Calculations required for large-scale inference were parallelized to take advantage of many-core CPU clusters. Based on this work, I present a new software package that can be orders of magnitude faster than previous state-of-the-art methods for analysis of genome-wide mutation patterns. I then present an exploratory analysis of chromosome 22 germline mutation data, showing that the results highlight the need for more complex and sophisticated mutation models in cancer and human genomics.en_US
dc.identifier.citationMathankeri, A. (2016). Rapid Large-Scale Inference of Genome-Wide Mutational Heterogeniety (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/27529en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/27529
dc.identifier.urihttp://hdl.handle.net/11023/3432
dc.language.isoeng
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectBioinformatics
dc.subject.classificationHidden Markov Modelsen_US
dc.subject.classificationParallel Computingen_US
dc.subject.classificationContinuous Time Markov Chainsen_US
dc.subject.classificationGenomicsen_US
dc.subject.classificationLarge-Scale Inferenceen_US
dc.subject.classificationMutationen_US
dc.titleRapid Large-Scale Inference of Genome-Wide Mutational Heterogeniety
dc.typemaster thesis
thesis.degree.disciplineBiochemistry and Molecular Biology
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.item.requestcopytrue
Files