Identifying and explaining large-scale genome sequence convergence

atmire.migration.oldid5665
dc.contributor.advisorde Koning, A.P. Jason
dc.contributor.authorBryans, Nathaniel
dc.contributor.committeememberCastoe, Todd
dc.contributor.committeememberYeaman, Samuel
dc.contributor.committeememberWasmuth, James
dc.date.accessioned2017-06-06T20:20:01Z
dc.date.available2017-06-06T20:20:01Z
dc.date.issued2017
dc.date.submitted2017en
dc.description.abstractRecently it has been shown that convergent sequence evolution can happen in nature at unexpectedly large scales, systematically misleading methods of phylogenetic reconstruction. For this reason, among others, there has been growing interest in sequence convergence in recent years. Although various techniques for detecting sequence convergence, such as site-specific log-likelihood support and ancestral sequence reconstruction, have been used, there does not yet exist a general statistical procedure for reliably distinguishing between random convergence and convergence resulting from parallel selective pressures or time-heterogeneous evolutionary processes. Here, I intend to further our understanding of sequence convergence by creating a new algorithm for detecting, quantifying and understanding non-random sequence convergence in a principled and unbiased manner. I design and implement a new approach for detecting convergence across entire phylogenies, making it amenable to a wider variety of datasets than was previously possible. Finally, I investigate the role of effective population size in contributing to sequence convergence, where I show for the first time that time-heterogeneity in effective population sizes can be sufficient to cause large-scale episodes of convergent sequence evolution. This surprising finding suggests an apparently non-adaptive mechanistic explanation since it can occur without changes to the underlying fitness landscape and is instead driven by lineages with increased effective population sizes becoming enabled to climb higher on the same adaptive peaks. As a result, we believe this phenomenon to be of adaptive significance even though it does not require adaptation to a changing environment per se. These finding suggest that time-heterogeneous evolutionary processes must be integrated into the models used for phylogenomic reconstruction and in comparative genomics more broadly.en_US
dc.identifier.citationBryans, N. (2017). Identifying and explaining large-scale genome sequence convergence (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/26425en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/26425
dc.identifier.urihttp://hdl.handle.net/11023/3872
dc.language.isoeng
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectBioinformatics
dc.subjectGenetics
dc.subject.otherSequence Convergence
dc.subject.otherConvergent Evolution
dc.subject.otherAdaptation
dc.subject.otherPhylogenetics
dc.titleIdentifying and explaining large-scale genome sequence convergence
dc.typemaster thesis
thesis.degree.disciplineBiochemistry and Molecular Biology
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.item.requestcopytrue
Files