Identifying and explaining large-scale genome sequence convergence

Bryans, Nathaniel

Identifying and explaining large-scale genome sequence convergence

atmire.migration.oldid	5665
dc.contributor.advisor	de Koning, A.P. Jason
dc.contributor.author	Bryans, Nathaniel
dc.contributor.committeemember	Castoe, Todd
dc.contributor.committeemember	Yeaman, Samuel
dc.contributor.committeemember	Wasmuth, James
dc.date.accessioned	2017-06-06T20:20:01Z
dc.date.available	2017-06-06T20:20:01Z
dc.date.issued	2017
dc.date.submitted	2017	en
dc.description.abstract	Recently it has been shown that convergent sequence evolution can happen in nature at unexpectedly large scales, systematically misleading methods of phylogenetic reconstruction. For this reason, among others, there has been growing interest in sequence convergence in recent years. Although various techniques for detecting sequence convergence, such as site-specific log-likelihood support and ancestral sequence reconstruction, have been used, there does not yet exist a general statistical procedure for reliably distinguishing between random convergence and convergence resulting from parallel selective pressures or time-heterogeneous evolutionary processes. Here, I intend to further our understanding of sequence convergence by creating a new algorithm for detecting, quantifying and understanding non-random sequence convergence in a principled and unbiased manner. I design and implement a new approach for detecting convergence across entire phylogenies, making it amenable to a wider variety of datasets than was previously possible. Finally, I investigate the role of effective population size in contributing to sequence convergence, where I show for the first time that time-heterogeneity in effective population sizes can be sufficient to cause large-scale episodes of convergent sequence evolution. This surprising finding suggests an apparently non-adaptive mechanistic explanation since it can occur without changes to the underlying fitness landscape and is instead driven by lineages with increased effective population sizes becoming enabled to climb higher on the same adaptive peaks. As a result, we believe this phenomenon to be of adaptive significance even though it does not require adaptation to a changing environment per se. These finding suggest that time-heterogeneous evolutionary processes must be integrated into the models used for phylogenomic reconstruction and in comparative genomics more broadly.	en_US
dc.identifier.citation	Bryans, N. (2017). Identifying and explaining large-scale genome sequence convergence (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/26425	en_US
dc.identifier.doi	http://dx.doi.org/10.11575/PRISM/26425
dc.identifier.uri	http://hdl.handle.net/11023/3872
dc.language.iso	eng
dc.publisher.faculty	Graduate Studies
dc.publisher.institution	University of Calgary	en
dc.publisher.place	Calgary	en
dc.rights	University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject	Bioinformatics
dc.subject	Genetics
dc.subject.other	Sequence Convergence
dc.subject.other	Convergent Evolution
dc.subject.other	Adaptation
dc.subject.other	Phylogenetics
dc.title	Identifying and explaining large-scale genome sequence convergence
dc.type	master thesis
thesis.degree.discipline	Biochemistry and Molecular Biology
thesis.degree.grantor	University of Calgary
thesis.degree.name	Master of Science (MSc)
ucalgary.item.requestcopy	true

Collections

Open Theses and Dissertations

Identifying and explaining large-scale genome sequence convergence

Files

Collections

Libraries & Cultural Resources