Cluster analysis of correlated non-Gaussian continuous data via finite mixtures of Gaussian copula distributions

dc.contributor.advisorDe Leon, Alexander R.
dc.contributor.advisorWu, Jingjing
dc.contributor.authorBurak, Katherine L.
dc.contributor.committeememberKopciuk, Karen Arlene
dc.contributor.committeememberLu, Xuewen
dc.date2019-11
dc.date.accessioned2019-06-14T20:12:30Z
dc.date.available2019-06-14T20:12:30Z
dc.date.issued2019-06-12
dc.description.abstractModel-based cluster analysis in non-Gaussian settings is not straightforward due to a lack of standard models for non-Gaussian data. In this thesis, we adopt the class of Gaussian copula distributions (GCDs) to develop a flexible model-based clustering methodology that can accommodate a variety of correlated, non-Gaussian continuous data, where variables may have different marginal distributions and come from different parametric families. Unlike conventional model-based approaches that rely on the assumption of conditional independence, GCDs model conditional dependence among the disparate variables using the matrix of so-called normal correlations. We outline a hybrid approach to cluster analysis that combines the method of inference functions for margins (IFM) and the parameter-expanded EM (PX-EM) algorithm. We then report simulation results to investigate the performance of our methodology. Finally, we highlight the applications of this research by applying this methodology to a dataset regarding the purchases made by clients of a wholesale distributor.en_US
dc.identifier.citationBurak, K. L. (2019). Cluster analysis of correlated non-Gaussian continuous data via finite mixtures of Gaussian copula distributions (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/36637
dc.identifier.urihttp://hdl.handle.net/1880/110497
dc.language.isoengen_US
dc.publisher.facultyScienceen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subjectCluster analysisen_US
dc.subjectCopulaen_US
dc.subject.classificationStatisticsen_US
dc.titleCluster analysis of correlated non-Gaussian continuous data via finite mixtures of Gaussian copula distributionsen_US
dc.typemaster thesisen_US
thesis.degree.disciplineMathematics & Statisticsen_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrue
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2019_burak_katherine.pdf
Size:
2.26 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.74 KB
Format:
Item-specific license agreed upon to submission
Description: