Open-set Speaker Recognition with Bounded Laguerre Voronoi Clustering

dc.contributor.advisorGavrilova, Marina L
dc.contributor.authorOhi, Abu Quwsar
dc.contributor.committeememberSousa, Mario Costa
dc.contributor.committeememberBezdek, Karoly
dc.date.accessioned2024-08-20T17:34:00Z
dc.date.available2024-08-20T17:34:00Z
dc.date.issued2024-08-19
dc.description.abstractSpeaker recognition is a challenging problem in behavioral biometrics. It has been rigorously investigated over the last decade. Although numerous supervised closed-set systems successfully harvest the power of deep neural networks, limited studies have been made on open-set speaker recognition. This thesis proposes a self-supervised open-set speaker recognition that leverages the geometric properties of speaker distribution for accurate and robust speaker identification. The proposed framework consists of a deep neural network incorporating a wider viewpoint of temporal speech features and Laguerre–Voronoi diagram-based speech feature extraction. The deep neural network is trained with a specialized clustering criterion that only requires positive pairs during training. The framework further incorporates a novel approach of clustering by integrating concepts from Voronoi diagrams in Laguerre geometry. This approach offers flexibility by necessitating only one hyperparameter, an upper-bound value for the number of centroids. The experiments validated that the proposed system outperformed current state-of-the-art methods in open-set speaker verification and identification.
dc.identifier.citationOhi, A. Q. (2024). Open-set speaker recognition with bounded Laguerre Voronoi clustering (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/119443
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject.classificationComputer Science
dc.subject.classificationArtificial Intelligence
dc.titleOpen-set Speaker Recognition with Bounded Laguerre Voronoi Clustering
dc.typemaster thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2024_ohi_md.pdf
Size:
17.01 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: