Improving Deep Neural Networks: Optimization, Regularization, and Generative Modeling

Zhang, Zijun

Improving Deep Neural Networks: Optimization, Regularization, and Generative Modeling

dc.contributor.advisor	Li, Zongpeng
dc.contributor.advisor	Samavati, Faramarz
dc.contributor.author	Zhang, Zijun
dc.contributor.committeemember	Li, Zongpeng
dc.contributor.committeemember	Denzinger, Jorg
dc.contributor.committeemember	Krishnamurthy, Diwakar
dc.date	2020-06
dc.date.accessioned	2019-12-13T20:09:12Z
dc.date.available	2019-12-13T20:09:12Z
dc.date.issued	2019-12
dc.description.abstract	In the past decade, deep learning has revolutionized the fields of computer vision, speech recognition, natural language processing, and continues spreading to many other fields. Therefore, it is important to better understand and improve deep neural networks (DNNs), which serve as the backbone of deep learning. In this thesis, we approach this topic from three different perspectives: optimization, regularization, and generative modeling. Firstly, we address the generalization gap recently observed between adaptive optimization methods, such as Adam, and simple stochastic gradient descent (SGD). We develop a tailored version of Adam for training DNNs, which is shown to close the gap on image classification tasks. Secondly, we identify a side effect of a widely used regularization technique, dropout, and multiplicative noise in general. That is, multiplicative noise tends to increase the correlation between features. We then exploit batch normalization to efficiently remove the correlation effect. Finally, we focus on generative modeling, a fundamental application of DNNs. We propose a framework for training autoencoder-based generative models, with non-adversarial losses and unrestricted neural network architectures.	en_US
dc.identifier.citation	Zhang, Z. (2019). Improving Deep Neural Networks: Optimization, Regularization, and Generative Modeling (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.	en_US
dc.identifier.doi	http://dx.doi.org/10.11575/PRISM/37338
dc.identifier.uri	http://hdl.handle.net/1880/111343
dc.language.iso	eng	en_US
dc.publisher.faculty	Science	en_US
dc.publisher.institution	University of Calgary	en
dc.rights	University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.	en_US
dc.subject.classification	Computer Science	en_US
dc.title	Improving Deep Neural Networks: Optimization, Regularization, and Generative Modeling	en_US
dc.type	doctoral thesis	en_US
thesis.degree.discipline	Computer Science	en_US
thesis.degree.grantor	University of Calgary	en_US
thesis.degree.name	Doctor of Philosophy (PhD)	en_US
ucalgary.item.requestcopy	true	en_US