Meta-Feature Taxonomy for Supporting Automatic Machine Learning

Davies, Cooper

Meta-Feature Taxonomy for Supporting Automatic Machine Learning

dc.contributor.advisor	Maurer, Frank
dc.contributor.advisor	Denzinger, Jörg
dc.contributor.author	Davies, Cooper
dc.contributor.committeemember	Jacob, Christian
dc.contributor.committeemember	Alim, Usman R.
dc.contributor.committeemember	Oehlberg, Lora A.
dc.date	2020-02
dc.date.accessioned	2020-01-02T06:48:45Z
dc.date.available	2020-01-02T06:48:45Z
dc.date.issued	2019-12-23
dc.description.abstract	Many automatic machine learning (AutoML) libraries have been developed recently, meeting public demand for more machine learning tools which can be used without an expert. A common tactic illicited by these frameworks is to initially generate meta-features which are then used as an initial heuristic for further evaluation in recent AutoML frameworks. In this thesis we provide a systematic categorization of meta-features in the AutoML literature. Current implementations of automatic machine learning frameworks fail to provide reasoning for meta-feature selection, and a taxonomic categorization is needed. Our approach reviewed current AutoML frameworks and created a taxonomy of five categories into which any meta-feature can be categorized. We have created a general framework with which any currently used meta-features can be described, as well as demonstrate some scenarios for their applications. Additionally, a runtime analysis of the wall-clock time required for meta-feature generation is provided for 18 data collections found in previous CHALearn AutoML competitions, which took between 0:10:26.9, and 98:43:46.5. Additionally we found that a sample percentage of 0.1 is sufficient for use in Sample Variant Landmark Meta-Feature generation when using the Nearest Neighbour, Elite Nearest Neighbour, Best Decision Node, and Random Decision Node Landmarks which indicates potential use as meta-features in AutoML.	en_US
dc.identifier.citation	Cooper T.S. Meta-Feature Taxonomy for Supporting Automatic Machine Learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.	en_US
dc.identifier.doi	http://dx.doi.org/10.11575/PRISM/37377
dc.identifier.uri	http://hdl.handle.net/1880/111399
dc.language.iso	eng	en_US
dc.publisher.faculty	Science	en_US
dc.publisher.institution	University of Calgary	en
dc.rights	University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.	en_US
dc.subject	Machine Learning	en_US
dc.subject	Meta-Feature	en_US
dc.subject	Automatic Machine Learning	en_US
dc.subject	AutoML	en_US
dc.subject.classification	Computer Science	en_US
dc.title	Meta-Feature Taxonomy for Supporting Automatic Machine Learning	en_US
dc.type	master thesis	en_US
thesis.degree.discipline	Computer Science	en_US
thesis.degree.grantor	University of Calgary	en_US
thesis.degree.name	Master of Science (MSc)	en_US
ucalgary.item.requestcopy	true	en_US