Meta-Feature Taxonomy for Supporting Automatic Machine Learning
dc.contributor.advisor | Maurer, Frank | |
dc.contributor.advisor | Denzinger, Jörg | |
dc.contributor.author | Davies, Cooper | |
dc.contributor.committeemember | Jacob, Christian | |
dc.contributor.committeemember | Alim, Usman R. | |
dc.contributor.committeemember | Oehlberg, Lora A. | |
dc.date | 2020-02 | |
dc.date.accessioned | 2020-01-02T06:48:45Z | |
dc.date.available | 2020-01-02T06:48:45Z | |
dc.date.issued | 2019-12-23 | |
dc.description.abstract | Many automatic machine learning (AutoML) libraries have been developed recently, meeting public demand for more machine learning tools which can be used without an expert. A common tactic illicited by these frameworks is to initially generate meta-features which are then used as an initial heuristic for further evaluation in recent AutoML frameworks. In this thesis we provide a systematic categorization of meta-features in the AutoML literature. Current implementations of automatic machine learning frameworks fail to provide reasoning for meta-feature selection, and a taxonomic categorization is needed. Our approach reviewed current AutoML frameworks and created a taxonomy of five categories into which any meta-feature can be categorized. We have created a general framework with which any currently used meta-features can be described, as well as demonstrate some scenarios for their applications. Additionally, a runtime analysis of the wall-clock time required for meta-feature generation is provided for 18 data collections found in previous CHALearn AutoML competitions, which took between 0:10:26.9, and 98:43:46.5. Additionally we found that a sample percentage of 0.1 is sufficient for use in Sample Variant Landmark Meta-Feature generation when using the Nearest Neighbour, Elite Nearest Neighbour, Best Decision Node, and Random Decision Node Landmarks which indicates potential use as meta-features in AutoML. | en_US |
dc.identifier.citation | Cooper T.S. Meta-Feature Taxonomy for Supporting Automatic Machine Learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. | en_US |
dc.identifier.doi | http://dx.doi.org/10.11575/PRISM/37377 | |
dc.identifier.uri | http://hdl.handle.net/1880/111399 | |
dc.language.iso | eng | en_US |
dc.publisher.faculty | Science | en_US |
dc.publisher.institution | University of Calgary | en |
dc.rights | University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. | en_US |
dc.subject | Machine Learning | en_US |
dc.subject | Meta-Feature | en_US |
dc.subject | Automatic Machine Learning | en_US |
dc.subject | AutoML | en_US |
dc.subject.classification | Computer Science | en_US |
dc.title | Meta-Feature Taxonomy for Supporting Automatic Machine Learning | en_US |
dc.type | master thesis | en_US |
thesis.degree.discipline | Computer Science | en_US |
thesis.degree.grantor | University of Calgary | en_US |
thesis.degree.name | Master of Science (MSc) | en_US |
ucalgary.item.requestcopy | true | en_US |