A data mining framework for efficient discovery of classification rules

dc.contributor.advisorBarker, Kenneth E.
dc.contributor.authorGopalan, Janaki
dc.date.accessioned2005-08-16T17:00:07Z
dc.date.available2005-08-16T17:00:07Z
dc.date.issued2004
dc.descriptionBibliography: p. 103-110en
dc.description.abstractAssociative classification is an important research topic in data mining (DM). The thesis proposes a framework to derive accurate and interesting classification rules using the association rule mining (ARM) technique. To effectively address the rule discovery task, in the framework, two fundamental problems in the pre-processing and the post-processing components of the DM process are identified. In the preĀ­processing component, it is identified that the choice of the training set is an imporĀ­tant factor in deriving good classification rules. The thesis proposes a novel technique using a genetic algorithm (GA) to find an appropriate split of a dataset into training and test sets. Using the obtained training set as the input to the ARM technique generates high accuracy classification rules. It is also identified that an algorithm (or heuristic) is required to find the best set of interesting and accurate rules from the discovered ones. In the post-processing component, the thesis proposes a pruning strategy using a GA to find the accurate interesting rules.en
dc.format.extentxiv, 110 leaves : ill. ; 30 cm.en
dc.identifier.citationGopalan, J. (2004). A data mining framework for efficient discovery of classification rules (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/10785en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/10785
dc.identifier.isbn0612976483en
dc.identifier.lccAC1 .T484 2004 G67en
dc.identifier.urihttp://hdl.handle.net/1880/41548
dc.language.isoeng
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.titleA data mining framework for efficient discovery of classification rules
dc.typemaster thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.item.requestcopytrue
ucalgary.thesis.accessionTheses Collection 58.002:Box 1504 520492021
ucalgary.thesis.notesUARCen
ucalgary.thesis.uarcreleaseyen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2004_Gopalan.pdf
Size:
37.05 MB
Format:
Adobe Portable Document Format
Description:
Collections