The use of concept hierarchies in privacy preserving data acquisition for data mining

dc.contributor.advisorBarker, Kenneth E.
dc.contributor.authorWilliams, Adepele Adeduntan
dc.date.accessioned2017-12-18T22:31:03Z
dc.date.available2017-12-18T22:31:03Z
dc.date.issued2012
dc.descriptionBibliography: p. 156-168en
dc.descriptionIncludes copy of ethics approval. Original copy with original Partial Copyright Licence.en
dc.description.abstractThis thesis presents a concept hierarchy-based approach to pnvacy preservmg data collection for data mining called the p-level model. The p-level model allows data providers to divulge information at any chosen privacy level (p-level), on any attribute. Data collected at a high p-level signifies divulgence at a higher conceptual level and thus ensures more privacy. Data providers have greater control of their privacy preferences, and have provided significantly (25-75%) more personal data values, at various p-levels, than when providing the same information using the regular, fixed-level Cf-level) method of data collection. However, the data mining process, which involves the integration of various data values, can constitute a privacy breach if combinations of attributes at the various p-levels result in the inference of knowledge that exists at lower p-levels. Providing anonymity guarantees prior to release can further protect the collected data set from privacy breaches due to linking the released data set with external data sets. This thesis describes the p­level reduction phenomenon and proposes methods to identify and control the occurrence of this privacy breach. One objective of this thesis is to explore the feasibility of applying data collected with the p-level approach to data mining problems. We apply data collected using the p-level approach to a data classification problem, and discover that the mining accuracy of the p­level approach classifier is comparable to that of the f-level (no privacy) approach, thus we conclude that the p-level approach is beneficial for the purpose of privacy preserving data collection.
dc.format.extentxiii, 168 leaves : ill. ; 30 cm.en
dc.identifier.citationWilliams, A. A. (2012). The use of concept hierarchies in privacy preserving data acquisition for data mining (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/4735en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/4735
dc.identifier.urihttp://hdl.handle.net/1880/105736
dc.language.isoeng
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.titleThe use of concept hierarchies in privacy preserving data acquisition for data mining
dc.typedoctoral thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameDoctor of Philosophy (PhD)
ucalgary.item.requestcopytrue
ucalgary.thesis.accessionTheses Collection 58.002:Box 2097 627942969
ucalgary.thesis.notesUARCen
ucalgary.thesis.uarcreleaseyen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_Williams_2012.pdf
Size:
74.16 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Collections