The use of concept hierarchies in privacy preserving data acquisition for data mining
dc.contributor.advisor | Barker, Kenneth E. | |
dc.contributor.author | Williams, Adepele Adeduntan | |
dc.date.accessioned | 2017-12-18T22:31:03Z | |
dc.date.available | 2017-12-18T22:31:03Z | |
dc.date.issued | 2012 | |
dc.description | Bibliography: p. 156-168 | en |
dc.description | Includes copy of ethics approval. Original copy with original Partial Copyright Licence. | en |
dc.description.abstract | This thesis presents a concept hierarchy-based approach to pnvacy preservmg data collection for data mining called the p-level model. The p-level model allows data providers to divulge information at any chosen privacy level (p-level), on any attribute. Data collected at a high p-level signifies divulgence at a higher conceptual level and thus ensures more privacy. Data providers have greater control of their privacy preferences, and have provided significantly (25-75%) more personal data values, at various p-levels, than when providing the same information using the regular, fixed-level Cf-level) method of data collection. However, the data mining process, which involves the integration of various data values, can constitute a privacy breach if combinations of attributes at the various p-levels result in the inference of knowledge that exists at lower p-levels. Providing anonymity guarantees prior to release can further protect the collected data set from privacy breaches due to linking the released data set with external data sets. This thesis describes the plevel reduction phenomenon and proposes methods to identify and control the occurrence of this privacy breach. One objective of this thesis is to explore the feasibility of applying data collected with the p-level approach to data mining problems. We apply data collected using the p-level approach to a data classification problem, and discover that the mining accuracy of the plevel approach classifier is comparable to that of the f-level (no privacy) approach, thus we conclude that the p-level approach is beneficial for the purpose of privacy preserving data collection. | |
dc.format.extent | xiii, 168 leaves : ill. ; 30 cm. | en |
dc.identifier.citation | Williams, A. A. (2012). The use of concept hierarchies in privacy preserving data acquisition for data mining (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/4735 | en_US |
dc.identifier.doi | http://dx.doi.org/10.11575/PRISM/4735 | |
dc.identifier.uri | http://hdl.handle.net/1880/105736 | |
dc.language.iso | eng | |
dc.publisher.institution | University of Calgary | en |
dc.publisher.place | Calgary | en |
dc.rights | University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. | |
dc.title | The use of concept hierarchies in privacy preserving data acquisition for data mining | |
dc.type | doctoral thesis | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | University of Calgary | |
thesis.degree.name | Doctor of Philosophy (PhD) | |
ucalgary.item.requestcopy | true | |
ucalgary.thesis.accession | Theses Collection 58.002:Box 2097 627942969 | |
ucalgary.thesis.notes | UARC | en |
ucalgary.thesis.uarcrelease | y | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- thesis_Williams_2012.pdf
- Size:
- 74.16 MB
- Format:
- Adobe Portable Document Format
- Description:
- Thesis