Towards theoretical and practical evaluation of privacy and utility of data sanitization mechanisms

dc.contributor.advisorSafavi-Naeini, Reyhaneh
dc.contributor.advisorBarker, Kenneth E.
dc.contributor.authorAskari, Mina
dc.date.accessioned2017-12-18T22:37:16Z
dc.date.available2017-12-18T22:37:16Z
dc.date.issued2012
dc.descriptionBibliography: p. 139-151en
dc.description.abstractMassive data collection, aggregation and analysis about individuals on the Internet raises the fundamental issue of privacy protection. Releasing of collected data is often beneficial for research, testing, marketing, decision making and data mining. However, published data can violate individual's privacy, especially when aggregated with other sources of data. In response to privacy concerns and to ensure privacy of the individuals in the published dataset, data are sanitized by applying specific operations on data prior to publishing them. The cost of performing the privacy operations on the original collected data to achieve privacy is the loss of some information. Hence, data utility is another important factor that should be considered in data sanitization mechanisms. In this thesis, we focus primarily on privacy and utility issues of sanitization mechanisms. There are several sanitization mechanisms with different notions of privacy and utility. To be able to measure, set and compare the level of privacy protection and utility of these mechanisms, there is a need to translate these different mechanisms to a unified frame­work for evaluation. In this thesis, a thorough theoretical and empirical investigation for evaluation of privacy and utility of sanitization mechanisms in non-interactive data r release is proposed by developing two fameworks. Furthermore, we use the specifications of several sanitization mechanisms, to evaluate our frameworks. We first propose a novel framework that represents a mechanism as a noisy channel and evaluate its privacy and utility using information theoretic measures. We show that the deterministic publishing property that is used in most of these mechanisms reduces privacy guarantees and causes information to leak. We also show that by using this framework we can compute the sanitization mechanism's utility from the point of view of a data user. By formalizing the adversary and data user's background knowledge, we demonstrate their great effects on these metrics. We use k-anonymity, a popular sanitization mechanism, as an example and use the framework to analyze the privacy and utility offered by the mechanism. We then provide a mining framework that can be specialized to specific scenarios -modeling privacy and usefulness notions and quantifying their levels for the given dataset. This framework uses a definition of utility of mining tasks that data providers can use to measure and compare the utility of data mining results obtained from the original and sanitized datasets. This will provide a decision support mechanism for data providers to select appropriate sanitization mechanisms. This utility definition is general and captures the information obtained by any data user. The power of the framework is in its adaptability to capture various notions of privacy, utility and adversarial power for comparing sanitization systems in a particular setting.
dc.format.extentix, 151 leaves ; 30 cm.en
dc.identifier.citationAskari, M. (2012). Towards theoretical and practical evaluation of privacy and utility of data sanitization mechanisms (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/5037en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/5037
dc.identifier.urihttp://hdl.handle.net/1880/106038
dc.language.isoeng
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.titleTowards theoretical and practical evaluation of privacy and utility of data sanitization mechanisms
dc.typedoctoral thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameDoctor of Philosophy (PhD)
ucalgary.item.requestcopytrue
ucalgary.thesis.accessionTheses Collection 58.002:Box 2102 627942972
ucalgary.thesis.notesUARCen
ucalgary.thesis.uarcreleaseyen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_Askari_2012.pdf
Size:
74.32 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Collections