Measurement, modeling, and analysis of the file hosting ecosystem

dc.contributor.advisorWilliamson, Carey
dc.contributor.authorMahanti, Aniket
dc.date.accessioned2017-12-18T22:36:43Z
dc.date.available2017-12-18T22:36:43Z
dc.date.issued2012
dc.descriptionBibliography: p. 176-192en
dc.description.abstractThe Web has recently witnessed the emergence of file hosting services. These services provide users with a Web interface to upload, manage, and share files in the cloud. We present a comprehensive, longitudinal characterization study of the file hosting ecosystem. We perform detailed multi-level analysis of the usage behavior, infrastructure properties, content characteristics, and user-perceived performance of several top file hosting services. We instrument a measurement infrastructure that captures the characteristics of the ecosystem from multiple viewpoints across multiple layers. Our study utilizes multiple datasets collected over extended periods of time from passive measurements at an edge network, active measurement of an index site, as well as data collected through third-party Web analytics sources. Our two primary datasets are HTTP transaction and connection summaries of all Internet traffic collected at a large campus edge network over a one-year period. We carefully devised methods to identify user clickstreams in the HTTP transaction summary trace, including the identification of free and premium user instances, as well as the identification of content that is split into multiple pieces and downloaded using multiple transactions. We utilized the connection summary trace to understand and model salient flow-level and host-level properties of file hosting traffic. We augment our analysis with measurements from third-party analytics sources of global file hosting dynamics, as well as crawling file hosting links on an index site. Throughout this characterization, we compare and contrast these services with each other as well as with peer-to-peer file sharing and other media sharing services. To the best of our knowledge, this is the largest characterization study of the file hosting ecosystem. Our results have implications on caching, network management, content placement, and data center provisioning, and are likely to be relevant for both researchers and network administrators.
dc.format.extentxii, 192 leaves : ill. ; 30 cm.en
dc.identifier.citationMahanti, A. (2012). Measurement, modeling, and analysis of the file hosting ecosystem (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/5012en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/5012
dc.identifier.urihttp://hdl.handle.net/1880/106013
dc.language.isoeng
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.titleMeasurement, modeling, and analysis of the file hosting ecosystem
dc.typedoctoral thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameDoctor of Philosophy (PhD)
ucalgary.item.requestcopytrue
ucalgary.thesis.accessionTheses Collection 58.002:Box 2113 627942983
ucalgary.thesis.notesUARCen
ucalgary.thesis.uarcreleaseyen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_Mahanti_2012.pdf
Size:
91.34 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Collections