Utilization of Natural Language Processing for Extracting Smart Cities Requirements from Large Social Media Text

dc.contributor.advisorBarcomb, Ann
dc.contributor.advisorTan, Benjamin
dc.contributor.authorMirshafiee Khoozani, Mitra Sadat
dc.contributor.committeememberMessier, Geoffrey
dc.contributor.committeememberFapojuwo, Abraham
dc.date.accessioned2024-05-28T17:03:30Z
dc.date.available2024-05-28T17:03:30Z
dc.date.issued2024-05-14
dc.description.abstractMajor organizations such as urban centers worldwide face challenges from rapid population growth and evolving demands, requiring innovative approaches to stay responsive to residents' needs. This challenge is exemplified by the city of Calgary, where an automated system for aggregating and categorizing resident feedback could improve city planning. What people find important and useful can be seen in the articles they post on social media. One method for determining the performance of urban services and assets for citizens is paying attention to these data generated by the residents. In this regard, we need to examine datasets wherein writing is the primary form of citizen engagement (direct messages, requests, comments, complaints, etc.). To interpret this data, it is necessary to use appropriate tools and techniques for data processing and analysis of large volumes of unstructured text. Some of the most effective tools used by researchers nowadays falls into the scope of computational linguistics, specifically Natural language processing (NLP). Furthermore, Twitter is one of the primary platforms where individuals freely voice their opinions and concerns. In this study, we develop an automated workflow that can scrape, classify, and display tweets in a simplistic view. With the help of this system, local officials will be able to speed up the decision-making process when considering citizens' current problems. Following our research question, we look into the optimal scraping criteria, explore a variety of methods for topic and emotions analysis, and validate these methods both using automatic evaluation and manual assessment. As a result, we are able to identify issues related to city development, senior citizens, taxes, and unemployment using our best performing models (BERTopic for topic modeling and few-shot learning using Setfit for emotion analysis.) Afterward, we collect city employees' opinion regarding our research to determine the usefulness and applicability of this approach. Overall, we demonstrate how delving into these analyses can complement the current systems in place for urban planning.
dc.identifier.citationMirshafiee Khoozani, M. S. (2024). Utilization of natural language processing for extracting smart cities requirements from large social media text (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/118847
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject.classificationEducation--Technology
dc.titleUtilization of Natural Language Processing for Extracting Smart Cities Requirements from Large Social Media Text
dc.typemaster thesis
thesis.degree.disciplineEngineering – Electrical & Computer
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2024_mirshafiee-khoozani_mitrasadat.pdf.pdf
Size:
1004.49 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: