Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model

dc.contributor.advisorMohsirpour, Mohammad
dc.contributor.advisorBelostotski, Leonid
dc.contributor.authorRadmanesh, Shiva
dc.contributor.committeememberBarcomb, Ann
dc.contributor.committeememberDrew, Steve
dc.date.accessioned2024-09-19T20:56:04Z
dc.date.available2024-09-19T20:56:04Z
dc.date.issued2024-09-18
dc.description.abstractCode comments are essential for clarifying code functionality, improving readability, and facilitating collaboration among developers. They serve as a guide to help both current and future developers understand the logic and purpose behind specific code segments. However, as software evolves, code changes frequently, and comments may not always be updated to reflect these changes. Despite their importance, comments often become outdated, leading to inconsistencies with the corresponding code. This can mislead developers and potentially introduce bugs. This thesis investigates the impact of code-comment inconsistency on bug introduction using large language models, specifically GPT-3.5. I first compare the performance of the GPT-3.5 model with other state-of-the-art models in detecting these inconsistencies, demonstrating the superiority of GPT-3.5 in this domain. Additionally, I analyze the temporal evolution of code-comment inconsistencies and their effect on bug proneness over various timeframes using the GPT-3.5 model and odds ratio analysis. Our findings reveal that inconsistent changes are around 1.5 times more likely to lead to a bug-introducing commit than consistent changes, highlighting the necessity of maintaining consistent and up-to-date comments in software development. This thesis provides new insights into the relationship between code-comment inconsistency and software quality, offering a comprehensive analysis of its impact over time.
dc.identifier.citationRadmanesh, S. (2024). Investigating the impact of code comment inconsistency on bug introducing using an LLM model (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/119796
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject.classificationArtificial Intelligence
dc.subject.classificationComputer Science
dc.titleInvestigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model
dc.typemaster thesis
thesis.degree.disciplineEngineering – Electrical & Computer
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI require a thesis withhold – I need to delay the release of my thesis due to a patent application, and other reasons outlined in the link above. I have/will need to submit a thesis withhold application.
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
ucalgary_2024_radmanesh_shiva.pdf
Size:
292.55 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.64 KB
Format:
Item-specific license agreed upon to submission
Description: