Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model

Radmanesh, Shiva

Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model

dc.contributor.advisor	Mohsirpour, Mohammad
dc.contributor.advisor	Belostotski, Leonid
dc.contributor.author	Radmanesh, Shiva
dc.contributor.committeemember	Barcomb, Ann
dc.contributor.committeemember	Drew, Steve
dc.date.accessioned	2024-09-19T20:56:04Z
dc.date.available	2024-09-19T20:56:04Z
dc.date.issued	2024-09-18
dc.description.abstract	Code comments are essential for clarifying code functionality, improving readability, and facilitating collaboration among developers. They serve as a guide to help both current and future developers understand the logic and purpose behind specific code segments. However, as software evolves, code changes frequently, and comments may not always be updated to reflect these changes. Despite their importance, comments often become outdated, leading to inconsistencies with the corresponding code. This can mislead developers and potentially introduce bugs. This thesis investigates the impact of code-comment inconsistency on bug introduction using large language models, specifically GPT-3.5. I first compare the performance of the GPT-3.5 model with other state-of-the-art models in detecting these inconsistencies, demonstrating the superiority of GPT-3.5 in this domain. Additionally, I analyze the temporal evolution of code-comment inconsistencies and their effect on bug proneness over various timeframes using the GPT-3.5 model and odds ratio analysis. Our findings reveal that inconsistent changes are around 1.5 times more likely to lead to a bug-introducing commit than consistent changes, highlighting the necessity of maintaining consistent and up-to-date comments in software development. This thesis provides new insights into the relationship between code-comment inconsistency and software quality, offering a comprehensive analysis of its impact over time.
dc.identifier.citation	Radmanesh, S. (2024). Investigating the impact of code comment inconsistency on bug introducing using an LLM model (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.uri	https://hdl.handle.net/1880/119796
dc.language.iso	en
dc.publisher.faculty	Graduate Studies
dc.publisher.institution	University of Calgary
dc.rights	University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject.classification	Artificial Intelligence
dc.subject.classification	Computer Science
dc.title	Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model
dc.type	master thesis
thesis.degree.discipline	Engineering – Electrical & Computer
thesis.degree.grantor	University of Calgary
thesis.degree.name	Master of Science (MSc)
ucalgary.thesis.accesssetbystudent	I require a thesis withhold – I need to delay the release of my thesis due to a patent application, and other reasons outlined in the link above. I have/will need to submit a thesis withhold application.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ucalgary_2024_radmanesh_shiva.pdf
Size:: 292.55 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.64 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Restricted Theses and Dissertations

Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model

Files

Original bundle

License bundle

Collections

Libraries & Cultural Resources