Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model
dc.contributor.advisor | Mohsirpour, Mohammad | |
dc.contributor.advisor | Belostotski, Leonid | |
dc.contributor.author | Radmanesh, Shiva | |
dc.contributor.committeemember | Barcomb, Ann | |
dc.contributor.committeemember | Drew, Steve | |
dc.date.accessioned | 2024-09-19T20:56:04Z | |
dc.date.available | 2024-09-19T20:56:04Z | |
dc.date.issued | 2024-09-18 | |
dc.description.abstract | Code comments are essential for clarifying code functionality, improving readability, and facilitating collaboration among developers. They serve as a guide to help both current and future developers understand the logic and purpose behind specific code segments. However, as software evolves, code changes frequently, and comments may not always be updated to reflect these changes. Despite their importance, comments often become outdated, leading to inconsistencies with the corresponding code. This can mislead developers and potentially introduce bugs. This thesis investigates the impact of code-comment inconsistency on bug introduction using large language models, specifically GPT-3.5. I first compare the performance of the GPT-3.5 model with other state-of-the-art models in detecting these inconsistencies, demonstrating the superiority of GPT-3.5 in this domain. Additionally, I analyze the temporal evolution of code-comment inconsistencies and their effect on bug proneness over various timeframes using the GPT-3.5 model and odds ratio analysis. Our findings reveal that inconsistent changes are around 1.5 times more likely to lead to a bug-introducing commit than consistent changes, highlighting the necessity of maintaining consistent and up-to-date comments in software development. This thesis provides new insights into the relationship between code-comment inconsistency and software quality, offering a comprehensive analysis of its impact over time. | |
dc.identifier.citation | Radmanesh, S. (2024). Investigating the impact of code comment inconsistency on bug introducing using an LLM model (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. | |
dc.identifier.uri | https://hdl.handle.net/1880/119796 | |
dc.language.iso | en | |
dc.publisher.faculty | Graduate Studies | |
dc.publisher.institution | University of Calgary | |
dc.rights | University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. | |
dc.subject.classification | Artificial Intelligence | |
dc.subject.classification | Computer Science | |
dc.title | Investigating the Impact of Code Comment Inconsistency on Bug Introducing Using an LLM Model | |
dc.type | master thesis | |
thesis.degree.discipline | Engineering – Electrical & Computer | |
thesis.degree.grantor | University of Calgary | |
thesis.degree.name | Master of Science (MSc) | |
ucalgary.thesis.accesssetbystudent | I require a thesis withhold – I need to delay the release of my thesis due to a patent application, and other reasons outlined in the link above. I have/will need to submit a thesis withhold application. |