Content vs metrics: Using language modeling to evaluate in-line source code comments for Python

dc.contributor.authorBoham, Maame Efua
dc.date.accessioned2022-11-04T11:11:24Z
dc.date.available2022-11-04T11:11:24Z
dc.date.issued2020
dc.descriptionUndergraduate thesis submitted to the Department of Computer Science, Ashesi University, in partial fulfillment of Bachelor of Science degree in / Computer Science, May 2020
dc.description.abstractDocumentation is vital to the understanding, maintenance and, ultimately, survival of software projects . And yet, a lot of software projects either lack documentation, or are very poorly documented. This results in a gradual decline in the quality of the code and may require complete overhauls in extreme cases. It is therefore important to evaluate documentation to ensure that it conveys clear and meaningful ideas. While existing methods of evaluating documentation are metrics based and look at the structure of documentation examples, this paper explores the possibility of evaluating documentation by assessing its contents. There is, however, a lack of an existing corpus of documentation for natural language processing tasks. A corpus of Python function/method comments is assembled, and a language modeling experiment is performed on them. The results of this experiment are mixed. While they show that it is possible to evaluate documentation by looking at its content as opposed to structure, they also show that this approach may not necessarily be more accurate, with lower quality comment examples having higher probability than those of higher quality.
dc.description.sponsorshipAshesi University
dc.identifier.urihttp://hdl.handle.net/20.500.11988/679
dc.subjectdocumentationen
dc.subjectsoftware projectsen
dc.subjectnatural language processing (NLP)en
dc.titleContent vs metrics: Using language modeling to evaluate in-line source code comments for Python
dc.typeUndergraduate thesisen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Boham_Maame_2020_CS_Thesis.pdf
Size:
493.21 KB
Format:
Adobe Portable Document Format
Description: