ACoRA – A Platform for Automating Code Review Tasks

Background: Modern Code Reviews (MCR) are frequently adopted when assuring code and design quality in continuous integration and deployment projects. Although tiresome, they serve a secondary purpose of learning about the software product. Aim: Our objective is to design and evaluate a support tool...

Full description

Saved in:

Bibliographic Details
Published in	E-informatica : software engineering journal Vol. 19; no. 1
Main Authors	Ochodek, M., Staron, Miroslaw
Format	Journal Article
Language	English
Published	Wroclaw University of Science and Technology 01.01.2025
Subjects	BERT Code Reviews Continous Integration Machine Learning
Online Access	Get full text
ISSN	1897-7979 2084-4840 2084-4840
DOI	10.37190/e-Inf250102

Cover

More Information
Summary:	Background: Modern Code Reviews (MCR) are frequently adopted when assuring code and design quality in continuous integration and deployment projects. Although tiresome, they serve a secondary purpose of learning about the software product. Aim: Our objective is to design and evaluate a support tool to help software developers focus on the most important code fragments to review and provide them with suggestions on what should be reviewed in this code. Method: We used design science research to develop and evaluate a tool for automating code reviews by providing recommendations for code reviewers. The tool is based on Transformer-based machine learning models for natural language processing, applied to both programming language code (patch content) and the review comments. We evaluate both the ability of the language model to match similar lines and the ability to correctly indicate the nature of the potential problems encoded in a set of categories. We evaluated the tool on two open-source projects and one industry project. Results: The proposed tool was able to correctly annotate (only true positives) 35%–41% and partially correctly annotate 76%–84% of code fragments to be reviewed with labels corresponding to different aspects of code the reviewer should focus on. Conclusion: By comparing our study to similar solutions, we conclude that indicating lines to be reviewed and suggesting the nature of the potential problems in the code allows us to achieve higher accuracy than suggesting entire changes in the code considered in other studies. Also, we have found that the differences depend more on the consistency of commenting rather than on the ability of the model to find similar lines.
ISSN:	1897-7979 2084-4840 2084-4840
DOI:	10.37190/e-Inf250102