Dolos: Language‐agnostic plagiarism detection in source code
Background Learning to code is increasingly embedded in secondary and higher education curricula, where solving programming exercises plays an important role in the learning process and in formative and summative assessment. Unfortunately, students admit that copying code from each other is a common...
Saved in:
| Published in | Journal of computer assisted learning Vol. 38; no. 4; pp. 1046 - 1061 |
|---|---|
| Main Authors | , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Chichester, UK
John Wiley & Sons, Inc
01.08.2022
Wiley Wiley Subscription Services, Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0266-4909 1365-2729 1365-2729 |
| DOI | 10.1111/jcal.12662 |
Cover
| Summary: | Background
Learning to code is increasingly embedded in secondary and higher education curricula, where solving programming exercises plays an important role in the learning process and in formative and summative assessment. Unfortunately, students admit that copying code from each other is a common practice and teachers indicate they rarely use plagiarism detection tools.
Objectives
We want to lower the barrier for teachers to detect plagiarism by introducing a new source code plagiarism detection tool (Dolos) that is powered by state‐of‐the art similarity detection algorithms, offers interactive visualizations, and uses generic parser models to support a broad range of programming languages.
Methods
Dolos is compared with state‐of‐the‐art plagiarism detection tools in a benchmark based on a standardized dataset. We describe our experience with integrating Dolos in a programming course with a strong focus on online learning and the impact of transitioning to remote assessment during the COVID‐19 pandemic.
Results and Conclusions
Dolos outperforms other plagiarism detection tools in detecting potential cases of plagiarism and is a valuable tool for preventing and detecting plagiarism in online learning environments. It is available under the permissive MIT open‐source license at https://dolos.ugent.be.
Implications
Dolos lowers barriers for teachers to discover, prove and prevent plagiarism in programming courses. This helps to enable a shift towards open and online learning and assessment environments, and opens up interesting avenues for more effective learning and better assessment.
Lay description
What is already known about this topic?
Source code plagiarism is common in programming courses.
Students mention lack of checking as one reason to commit plagiarism.
Existing plagiarism detection tools are not broadly used in current practice.
Current tools support few programming languages.
What this paper adds
A user‐friendly plagiarism detection tool that supports many programming languages.
Interactive visualizations to discover, prove and prevent plagiarism.
Plagiarism detection tools can help to prevent plagiarism for remote assessment.
Implications for practice and/or policy
Dolos makes it easier for teachers to detect source code plagiarism.
Dolos helps to prevent students from committing source code plagiarism.
Interactive visualizations support plagiarism discovery as an exploratory process.
Keep in mind that high similarity does not equal proven plagiarism. |
|---|---|
| Bibliography: | Funding information Research Foundation, Grant/Award Number: I002819N ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 0266-4909 1365-2729 1365-2729 |
| DOI: | 10.1111/jcal.12662 |