Measuring Whitespace Pattern Sequences as an Indication of Plagiarism
Author(s) -
Nikolaus Baer,
Robert Zeidman
Publication year - 2012
Publication title -
journal of software engineering and applications
Language(s) - English
Resource type - Journals
eISSN - 1945-3124
pISSN - 1945-3116
DOI - 10.4236/jsea.2012.54029
Subject(s) - computer science , identifier , copying , matching (statistics) , similarity (geometry) , reliability (semiconductor) , data mining , information retrieval , artificial intelligence , programming language , image (mathematics) , statistics , power (physics) , physics , mathematics , quantum mechanics , political science , law
There are several methods and technologies for comparing the statements, comments, strings, identifiers, and other visible elements of source code in order to efficiently identify similarity. In a prior paper we found that comparing the whitespace patterns was not precise enough to identify copying by itself. However, several possible methods for improving the precision of a whitespace pattern comparison were presented, the most promising of which was an examination of the sequences of lines with matching whitespace patterns. This paper demonstrates a method of evaluating the sequences of matching whitespace patterns and a detailed study of the method’s reliability
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom