IR-based technique for linearizing abstract method invocation in plagiarism-suspected source code pair
Author(s) -
Oscar Karnalim
Publication year - 2018
Publication title -
journal of king saud university - computer and information sciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.617
H-Index - 33
eISSN - 2213-1248
pISSN - 1319-1578
DOI - 10.1016/j.jksuci.2018.01.012
Subject(s) - computer science , invocation , executable , security token , source code , code (set theory) , programming language , plagiarism detection , object (grammar) , process (computing) , linearization , theoretical computer science , similarity (geometry) , algorithm , information retrieval , image (mathematics) , artificial intelligence , computer security , nonlinear system , set (abstract data type) , sociology , anthropology , physics , quantum mechanics
According to several works, low-level approach is an effective and efficient solution for detecting source code plagiarism. Instead of relying on source code tokens, it compares the executable form of given code; that form only contains semantic-preserving tokens and is resistant to various plagiarism attacks. However, to our knowledge, an issue about statically linearizing abstract method (i.e. replacing each abstract method invocation with its respective invoked method content without considering invocation semantic) has not been handled comprehensively. Such issue, at some extent, will generate inaccurate plagiarism detection result when handling object-oriented source codes. This paper aims to solve such issue locally per plagiarism-suspected pair. It will generate all possible linearization pair alternatives and select the correct one through IR-based similarity. According to our evaluation regarding the reversed number of mismatched token, the number of false positive, and the number of process, proposed technique is more effective and efficient when compared to state-of-the-art and combinatoric technique. In addition, it is also observed that each IR mechanism used in proposed technique has its own exclusive benefit for selecting the correct linearized forms.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom