Deakin University
Browse
gagolewski-detectingsimilarity-2015.pdf (1017.46 kB)

Detecting similarity of R functions via a fusion of multiple heuristic methods

Download (1017.46 kB)
conference contribution
posted on 2015-01-01, 00:00 authored by Maciej Bartoszuk, Marek Gagolewski
In this paper we describe recent advances in our R code similarity detection algorithm. We propose a modification of the Program Dependence Graph (PDG) procedure used in the GPLAG system that better fits the nature of functional programming languages like R. The major strength of our approach lies in a proper aggregation of outputs of multiple plagiarism detection methods, as it is well known that no single technique gives perfect results. It turns out that the incorporation of the PDG algorithm significantly improves the recall ratio, i.e. it is better in indicating true positive cases of plagiarism or code cloning patterns. The implemented system is available as web application at http://SimilaR.Rexamine.com/.

History

Volume

89

Pagination

419-426

Location

Gijon, Spain

Open access

  • Yes

Start date

2015-06-30

End date

2015-07-03

ISSN

1951-6851

ISBN-13

9789462520776

Language

eng

Publication classification

E1.1 Full written paper - refereed

Editor/Contributor(s)

Alonso JM, Bustince H, Reformat M

Title of proceedings

IFSA and EUSFLAT 2019 : Proceedings of the 2015 Combined Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology

Event

International Fuzzy Systems Association and European Society for Fuzzy Logic and Technology. Combined Conference (16th and 9th : 2015, Gijon, Spain)

Publisher

Atlantis Press

Place of publication

Paris, France

Series

Advances in Intelligent Systems Research