Deakin University
Browse

File(s) under permanent embargo

A Fuzzy R Code similarity detection algorithm

Version 2 2024-06-12, 15:08
Version 1 2019-10-09, 08:15
conference contribution
posted on 2024-06-12, 15:08 authored by M Bartoszuk, M Gagolewski
R is a programming language and software environment for performing statistical computations and applying data analysis that increasingly gains popularity among practitioners and scientists. In this paper we present a preliminary version of a system to detect pairs of similar R code blocks among a given set of routines, which bases on a proper aggregation of the output of three different [0,1]-valued (fuzzy) proximity degree estimation algorithms. Its analysis on empirical data indicates that the system may in future be successfully applied in practice in order e.g. to detect plagiarism among students' homework submissions or to perform an analysis of code recycling or code cloning in R's open source packages repositories. © Springer International Publishing Switzerland 2014.

History

Volume

444

Pagination

21-30

Location

Montpellier, France

Start date

2014-07-15

End date

2014-07-19

ISSN

1865-0929

ISBN-13

9783319088518

Language

eng

Publication classification

E1.1 Full written paper - refereed

Editor/Contributor(s)

Laurent A, Strauss O, Bouchon-Meunier B, Yager RR

Title of proceedings

IPMU 2014 : Information processing and management of uncertainty in knowledge-based systems : 15th international conference, IPMU 2014, Montpellier France, July 15-19, 2014 : Proceedings

Event

Information Processing and Management of Uncertainty in Knowledge-Based Systems. Conference (15th : 2014 : Montpellier, France)

Issue

Part 3

Publisher

Springer

Place of publication

Berlin, Germany

Series

Communications in Computer and Information Science

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC