File(s) under permanent embargo
A Fuzzy R Code similarity detection algorithm
Version 2 2024-06-12, 15:08Version 2 2024-06-12, 15:08
Version 1 2019-10-09, 08:15Version 1 2019-10-09, 08:15
conference contribution
posted on 2024-06-12, 15:08 authored by M Bartoszuk, M GagolewskiR is a programming language and software environment for performing statistical computations and applying data analysis that increasingly gains popularity among practitioners and scientists. In this paper we present a preliminary version of a system to detect pairs of similar R code blocks among a given set of routines, which bases on a proper aggregation of the output of three different [0,1]-valued (fuzzy) proximity degree estimation algorithms. Its analysis on empirical data indicates that the system may in future be successfully applied in practice in order e.g. to detect plagiarism among students' homework submissions or to perform an analysis of code recycling or code cloning in R's open source packages repositories. © Springer International Publishing Switzerland 2014.
History
Volume
444Pagination
21-30Location
Montpellier, FrancePublisher DOI
Start date
2014-07-15End date
2014-07-19ISSN
1865-0929ISBN-13
9783319088518Language
engPublication classification
E1.1 Full written paper - refereedEditor/Contributor(s)
Laurent A, Strauss O, Bouchon-Meunier B, Yager RRTitle of proceedings
IPMU 2014 : Information processing and management of uncertainty in knowledge-based systems : 15th international conference, IPMU 2014, Montpellier France, July 15-19, 2014 : ProceedingsEvent
Information Processing and Management of Uncertainty in Knowledge-Based Systems. Conference (15th : 2014 : Montpellier, France)Issue
Part 3Publisher
SpringerPlace of publication
Berlin, GermanySeries
Communications in Computer and Information ScienceUsage metrics
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC