Our approach to the author identification task uses existing authorship attribution methods using local n-grams (LNG) and performs a weighted ensemble. This approach came in third for this year's competition, using a relatively simple scheme of weights by training set accuracy. LNG models create profiles, consisting of a list of character n-grams that best represent a particular author's writing. The use of a weighted ensemble improved upon the accuracy of the method without reducing the speed of the algorithm; the submitted solution was not only near the top of the leaderboard in terms of accuracy, but it was also one of the faster algorithms submitted.
History
Volume
1179
Pagination
1-4
Location
Valencia, Spain
Start date
2013-09-23
End date
2013-09-26
ISSN
1613-0073
Language
eng
Publication classification
E1.1 Full written paper - refereed
Copyright notice
2013, M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen
Editor/Contributor(s)
Forner P, Navigli R, Tufis D, Ferro N
Title of proceedings
CLEF 2013 : Proceedings of the CLEF 2013 Conference
Event
Conference and Labs of the Evaluation Forum Association. Conference (2013 : Valencia, Spain)
Publisher
M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen
Place of publication
Aachen, Germany
Series
Conference and Labs of the Evaluation Forum Association Conference