File(s) under permanent embargo
Semi-automated extraction of new requirements from online reviews for software product evolution
Version 2 2024-06-12, 15:41Version 2 2024-06-12, 15:41
Version 1 2019-11-21, 14:20Version 1 2019-11-21, 14:20
conference contribution
posted on 2024-06-12, 15:41 authored by J Buchan, M Bano, D Zowghi, P Volabouth© 2018 IEEE. In order to improve and increase their utility, software products must evolve continually and incrementally to meet the new requirements of current and future users. Online reviews from users of the software provide a rich and readily available resource for discovering candidate new features for future software releases. However, it is challenging to manually analyze a large volume of potentially unstructured and noisy data to extract useful information to support software release planning decisions. This paper investigates machine learning techniques to automatically identify text that represents users' ideas for new features from their online reviews. A binary classification approach to categorize extracted text as either a feature or non-feature was evaluated experimentally. Three machine learning algorithms were evaluated in the experiments: Naïve Bayes (with multinomial and Bernoulli variants), Support Vector Machines (with linear and multinomial variants) and Logistic Regression. Variations on the configurations of k-fold cross validation, the use of n-grams and review sentiment were also experimentally evaluated. Based on binary classification of over a thousand separate reviews of two products, Trello and Jira, linear Support Vector Machines with review sentiment as an input, using n-gram (1,4) together with k-fold 10 cross validation gave the best performance. The results have confirmed the feasibility and accuracy of semi-automated extraction of candidate requirements from a large volume of unstructured and noisy online user reviews. The next steps planned are to experiment with machine supported grouping, prioritizing and visualizing the extracted features to best support release planners' work, as well as extending the sources of candidate requirements.
History
Pagination
31-40Location
Adelaide, South AustraliaPublisher DOI
Start date
2018-11-26End date
2018-11-30ISBN-13
9781728112411Language
engPublication classification
E1.1 Full written paper - refereedTitle of proceedings
ASWEC 2018 : Proceedings of the 25th Australasian Software Engineering Conference 2018Event
Australasian Software Engineering Conference (25th : 2018 : Adelaide, South Australia)Publisher
IEEEPlace of publication
Piscataway, N.J.Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC