Deakin University
Browse

File(s) under permanent embargo

Semi-automated extraction of new requirements from online reviews for software product evolution

Version 2 2024-06-12, 15:41
Version 1 2019-11-21, 14:20
conference contribution
posted on 2024-06-12, 15:41 authored by J Buchan, M Bano, D Zowghi, P Volabouth
© 2018 IEEE. In order to improve and increase their utility, software products must evolve continually and incrementally to meet the new requirements of current and future users. Online reviews from users of the software provide a rich and readily available resource for discovering candidate new features for future software releases. However, it is challenging to manually analyze a large volume of potentially unstructured and noisy data to extract useful information to support software release planning decisions. This paper investigates machine learning techniques to automatically identify text that represents users' ideas for new features from their online reviews. A binary classification approach to categorize extracted text as either a feature or non-feature was evaluated experimentally. Three machine learning algorithms were evaluated in the experiments: Naïve Bayes (with multinomial and Bernoulli variants), Support Vector Machines (with linear and multinomial variants) and Logistic Regression. Variations on the configurations of k-fold cross validation, the use of n-grams and review sentiment were also experimentally evaluated. Based on binary classification of over a thousand separate reviews of two products, Trello and Jira, linear Support Vector Machines with review sentiment as an input, using n-gram (1,4) together with k-fold 10 cross validation gave the best performance. The results have confirmed the feasibility and accuracy of semi-automated extraction of candidate requirements from a large volume of unstructured and noisy online user reviews. The next steps planned are to experiment with machine supported grouping, prioritizing and visualizing the extracted features to best support release planners' work, as well as extending the sources of candidate requirements.

History

Pagination

31-40

Location

Adelaide, South Australia

Start date

2018-11-26

End date

2018-11-30

ISBN-13

9781728112411

Language

eng

Publication classification

E1.1 Full written paper - refereed

Title of proceedings

ASWEC 2018 : Proceedings of the 25th Australasian Software Engineering Conference 2018

Event

Australasian Software Engineering Conference (25th : 2018 : Adelaide, South Australia)

Publisher

IEEE

Place of publication

Piscataway, N.J.

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC