Deakin University
Browse

File(s) not publicly available

Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review

Version 3 2024-06-19, 13:26
Version 2 2024-06-05, 01:56
Version 1 2023-02-09, 03:50
journal contribution
posted on 2024-06-19, 13:26 authored by LJ Hagg, Stephanie MerkourisStephanie Merkouris, GA O'Dea, Lauren FrancisLauren Francis, Christopher GreenwoodChristopher Greenwood, Matthew Fuller-TyszkiewiczMatthew Fuller-Tyszkiewicz, Elizabeth WestruppElizabeth Westrupp, Jacqui MacdonaldJacqui Macdonald, George YoussefGeorge Youssef
Background Topic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within “big data,” but outputs can be sensitive to decisions made during the analytic pipeline and may not be suitable for certain scenarios such as short texts, and we highlight resources for alternative approaches. This review focuses on the complex analytical practices specific to LDA, which existing practical guides for training LDA models have not addressed. Objective This scoping review used key analytical steps (data selection, data preprocessing, and data analysis) as a framework to understand the methodological approaches being used in psychology research using LDA. Methods A total of 4 psychology and health databases were searched. Studies were included if they used LDA to analyze written words and focused on a psychological construct or issue. The data charting processes were constructed and employed based on common data selection, preprocessing, and data analysis steps. Results A total of 68 studies were included. These studies explored a range of research areas and mostly sourced their data from social media platforms. Although some studies reported on preprocessing and data analysis steps taken, most studies did not provide sufficient detail for reproducibility. Furthermore, the debate surrounding the necessity of certain preprocessing and data analysis steps is revealed. Conclusions Our findings highlight the growing use of LDA in psychological science. However, there is a need to improve analytical reporting standards and identify comprehensive and evidence-based best practice recommendations. To work toward this, we developed an LDA Preferred Reporting Checklist that will allow for consistent documentation of LDA analytic decisions and reproducible research outcomes.

History

Journal

Journal of Medical Internet Research

Volume

24

Pagination

e33166-e33166

Location

Canada

ISSN

1439-4456

eISSN

1438-8871

Language

en

Publication classification

C1 Refereed article in a scholarly journal

Issue

11

Publisher

JMIR Publications Inc.

Usage metrics

    Research Publications

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC