Deakin University
Browse

An evaluation of randomized machine learning methods for redundant data: Predicting short and medium-term suicide risk from administrative records and risk assessments

Version 2 2024-06-03, 17:51
Version 1 2023-10-25, 23:22
journal contribution
posted on 2024-06-03, 17:51 authored by Thuong Nguyen, Truyen TranTruyen Tran, Shivapratap Gopakumar, Dinh Phung, Svetha VenkateshSvetha Venkatesh
Accurate prediction of suicide risk in mental health patients remains an open problem. Existing methods including clinician judgments have acceptable sensitivity, but yield many false positives. Exploiting administrative data has a great potential, but the data has high dimensionality and redundancies in the recording processes. We investigate the efficacy of three most effective randomized machine learning techniques random forests, gradient boosting machines, and deep neural nets with dropout in predicting suicide risk. Using a cohort of mental health patients from a regional Australian hospital, we compare the predictive performance with popular traditional approaches clinician judgments based on a checklist, sparse logistic regression and decision trees. The randomized methods demonstrated robustness against data redundancies and superior predictive performance on AUC and F-measure.

History

Journal

arXiv

Article number

1605.01116

Publication classification

CN Other journal article

Publisher

Cornell University

Usage metrics

    Research Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC