Deakin University
Browse

File(s) under permanent embargo

Improving the classification of neuropsychiatric conditions using gene ontology terms as features

journal contribution
posted on 2019-10-01, 00:00 authored by Thomas Quinn, Samuel Lee, Svetha VenkateshSvetha Venkatesh, Thin NguyenThin Nguyen
Although neuropsychiatric disorders have an established genetic background, their molecular foundations remain elusive. This has prompted many investigators to search for explanatory biomarkers that can predict clinical outcomes. One approach uses machine learning to classify patients based on blood mRNA expression. However, these endeavors typically fail to achieve the high level of performance, stability, and generalizability required for clinical translation. Moreover, these classifiers can lack interpretability because not all genes have relevance to researchers. For this study, we hypothesized that annotation-based classifiers can improve classification performance, stability, generalizability, and interpretability. To this end, we evaluated the models of four classification algorithms on six neuropsychiatric data sets using four annotation databases. Our results suggest that the Gene Ontology Biological Process database can transform gene expression into an annotation-based feature space that is accurate and stable. We also show how annotation features can improve the interpretability of classifiers: as annotations are used to assign biological importance to genes, the biological importance of annotation-based features are the features themselves. In evaluating the annotation features, we find that top ranked annotations tend contain top ranked genes, suggesting that the most predictive annotations are a superset of the most predictive genes. Based on this, and the fact that annotations are used routinely to assign biological importance to genetic data, we recommend transforming gene-level expression into annotation-level expression prior to the classification of neuropsychiatric conditions.

History

Journal

American journal of medical genetics part b: neuropsychiatric genetics

Volume

180

Issue

7

Pagination

508 - 518

Publisher

Wiley

Location

Chichester, Eng.

ISSN

1552-4841

eISSN

1552-485X

Language

eng

Publication classification

C1 Refereed article in a scholarly journal

Copyright notice

2019, Wiley Periodicals, Inc.