Deakin University
Browse

File(s) under permanent embargo

Hierarchical dirichlet process for tracking complex topical structure evolution and its application to autism research literature

Version 2 2024-06-03, 16:52
Version 1 2022-11-17, 22:35
chapter
posted on 2024-06-03, 16:52 authored by A Beykikhoshk, O Arandjelovic, Svetha VenkateshSvetha Venkatesh, Q Phung
In this paper we describe a novel framework for the discovery of the topical content of a data corpus, and the tracking of its complex structural changes across the temporal dimension. In contrast to previous work our model does not impose a prior on the rate at which documents are added to the corpus nor does it adopt the Markovian assumption which overly restricts the type of changes that the model can capture. Our key technical contribution is a framework based on (i) discretization of time into epochs, (ii) epoch-wise topic discovery using a hierarchical Dirichlet process-based model, and (iii) a temporal similarity graph which allows for the modelling of complex topic changes: emergence and disappearance, evolution, splitting and merging. The power of the proposed framework is demonstrated on the medical literature corpus concerned with the autism spectrum disorder (ASD) - an increasingly important research subject of significant social and healthcare importance. In addition to the collected ASD literature corpus which we made freely available, our contributions also include two free online tools we built as aids to ASD researchers. These can be used for semantically meaningful navigation and searching, as well as knowledge discovery from this large and rapidly growing corpus of literature.

History

Volume

9077

Chapter number

43

Pagination

550-562

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783319180380

Language

eng

Publication classification

X Not reportable, B Book chapter, B1 Book chapter

Copyright notice

2015, Springer

Extent

58

Editor/Contributor(s)

Cao T, Lim EP, Zhou ZH, Ho TB, Cheung D, Motoda H

Publisher

Springer

Place of publication

Berlin, Germany

Title of book

Advances in knowledge discovery and data mining 19th Pacific-Asia Conference, PAKDD 2015, Ho Chi Minh City, Vietnam, May 19-22, 2015, Proceedings, Part I

Series

Lecture notes in artifical intelligence; v.9077