Stability in clinical prediction models is crucial for transferability between studies, yet has received little attention. The problem is paramount in high dimensional data, which invites sparse models with feature selection capability. We introduce an effective method to stabilize sparse Cox model of time-to-events using statistical and semantic structures inherent in Electronic Medical Records (EMR). Model estimation is stabilized using three feature graphs built from (i) Jaccard similarity among features (ii) aggregation of Jaccard similarity graph and a recently introduced semantic EMR graph (iii) Jaccard similarity among features transferred from a related cohort. Our experiments are conducted on two real world hospital datasets: a heart failure cohort and a diabetes cohort. On two stability measures – the Consistency index and signal-to-noise ratio (SNR) – the use of our proposed methods significantly increased feature stability when compared with the baselines.
History
Volume
9078
Chapter number
26
Pagination
331-343
Location
Vietnam
Start date
2015-01-01
End date
2015-01-01
ISSN
0302-9743
eISSN
1611-3349
ISBN-13
9783319180328
Language
eng
Publication classification
B Book chapter, B1 Book chapter
Copyright notice
2015, IEEE
Extent
59
Editor/Contributor(s)
Cao T, Lim EP, Zhou ZH, Ho TB, Cheung D, Motoda H
Publisher
Springer
Place of publication
Berlin, Germany
Title of book
Advances in knowledge discovery and data mining 19th Pacific-Asia Conference, PAKDD 2015, Ho Chi Minh City, Vietnam, May 19-22, 2015, Proceedings, Part II