Small-variance asymptotics for bayesian nonparametric models with constraints

Li, Cheng; Rana, Santu; Phung, Quoc-Dinh; Venkatesh, Svetha

File(s) under permanent embargo

Small-variance asymptotics for bayesian nonparametric models with constraints

chapter

posted on 2015-01-01, 00:00 authored by Cheng Li, Santu RanaSantu Rana, Quoc-Dinh Phung, Svetha VenkateshSvetha Venkatesh

The users often have additional knowledge when Bayesian nonparametric models (BNP) are employed, e.g. for clustering there may be prior knowledge that some of the data instances should be in the same cluster (must-link constraint) or in different clusters (cannot-link constraint), and similarly for topic modeling some words should be grouped together or separately because of an underlying semantic. This can be achieved by imposing appropriate sampling probabilities based on such constraints. However, the traditional inference technique of BNP models via Gibbs sampling is time consuming and is not scalable for large data. Variational approximations are faster but many times they do not offer good solutions. Addressing this we present a small-variance asymptotic analysis of the MAP estimates of BNP models with constraints. We derive the objective function for Dirichlet process mixture model with constraints and devise a simple and efficient K-means type algorithm. We further extend the small-variance analysis to hierarchical BNP models with constraints and devise a similar simple objective function. Experiments on synthetic and real data sets demonstrate the efficiency and effectiveness of our algorithms.

History

Event

Pacific-Asia Conference on Knowledge Discovery and Data Mining

Title of book

Advances in knowledge discovery and data mining 19th Pacific-Asia Conference, PAKDD 2015, Ho Chi Minh City, Vietnam, May 19-22, 2015, Proceedings, Part II

Volume

9078

Series

Lecture notes in computer science; v.9078

Chapter number

8

Pagination

92 - 105

Publisher

Springer

Location

Vietnam

Place of publication

Berlin, Germany

Publisher DOI

https://doi.org/10.1007/978-3-319-18032-8_8

Start date

2015-01-01

End date

2015-01-01

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783319180380

Language

eng

Publication classification

B Book chapter; B1 Book chapter

Copyright notice

2015, Springer

Extent

59

Editor/Contributor(s)

T Cao, E Lim, Z Zhou, T Ho, D Cheung, H Motoda

Usage metrics

Keywords

Science & Technology Technology Computer Science, Artificial Intelligence Computer Science, Information Systems Computer Science, Theory & Methods Computer Science

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Small-variance asymptotics for bayesian nonparametric models with constraints

History

Event

Title of book

Volume

Series

Chapter number

Pagination

Publisher

Location

Place of publication

Publisher DOI

Start date

End date

ISSN

eISSN

ISBN-13

Language

Publication classification

Copyright notice

Extent

Editor/Contributor(s)

Usage metrics

Categories

Keywords

Licence

Exports