- 1、本文档共8页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
A permutation-augmented sampler for DP mixture models
A permutation-augmented sampler for DP mixture models
Percy Liang pliang@
University of California, Berkeley
Michael Jordan jordan@
University of California, Berkeley
Ben Taskar taskar@
University of Pennsylvania
Abstract
We introduce a new inference algorithm for
Dirichlet process mixture models. While
Gibbs sampling and variational methods fo-
cus on local moves, the new algorithm makes
more global moves. This is done by intro-
ducing a permutation of the data points as an
auxiliary variable. The algorithm is a blocked
sampler which alternates between sampling
the clustering and sampling the permutation.
The key to the efficiency of this approach is
that it is possible to use dynamic program-
ming to consider all exponentially many clus-
terings consistent with a given permutation.
We also show that random projections can be
used to effectively sample the permutation.
The result is a stochastic hill-climbing algo-
rithm that yields burn-in times significantly
smaller than those of collapsed Gibbs sam-
pling.
1. Introduction
Dirichlet process (DP) mixture models (Antoniak,
1974) have been usefully employed as a clustering
methodology in a variety of applied areas such as bioin-
formatics (Xing et al., 2004), vision (Sudderth et al.,
2006), and topic modeling (Teh et al., 2006). By treat-
ing the number of mixture components as random,
DP mixtures provide an appealing nonparametric ap-
proach to mixture modeling in which the complexity
of the model adapts to the complexity inherent in the
data.
Posterior inference for DP mixtures is challenging, and
a variety of inference algorithms have been specialized
Appearing in Proceedings of the 24 th International Confer-
ence on Machine Learning, Corvallis, OR, 2007. Copyright
2007 by the author(s)/owner(s).
to the DP mixture setting, including samplers (Ish-
waran James, 2001; Escobar West, 1995), varia-
tional approximations (Blei Jordan, 2005; Kurihara
et al., 2007), and other search algorithms (Daume,
2007). A diffic
您可能关注的文档
- 3种用U盘或外置光驱给MAC_BOOK_AIR或MAC_BOOK安装WIN7系统的方法--非常详细.pdf
- 3t diffusion tensor imaging and electrhy of peripheral carpal tunnel syndrome.pdf
- 3月7日托福考试阅读及词汇题分析-智课教育旗下智课教育.pdf
- 3、团体催眠对高三学生考试焦虑的改善作用.pdf
- 400句意大利口头语.pdf
- 40000_lettres_types_correspondance.pdf
- 4113131 夏考英语 冲刺班 201011 阅读 共66页 第15-20页(life routine).pdf
- 413751导学班 词汇精品课堂 短语搭配 第10-19页.pdf
- 42+MW燃煤热风炉结构设计特点分析.pdf
- 400款奢侈品fillico神户矿泉水包装.pdf
文档评论(0)