Model Assessment, Selection and Averaging课件.ppt

下载文档

2
0
约1.01万字
约 35页
2015-09-03 发布于上海
举报
版权申诉
保障服务

Model Assessment, Selection and Averaging课件.ppt

1、本文档共35页，可阅读全部内容。
2、原创力文档（book118）网站文档一经付费（服务费），不意味着购买了该文档的版权，仅供个人/单位学习、研究之用，不得用于商业用途，未经授权，严禁复制、发行、汇编、翻译或者网络传播等，侵权必究。
3、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。如您付费，意味着您自己接受本站规则且自行承担风险，本站不退款、不进行额外附加服务；查看《如何避免下载的几个坑》。如果您已付费下载过本站文档，您可以点击这里二次下载。
4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等，请点击“版权申诉”（推荐），也可以打举报电话：400-050-0827(电话支持时间：9:00-18:30)。

Model Assessment, Selection and Averaging Presented by: Bibhas Chakraborty Goals Model Selection: estimating the performance of different models in order to choose the best one. Model Assessment: having chosen a final model, estimating its generalization error on new data. Model Averaging: averaging the predictions from different models to achieve improved performance. Splitting the data Split the dataset into three parts: Training set: used to fit the models. Validation set: used to estimate prediction error for model selection. Test set: used to assess the generalization error for the final chosen model. Bayesian Information Criterion (BIC) Model selection tool applicable in settings where the fitting is carried out by maximization of a log-likelihood. Motivation from Bayesian point of view. BIC tends to penalize complex models more heavily, giving preference to simpler models in selection. Its generic form is: Bayesian Model Selection Suppose we have candidate models with corresponding model parameters Prior distribution: Posterior probability: Compare two models via posterior odds: The second factor on the RHS is called the Bayes factor and describes the contribution of the data towards posterior odds. Bayesian Approach Continued Unless strong evidence to the contrary, we typically assume that prior over models is uniform (non-informative prior). Using Laplace approximation, one can establish a simple (but approximate) relationship between posterior model probability and the BIC. Lower BIC implies higher posterior probability of the model. Use of BIC as model selection criterion is thus justified. AIC or BIC? BIC is asymptotically consistent as a selection criterion. That means, given a family of models including the true model, the probability that BIC will select the correct one approaches one as the sample size becomes large. AIC does not have the above proper