Unsupervised Group Discovery in Relational Datasets 关系数据集的无监督群体发现.pptVIP

  • 12
  • 0
  • 约1.02万字
  • 约 46页
  • 2017-03-09 发布于上海
  • 举报

Unsupervised Group Discovery in Relational Datasets 关系数据集的无监督群体发现.ppt

Unsupervised Group Discovery in Relational Datasets 关系数据集的无监督群体发现

Unsupervised Group Discovery in Relational Datasets: A nonparametric Bayesian Approach P.S. Koutsourelakis School of Civil and Environmental Engineering Cornell University Artificial Intelligence Seminar, 10/12/07 Problem Setting Problem Setting Problem Setting Problem Setting Augmented Problem Setting Problem Setting Nonparametric Bayesian Methods* ? Bayesian methods are most powerful when your prior adequately captures your beliefs. ? Inflexible models (e.g. with a fixed number of groups) might yield unreasonable inferences. ? Non-parametrics provide a way of getting very flexible models. ? Non-parametric models can automatically infer an adequate model size/complexity from the data, without needing to explicitly do Bayesian model comparison ? Many can be derived by starting with a finite parametric model and taking the limit as number of parameters Chinese Restaurant Process (CRP) Infinite Relational Model (IRM) Application: Object-Feature Dataset Application: Object-Feature Dataset Predicting Missing Links Infinite Relational Model (IRM) Advantages: It is an unsupervised learner with only two tunable parameters β and γ. It can be applied to multiple node types and relations. It has all the advantages of a Bayesian formulations (missing data, confidence intervals) and nonparametric methods (adaptation to data, outlier accommodation). It has been successfully used for co-clustering object features, learning ontologies and social networks. “Multiple Personalities” In real data, objects (e.g. people) do not belong exclusively to one group, i.e. their identity is a mixture of basic components. These components can be the same for each object type but the mixing proportions might vary from one object to another.. IRM assumes that each object participates in all the relations it is involved with a single identity. A proper model should account for a different mixture for each object over all the possible identity components (which are common for the who

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档