DM13 Clustering - Data Mining, Analytics, Big Data, and Data 21聚类数据挖掘,分析,大的数据,和数据.pptVIP

  • 0
  • 0
  • 约6.49千字
  • 约 35页
  • 2018-04-14 发布于湖北
  • 举报

DM13 Clustering - Data Mining, Analytics, Big Data, and Data 21聚类数据挖掘,分析,大的数据,和数据.ppt

DM13 Clustering - Data Mining, Analytics, Big Data, and Data 21聚类数据挖掘,分析,大的数据,和数据

* *Overfitting-avoidance heuristic If every instance gets put into a different category the numerator becomes (maximal): Where n is number of all possible attribute values. So without k in the denominator of the CU-formula, every cluster would consist of one instance! Maximum value of CU * Other Clustering Approaches EM – probability based clustering Bayesian clustering SOM – self-organizing maps … * Discussion Can interpret clusters by using supervised learning learn a classifier based on clusters Decrease dependence between attributes? pre-processing step E.g. use principal compon

文档评论(0)

1亿VIP精品文档

相关文档