K-Means ExampleStep.2 x x x new center after 1st iteration new center after 1st iteration new center after 1st iteration * K-Means ExampleStep.3 new center after 2nd iteration new center after 2nd iteration new center after 2nd iteration * Main Techniques (2) Hierarchical Clustering Multilevel clustering: level 1 has n clusters ? level n has one cluster, or upside down. Agglomerative HC: starts with singleton and merge clusters (bottom-up). Divisive HC: starts with one sample and split clusters (top-down). Dendrogram * Agglomerative HC Example Nearest Neighbor Level 2, k = 7 clusters. * Nearest Neighbor, Level 3, k = 6 clusters. * Nearest Neighbor, Level 4, k = 5 clusters. * Nearest Neighbor, Level 5, k = 4 clusters. * Nearest Neighbor, Level 6, k = 3 clusters. * Nearest Neighbor, Level 7, k = 2 clusters. * Nearest Neighbor, Level 8, k = 1 cluster. * Remarks Partitioning Clustering Hierarchical Clustering Time Complexity O(n) O(n2log n) Pros Easy to use and Relatively efficient Outputs a dendrogram that is desired in many applications. Cons Sensitive to initialization; bad initialization might lead to bad results. Need to store all data in memory. higher time complexity; Need to store all data in memory. * Introduction to BIRCH Designed for very large data sets Time and memory are limited Incremental and dynamic clustering of incoming objects Only one scan of data is necessary Does not need the whole data set in advance Two key phases: Scans the database to build an in-memory tree Applies clustering algorithm to cluster the leaf nodes * Similarity Metric(1) Given a cluster of instances , we define: Centroid: Radius: average distance from member points to centroid Diameter: average pair-wise distance within a cluster * Similarity Metric(2) centroid Euclidean distance: centroid Manhattan distance: average inter-cluster: average intra-cluster: variance increase: * Clustering Feature The Birch algorithm builds a dendrogram called clustering featu
您可能关注的文档
最近下载
- 初中生“数与代数”内容解题错误分析与对策研究.pdf
- 学校管理内容课件.pptx VIP
- 2025年信息系统安全专家数据库云环境下的加密挑战与解决方案专题试卷及解析.pdf VIP
- 2025年湖北省武汉市中考四调英语试题(附答案解析).docx VIP
- 2025年赣南卫生健康职业学院单招语文测试模拟试卷.docx VIP
- 2026年春季小学一年级下册美术(人美版2024版)教学计划附教学进度表.docx VIP
- 2025年安徽省合肥市中考英语试卷(含答案逐题解析).docx
- 01 从地理学到城市地理学.pdf VIP
- 2026东方资产校招试题及答案.doc VIP
- (高清版)DB42∕T 2012-2023 《土家族吊脚楼营造规程》.pdf VIP
原创力文档

文档评论(0)