- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
商业智慧的资料探勘
* * * * * * * * * * * * * * * * * I : the expected information needed to classify a given sample E (entropy) : expected information based on the partitioning into subsets by A * * * * * * * * * * Point P P(x,y) p01 a (3, 4) p02 b (3, 6) p03 c (3, 8) p04 d (4, 5) p05 e (4, 7) p06 f (5, 1) p07 g (5, 5) p08 h (7, 3) p09 i (7, 5) p10 j (8, 5) Initial m1 (3, 4) Initial m2 (8, 5) m1 = (3, 4) M2 = (8, 5) K-Means Clustering Step 1: K=2, Arbitrarily choose K object as initial cluster center * Point P P(x,y) m1 distance m2 distance Cluster p01 a (3, 4) 0.00 5.10 Cluster1 p02 b (3, 6) 2.00 5.10 Cluster1 p03 c (3, 8) 4.00 5.83 Cluster1 p04 d (4, 5) 1.41 4.00 Cluster1 p05 e (4, 7) 3.16 4.47 Cluster1 p06 f (5, 1) 3.61 5.00 Cluster1 p07 g (5, 5) 2.24 3.00 Cluster1 p08 h (7, 3) 4.12 2.24 Cluster2 p09 i (7, 5) 4.12 1.00 Cluster2 p10 j (8, 5) 5.10 0.00 Cluster2 Initial m1 (3, 4) Initial m2 (8, 5) M2 = (8, 5) Step 2: Compute seed points as the centroids of the clusters of the current partition Step 3: Assign each objects to most similar center m1 = (3, 4) K-Means Clustering * Point P P(x,y) m1 distance m2 distance Cluster p01 a (3, 4) 0.00 5.10 Cluster1 p02 b (3, 6) 2.00 5.10 Cluster1 p03 c (3, 8) 4.00 5.83 Cluster1 p04 d (4, 5) 1.41 4.00 Cluster1 p05 e (4, 7) 3.16 4.47 Cluster1 p06 f (5, 1) 3.61 5.00 Cluster1 p07 g (5, 5) 2.24 3.00 Cluster1 p08 h (7, 3) 4.12 2.24 Cluster2 p09 i (7, 5) 4.12 1.00 Cluster2 p10 j (8, 5) 5.10 0.00 Cluster2 Initial m1 (3, 4) Initial m2 (8, 5) M2 = (8, 5) Step 2: Compute seed points as the centroids of the clusters of the current partition Step 3: Assign each objects to most similar center m1 = (3, 4) K-Means Clustering Euclidean distance b(3,6) ??m2(8,5) = ((8-3)2 + (5-6)2 )1/2= (52 + (-1)2)1/2 = (25 + 1)1/2 = (26)1/2 = 5.10 Euclidean distance b(3,6) ??m1(3,4) = ((3-3)2 + (4-6)2 )1/2= (02 + (-2)2)1/2 = (0 + 4)1/2 = (4)1/2 = 2.00 * Point P P(x,y) m1 distance m2 distance Cluster p01 a (3, 4) 1.43 4.34 Clust
文档评论(0)