- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
48 2018 ,54(15) Computer Engineering and Applications 计算机工程与应用
最小化误差平方和k-means 初始聚类中心优化方法
用
周本金,陶以政,纪 斌,谢永辉 应
ZHOU Benjin, TAO Yizheng, JI Bin, XIE Yonghui
与
中国工程物理研究院 计算机应用研究所,四川 绵阳 621900
程
Institute of Computer Application, China Academy of Engineering Physics, Mianyang, Sichuan 621900, China
g
工 r
o
ZHOU Benjin, TAO Yizheng, JI Bin, et al. Optimizing k-means initial clustering centers by minimizing sum of
.
j
squared error. Computer Engineering and Applications, 2018, 54 (15):48-52.
机 a
算 e
Abstract :Traditional k-means algorithm is sensitive to initial clustering centers and isolated points, based on the principal
c
.
of minimizing the sum of squared error to the most extent, an optimized k-means method is presented on selecting initial
计 w
clustering centers. At the phase of initial selecting clustering centers, when adding a clustering point each time, compute
reduced sum of squared error of each point and select the point that can maximize the square of the reduced error. Using
w
real datasets and compared with the results of other algorithms, the experimental results show the number of iteration is
w
reduced on selecting initial clustering centers and the quality of clustering is improved. Besides, artificial dataset demon-
strates the method is much less sensitive to isolated points.
Key words :clustering; k-means algorithm; sum of squared error; isolated points
摘 要:传统的k-均值算法对初始聚类中心和孤立点敏感,文中以最大程度地减少误差平方和为基本思想,提出一
原创力文档


文档评论(0)