- 2
- 0
- 约6.29万字
- 约 63页
- 2019-03-28 发布于上海
- 举报
兰州大学硕士学位论文
兰州大学硕士学位论文 基于网格的并行聚类算法及数据流聚类算法研究
Abstract
Clustering analysis,as an important task of data mimng,has wide application fields.These different applications raise some novel requirements for clustering analysis algorithm.
This thesis proposes a novel grid-based parallel clustering algorithm for multi—density datasets,called PGMCLU.The innovative works of it are as follows. Define the concepts,including grd compactness,grid density-connected,grid feature, cluster density and cluster similarity.Propose the method for data partition based on
grd partition,the method for local clustering based on grid density-connected concept, and the method for merging local clusters based on cluster similarity measure.Realize the adaptive set for parameter minPts.PGMCLU algorithm can better handle high—dimensional and massive datasets,and Can be capable of identifying clusters with distinguished shape and density.
Data stream is a sequence composed of a series of infinite,successive, high—speed,and time-ordered data objects.Data stream has the characteristics of real—time and infinity,which determines that clustering algorithm for data stream compared with traditional clustering algorithm for static dataset has some
distinguished properties.
This thesis proposes the grid-based clustering algorithm for data stream,shorten
for GC-Stream.The innovative works of it are as follows.Propose the concept of grid feature vector for describing the grid summary information.Improve the SP—Tree structure,and propose the novel spatial index structure LSP—Tree based on List data structure.Propose the exponential damped strategy for grid information,and the
pruning strategy for noisy grid and outdated grid.GC—Stream algorithm con better satisfy the real-time requirement of data stream clustering,and can be adaptive for
memory size.
.Detailed and complete experiments have proved the correctness and effectiveness of PGMCLU and GC—Stream algorithm,therefore,these novel algorithms will have s
您可能关注的文档
- 基于童谣活动的小学生德育分析-教育学原理专业论文.docx
- 基于搜索排序算法的本体评价系统研究-软件工程专业论文.docx
- 基于历史信息的移动对象轨迹预测研究-软件工程专业论文.docx
- 基于文献计量学的中国档案学者群体研究-情报学专业论文.docx
- 基于利益相关者视角的企业社会责任管理研究-企业管理专业论文.docx
- 基于碳酸氢钠的酸度敏感型静电纺丝纤维构建及作为载药支架的研究-材料科学与工程专业论文.docx
- 基于碳减排的广东省土地利用调控对策研究-土地资源管理专业论文.docx
- 基于系统调用的异常入侵检测技术及IDS扩展功能的研究-计算机科学与技术专业论文.docx
- 基于全矢谱的全信息能量研究-机械电子工程专业论文.docx
- 基于遗传算法对山西工行QOS路由优化的研究-计算机应用技术专业论文.docx
最近下载
- 煤的介绍课件.pptx VIP
- 部编人教版9年级下册《道德与法治》全册课件.pptx
- 官方通用文本离婚协议书 2026年.docx VIP
- 结构力学仿真软件:SAP2000:SAP2000中的材料属性设置.pdf VIP
- 2025WHO脑膜炎指南解读.pptx
- 老年人胆囊结石诊断和治疗专家共识(2026版).pptx VIP
- 普通党员2025年度组织生活会围绕“五个方面”查摆问题50条和整改措施供参考.docx VIP
- 如何开一家废品回收站?.docx VIP
- 2026年河南水利与环境职业学院单招职业适应性测试题库含答案详解.docx VIP
- 小学常用单词分类汇总国标手写斜体英语字帖(含例句).pdf VIP
原创力文档

文档评论(0)