高效数据流和海量文本处理算法分析-analysis of efficient data stream and massive text processing algorithm.docx

高效数据流和海量文本处理算法分析-analysis of efficient data stream and massive text processing algorithm.docx

高效数据流和海量文本处理算法分析-analysis of efficient data stream and massive text processing algorithm

methods can not capture clusters in each dimension well when they are applied in evolving high dimensional data streams. In this thesis, we quantify each dimension (attribute) of data points separately and use the generated representative data points for each dimension to substitute the fixed-size interval. These data points are updated with incoming data points continuously so that they can capture the cluster trends in each dimension more accurately than the fixed-size intervals.Experiment results on synthetic and real data sets display the high effectiveness and accuracy of the proposed met

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档