基于遗传算法的决策树优化算法研究-计算数学专业论文.docxVIP

  • 1
  • 0
  • 约4.83万字
  • 约 57页
  • 2019-03-30 发布于上海
  • 举报

基于遗传算法的决策树优化算法研究-计算数学专业论文.docx

基于 基于遗传算法的决策树优化算法研究 兰州 兰州交通大学硕士学位论文 万方数据 万方数据 万方数据 万方数据 关键词:数据挖掘;决策树;遗传算法;C4.5 算法 论文类型:应用基础研究 - II - Abstract With the rapid development of the network technology and database management system, the past data analysis tools and techniques are hard to meet the demand of processing the massive of data accumulated in different areas of the internal enterprise which leads to a huge waste of the data resources. Thus, the methods of finding the useful data for the enterprise from the existence of the huge information and knowledge data pool becomes a new angle caused extensive attentions. Data mining is a new technique which is to extract information from a data set and transform it into an understandable structure for further use. Among them, the classification and prediction is an important data mining tasks. At present, decision tree algorithm is used as the most commonly method in the data mining classification technology as its highly accurate classification, fast processing speed and comprehensible classification rules. The performance of the decision tree mainly depends on the accuracy and complexity of the classification and prediction model.C4.5, as the classic decision tree classification algorithm, has good nicety of grading (accuracy rate).However, because of the greedy algorithm adopted by the process of the tree construction, the structure of the decision tree often has some defects such as over fitting, too large scale etc. Genetic algorithms categorized as global search heuristics have the potential features of Parallelism and scalability which are easy to combine with other algorithms. Thus, applying the genetic algorithm to the decision tree classification algorithm C4.5 can optimized the decision tree through two different thinking approaches: This paper has deeply analyzed the basic principle of the decision tree algorithm C4.5 and summarized the shortcomings by practical cases on the balance of classification accuracy rate and scale control etc. Parti

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档