- 1
- 0
- 约4.83万字
- 约 57页
- 2019-03-30 发布于上海
- 举报
基于
基于遗传算法的决策树优化算法研究
兰州
兰州交通大学硕士学位论文
万方数据
万方数据
万方数据
万方数据
关键词:数据挖掘;决策树;遗传算法;C4.5 算法
论文类型:应用基础研究
- II -
Abstract
With the rapid development of the network technology and database management system, the past data analysis tools and techniques are hard to meet the demand of processing the massive of data accumulated in different areas of the internal enterprise which leads to a huge waste of the data resources. Thus, the methods of finding the useful data for the enterprise from the existence of the huge information and knowledge data pool becomes a new angle caused extensive attentions. Data mining is a new technique which is to extract information from a data set and transform it into an understandable structure for further use. Among them, the classification and prediction is an important data mining tasks.
At present, decision tree algorithm is used as the most commonly method in the data mining classification technology as its highly accurate classification, fast processing speed and comprehensible classification rules. The performance of the decision tree mainly depends on the accuracy and complexity of the classification and prediction model.C4.5, as the classic decision tree classification algorithm, has good nicety of grading (accuracy rate).However, because of the greedy algorithm adopted by the process of the tree construction, the structure of the decision tree often has some defects such as over fitting, too large scale etc. Genetic algorithms categorized as global search heuristics have the potential features of Parallelism and scalability which are easy to combine with other algorithms. Thus, applying the genetic algorithm to the decision tree classification algorithm C4.5 can optimized the decision tree
through two different thinking approaches:
This paper has deeply analyzed the basic principle of the decision tree algorithm C4.5 and summarized the shortcomings by practical cases on the balance of classification accuracy rate and scale control etc. Parti
您可能关注的文档
- 基于网络安全的政府监管分析-行政管理专业论文.docx
- 基于塑性和弹性模型的日元美元汇率波动实证研究-金融学专业论文.docx
- 基于数据挖掘的体育成绩管理与体能分析系统-软件工程专业论文.docx
- 基于前景理论的随机模糊多属性决策方法的研究-管理科学与工程专业论文.docx
- 基于生活情境的中学物理教学对学生能力培养的研究-课程与教学论(物理)专业论文.docx
- 基于利益相关者的企业社会责任与企业价值关系研究-会计学专业论文.docx
- 基于决策树的港口后方堆场辅助决策应用的研究计算机技术专业论文.docx
- 基于碳排放的 产品质量设计与推广策略研究-企业管理专业论文.docx
- 基于随机波动率和随机利率的亚式期权定价-应用数学专业论文.docx
- 基于数据挖掘的高校成绩分析系统的设计与实现-计算机技术专业论文.docx
- 2025年全国演出经纪人员资格认定考试试卷带答案(研优卷).docx
- 2025年全国演出经纪人员资格认定考试试卷完整版.docx
- 2025年全国演出经纪人员资格认定考试试题库及完整答案.docx
- 2025年全国演出经纪人员资格认定考试试卷完美版.docx
- 2025年全国演出经纪人员资格认定考试试卷含答案(实用).docx
- 2025年全国演出经纪人员资格认定考试试卷及答案(各地真题).docx
- 2025年下半年内江市部分事业单位公开考试招聘工作人员(240人)备考题库附答案.docx
- 2025年全国演出经纪人员资格认定考试试卷及答案1套.docx
- 2025年下半年四川成都市郫都区面向社会引进公共类事业单位人员2人备考题库最新.docx
- 2025年下半年内江市部分事业单位公开考试招聘工作人员(240人)备考题库附答案.docx
最近下载
- 《肖申克救赎》与《人性污点》对比评析.doc VIP
- 陕晋青宁四省2025-2026学年高三上学期(1月)第二次联考数学试卷(含答案详解).pdf
- 2025年AWS认证DynamoDB全局表数据不一致性问题的诊断与解决专题试卷及解析.pdf VIP
- 2025年房地产经纪人高级谈判策略模拟与实战演练专题试卷及解析.pdf VIP
- 2025年公共营养师不同食物类别中碘的分布规律专题试卷及解析.pdf VIP
- 2025年无人机驾驶员执照飞行操作责任归属法律依据专题试卷及解析.pdf VIP
- 2025年招标师招标采购从业人员接受礼品、宴请与旅游的禁止性规定专题试卷及解析.pdf VIP
- 文旅创意产业商业计划书.docx VIP
- 2025年MK 袋鼠数学竞赛Level-D (7-8年级) 真题+解析.pdf
- 【数学卷+解析】苏州零模2601.pdf
原创力文档

文档评论(0)