- 0
- 0
- 约8.82千字
- 约 39页
- 2019-11-28 发布于广东
- 举报
* BUPT AIDM * Discussion When to use ES or DM? 1.5 A Simple Data Mining Process Model Figure 1.3 A simple data mining process model Assembling the Data The Data Warehouse Relational Databases and Flat Files The Data Warehouse The data warehouse is a historical database designed for decision support. Mining the Data Interpreting the Results Result Application * AIDM * A KDD Process (by Han) Data mining: the core of knowledge discovery process. Data Cleaning Data Integration Databases Data Warehouse Task-relevant Data Selection Data Mining Pattern Evaluation * * * Cruncher: 捣弄数字者;能够进行复杂、大量运算的人 * Trivial:不重要的 * * * 欺诈 * * * 终端节点 * * * * * * BUPT AIDM BUPT AIDM Part I Data Mining Fundamentals Chapter 1: Data Mining: A First View * BUPT AIDM * Content 1.1 What is Data Mining? Definition 1.2 What can computers Learn? 1.3 Is Data Mining Appropriate for My Problem? 1.4 Expert Systems or Data Mining? 1.6 Why Not Simple Search? * BUPT AIDM * 1.1 What is data mining: Motivation Data explosion problem Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories. Such amount of data beyond human understanding. We are drowning in data, but starving for knowledge! Solution: Data warehousing and data mining Data warehousing: for data storage Data mining: for Extraction of interesting knowledge (rules, regularities, patterns, constraints) from data in large databases * BUPT AIDM * 1.1 Data Mining is a result of natural evolution of information technology 1960s: Data collection and database creation 1970s - early 1980s: Database Management Systems Mid-1980s - present: Data warehouse Data analysis and understanding (data mining) * BUPT AIDM * Data Analysis:New Trend This is a time that one must speak with data. 未来属于运算师 (Super Crunchers《超级运算师》, Ian Ayres, 2009):日常决策将变得越来越自动化,人的判断作用将局限于为计算提供数据 葡萄酒味道和香味的预测:奥利.阿申费尔特是普林斯顿大学的经济学家,完全不懂葡萄酒的制作,但可以预测波尔多葡萄酒的价格基于天气(炎热
您可能关注的文档
- 《基础物理实验》高温超导材料特性测试以及低温温度计.ppt
- 《计量经济学导论》chapter.ppt
- 《电商公司的培训文件》自我的培训,团队教练.ppt
- 《计算机图形学教学资料》期末复习题.ppt
- 《计算机网络教学资料》实验一.ppt
- 《电子创新设计自动化eda》组合逻辑电路创新设计.ppt
- 《计算机网络科学应用教学讲义》计算机网络科学应用之六.ppt
- 计算机网络科学应用教学课程分析.ppt
- 《电子电路教学资料》前三章课前小测验合集.ppt
- 计算机网络科学应用教学.ppt
- 基于MOF复合材料修饰丝网印刷电极对Cd2+Cu2+的便携快检研究.pdf
- 不同基材表面铜单原子催化剂的制备及其催化性能的研究.pdf
- 噻吩取代四芳基乙烯分子的构筑及其多重刺激响应性质研究.pdf
- Na助熔剂—坩埚下降法GaN晶体生长中传热传质研究.pdf
- 基于咔唑的氢键有机框架材料的制备及其性能研究.pdf
- 直链二酸化合物及其改性水性环氧树脂的制备与性能研究.pdf
- 基于CADD策略的新型杂环嘧啶类高选择性CDK46抑制剂的设计、合成与抗肿瘤活性研究.pdf
- 具有光响应四芳基乙烯的合成及其吡啶功能化的研究.pdf
- 任务式教学法对高中聋哑学生的心理韧性与体育学习态度的影响研究.pdf
- 思维导图在高中英语语法复习中的应用研究.pdf
原创力文档

文档评论(0)