- 9
- 0
- 约1.9万字
- 约 53页
- 2016-12-03 发布于河南
- 举报
data mining 2
How about real world data? Data in the real world is dirty incomplete lacking attribute value, lacking certain attributes of interest, containing only aggregate data noisy containing errors or outliers inconsistent containing discrepancies in codes or names Why incomplete? mainly from data collection attributes of interest may not always be available e.g. age of a female customer relevant data may have been considered unimportant at the time of entry e.g. height of a customer malfunctions of data entry equipment inconsistent data may have been removed …… Why no
原创力文档

文档评论(0)