第2课数据预处理技术.pptVIP

  • 7
  • 0
  • 约1.87万字
  • 约 50页
  • 2017-02-26 发布于湖北
  • 举报
第2课数据预处理技术

第2课 数据预处理技术 徐从富,副教授 浙江大学人工智能研究所 内容提纲 Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept hierarchy generation Summary Why Data Preprocessing? Data in the real world is dirty incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate data e.g., occupation=“” noisy: containing errors or outliers e.g., Salary=“-10” inconsistent: containing discrepancies in codes or names e.g., Age=“42” Birthday=“03/07/1997” e.g., Was rating “1,2,3”, now rating “A, B, C” e.g., discrepancy between dupl

文档评论(0)

1亿VIP精品文档

相关文档