- 6
- 0
- 约2.5万字
- 约 64页
- 2018-03-26 发布于广东
- 举报
* Data Mining: Concepts and Techniques * http://www.cs.sfu.ca/~han/dmbook Thank you !!! * * * * * * * * * * * * * * * * * * * Mining Class Comparisons Comparison: Comparing two or more classes. Method: Partition the set of relevant data into the target class and the contrasting class(es) Generalize both classes to the same high level concepts Compare tuples with the same high level descriptions Present for every tuple its description and two measures: support - distribution within single class comparison - distribution between classes Highlight the tuples with strong discriminant features Relevance Analysis: Find attributes (features) which best distinguish different classes. * Data Mining: Concepts and Techniques * Example: Analytical comparison Task Compare graduate and undergraduate students using discriminant rule. DMQL query use Big_University_DB mine comparison as “grad_vs_undergrad_students” in relevance to name, gender, major, birth_place, birth_date, residence, phone#, gpa for “graduate_students” where status in “graduate” versus “undergraduate_students” where status in “undergraduate” analyze count% from student * Data Mining: Concepts and Techniques * Example: Analytical comparison (2) Given attributes name, gender, major, birth_place, birth_date, residence, phone# and gpa Gen(ai) = concept hierarchies on attributes ai Ui = attribute analytical thresholds for attributes ai Ti = attribute generalization thresholds for attributes ai R = attribute relevance threshold * Data Mining: Concepts and Techniques * Example: Analytical comparison (3) 1. Data collection target and contrasting classes 2. Attribute relevance analysis remove attributes name, gender, major, phone# 3. Synchronous generalization controlled by user-specified dimension thresholds prime target and contrasting class(es) relations/cuboids * Data Mining: Concepts and Techniques * Example: Analytical comparison (4) Prime generalized relation for the target class: Graduate students Prime
您可能关注的文档
最近下载
- 《变幻的空间》 课件 2026浙美版美术八年级下册.ppt
- 2026年中国豆制品市场深度分析与发展动向研究报告.docx
- 学位论文___土木工程(结构工程)中学学生宿舍楼.doc VIP
- 初中生数学学习困难学生的心理辅导与教育干预策略教学研究课题报告.docx
- 2026浙美版美术八年级下册第二单元第4课《黑白的魅力》课件.pptx
- 职业病诊断医师考试题库及答案.docx VIP
- 火力发电厂典型事故案例汇编.pdf VIP
- 语文学习困难学生帮扶方案.docx VIP
- 2025年四川省广安市高考物理二诊试卷(含详细答案解析).docx VIP
- 全国大学生数学建模竞赛b题全国优秀论文.docx VIP
原创力文档

文档评论(0)