- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Data Warehousing资料仓储.ppt
* Data Mining: Concepts and Techniques * Kinds of Exceptions and their Computation Parameters SelfExp: surprise of cell relative to other cells at same level of aggregation InExp: surprise beneath the cell PathExp: surprise beneath cell for each drill-down path Computation of exception indicator (modeling fitting and computing SelfExp, InExp, and PathExp values) can be overlapped with cube construction Exception themselves can be stored, indexed and retrieved like precomputed aggregates * Data Mining: Concepts and Techniques * Examples: Discovery-Driven Data Cubes * Data Mining: Concepts and Techniques * Complex Aggregation at Multiple Granularities: Multi-Feature Cubes Multi-feature cubes (Ross, et al. 1998): Compute complex queries involving multiple dependent aggregates at multiple granularities Ex. Grouping by all subsets of {item, region, month}, find the maximum price in 1997 for each group, and the total sales among all maximum price tuples select item, region, month, max(price), sum(R.sales) from purchases where year = 1997 cube by item, region, month: R such that R.price = max(price) Continuing the last example, among the max price tuples, find the min and max shelf live, and find the fraction of the total sales due to tuple that have min shelf life within the set of all max price tuples * Data Mining: Concepts and Techniques * Cube-Gradient (Cubegrade) Analysis of changes of sophisticated measures in multi-dimensional spaces Query: changes of average house price in Vancouver in ‘00 comparing against ’99 Answer: Apts in West went down 20%, houses in Metrotown went up 10% Cubegrade problem by Imielinski et al. Changes in dimensions ? changes in measures Drill-down, roll-up, and mutation * Data Mining: Concepts and Techniques * From Cubegrade to Multi-dimensional Constrained Gradients in Data Cubes Significantly more expressive than association rules Capture trends in user-specified measures Serious challenges Many trivial cells in a cube ? “significance co
您可能关注的文档
- Chapter 13 Testing Hypotheses.ppt
- Chapter 13 The Chi-Square Test.ppt
- Chapter 13 Uncertainty.ppt
- Chapter 13 XML.ppt
- Chapter 13-2 IO Systems.ppt
- Chapter 13CRAFTING A DEPLOYMENT STRATEGY.ppt
- Chapter 13Embedded Systems.ppt
- Chapter 13Graphics classes.ppt
- Chapter 13Ideal Transformers.ppt
- Chapter 13Magnetically Coupled Circuits.ppt
- Date 14 Oct 2004Location BrusselsAuthor Telefónica File.ppt
- December 18, 2006.ppt
- Decision Tables.ppt
- DEDUCTIVE vs. INDUCTIVE REASONING.ppt
- DefinitionTotal Quality Management.ppt
- Design Realization lecture 13.ppt
- Design Realization lecture 16.ppt
- Detection of PrPres in plasma.ppt
- Diabetic Ketoacidosis.ppt
- Digestive System & NutritionChp 14.ppt
文档评论(0)