* Larry Page Sergey Brin wrote BigFiles; GFS (Google File System) grew out of that then MapReduce which maps problems across cluster a of worker nodes then collects results aggregates/reduces result (used to generate Google’s index of WWW) Apache came out with Hadoop (used by Facebook, Yahoo, Amazon EC2 S3) which was an Open Source version with HDFS MapReduce – Batch Processing Jobs going after distributed data processing it near the data (same node) – not super fast (seconds vs. ms) not good for interactive/analytic (No updates / only appends) Google then came out with BigTable (compressed, high performance data storage) used by Google Maps, Google Reader, Google Earth, YouTube, and Gmail Apache adds NoSQL DB’s: Cassandra HBase The NoSQL onslaught of systems started (over 100 of them) including Oracle’s NoSQL (BerkeleyDB). * Goal was to Organize Data without moving it! – Hadoop HDFS MapReduce (Cheaper way to access Petabytes). HDFS can store any type of data or structure, but MapReduce works with key/value pairs Acquire Store data – NoSQL (simple key value storage) – Amazon DynamoDB (hosted), Apache Cassandra, HBase, BigTable, MongoDB, Oracle NoSQL (distributed key value) or just use the original HDFS / GFS MapReduce (many are EVENTUALLY consistent!) Analyze Data – Google Dremel, Apache Hive Data Warehouse, Oracle Data Warehouse (OBIEE) 54% of companies doing Big Data say: “Projects are critical!” * Many in the industry have considered ACID properties as integral needs of Databases. But these are more from transactional perspective – and not necessarily required to the fullest extent in analytical situations, as long as the end state continues to be consistent. By carefully dropping certain aspects of ACID support, such systems can be geared to handle Big Data… especially the simpler types of Big Data like web-clicks. * * Notes: Notes: Notes: “As customers look to manage the huge explosion in data from new and evolving sources, such
您可能关注的文档
最近下载
- 新疆兵团考试题型及答案.doc VIP
- 公考:申论26个高分万能写作模板(考前必看).pdf
- 2026年严格对照“带头固本培元、带头干事创业、敬畏人民等(五个带头)”方面检查材料与政法委书记带头强化政治忠诚、提高政治能力等“五个带头”方面检查材料2篇文.docx VIP
- 探界者钟扬-课件.ppt VIP
- 湖北鸿强矿业科技有限公司年产20000吨选矿药剂产品建设项目环境影响报告书.pdf VIP
- 2026年春季青岛版(五四制2024)三年级下册小学科学教学计划含进度表.docx VIP
- 2025年政府采购评审专家考试题库附含答案.docx VIP
- 医疗器械注册质量管理体系核查指南讲解.pptx VIP
- 营销策划 -塔斯汀中国汉堡品牌手册.pdf
- 19.3 二次根式的加法与减法(第2课时)课件 人教版数学八年级下册.pptx VIP
原创力文档

文档评论(0)