- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
Introduction to Hive Cloudera Engineering Blog(蜂巢Cloudera工程概论的博客)
Introduction to Hive
© 2009 Cloudera, Inc.
Outline
• Motivation
• Overview
• Data Model
• Working with Hive
• Wrap up Conclusions
© 2009 Cloudera, Inc.
Background
• Started at Facebook
• Data was collected by
nightly cron jobs into
Oracle DB
• “ETL” via hand-coded
python
• Grew from 10s of
GBs (2006) to 1
TB/day new data
(2007), now 10x that.
© 2009 Cloudera, Inc.
Hadoop as Enterprise Data
Warehouse
• Scribe and MySQL data loaded into
Hadoop HDFS
• Hadoop MapReduce jobs to process data
• Missing components:
– Command-line interface for “end users”
– Ad-hoc query support
• … without writing full MapReduce jobs
– Schema information
© 2009 Cloudera, Inc.
Hive Applications
• Log processing
• Text mining
• Document indexing
• Customer-facing business intelligence
(e.g., Google Analytics)
• Predictive modeling, hypothesis testing
© 2009 Cloudera, Inc.
Hive Components
• Shell: allows interactive queries like
MySQL shell connected to database
– Also supports web and JDBC clients
• Driver: session handles, fetch, execute
• Compiler: parse, plan, optimize
• Execution engine: DAG of stages (M/R,
HDFS, or metadata)
• Metastore: schema, location in HDFS,
SerDe
© 2009 Cloudera, Inc.
Data Model
• Tables
– Typed columns (int, float, string, date,
boolean)
– Also, list: map (for JSON-like data)
• Partitions
– e.g., to range-partition table
您可能关注的文档
- Infrared Spectroscopy BACKGROUND (红外光谱背景).pdf
- INFRARED STIMULATED LUMINESCENCE AND (红外激发发光和).pdf
- Initial Research Report Taglich Brothers(初步研究报告Taglich兄弟).pdf
- Initial Setup UMaine Electrical and Computer (初始设置UMaine电气和计算机).pdf
- Injection Molded Polymer Optics In The 21st (注射成型聚合物光学在21).pdf
- Injection Molding and MicroAbrasive Blasting (注塑和MicroAbrasive爆破).pdf
- Injury Incident Report Form(伤害事故报告形式).pdf
- Inkle WeavIng Schacht Spindle(亚麻织带织造沙赫特主轴).pdf
- Inlet and Outlet Greenheck(格林瀚克进口和出口).pdf
- Innovation and value creation Alan Hippe, CFO (创新和价值创造Alan Hippe首席财务官).pdf
- INTRODUCTION TO IX SIGMA FOR MARKETING (介绍第九σ营销).pdf
- Introduction to Internet NOS(介绍网络号).pdf
- Introduction to IQdemodulation of RFdata(介绍IQdemodulation RFdata).pdf
- Introduction to Java Platform, Enterprise Edition 7(介绍Java Platform,Enterprise Edition 7).pdf
- Introduction to JIGS AND FIXTURES National (介绍了工装夹具的国家).pdf
- Introduction to Lagrangian and Hamiltonian (介绍了拉格朗日和哈密顿).pdf
- Introduction to Modeling and Simulation(建模与仿真的介绍).pdf
- INTRODUCTION TO Melt Pressure Transducers (介绍了熔体压力传感器).pdf
- Introduction to MonteCarlo Methods(介绍蒙特卡洛方法).pdf
- Introduction to MultiModal Transportation (介绍了多通道运输).pdf
文档评论(0)