分布式OLAP引擎Pinot的聚合索引—PINOT Real-time Analytics at LinkedIn.pptx

分布式OLAP引擎Pinot的聚合索引—PINOT Real-time Analytics at LinkedIn.pptx

  1. 1、本文档共27页,可阅读全部内容。
  2. 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
分布式OLAP引擎Pinot的聚合索引—PINOT Real-time Analytics at LinkedIn

Real-time Analytics at LinkedIn Greg Brandt (2015-­‐09-­‐11) Pinot Agenda Pinot at LinkedIn Design and Architecture Segment Layout StarTree for Fast Aggregation ThirdEye for Business Monitoring Use Cases at LinkedIn Who’s Viewed My Profile – (member) Use Cases at LinkedIn XLNT – A/B tes-ng – (internal) Use Cases at LinkedIn Ad campaigns – (customers) Data Analytics Ecosystem at Li nkedln (Analytics Products) WVMP (Tracking Data) Real-time Offline High-Level Design Goals SQL-like interface Columnar storage and indexing Real-time data load Non-goals JOIN, UDF (unpredictable performance) Not source of truth Mutability – Immutability allows trivial data distribu-on Pinot Architecture 挝蜕毁照E Networked Storage Stream data from Kafka Kafka Segment Assignment (1) Push time segment(s) from t:!到照12. (2) Controller computes segment assignment z。oKeeoer (3) Servers receive notification for new partitions they own Storage (4) Servers pull data from networked storage Apache Helix A framework for building distributed, stateful applications Developed at LinkedIn for Espresso, (distributed OLTP database) Leveraged in Pinot for cluster management (Controller uses Helix libraries) Query Execution: Distributed SElECT SUM(clicks) FROM table WHERE time B盯WEEN 20 AND 60 Broker time- partitions que时, then sca忧ers que即/ gathers results Historical data preferred to real­ time data Controller notifies of segment placement Quer y Execution: Local 1. Parser – SELECT COUNT(*) … = Abstract Syntax Tree Logical Planner Operators like COUNT, WHERE, etc. Physical Planner Consider data layout (sort key, dic-onary encoding, compression, etc.) Layout: Columnar Storage moon-roof,reliable COLUMNAR DATA RAW DATA tags moon-roof,reliable mp3,compact Manual,compact leather,cool automatlc INDEX GENERATION Oocld Model Year Prlce 饱gs Toyota 1997 7500 m∞n-roof ,reliable 2 Toyota 2001 7切。 m阂,compact 3 Mazda 2005 1切。 Manual,compact 4 Honda 2010 8∞o leather ,cool 5 Honda 2011 9∞o aulomalic 100∞ b

文档评论(0)

ipbohn97 + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档