- 0
- 0
- 约5.82千字
- 约 26页
- 2017-09-06 发布于天津
- 举报
lecture 1 – introduction
Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Outline MapReduce: Programming Model MapReduce Examples A Brief History MapReduce Execution Overview Hadoop MapReduce Resources MapReduce “A simple and powerful interface that enables automatic parallelization and distribution of large-scale computations, combined with an implementation of this interface that achieves high performance on large clusters of commodity PCs.” MapReduce More simply, MapReduce is: A parallel programming model and associated implementation. Programming Model Description The mental model the programmer has about the detailed execution of their application. Purpose Improve programmer productivity Evaluation Expressibility Simplicity Performance Programming Models von Neumann model Execute a stream of instructions (machine code) Instructions can specify Arithmetic operations Data addresses Next instruction to execute Complexity Track billions of data locations and millions of instructions Manage with: Modular design High-level programming languages (isomorphic) Programming Models Parallel Programming Models Message passing Independent tasks encapsulating local data Tasks interact by exchanging messages Shared memory Tasks share a common address space Tasks interact by reading and writing this space asynchronously Data parallelization Tasks execute a sequence of independent operations Data usually evenly partitioned across tasks Also referred to as “Embarrassingly parallel” MapReduce:Programming Model Process data using special map() and reduce() functions The map() function is called on every item in the input and emits a series of intermediate key/value pairs All values associated with a given key are grouped together The reduce() function is called on every unique key, and its value list, and emits a value that is added to the output MapReduce:Programming Model MapReduce:Programming Model More formally, Map(k1,v1) -- list(k2,v2) Reduce(k2, list(v2)) -- list(v2) Map
您可能关注的文档
- jl琏通分割和最优区域分并分割算法.pdf
- jmp pro的魔力.pdf
- jmpによる心理统计学入门 - kurume university institutional repository.pdf
- jci评审与护理质量 - psbh china.ppt
- jstor检索指南 - 天津商业大学图书馆.ppt
- just 检测与质量控制.pdf
- jy30型网(站)异常检测装置 - 铁岭市谐振电子科技有限公司.doc
- keysight technologies u2020 x 系列usb 峰值与平均功率感测器.pdf
- keysight n9344c 手持式频谱分析仪(hsa).pdf
- knn多类标算法.ppt
原创力文档

文档评论(0)