Dynamic Warp Formation and Scheduling for GPU Control Flow动态偏差的形成和GPU的控制流调度.pptVIP

  • 21
  • 0
  • 约5.87千字
  • 约 23页
  • 2018-06-24 发布于四川
  • 举报

Dynamic Warp Formation and Scheduling for GPU Control Flow动态偏差的形成和GPU的控制流调度.ppt

Dynamic Warp Formation and Scheduling for GPU Control Flow动态偏差的形成和GPU的控制流调度

Dynamic Warp Formation and Scheduling for GPU Control Flow Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow Wilson W. L. Fung Ivan Sham George Yuan Tor M. Aamodt Electrical and Computer Engineering University of British Columbia Micro-40 Dec 5, 2007 Motivation = GPU: A massively parallel architecture SIMD pipeline: Most computation out of least silicon/energy Goal: Apply GPU to non-graphics computing Many challenges This talk: Hardware Mechanism for Efficient Control Flow Programming Model Modern graphics pipeline CUDA-like programming model Hide SIMD pipeline from programmer Single-Program-Multiple-Data (SPMD) Programmer expresses parallelism using threads ~Stream processing Programming Model Warp = Threads grouped into a SIMD instruction From Oxford Dictionary: Warp: In the textile industry, the term “warp” refers to “the threads stretched lengthwise in a loom to be crossed by the weft”. The Problem: Control flow GPU uses SIMD pipeline to save area on control logic. Group scalar threads into warps Branch divergence occurs when threads inside warps branches to different execution paths. Dynamic Warp Formation Consider multiple warps Outline Introduction Baseline Architecture Branch Divergence Dynamic Warp Formation and Scheduling Experimental Result Related Work Conclusion Baseline Architecture SIMD Execution of Scalar Threads All threads run the same kernel Warp = Threads grouped into a SIMD instruction Latency Hiding via Fine Grain Multithreading Interleave warp execution to hide latencies Register values of all threads stays in register file Need 100~1000 threads Graphics has millions of pixels SPMD Execution on SIMD Hardware: The Branch Divergence Problem Baseline: PDOM Dynamic Warp Formation: Key Idea Idea: Form new warp at divergence Enough threads branching to each path to create full new warps Dynamic Warp Formation: Example Dynamic Warp Formation: Hardware Implementation Methodology Created new cycle-accurate simulat

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档