- 21
- 0
- 约5.87千字
- 约 23页
- 2018-06-24 发布于四川
- 举报
Dynamic Warp Formation and Scheduling for GPU Control Flow动态偏差的形成和GPU的控制流调度
Dynamic Warp Formation and Scheduling for GPU Control Flow Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow Wilson W. L. Fung Ivan Sham George Yuan Tor M. Aamodt Electrical and Computer Engineering University of British Columbia Micro-40 Dec 5, 2007 Motivation = GPU: A massively parallel architecture SIMD pipeline: Most computation out of least silicon/energy Goal: Apply GPU to non-graphics computing Many challenges This talk: Hardware Mechanism for Efficient Control Flow Programming Model Modern graphics pipeline CUDA-like programming model Hide SIMD pipeline from programmer Single-Program-Multiple-Data (SPMD) Programmer expresses parallelism using threads ~Stream processing Programming Model Warp = Threads grouped into a SIMD instruction From Oxford Dictionary: Warp: In the textile industry, the term “warp” refers to “the threads stretched lengthwise in a loom to be crossed by the weft”. The Problem: Control flow GPU uses SIMD pipeline to save area on control logic. Group scalar threads into warps Branch divergence occurs when threads inside warps branches to different execution paths. Dynamic Warp Formation Consider multiple warps Outline Introduction Baseline Architecture Branch Divergence Dynamic Warp Formation and Scheduling Experimental Result Related Work Conclusion Baseline Architecture SIMD Execution of Scalar Threads All threads run the same kernel Warp = Threads grouped into a SIMD instruction Latency Hiding via Fine Grain Multithreading Interleave warp execution to hide latencies Register values of all threads stays in register file Need 100~1000 threads Graphics has millions of pixels SPMD Execution on SIMD Hardware:The Branch Divergence Problem Baseline: PDOM Dynamic Warp Formation: Key Idea Idea: Form new warp at divergence Enough threads branching to each path to create full new warps Dynamic Warp Formation: Example Dynamic Warp Formation: Hardware Implementation Methodology Created new cycle-accurate simulat
您可能关注的文档
- Active Reading Strategies:积极的阅读策略.ppt
- Active Filters, EQs & Crossovers - CEDIA Crosspoint:有源滤波器,情商&;交叉- CEDIA交叉点.ppt
- Active Solar Power:主动式太阳能发电.ppt
- Alcohol Withdrawl Syndromes:酒精戒断综合征.ppt
- Affordable “Breathing Wall” Building Envelope Design:负担得起的“呼吸幕墙”建筑围护结构的设计.ppt
- Alkanes - City University of New York:烷烃-纽约城市大学.ppt
- An Architecture for Scheduling and Control in Flexible :在柔性调度和控制结构.ppt
- Application Layer - Student Blog:应用层——学生的博客.ppt
- ANALISA BIAYA VOLUME & LABA - Saparila Worokinasih, …:analisa用体积和腊八- saparila worokinasih,….ppt
- Application of DICOM Structured Report:DICOM结构化报告中的应用.ppt
原创力文档

文档评论(0)