- 16
- 0
- 约4.29万字
- 约 52页
- 2019-05-08 发布于上海
- 举报
华
华 中 科 技 大 学 硕 士 学 位 论 文
II万方数据
II
万方数据
Abstract
Graphics processing unit (GPU) has low price but is powerful in computing capacity, these characteristics make it increasingly popular in the field of high performance computing in recent years. Programming languages on GPU such as CUDA and OpenCL make the GPU programming widely accepted, but the GPU programming is still a very complex task for two reasons: On the one hand, design an algorithm for a specific GPU is very time-consuming, and requires programmers to be very familiar with GPU algorithm itself and also the underlying architecture. On the other, these codes are lack of portability. The performance that codes gains on different GPU platform may be very different. Codes can gain a high performance on a GPU, but it is almost impossible to maintain the performance on another one, and often need to be modified to achieve the goal.
To reduce the difficulty of GPU programming and make the codes portable, we propose a frame that can map the dataflow programs to GPU efficiently. The input of this frame are the COStream codes, the output are OpenCL codes which are parallel optimized for the GPU. The system will optimize the programs in two levels: software pipelining scheduling and optimize for dataflow program according to the characteristics of the GPU platform. METIS will be used to finish the partition of the task, which takes into account the load balancing and communication overhead. In the optimization process, we introduce a variable called expansion factor, which can not only make full use of GPU computing resources, but also can effectively reduce the number of synchronization between threads, thus reduce the synchronization overhead. We can use pinned memory to accelerate the data transfer between the CPU and the GPU. Overlap the compute and communicate efficiently will be helpful for the performance. In addition, the rational use of local memory is not only able to access to the global memory in a coale
您可能关注的文档
- 科技企业孵化器成长的影响因素研究-技术经济及管理专业毕业论文.docx
- 梅毒螺旋体外膜蛋白核酸疫苗的筛选构建及其免疫效果的比较研究-病原生物学专业毕业论文.docx
- 考虑战略消费者及其需求不确定性的价格保护策略研究-管理科学与工程专业毕业论文.docx
- 面向复杂环境的智能轮椅仿真及导航技术研究-控制科学与工程专业毕业论文.docx
- 面向高维数据的变密度空间聚类研究-计算机软件与理论专业毕业论文.docx
- 面向网络渗透的扫描技术研究-计算机技术专业毕业论文.docx
- 黄芩茎叶总黄酮对大鼠心肌细胞氧化损伤的保护作用及其机制的研究-药理学专业毕业论文.docx
- 黄芪多糖对慢性粒细胞白血病患者浆细胞样树突状细胞功能的影响-内科学(血液病)专业毕业论文.docx
- 贸易成本与国际贸易模式-世界经济专业毕业论文.docx
- 门脉高压大鼠胃组织病变与血红素加氧酶1关系的研究-内科学专业毕业论文.docx
- 多元统计学实验:判别分析与消费水平预测.pdf
- 平方根概念与运算探究——人教版七年级下册数学课件.pdf
- 如果去派对,你会如何选择?——基于课本内容句子练习.pdf
- 创新与创业:市场影响及策略选择.pdf
- 高考英语一模突破卷04(全国二卷通用)(全解全析).docx
- 第三单元课外古诗词《梁甫行》课件+2025-2026学年统编版语文八年级上册.pptx
- 22.2++函数的表示+课件++++2025-2026学年人教版八年级数学下册.pptx
- 高考英语二轮-高频主题阅读词汇(综合练)(全国通用)(教师版).docx
- 20.1勾股定理及其应用+课件2025-2026学年+人教版+八年级数学下册.pptx
- 5.14毛泽东开辟井冈山道路课件-2025-2026学年统编版历史八年级上册.pptx
最近下载
- 2026最新施工资质挂靠协议.docx
- 美好前程-Vencorex.pdf VIP
- 传感器技术及应用中职课件完整全套教学课件.pptx VIP
- 机车动力装置检查与维修课件:16V265H型柴油机配气机构.pptx VIP
- 2026年度江苏财经职业技术学院单招《数学》检测卷AB卷附答案详解.docx VIP
- 《领略丝路遗珍》核心素养教学设计 2026人美版美术八年级下册.pdf
- 材料成形原理-第十章-偏析.ppt
- (2026年春季新版本)人教版三年级数学下册全册教案.docx
- API 682-2014 离心泵和转子泵用轴封系统 第四版(中文).pdf
- IPC-2221C-EN TOC2023印制版设计通用标准 Generic Standard on Printed Board Design.pdf VIP
原创力文档

文档评论(0)