Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs基于FPGAs的卷积神经网络快速算法评价.pdf
- 1、本文档共8页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines
Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs
∗ 1,3 † 1 1 2 ,3
Liqiang Lu , Yun Liang , Qingcheng Xiao , Shengen Yan
1Center for Energy-efficient Computing and Applications, Peking University, Beijing, China
2Department of Information Engineering, The Chinese University of Hong Kong.
3SenseTime Group Limited.
Email: {liqianglu, ericlyun, walkershaw}@, yanshengen@
Abstract—In recent years, Convolutional Neural Networks A CNN typically involves multiple layers, where the
(CNNs) have become widely adopted for computer vision output feature maps of one layer are the input feature maps
tasks. FPGAs have been adequately explored as a promising of the following layer. Prior studies have shown that the
hardware accelerator for CNNs due to its high performance,
energy efficiency, and reconfigurability. However, prior FPGA computation of the state-of-the-art CNNs are dominated
solutions based on the conventional convolutional algorithm by the convolutional layers [6, 7]. Using the conventional
is often bounded by the computational capability of FPGAs convolution algorithm, each element in the output feature
(e.g., the number of DSPs). In this paper, we demonstrate map is computed individually by using multiple multiply-
that fast Winograd algorithm can dramatically reduce the accumulate operations. While the p
您可能关注的文档
- 新思维 新方法 新境界—《信息资源价值论—信息文明的价值思考》述评.pdf
- 对氨基酚溶液的高效液相色谱分析.pdf
- 基于改进粒子群优化算法的重力坝断面设计研究.pdf
- Deep-Level Defect Enhanced Photothermal Performance of Bismuth Sulfide–Gold Heterojunction Nanorods for Photothermal Therapy of Cancer Guided by Computed Tomography Imaging引导下肿瘤光热治疗用硫化铋-金异质结纳米棒的深能级缺陷增强光热性能.pdf
- D5000系统下火电机组AGC源网优化控制应用研究.pdf
- MRI图像的脑肿瘤分割方法研究.pdf
- “两个细则”条件下的超临界机组AGC协调控制优化.pdf
- 中国物流行业的发展战略分析——基于中航运的案例分析.pdf
- Rethinking Rewards for Technical Employees对技术员工薪酬的再思考.pdf
- 第一性原理研究Mg_2Si同质异相体的结构、电子结构和弹性性质.pdf
- NEO-6M-GPS模块说明手册.pdf
- The antibiotic azithromycin is a motilin receptor agonist in human stomachcomparison with erythromycin阿奇霉素是胃动素受体激动剂与红霉素的比较.pdf
- 2018年杭州房地产报告.pdf
- TD-LTE系统中RLC和PDCP协议的研究与实现.pdf
- 选项 M12可配置 IO 扩展卡,13 路数字量输入4 路继电器输出.pdf
- 选项 M13.x可配置 IO 扩展卡,7 路数字量输入.pdf
- 选项 M14.x可配置 IO 扩展卡,4 路继电器输出.pdf
- Tariffs and Schumpeterian growth关税与熊彼特式增长.pdf
- Consensus and Update on the Definition of On-Treatment Platelet Reactivity to Adenosine Diphosphate Associated With Ischemia and Bleeding血小板对二磷酸腺苷与缺血出血相关反应定义的共识和最新进展.pdf
- 室内排水管道安装作业指导书.pdf
文档评论(0)