- 3
- 0
- 约3.44千字
- 约 24页
- 2017-12-24 发布于湖北
- 举报
TPUTCACHE HIGH-FREQUENCY, MULTI-WAY CACHE FOR HIGH-THROUGHPUT…
* TPUTCACHE: HIGH-FREQUENCY, MULTI-WAY CACHE FOR HIGH-THROUGHPUTFPGA APPLICATIONS Aaron Severance University of British Columbia Advised by Guy Lemieux * Our Problem We use overlays for data processing Partially/fully fixed processing elements Virtual CGRAs, soft vector processors Memory: Large register files/scratchpad in overlay Low latency, local data Trivial (large DMA): burst to/from DDR Non-trivial? Scatter/Gather Data dependent store/load vscatter adr_ptr, idx_vect, data_vect for i in 1..N adr_ptr[idx_vect[i]] = data_vect[i] Random narrow (32-bit) accesses Waste bandwidth on DDR interfaces * * If Data Fits on the FPGA… BRAMs with interconnect network General network… Not customized per application Shared: all masters - all slaves Memory mapped BRAM Double-pump (2x clk) if possible Banking/LVT/etc. for further ports * Example BRAM system * But if data doesn’t fit… (oversimplified) * So Let’s Use a Cache But a throughput focused cache Low latency data held in local memories Amortize latency over multiple accesses Focus on bandwidth Replace on-chip memory or augment memory controller? Data fits on-chip Want BRAM like speed, bandwidth Low overhead compared to shared BRAM Data doesn’t fit on-chip Use ‘leftover’ BRAMs for performance * * TputCache Design Goals Fmax near BRAM Fmax Fully pipelined Support multiple outstanding misses Write coalescing Associativity * TputCache Architecture Replay based architecture Reinsert misses back into pipeline Separate line fill/evict logic in background Token FIFO for completing requests in order No MSHRs for tracking misses Fewer muxes (only single replay request mux) 6 stage pipeline - 6 outstanding misses Good performance with high hit rate Common case fast * TputCache Architecture * Cache Hit * Cache Miss * Evict/Fill Logic * Area Fmax Results Reaches 253MHz compared to 270MHz BRAM fmax on Cyclone IV 423MHz compared to 490MHz BRAM fmax on Stratix IV Minor degredation with increasing size, associativity 13% to 35% extr
您可能关注的文档
- PPT模板 029(品牌战略项目建议书).ppt
- PPT精品文档---恒生_可吸收缝合线.ppt
- PPT精品模板-清晰漂亮-7[整理版].ppt
- PPT高手之路4懂一点配色(无动画版)(冲突文件2014年8月31日 16时11分57秒).ppt
- SBA-40C型分析仪速成操作指南(图解) 请仔细阅读随机提供的说明书详细资料.ppt
- STC系列增强型8051单片机原理与应用-宣传资料(同名1328).doc
- Structural Health Monitoring of Steel Bridges:钢结构桥梁结构健康监测.ppt
- unit 5《learning about china》topic 1 section b(仁爱湘教版九年级下)ppt--初中英语.ppt
- Using Corpora in Translation Studies Research - The :翻译研究中使用语料库研究.ppt
- VRay 渲染器参数讲解[策划].doc
- 广东省广州省实验中学教育集团2025-2026学年八年级上学期期中考试物理试题(解析版).docx
- 广东省广州大学附属中学2025-2026学年八年级上学期奥班期中物理试题(解析版).docx
- 广东省广州市第八十六中学2025-2026学年八年级上学期期中物理试题(含答案).docx
- 广东省广州市第八十九中学2025-2026学年八年级上学期期中考试物理试题(解析版).docx
- 广东省广州市第二中学2025-2026学年八年级上学期期中考试物理试题(含答案).docx
- 广东省广州市第八十六中学2025-2026学年八年级上学期期中物理试题(解析版).docx
- 广东省广州市第八十九中学2025-2026学年八年级上学期期中考试物理试题(含答案).docx
- 广东省广州市第二中学2025-2026学年八年级上学期期中考试物理试题(解析版).docx
- 2026《中国人寿上海分公司营销员培训体系优化研究》18000字.docx
- 《生物探究性实验教学》中小学教师资格模拟试题.docx
原创力文档

文档评论(0)