chapter matrix vector mutiplication.pptVIP

  • 4
  • 0
  • 约1.41万字
  • 约 68页
  • 2017-02-06 发布于江苏
  • 举报
Benchmarking Procs Predicted(msec) Actual (msec) Speedup Megaflops 1 63.4 63.4 1.00 31.6 4 17.8 17.4 3.64 114.9 9 9.7 9.7 6.53 206.2 16 6.2 6.2 10.21 322.6 Comparison of Three Algorithms Summary (1/3) Matrix decomposition ? communications needed Rowwise block striped: all-gather Columnwise block striped: all-to-all exchange Checkerboard block: gather, scatter, broadcast, reduce All three algorithms: roughly same number of messages Elements transmitted per process varies First two algorithms: ?(n) elements per process Checkerboard algorithm: ?(n/?p) elements Checkerboard block algorithm has be

文档评论(0)

1亿VIP精品文档

相关文档