- 3
- 0
- 约1.42万字
- 约 55页
- 2019-05-23 发布于湖北
- 举报
ScaLAPACK Team Susan Blackford, UT Jaeyoung Choi, Soongsil U Andy Cleary, LLNL Ed DAzevedo, ORNL Jim Demmel, UCB Inder Dhillon, UCB /scalapack Jack Dongarra, UT/ORNL Sven Hammarling, NAG Greg Henry, Intel Osni Marques, NERSC Antoine Petitet, UT Ken Stanley, UCB David Walker, Cardiff U Clint Whaley, UT scalapack@ Possible Data Layouts ScaLAPACK supports all layouts 2D block cyclic recommended, for load balance and BLAS3 1D blocked 1D cyclic 1D block cyclic 2D block cyclic ScaLAPACK Structure ScaLAPACK BLAS LAPACK BLACS MPI/PVM/… PBLAS Global Local Parallelism in ScaLAPACK Level 3 BLAS block operations All the reduction routines Pipelining QR Iteration, Triangular Solvers, classic factorizations Redundant computations Condition estimators Static work assignment Bisection Task parallelism Sign function eigenvalue computations Divide and Conquer Tridiagonal and band solvers, symmetric eigenvalue problem and Sign function Cyclic reduction Reduced system in the band solver ScaLAPACK Performance Models (1)ScaLAPACK Operation Counts ScaLAPACK Performance Models (2)Compare Predictions and Measurements (LU) (Cholesky) Making the nonsymmetric eigenproblem scalable Axi = li xi , Schur form A = QTQT Parallel HQR Henry, Watkins, Dongarra, Van de Geijn Now in ScaLAPACK Not as scalable as LU: N times as many messages Block-Hankel data layout better in theory, but not in ScaLAPACK Sign Function Beavers, Denman, Lin, Zmijewski, Bai, Demmel, Gu, Godunov, Bulgakov, Malyshev Ai+1 = (Ai + Ai-1)/2 ? shifted projector onto Re l 0 Repeat on transformed A to divide-and-conquer spectrum Only uses inversion, so scalable Inverse free version exists (uses QRD) Very high flop count compared to HQR, less stable The “Holy Grail” (Parlett, Dhillon, Marques) Perfect Output complexity (O(n * #vectors)), Embarrassingly parallel, Accurate To be propagated throughout LAPACK and ScaLAPACK Making the symmetric eigenproblem and SVD scalable ScaLAPACK Summa
您可能关注的文档
- fengdd课件.ppt
- 33-常量课件.ppt
- Logic课件.ppt
- Public- Consultation-on-the课件.docx
- Public- Key- Infrastructure-–-tell-me-in-plain- English- A- N- D- T- H- E- N-课件.ppt
- Python- Dictionaries课件.pptx
- P糖蛋白与- Survivin在喉鳞癌中的表达及相关性分析课件.ppt
- Q&- A课件.pptx
- Q- H- S- E法律法规识别表课件.doc
- Quality-of- Service-for- Internet- Telephony课件.pptx
最近下载
- PMO项目管理制度.docx VIP
- 2026年湖南汽车工程职业学院单招职业适应性测试题库带答案详解.docx VIP
- 构棘枝叶中异戊烯基取代香豆素类化学成分研究.docx VIP
- 智算中心建设项目规划方案.pdf VIP
- 新能源储能柜生产线工艺流程.doc VIP
- 【《基于单片机天然气报警系统设计》7400字(论文)】 .pdf
- (正式版)D-L∕T 821-2017 金属熔化焊对接接头射线检测技术和质量分级.docx VIP
- 机动车检测站授权签字人考试试题(含答案).docx VIP
- 《生物质替代水泥窑炉固体燃料技术要求》.pdf VIP
- 广西南宁第十四中学2025-2026学年九年级上学期期中语文试题(含答案).docx VIP
原创力文档

文档评论(0)