- 13
- 0
- 约1.73万字
- 约 29页
- 2017-09-02 发布于天津
- 举报
mkl概览 - 杨全胜
* Intel? Math Kernel Library Contents Each of the BLAS has 4 data types: single and double precision real and complex data types. Most all the functions (with some exceptions) have identical functionality in each data type. The extended BLAS are a set of level 1 BLAS, which support sparse data. * Intel? Math Kernel Library Contents Intel MKL’s value-add to the LAPACK code includes: Just building the LAPACK code takes some effort Threading key portions of the functions Optimizing key functions through the use of recursion The new Fourier transforms meet the needs of a far wider audience than did the previous radix-2 FFTs. This list shows key features. Optimization of the functions will continue for some time yet, but the complex transforms are well optimized for IPF-2 now. VML and VSL offer improved performance over scalar implementations of the underlying functions provided the user can vectorize the code. * Roll Your Own/Dot Product Roll Your Own: This is a simple, straightforward dot product approach to matrix multiplication. Note that the innermost loop is a dot product, and thus can be replaced with a call to the dot product, which is shown in the second panel. * DGEMV/DGEMM The two innermost loops comprise a matrix-vector multiply, which can form the central operation of matrix multiplication. DGEMV parameters: incx = 1; incy = ldb; alpha = 1.0; beta = 0.0; transa = t; DGEMM parameters: alpha = 1.0; beta = 0.0; * Intel? Math Kernel Library Optimizations in LAPACK* Threading at higher levels (LAPACK factorization rather than at DGEMM, for instance) opens additional parallelization opportunities. The blocking strategy employed in traditional LAPACK can be extended to the factorization of the block columns to improve locality of reference and minimize vector operations. NETLIB LAPACK has numerous intrinsic function calls, which raises the need for run-time library support. All of these calls have been implemented within Intel MKL, so no run-time
您可能关注的文档
最近下载
- 2026年牛津译林版中考英语新课标1500个单词背诵清单.pdf
- 疥疮诊疗中国专家共识(2026版)解读PPT课件.pptx VIP
- 《烟雾病和烟雾综合征诊断与治疗中国专家共识(2024)》解读PPT课件.pptx VIP
- 2024年改良型新药行业研究报告及未来五至十年预测分析报告.docx
- 乡镇民主生活会批评与自我批评.docx VIP
- 陕西凤翔县马家庄秦墓出土的出土陶罐.docx VIP
- 采血后预防淤青的按压方式.pptx VIP
- 纺织厂供配电系统设计.doc VIP
- 乡镇领导班子成员相互批评意见.docx VIP
- 30.XX中专职业学校“十五五”五年中长期发展规划(2026-2030年).pdf
原创力文档

文档评论(0)