Applied Mathematics

Volume 6, Issue 8 (July 2015)

ISSN Print: 2152-7385   ISSN Online: 2152-7393

Google-based Impact Factor: 0.58  Citations  

Intel® Math Kernel Library PARDISO* for Intel® Xeon PhiTM Manycore Coprocessor

HTML  XML Download Download as PDF (Size: 416KB)  PP. 1276-1281  
DOI: 10.4236/am.2015.68121    4,694 Downloads   6,285 Views  Citations

ABSTRACT

The paper describes an efficient direct method to solve an equation Ax = b, where A is a sparse matrix, on the Intel® Xeon PhiTM coprocessor. The main challenge for such a system is how to engage all available threads (about 240) and how to reduce OpenMP* synchronization overhead, which is very expensive for hundreds of threads. The method consists of decomposing A into a product of lower-triangular, diagonal, and upper triangular matrices followed by solves of the resulting three subsystems. The main idea is based on the hybrid parallel algorithm used in the Intel® Math Kernel Library Parallel Direct Sparse Solver for Clusters [1]. Our implementation exploits a static scheduling algorithm during the factorization step to reduce OpenMP synchronization overhead. To effectively engage all available threads, a three-level approach of parallelization is used. Furthermore, we demonstrate that our implementation can perform up to 100 times better on factorization step and up to 65 times better in terms of overall performance on the 240 threads of the Intel® Xeon PhiTM coprocessor.

Share and Cite:

Kalinkin, A. , Anders, A. and Anders, R. (2015) Intel® Math Kernel Library PARDISO* for Intel® Xeon PhiTM Manycore Coprocessor. Applied Mathematics, 6, 1276-1281. doi: 10.4236/am.2015.68121.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.