TY - GEN
T1 - Efficient parabolic solvers scalable across multi-architectural levels
AU - Zhuang, Yu
AU - Wu, Heng
PY - 2012
Y1 - 2012
N2 - High end computing hardware has been growing fast in both uniprocessor performance and parallel system scales. Steadily advancing but somewhat lagging behind is the speed of memory accesses. Thus, needed are software and algorithms behind software that adapt well with architectural features of high end computing hardware. Stable explicit implicit domain decomposition (SEIDD) is a class of numerical algorithms originally introduced for solving parabolic equations on parallel computers, which has adequately high parallelism, flexible controllability for load balancing, minimal communication cost, and good stability and efficiency. In this paper, we study the effectiveness of SEIDD in harnessing the computing power at the inter-processor level for parallel processing as well as the level of cache memories for fast memory accesses.
AB - High end computing hardware has been growing fast in both uniprocessor performance and parallel system scales. Steadily advancing but somewhat lagging behind is the speed of memory accesses. Thus, needed are software and algorithms behind software that adapt well with architectural features of high end computing hardware. Stable explicit implicit domain decomposition (SEIDD) is a class of numerical algorithms originally introduced for solving parabolic equations on parallel computers, which has adequately high parallelism, flexible controllability for load balancing, minimal communication cost, and good stability and efficiency. In this paper, we study the effectiveness of SEIDD in harnessing the computing power at the inter-processor level for parallel processing as well as the level of cache memories for fast memory accesses.
KW - Parallel algorithm
KW - cache performance
KW - numerical solution
KW - partial differential equation
UR - http://www.scopus.com/inward/record.url?scp=84867251266&partnerID=8YFLogxK
U2 - 10.1109/ISPA.2012.23
DO - 10.1109/ISPA.2012.23
M3 - Conference contribution
AN - SCOPUS:84867251266
SN - 9780769547015
T3 - Proceedings of the 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012
SP - 111
EP - 118
BT - Proceedings of the 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012
T2 - 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012
Y2 - 10 July 2012 through 13 July 2012
ER -