Hiding I/O latency with pre-execution prefetching for parallel applications

Yong Chen; Surendra Byna; Xian He Sun; Rajeev Thakur; William Gropp

doi:10.1109/SC.2008.5213209

Hiding I/O latency with pre-execution prefetching for parallel applications

Yong Chen, Surendra Byna, Xian He Sun, Rajeev Thakur, William Gropp

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

52 Scopus citations

Abstract

Parallel applications are usually able to achieve high computational performance but suffer from large latency in I/O accesses. I/O prefetching is an effective solution for masking the latency. Most of existing I/O prefetching techniques, however, are conservative and their effectiveness is limited by low accuracy and coverage. As the processor-I/O performance gap has been increasing rapidly, data-access delay has become a dominant performance bottleneck. We argue that it is time to revisit the "I/O wall" problem and trade the excessive computing power with data-access speed. We propose a novel pre-execution approach for masking I/O latency. We describe the pre-execution I/O prefetching framework, the pre-execution thread construction methodology, the underlying library support, and the prototype implementation in the ROMIO MPI-IO implementation in MPICH2. Preliminary experiments show that the pre-execution approach is promising in reducing I/O access latency and has real potential.

Original language	English
Title of host publication	2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008
DOIs	https://doi.org/10.1109/SC.2008.5213209
State	Published - 2008
Event	2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008 - Austin, TX, United States Duration: Nov 15 2008 → Nov 21 2008

Publication series

Name	2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008

Conference

Conference	2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008
Country/Territory	United States
City	Austin, TX
Period	11/15/08 → 11/21/08

Access to Document

10.1109/SC.2008.5213209

Cite this

Chen, Y., Byna, S., Sun, X. H., Thakur, R., & Gropp, W. (2008). Hiding I/O latency with pre-execution prefetching for parallel applications. In 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008 Article 5213209 (2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008). https://doi.org/10.1109/SC.2008.5213209

@inproceedings{74ef81e1c5e84ed0a3f73b6e533e6e3a,

title = "Hiding I/O latency with pre-execution prefetching for parallel applications",

abstract = "Parallel applications are usually able to achieve high computational performance but suffer from large latency in I/O accesses. I/O prefetching is an effective solution for masking the latency. Most of existing I/O prefetching techniques, however, are conservative and their effectiveness is limited by low accuracy and coverage. As the processor-I/O performance gap has been increasing rapidly, data-access delay has become a dominant performance bottleneck. We argue that it is time to revisit the {"}I/O wall{"} problem and trade the excessive computing power with data-access speed. We propose a novel pre-execution approach for masking I/O latency. We describe the pre-execution I/O prefetching framework, the pre-execution thread construction methodology, the underlying library support, and the prototype implementation in the ROMIO MPI-IO implementation in MPICH2. Preliminary experiments show that the pre-execution approach is promising in reducing I/O access latency and has real potential.",

author = "Yong Chen and Surendra Byna and Sun, {Xian He} and Rajeev Thakur and William Gropp",

year = "2008",

doi = "10.1109/SC.2008.5213209",

language = "English",

isbn = "9781424428359",

series = "2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008",

booktitle = "2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008",

note = "2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008 ; Conference date: 15-11-2008 Through 21-11-2008",

}

Chen, Y, Byna, S, Sun, XH, Thakur, R & Gropp, W 2008, Hiding I/O latency with pre-execution prefetching for parallel applications. in 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008., 5213209, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008, Austin, TX, United States, 11/15/08. https://doi.org/10.1109/SC.2008.5213209

Hiding I/O latency with pre-execution prefetching for parallel applications. / Chen, Yong; Byna, Surendra; Sun, Xian He et al.
2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008. 2008. 5213209 (2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Hiding I/O latency with pre-execution prefetching for parallel applications

AU - Chen, Yong

AU - Byna, Surendra

AU - Sun, Xian He

AU - Thakur, Rajeev

AU - Gropp, William

PY - 2008

Y1 - 2008

N2 - Parallel applications are usually able to achieve high computational performance but suffer from large latency in I/O accesses. I/O prefetching is an effective solution for masking the latency. Most of existing I/O prefetching techniques, however, are conservative and their effectiveness is limited by low accuracy and coverage. As the processor-I/O performance gap has been increasing rapidly, data-access delay has become a dominant performance bottleneck. We argue that it is time to revisit the "I/O wall" problem and trade the excessive computing power with data-access speed. We propose a novel pre-execution approach for masking I/O latency. We describe the pre-execution I/O prefetching framework, the pre-execution thread construction methodology, the underlying library support, and the prototype implementation in the ROMIO MPI-IO implementation in MPICH2. Preliminary experiments show that the pre-execution approach is promising in reducing I/O access latency and has real potential.

AB - Parallel applications are usually able to achieve high computational performance but suffer from large latency in I/O accesses. I/O prefetching is an effective solution for masking the latency. Most of existing I/O prefetching techniques, however, are conservative and their effectiveness is limited by low accuracy and coverage. As the processor-I/O performance gap has been increasing rapidly, data-access delay has become a dominant performance bottleneck. We argue that it is time to revisit the "I/O wall" problem and trade the excessive computing power with data-access speed. We propose a novel pre-execution approach for masking I/O latency. We describe the pre-execution I/O prefetching framework, the pre-execution thread construction methodology, the underlying library support, and the prototype implementation in the ROMIO MPI-IO implementation in MPICH2. Preliminary experiments show that the pre-execution approach is promising in reducing I/O access latency and has real potential.

UR - http://www.scopus.com/inward/record.url?scp=70350757788&partnerID=8YFLogxK

U2 - 10.1109/SC.2008.5213209

DO - 10.1109/SC.2008.5213209

M3 - Conference contribution

AN - SCOPUS:70350757788

SN - 9781424428359

T3 - 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008

BT - 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008

T2 - 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008

Y2 - 15 November 2008 through 21 November 2008

ER -

Chen Y, Byna S, Sun XH, Thakur R, Gropp W. Hiding I/O latency with pre-execution prefetching for parallel applications. In 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008. 2008. 5213209. (2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008). doi: 10.1109/SC.2008.5213209

Hiding I/O latency with pre-execution prefetching for parallel applications

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this