Iteration based collective I/O strategy for Parallel I/O systems

Zhixiang Wang; Xuanhua Shi; Hai Jin; Song Wu; Yong Chen

doi:10.1109/CCGrid.2014.61

Iteration based collective I/O strategy for Parallel I/O systems

Zhixiang Wang, Xuanhua Shi, Hai Jin, Song Wu, Yong Chen

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

14 Scopus citations

Abstract

MPI collective I/O is a widely used I/O method that helps data-intensive scientific applications gain better I/O performance. However, it has been observed that existing collective I/O strategies do not perform well due to the access contention problem. Existing collective I/O optimization strategies mainly focus on the I/O phase efficiency and ignore the shuffle cost that may limit the potential of their performance improvement. We observe that as the size of I/O becomes larger, one I/O operation from the upper application would be separated into several iterations to complete. So, I/O requests in each file domain do not necessarily issue to the parallel file system simultaneously unless they are carried out within the same iteration step. Based on that observation, this paper proposes a new collective I/O strategy that reorganizes I/O requests within each file domain instead of coordinating requests across file domains, such that we can eliminate access contentions without introducing extra shuffle cost between aggregators and computing processes. Using benchmark workloads IOR, we evaluate our new strategy and compare with the conventional one. The proposed strategy achieves up to 47%-63% I/O bandwidth improvement compared to the existing ROMIO collective I/O strategy.

Original language	English
Title of host publication	Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014
Publisher	IEEE Computer Society
Pages	287-294
Number of pages	8
ISBN (Print)	9781479927838
DOIs	https://doi.org/10.1109/CCGrid.2014.61
State	Published - 2014
Event	14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014 - Chicago, IL, United States Duration: May 26 2014 → May 29 2014

Publication series

Name	Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014

Conference

Conference	14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014
Country/Territory	United States
City	Chicago, IL
Period	05/26/14 → 05/29/14

Keywords

access contention
collective I/O
iteration
parallel system

Access to Document

10.1109/CCGrid.2014.61

Cite this

Wang, Z., Shi, X., Jin, H., Wu, S., & Chen, Y. (2014). Iteration based collective I/O strategy for Parallel I/O systems. In Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014 (pp. 287-294). Article 6846464 (Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014). IEEE Computer Society. https://doi.org/10.1109/CCGrid.2014.61

@inproceedings{94d1d9333d9a4ab182580e5f3ce12198,

title = "Iteration based collective I/O strategy for Parallel I/O systems",

abstract = "MPI collective I/O is a widely used I/O method that helps data-intensive scientific applications gain better I/O performance. However, it has been observed that existing collective I/O strategies do not perform well due to the access contention problem. Existing collective I/O optimization strategies mainly focus on the I/O phase efficiency and ignore the shuffle cost that may limit the potential of their performance improvement. We observe that as the size of I/O becomes larger, one I/O operation from the upper application would be separated into several iterations to complete. So, I/O requests in each file domain do not necessarily issue to the parallel file system simultaneously unless they are carried out within the same iteration step. Based on that observation, this paper proposes a new collective I/O strategy that reorganizes I/O requests within each file domain instead of coordinating requests across file domains, such that we can eliminate access contentions without introducing extra shuffle cost between aggregators and computing processes. Using benchmark workloads IOR, we evaluate our new strategy and compare with the conventional one. The proposed strategy achieves up to 47%-63% I/O bandwidth improvement compared to the existing ROMIO collective I/O strategy.",

keywords = "access contention, collective I/O, iteration, parallel system",

author = "Zhixiang Wang and Xuanhua Shi and Hai Jin and Song Wu and Yong Chen",

year = "2014",

doi = "10.1109/CCGrid.2014.61",

language = "English",

isbn = "9781479927838",

series = "Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014",

publisher = "IEEE Computer Society",

pages = "287--294",

booktitle = "Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014",

note = "14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014 ; Conference date: 26-05-2014 Through 29-05-2014",

}

Wang, Z, Shi, X, Jin, H, Wu, S & Chen, Y 2014, Iteration based collective I/O strategy for Parallel I/O systems. in Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014., 6846464, Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014, IEEE Computer Society, pp. 287-294, 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014, Chicago, IL, United States, 05/26/14. https://doi.org/10.1109/CCGrid.2014.61

Iteration based collective I/O strategy for Parallel I/O systems. / Wang, Zhixiang; Shi, Xuanhua; Jin, Hai et al.
Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014. IEEE Computer Society, 2014. p. 287-294 6846464 (Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Iteration based collective I/O strategy for Parallel I/O systems

AU - Wang, Zhixiang

AU - Shi, Xuanhua

AU - Jin, Hai

AU - Wu, Song

AU - Chen, Yong

PY - 2014

Y1 - 2014

N2 - MPI collective I/O is a widely used I/O method that helps data-intensive scientific applications gain better I/O performance. However, it has been observed that existing collective I/O strategies do not perform well due to the access contention problem. Existing collective I/O optimization strategies mainly focus on the I/O phase efficiency and ignore the shuffle cost that may limit the potential of their performance improvement. We observe that as the size of I/O becomes larger, one I/O operation from the upper application would be separated into several iterations to complete. So, I/O requests in each file domain do not necessarily issue to the parallel file system simultaneously unless they are carried out within the same iteration step. Based on that observation, this paper proposes a new collective I/O strategy that reorganizes I/O requests within each file domain instead of coordinating requests across file domains, such that we can eliminate access contentions without introducing extra shuffle cost between aggregators and computing processes. Using benchmark workloads IOR, we evaluate our new strategy and compare with the conventional one. The proposed strategy achieves up to 47%-63% I/O bandwidth improvement compared to the existing ROMIO collective I/O strategy.

AB - MPI collective I/O is a widely used I/O method that helps data-intensive scientific applications gain better I/O performance. However, it has been observed that existing collective I/O strategies do not perform well due to the access contention problem. Existing collective I/O optimization strategies mainly focus on the I/O phase efficiency and ignore the shuffle cost that may limit the potential of their performance improvement. We observe that as the size of I/O becomes larger, one I/O operation from the upper application would be separated into several iterations to complete. So, I/O requests in each file domain do not necessarily issue to the parallel file system simultaneously unless they are carried out within the same iteration step. Based on that observation, this paper proposes a new collective I/O strategy that reorganizes I/O requests within each file domain instead of coordinating requests across file domains, such that we can eliminate access contentions without introducing extra shuffle cost between aggregators and computing processes. Using benchmark workloads IOR, we evaluate our new strategy and compare with the conventional one. The proposed strategy achieves up to 47%-63% I/O bandwidth improvement compared to the existing ROMIO collective I/O strategy.

KW - access contention

KW - collective I/O

KW - iteration

KW - parallel system

UR - http://www.scopus.com/inward/record.url?scp=84904573374&partnerID=8YFLogxK

U2 - 10.1109/CCGrid.2014.61

DO - 10.1109/CCGrid.2014.61

M3 - Conference contribution

AN - SCOPUS:84904573374

SN - 9781479927838

T3 - Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014

SP - 287

EP - 294

BT - Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014

PB - IEEE Computer Society

T2 - 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014

Y2 - 26 May 2014 through 29 May 2014

ER -

Wang Z, Shi X, Jin H, Wu S, Chen Y. Iteration based collective I/O strategy for Parallel I/O systems. In Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014. IEEE Computer Society. 2014. p. 287-294. 6846464. (Proceedings - 14th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2014). doi: 10.1109/CCGrid.2014.61

Iteration based collective I/O strategy for Parallel I/O systems

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this