TY - GEN
T1 - Using property graphs for rich metadata management in HPC systems
AU - Dai, Dong
AU - Ross, Robert B.
AU - Carns, Philip
AU - Kimpe, Dries
AU - Chen, Yong
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/1/20
Y1 - 2014/1/20
N2 - HPC platforms are capable of generating huge amounts of metadata about different entities including jobs, users, and files. Simple metadata, which describe the attributes of these entities (e.g., file size, name, and permissions mode), has been well recorded and used in current systems. However, only a limited amount of rich metadata, which records not only the attributes of entities but also relationships between them, are captured in current HPC systems. Rich metadata may include information from many sources, including users and applications, and must be integrated into a unified framework. Collecting, integrating, processing, and querying such a large volume of metadata pose considerable challenges for HPC systems. In this paper, we propose a rich metadata management approach that unifies metadata into one generic property graph. We argue that this approach supports not only simple metadata operations such as directory traversal and permission validation but also rich metadata operations such as provenance query and security auditing. The property graph approach provides an extensible method to store diverse metadata and presents an opportunity to leverage rapidly evolving graph storage and processing techniques.
AB - HPC platforms are capable of generating huge amounts of metadata about different entities including jobs, users, and files. Simple metadata, which describe the attributes of these entities (e.g., file size, name, and permissions mode), has been well recorded and used in current systems. However, only a limited amount of rich metadata, which records not only the attributes of entities but also relationships between them, are captured in current HPC systems. Rich metadata may include information from many sources, including users and applications, and must be integrated into a unified framework. Collecting, integrating, processing, and querying such a large volume of metadata pose considerable challenges for HPC systems. In this paper, we propose a rich metadata management approach that unifies metadata into one generic property graph. We argue that this approach supports not only simple metadata operations such as directory traversal and permission validation but also rich metadata operations such as provenance query and security auditing. The property graph approach provides an extensible method to store diverse metadata and presents an opportunity to leverage rapidly evolving graph storage and processing techniques.
UR - http://www.scopus.com/inward/record.url?scp=84988222699&partnerID=8YFLogxK
U2 - 10.1109/PDSW.2014.11
DO - 10.1109/PDSW.2014.11
M3 - Conference contribution
AN - SCOPUS:84988222699
T3 - Proceedings of PDSW 2014: 9th Parallel Data Storage Workshop - Held in Conjunction with SC 2014: The International Conference for High Performance Computing, Networking, Storage and Analysis
SP - 7
EP - 12
BT - Proceedings of PDSW 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th Parallel Data Storage Workshop, PDSW 2014 - Held in Conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014
Y2 - 16 November 2014
ER -