Standardized Multi-protocol Data Management for Grid and Cloud GridRPC Frameworks
Caniou, Yves and Croubois, Hadrien and Le Mahec, Gaël
GridRPC is an international standard of the Open Grid Forum defining an API designed to allow applications to be submitted in a seamless way on large scale, heterogeneous and geographically distributed computing platforms. First versions of the standard did not take into account any data management feature. Data were parameters of the Remote Procedure calls, without any possibility to prefetch them, to use persistence, replication, external sources, etc. , and making GridRPC codes middleware dependent. The data extension of the standard introduced a short set of functions and data structures to complete the API with simple but powerful data management features. In this paper, we present a modular and extensible implementation of both APIs, which needs only a few developments to be usable with any middleware relying on RPC, and which provides access to numerous and easy to extend protocols and data middleware to access data. Gaining data management functions, it introduces interesting potentiality for optimization that such an approach would provide to large scale applications.
@article{Caniou2014,
author="Caniou, Yves and Croubois, Hadrien and Le Mahec, Ga{\"e}l",
title="Standardized Multi-protocol Data Management for Grid and Cloud GridRPC Frameworks",
booktitle="Data Management in Cloud, Grid and P2P Systems",
editor="Hameurlain, Abdelkader and Dang, Tran Khanh and Morvan, Franck",
year="2014",
publisher="Springer International Publishing",
address="Cham",
pages="61--72",
abstract = "GridRPC is an international standard of the Open Grid Forum defining an API designed to allow applications to be submitted in a seamless way on large scale, heterogeneous and geographically distributed computing platforms. First versions of the standard did not take into account any data management feature. Data were parameters of the Remote Procedure calls, without any possibility to prefetch them, to use persistence, replication, external sources, etc. , and making GridRPC codes middleware dependent. The data extension of the standard introduced a short set of functions and data structures to complete the API with simple but powerful data management features. In this paper, we present a modular and extensible implementation of both APIs, which needs only a few developments to be usable with any middleware relying on RPC, and which provides access to numerous and easy to extend protocols and data middleware to access data. Gaining data management functions, it introduces interesting potentiality for optimization that such an approach would provide to large scale applications.",
isbn="978-3-319-10067-8",
doi="10.1007/978-3-319-10067-8_6",
file="//shared.hadriencroubois.com/publications/2014/2014_Standardized_multi-protocol_data_management_for_grid_and_cloud_gridrpc_frameworks.pdf",
}