HPC-Cloud-Big Data Convergent Architectures and Research Data Management: The LEXIS Approach
Hachinger Stephan, Martinovič Jan, Terzo Olivier, Levrier Marc, Scionti Alberto, Magarielli Donato, Goubier Thierry, Parodi Antonio, Harsh Piyush, Apopei Florin-Ionut, Munke Johannes, García-Hernández Rubén, Golasowski Martin, Hayek Mohamad, Donnat Frédéric, Ganne Laurent, Koch-Hofer Cédric, Vitali Giacomo,
Viviani Paolo, Schorlemmer Danijel, Danovaro Emanuele, Parodi Andrea, Murphy Seán, and Dees Aaron
In Proceedings of International Symposium on Grids & Clouds 2021 — PoS(ISGC2021) Jan 2021
The LEXIS project (Large-scale EXecution for Industry & Society, H2020 GA825532) provides a platform for optimised execution of Cloud-HPC workflows, reducing computation time and increasing energy efficiency. The system will rely on advanced, distributed orchestration solutions (Atos YSTIA Suite, with Alien4Cloud and Yorc, based on TOSCA), the High-End Application Execution Middleware HEAppE, and new hardware capabilities for maximising efficiency in data processing, analysis and transfer (e.g. Burst Buffers with GPU- and FPGA-based data reprocessing). LEXIS handles computation tasks and data from three Pilots, based on representative and demanding HPC/Cloud-Computing use cases in Industry (SMEs) and Science: i) Simulations of complex turbomachinery and gearbox systems in Aeronautics, ii) Tsunami simulations and earthquake loss assessments which are time-constrained to enable immediate warnings and to support well-informed decisions, and iii) Weather and Climate simulations where massive amounts of in-situ data are assimilated to improve forecasts. A user-friendly LEXIS web portal, as a unique entry point, will provide access to data as well as workflow-handling and remote visualisation functionality. As part of its back-end, LEXIS builds an elaborate system for the handling of input, intermediate and result data. At its core, a Distributed Data Infrastructure (DDI) ensures the availability of LEXIS data at all participating HPC sites, which will be federated with a common LEXIS Authentication and Authorisation Infrastructure (with unified security model, user database and policies). The DDI leverages best of breed data-management solutions from EUDAT, such as B2SAFE (based on iRODS) and B2HANDLE. REST APIs on top of it will ensure a smooth interaction with LEXIS workflows and the orchestration layer. Last, but not least, the DDI will provide functionalities for Research Data Management following the FAIR principles (“Findable, Interoperable, Accessible, Reusable”), e.g. DOI acquisition, which helps to publish and disseminate open data products.