How EGI services support the ENVRIplus project

Yin Chen reports on the progresses made by four ENVRIplus Science Demonstrators

The ENVRIplus project is now entering its final stage, being due to end in April 2019. The objective of ENVRIplus has been to provide solutions to shared challenges for European Environmental and Earth System Research Infrastructures (RIs) in their efforts to deliver new services for science and society.

The project now reports seven Science Demonstrators shocasing implementation results of community use cases, and serving as evidence of the added value brought to RIs.
EGI has been providing full support to the project’s service development and use case implementations. Four of the science demonstrators are now fully integrated with EGI services.

Here is an overview of the work done so far:

Cross Research Infrastructure improvements

The first Science Demonstrator addresses a common problem for ENVRIplus RIs: the preparation of data transfer prior to data transmission is often not yet sufficiently standardised. This hinders the operation of efficient cross-RI data processing routines, e.g., for data quality checking. The demonstrator introduces a service prototype that allows to submit and publish raw observational environmental time series data in common standard formats (T-SOS XML & SSNO JSON).

The EGI Monitoring service (ARGO) is used as a messaging API to perform Near Real Time (NRT) quality control procedures by an Apache Storm NRT QC Topology deployed on the EGI Federated Cloud, which in turn publishes the quality controlled and labelled data via a messaging output queue.

Watch demo.

EuroArgo Data Subscription Service

The Euro-Argo ERIC coordinates the European contribution to the international Argo programme for marine science observation. The EuroArgo Data Subscription Service allows researchers to subscribe to customised views of Argo data, select specific regions and time spans, and choose the frequency of updates. Tailored updates are then provided to researchers’ private storage.

The demo shows an integration solution that combines the EuroArgo community data portal with the EGI Federated Cloud for computing data product for each subscription. The pilot activity was initiated by the marine research community. RIs can benefit from the subscription services, e.g., to create more elaborated data products by requesting data from other sources, and can optimise their internal workflows by signing up for automatic updates.

Watch demo.

Transferable data analytics

The third Science Demonstrator describes a service prototype that supports aerosol scientists in studying new atmospheric particle formation events by moving data analysis from local computing environments to interoperable infrastructures. This results in harmonising data analysis and more importantly the syntax and semantics of data derived from analysis. The demonstrator showcases a possible architecture of a socio-technical infrastructure that transforms data into knowledge.

The service allows researchers to access JupyterLab operated on the EGI e-Infrastructure to analyse primary data for the purpose of new particle formation, event detection & description. JupyterLab is accessible from the corresponding D4Science Virtual Research Environment (VRE). This approach shows a range of novel possibilities, in particular enabling researchers to focus on data analysis and interpretation while leaving data access & transformation from & to systems to interoperable infrastructure. The demonstrator contributes to implementing the global agenda of FAIR data by promoting the notion of “FAIR by Design”, weaving data FAIRness into the fabric of infrastructures. It builds on the principle not to leave making data FAIR to researchers but to guarantee it by design of well-engineered infrastructures.

Watch demo.

LifeWatch

LifeWatch is a Research Infrastructure set up to support the fields of ecosystems research and biodiversity by equipping scientists with access to data, analytical tools and state-of-the-art virtual laboratories. The last Science Demonstrator illustrates how a LifeWatch researcher can easily upload and integrate an analysis algorithm in D4Science, & share it with other researchers in a VRE. The use case proposed is an integration solution that links the D4Science/gCube VRE to the LifeWatch RI and to the EGI e-Infrastructure.

This integration enables individual researchers to repeat and reuse algorithms, run trend analysis, and add new parameters and custom data. The VRE provides provenance registration that improves reproducibility and also allows retention of computation results in the user’s workspace. This facilitates the editing and adaptation of algorithms, features that are not provided by the existing LifeWatch ICT.

Watch demo.

More information

Yin Chen is Senior Technical Outreach Expert at the EGI Foundation and working in WP9 of the ENVRIplus project.

ENVRIplus website

Subscribe to the EGI newsletter: