WP2 - Enabling FAIR data for EU Photon and Neutron national RIs

WP2 extends and deepens the adoption and use of FAIR data principles within the Photon and Neutron community to allow publication and access of national RI data and services within the EOSC.

Lead partner: UKRI

Budgeted participation per partner:

UKRI ALBA HZB PSI SOLEIL Elettra
45 PMs 18 PMs 18 PMs 15 PMs 12 PMs 11 PMs

Tasks

Progress in each of the work package's tasks are reported in this section, with regular updates, indicated by the dates. To view the original description of work of each task, as it is written in the project's description of work, you can click on the table header cells. Only the last update of the next steps is displayed.

T2.1 Alignment of policies (PSI)
Task: Relevant Research Infrastructures have a variety of data policies and practices, typically building on the PaNdata Common Policy Framework (2011), and on later activities in CALIPSOplus. This task will review current data policies and revise this framework within the policy recommendation of the EOSC and FAIR data principles. Further factors on data policy, for example, IPR and data licensing, commercial data, and sensitive data (e.g. GDPR) will also be considered. The task will work closely with PaNOSC, participating in a policy workshop and other consultation exercises.
Progress: 29 Apr 2020:
ExPaNDS involvment with the PaNOSC revision of the PaN data policy framework of 2011 progressed very well over the last 2 months, thanks to up to 2 workshops per week.
The PaNOSC draft policy framework is currently being evaluated against the RDA data maturity model and ExPaNDS partners are contributing to these discussions.
10 Feb 2021:
After the publication of the PaNOSC model data policy in May ‘20, WP2 started to consult our national facilities on it. The first consultations revealed the need to change the initial strategy for the D2.1 deliverable. More time and a step back was necessary to have in-depth dialog with facilities management and be able to advise in their possible data policy update.
A landscape of the national data policies was included in the deliverable as well as 30 key elements of a data policy, to be used for further consultations. FAIRsFAIR's policy enhancement recommendations were also applied in our framework. This strategy and subsequent deliverable were presented during the INFRA-EOSC-5 task force on FAIR.
08 Oct 2021:
The formal consultations with the facility staff in charge of data policies were carried out between February and May 2021 with each ExPaNDS partner facility. They led to the publication of the final data policy framework of the project in August 2021.
Our work on the FAIRification of data policies was featured by the FAIRsFAIR project as the first of its adoption stories.
29 Sep 2022:
This task is now completed.
T2.2 Data management planning and DMP (HZB)
Task: As the science domains, instruments and techniques vary across national RI experiments, each experiment should describe its approach to providing FAIR data via a tailored Data Management Plan (DMP). This would give details on the metadata collected, the approach to data storage and release, and intended approach to data management for derived data. The DMP can then be used to guide the collection and validation of data and metadata and its subsequent publication and use. Much of the additional cost of developing DMPs for experiments can be mitigated by taking a common approach for a facility and its instruments, and by automatically populating metadata information based on the proposal and instrument information. We propose to develop a systematic approach to the development of DMPs within national RIs. This would include considering outcomes of various international working groups as the Commission’s Expert Group on FAIR data, the RDA Active Data Management Plans, IGs and related WGs, as well as the PaNOSC DMP templates for experiments. We would also consider tools for DMP, such as for example using Research Data Management Organiser (RDMO).
Thus we make recommendation on a common DMP framework for national RIs, considering knowledge sources and related roles and activities for DMP relevant information, and then develop a common DMP template for use within RIs, aligned to that of PANOSC, which can be tailored to particular instruments and scientific methods. We then develop and trial an approach to active DMPs, integrating the DMP information into data lifecycle and metadata collections, and within the RDMO tool for policy enforcement and reporting.
Progress: 29 Apr 2020:
T2.2 helped WP1 elaborate the project DMP in Jan ‘20. The activities on DMPs for PaN RIs just started.
10 Feb 2021:
Following introductory work on DMPs with PaNOSC, the task was officially kicked-off for WP2 in early Jan. ‘21. In Dec. ‘20, discussions with DANS (EOSC-hub) for a possible workshop on DMPs to be organised during the first quarter of 2021 took place. The topic will be followed-up in the next few weeks.
08 Oct 2021:
The DMP template developed by PaNOSC, with important contributions from ExPaNDS, is now being finalised (due by the end of November 2021).
As a complement, ExPaNDS is working on providing an analysis of when during the experimental lifecycle the information required to answer the template questions becomes available and where that information comes from, e.g. from the proposal system, from the instruments, directly from the user, etc. This analysis is linked to the experimental lifecycle metadata analysis that was carried out in the frame of task 2.3. The associated deliverable is being finalised and is due by the end of November.
29 Sep 2022:
D2.8 is under preparation.
Next steps: 29 Sep 2022: The deliverable D2.8 will be finished by the end of November.
T2.3 Mainstreaming of standards for data management (ALBA)
Task: To enable FAIR access to data for users and re-users of data, consistent open metadata standards should be used. However, Photon and Neutron RIs use a variety of metadata standards and formats. A survey of the current use and requirement of metadata within RIs will be undertaken and recommendations on best practice on metadata standards for Findability, Accessibility, Interoperability and Reusability will be developed for the benefit of Photon and Neutron service providers and the wider science user community. This will be undertaken by developing a common metadata framework on different aspects of metadata and other practices required for FAIR data, and in close interaction with the development of detailed metadata ontologies and implementations in data catalogue and API services in WP3, and for use in WP4, coordinating closely with PaNOSC. The framework include recommendations to profile RI metadata to publish RIs data into common EOSC data catalogues e.g. OpenAire-Zenodo, EUDat-B2Find, consider the use of common metadata formats and standards, such as schema.org, and promoting cross-search between RIs catalogues published in the EOSC Portal.
Further, common metadata will promote data reuse, with increased contextual and provenance metadata, supporting the whole data lifecycle, in preservation, and considering the use of relationships with ontologies for provenance and preservation, such as PROV-O and PREMIS. The metadata framework will take into account the software environment where data were produced and the environment required to reuse the data, also the events on data lifecycle such as data creation, validation or verification. This task will provide an draft framework early in the project, for use in WP3 and WP4, and a revised recommendation late in the project.
Progress: 29 Apr 2020:
The baseline for this task was defined thanks to the landscaping survey done with all partner facilities in Dec ‘19, including level of FAIRness and GDPR compliancy of existing data policies, and current practices regarding DMPs. A common glossary of terms for data management at national RIs was started in March. As a first step, terms and definitions from a range of sources within PaN (e.g. existing PaN facility data policies) and outside PaN (e.g. RDM glossary) were collected by WP2 partners.
10 Feb 2021:
Building on the glossary works, WP2 provided input to the EOSC glossary in Oct. ‘20 which was included in the next version.
The main effort for this task was the common work with the partners and WP3 to agree on what metadata is to be recorded at each step of an experiment life cycle, in an attempt to update the PaNdata ODI. This work is detailed in the draft recommendations for FAIR Photon and Neutron Data Management published in Dec. ‘20.
08 Oct 2021:
The draft recommendations for FAIR PaN data management were presented to the INFRA-EOSC-5 Task force on FAIR in March 2021.
The strategy to update the recommendations was defined and will focus on the “core” metadata fields, identified with the highest priority in the first version of the document. We will evaluate how these are implemented in practice, e.g.: are they in WP3’s search API, do they relate to other open science models like Dublin Core, B2FIND, DataCite, DCAT2 and NeXus.
29 Sep 2022:
The metadata workshop took place in March 2022, the deliverable was published in July.

Abigail will ask Nicolas for a stand alone document for the metadata framework (from 2.2). There was a change in 2.7 and furthermore it would be nice to have a separate document (like a technical note or a guidance note).

Next steps: 29 Sep 2022: The deliverable D2.8 will be finished by the end of November.
T2.4 Persistent Identifier infrastructure (UKRI)
Task: Persistent Identifiers (PIDs) are used for data publishing within some existing RIs. We will promote PIDs within the community, sharing best practices in their use and citation by user communities, and on mapping from the metadata framework to PID provider metadata. We will work with projects such as OpenAire Advance (OpenAire Graph) and FREYA (PID Graph) to contribute to the cross referencing of PID information into a graph of connections. Further, we will explore how work on PIDs for additional resources (e.g. instruments, software, samples) could be used within the Photon and Neutron communities.
Progress: 10 Feb 2021:
In June ‘20, WP2 participated in the discussions on the EOSC PID policy during a workshop organised by the EOSC WG on FAIR.
In the project per se, the task was kicked-off in Jan. ‘21. It will develop around two working groups, the first being composed of the usual suspects to work on the deliverables and the second group including more individuals, notably from WP3, to be used as a forum on PID practices at our different facilities. A first meeting of the latter took place at the end of Jan. ‘21.
08 Oct 2021:
The work on this task so far has been to landscape the PIDs used in the community and outside and to see what possibilities for PIDs/PID graphs we could exploit at our facilities.
A first presentation of this work-in-progress was made at the librarian and data managers symposium in September 2021 (see task 2.6).
29 Sep 2022:
A workshop took place last year, the deliverable D2.5 was published in March 2022. This task is now completed.
T2.5 Quality assurance and certification schemes for data repositories (UKRI)
Task: Certification schemes for quality data repositories (e.g. CoreTrustSeal) and assessment schemes for FAIR data are now emerging. This task will assess these schemes and profile their application to the Photon and Neutron community, and will lead an open self-assessment exercise of the national research infrastructures against these schemes. The task will work closely with the successful project within INFRAEOSC-05-c, which includes the objective of developing a certification scheme for evaluating FAIR data sharing and publication. We would evaluate how the collected metadata and associated data management procedures contribute to certification and what can be learned from related standards (e.g. CoreTrustSeal, ISO Standard 16363 Information package, PREMIS) and self-assessment against the appropriate parts of the certification standard to achieve FAIR data principles.
Progress: 10 Feb 2021:
Following the INFRA-EOSC-5 TF on FAIR dedicated to automatic FAIR assessment of repositories, WP2 is evaluating the opportunity to use F-UJI to assess the FAIRness of our facilities repositories.
08 Oct 2021:
This task started with an assessment of the current background of FAIR assessment schemes, using e.g. material published by FAIRsFAIR (M4.2 and M4.3).
Early tendency for our approach to this task is to focus on the experimental stages at the beamlines and how we are making (or not) the produced data FAIR. However, this is not the focus of current tools like F-UJI, which focuses on datasets or repositories FAIR assessment. As a result, we may produce a tailored survey to be used for self-assessment by facilities or beamlines themselves.
29 Sep 2022:
This task is nearly finished.
Next steps: 29 Sep 2022: ongoing
T2.6 Uptake of FAIR data practices (UKRI)
Task: Advocacy of the use of FAIR data practice will need to be tailored to different stakeholder groups, in order to be most effective. In particular we will target the following groups:
• Senior management and steering bodies;
• instrument scientists and other facilities staff;
• user groups and specific science communities.
We will prepare promotional material and presentations, highlighting the benefits of FAIR data, including user case studies and stories, developing domain specific guidance and training on benefits and rewards of FAIR data within national RIs, working closely with WP5 and WP6 within workshops and outreach. We will also work closely with the competency centre in FAIRsFAIR to make recommendations for RIs to incentivise the use of FAIR data in policies, DMPs and citations, and also consider the skills and competency framework specifically for national RIs.
Progress: 29 Apr 2020:
In Nov. ‘19, WP2 gave a presentation at the OpenScienceFair in Porto and participated in the FAIRsFAIR synchronisation task force in Budapest.
March ‘20 saw an initial discussion between WP5 and WP2 around the first training workshop on FAIR data practices to be held in Autumn ‘20.
There has also been some initial discussion with the FAIRsFAIR project to determine what training resources and other outputs from that project could prove useful for the ExPaNDS workshop.
Since it started in Jan. ‘20, WP2 also participates in the FAIR task force with other 5b projects (~1 meeting / month to share progress and common goals).
In April 2020, separate meetings with the EOSC glossary project and FAIRsFAIR were also undertaken.
10 Feb 2021:
WP2 organised a series of two workshops on FAIR in Oct. ‘20, with ~100 attendees for each session (see recordings). The workshops offered participants an overview of the benefits of FAIR, what is FAIR vs open, the use of DOIs and what metadata is to be expected at each step of the life cycle of a typical experiment. It was targeted to instrument scientists, computing staff and senior management. Speakers from FAIRsFAIR and PaNOSC also contributed.
WP2 also participated in FAIRsFAIR synchronisation workshops in June 2020 and applied for a session at the next RDA meeting in April under the PaNSIG group banner.
WP2 also met with WP5 to better define the strategy for the training material to be produced, including also FAIRsFAIR contributors.
08 Oct 2021:
The recent major WP2 outreach event has been the librarian and data managers symposium, which was held on the 30th of September 2021.
WP2 also contributed to the “let’s talk about FAIRyfying OS policies” session of the INFRA-EOSC-5 projects at the OSFair in September 2021 and to the synchronisation workshops series of FAIRsFAIR in April, May and June 2021.
29 Sep 2022:
This topic was covered during the face to face meeting in Prague. Heike is going to write a guidance note. some of patricks talks about FAIR could be included.
Next steps: 29 Sep 2022: ongoing

Deliverables

"Accepted" deliverables are the ones approved by the European Commission and thus published in CORDIS with other EU projects results.

"Delivered" deliverables are the ones we submitted to the European Commission but are not yet approved. They can only be found in our Zenodo community for now.

"Pending" deliverables are still in progress and were not yet submitted by the project to the European Commission.

Accepted deliverables Partner Date
2.1 Draft extended data policy framework for Photon and Neutron RIs (link) PSI 18 Sep 2020
2.2 Draft recommendations for FAIR Photon and Neutron Data Management (link) ALBA 14 Dec 2020
Delivered deliverables Partner Date
2.3 Final data policy framework for Photon and Neutron RIs (link) PSI 20 Aug 2021
2.4 DMPs for Photon and Neutron RIs (link) HZB 30 Nov 2021
2.5 Advanced infrastructure for PIDs in Photon and Neutron RIs (link) UKRI 03 Mar 2022
2.6 Self-evaluation Photon and Neutron RIs for FAIR data certification (link) UKRI 19 Dec 2022
2.7 Final Recommendations for FAIR Photon and Neutron Data Management (link) ALBA 11 Jul 2022
2.8 Active DMPs for Photon and Neutron RIs (link) HZB 21 Dec 2022
2.9 Report on promotion of FAIR data within Photon and Neutron RIs (link) UKRI 31 Jan 2023
Pending deliverables Partner Date
No pending deliverables at the moment

Milestones

Achieved milestones Partner Date
10 Production of draft FAIR data framework UKRI 14 Dec 2020
11 Production of final FAIR data framework UKRI 30 Dec 2022
Pending milestones Partner Date
No pending milestones at the moment