Open data resources

Here we list all PaN-related open data resources we know of, either domain-specific resources or open data portals from our individual facilities. We limit this list to data hosting repositories, excluding aggregators like the PaNOSC data portal or the EOSC search portal.

Domain-specific repositories

AlphaFold

Protein Structure Database

AlphaFold, the state-of-the-art AI system developed by DeepMind, is able to computationally predict protein structures with unprecedented accuracy and speed. Working in partnership with EMBL’s European Bioinformatics Institute (EMBL-EBI), we’ve released over 200 million protein structure predictions by AlphaFold that are freely and openly available to the global scientific community. Included are nearly all catalogued proteins known to science – with the potential to increase humanity’s understanding of biology by orders of magnitude.

API CC-BY-4.0
BMRB

Biological Magnetic Resonance Bank

BMRB makes bio-NMR data FAIR. It collects, annotates, archives, and disseminates spectral and quantitative data derived from NMR spectroscopic investigations of biological macromolecules and metabolites.

API
CSD

Cambridge Structural Database

The Cambridge Structural Database, or CSD, has been curated since 1965 from the published literature, direct deposition, and sources such as patents and PhD theses.
The world’s largest database of small-molecule organic and metal-organic crystal structure data, the CSD is managed by the Cambridge Crystallographic Data Centre (CCDC).

crystallography subscription-based
CXIDB

Coherent X-ray Imaging Data Bank

CXIDB offers scientists from all over the world a unique opportunity to access data from Coherent X-Ray Imaging (CXI) experiments. The website also serves as the reference for the CXI file format, in which most of the experimental data on the database is stored in.

coherent x-ray imaging free
EMDB

Electron Microscopy Data Bank

EMDB (the Electron Microscopy Data Bank) is a public repository for electron cryo-microscopy maps and tomograms of macromolecular complexes and subcellular structures. It covers a variety of techniques, including single-particle analysis, electron tomography, sub-tomogram averaging, fibre diffraction and electron crystallography.

API electron microscopy free
Human Organ Atlas

Human Organ Atlas

The Human Organ Atlas is making Hierarchical Phase-Contrast Tomography (HiP-CT) 3D scans of entire organs, with ca. 20 micron voxels, open access.

CC-BY-4.0 free
International XAFS DB

International XAFS Database Portal of the Japanese XAFS Society

This International XAFS DB Portal was created with the aim of making XAFS data from around the world findable.

delegated-license x-ray absorption fine structure free
MX-RDR

Macromolecular Xtallography Raw Data Repository

The Macromolecular Xtallography Raw Data Repository (MX- RDR) was developed as a part of an EU funded project, coordinated by ICM UW, which aimed to create three open access discipline dedicated raw data repositories.

The MX-RDR repository, which is accessible via the web portal at https://mxrdr.icm.edu.pl, was designed to archive and provide access to raw diffraction data collected for macromolecular crystals. It includes tools for creating datasets of crystallographic metadata by combining information extracted directly from diffraction images and obtained from a PDB deposit and/or user input. Each data set is characterized by rich metadata, both to facilitate their management and long-term curation, and to allow effective scientific reuse. The resource can be searched using various criteria and all data are available for unrestricted access and download.

CC-BY-4.0 / CC0-1.0 / All rights reserved macromolecular crystallography free
PED

Protein Ensemble Database

PED is a platform for the intrinsically disordered proteins (IDP) community where ensembles and their corresponding primary data can be stored and used as benchmarking datasets to facilitate the development of new ensemble calculation methods.

API
Perovskite DB

The Perovskite Database Project

The Perovskite Database Project aims at making all perovskite device data, both past and future, available in a form adherent to the FAIR data principles, i.e. findable, accessible, interoperable, and reusable.

In the initial phase of the project, the project team went through the over 16000 perovskite papers published until the end of February 2020 and extracted data for every single adequately described perovskite solar cell we could find. For papers published after that, the database relies on authors to upload their own data.

The project is based around an open database and open-sourced tools enabling anyone, without any programming experience, to interactively explore, search, filter, analyse, and visualise the data. The core of those tools are a set of interactive graphics. The interactive graphics are hosted by MaterialsZone. To reach the graphics, you will need to create a free account by filling out this form. Shortly after filling out the form, you will receive an invitation by email.

CC-BY-4.0 free after registering
RefXAS

RefXAS- an open access database of XAS spectra

In the frame of DAPHNE4NFDI, an X-ray absorption spectroscopy (XAS) reference database called RefXAS has been set-up where users are provided with well curated XAS reference spectra along with related metadata fields and online processing tools for visualizing the data. The developed online procedure enables users to submit a raw dataset along with its associated metadata via a dedicated website for inclusion in the database. The unique feature of quality criteria formulated for the uploaded reference data at the database make users aware about the usability of the data. These quality criteria are further employed for automatic quality check of the uploaded data which is then followed by manual curation at the interface. Implementation of the database includes an upload of metadata to the Scientific-Catalogue and an upload of files via object storage, with automated query capabilities through a web server and visualisation of the data and files. A prototype of the database with integrated quality control for uploaded spectra has been created, which can process common data types for X-ray absorption spectra and has a standardized metadata schema.

CC-BY-4.0 x-ray absorption spectroscopy free
SASDB

Small Angle Scattering Biological Data Bank

SASBDB is a curated repository of freely accessible and fully searchable SAS experimental data, which are deposited together with the relevant experimental conditions, sample details, instrument characteristic and derived models. The quality of deposited experimental data and the accuracy of models obtained from SAS and complementary techniques is assessed by the site developers.

API small angle scattering free
TomoBank

X-Ray Tomography Data Bank

The X-ray Tomography Data Bank or TomoBank, provides a repository of experimental and simulated data sets with the aim to foster collaboration among computational scientists, beamline scientists and experimentalists, to accelerate the development of tomographic reconstruction and 3D visualization methods and to speed up their implementation in the various synchrotron facility data analysis software packages.

CC-BY-4.0 with exceptions tomography free
wwPDB

Protein Data Bank

CoreTrustSeal

Since 1971, the Protein Data Bank archive (PDB) has served as the single repository of information about the 3D structures of proteins, nucleic acids, and complex assemblies.

CC0-1.0

PaN facilities repositories

Facility Open data repository OAI-PMH endpoint PaN search API endpoint
ALBA data.cells.es/...
Endpoint: link [Identify]
Items: 25
Sets: 2
SetItems Types
Life Science2 Dataset: 2
Spectroscopy22 Dataset: 22

Types 
Type Items
Dataset 25
last check: 2025-01-20
Endpoint: link [count]
Datasets: 1
last check: 2025-01-20
Elettra opendata.elettra.eu/...
Endpoint: link [Identify]
Items: 0
last check: 2025-01-20
Endpoint: link [count]
Status: Error
last check: 2025-01-20
ESRF data.esrf.fr/... CoreTrustSeal
Endpoint: link [Identify]
Items: 7,204
Types 
Type Items
Collection 7,204
last check: 2025-01-20
Endpoint: link [count]
Datasets: 419,228
last check: 2025-01-20
ESS scicat.ess.eu/...
Endpoint: link [Identify]
Harvesting suspended.
last check: 2025-01-20
Endpoint: link [count]
Status: Error
last check: 2025-01-20
EuXFEL in.xfel.eu/metadata/...
Endpoint: link [Identify]
Items: 5
Sets: 1
SetItems
OpenAIRE5
last check: 2025-01-20
Endpoint: link [count]
Datasets: 101
last check: 2025-01-20
HZB
Endpoint: link [Identify]
Items: 28,952
Sets: 3
SetItems Types
Data Publication10 Dataset: 10
Raw Dataset28,733 Dataset: 28,733
Investigation Raw Data209 Collection: 209

Types 
Type Items
Collection 209
Dataset 28,743
last check: 2025-01-20
last check:
HZDR rodare.hzdr.de/...
Endpoint: link [Identify]
Items: 1,037
Sets: 39
SetItems Types
OpenAIRE data sets1,066 Dataset: 1,066
Software217 Software: 217
ATHENA (Accelerator Technology HElmholtz iNfrAstructure)3 Dataset: 3
CASUS23 Audiovisual: 1
Dataset: 14
Software: 8
Electron-beam testing station for detectors (at ELBE)2 Dataset: 2
High-power ultra-short pulse laser DRACO (at ELBE)14 Dataset: 12
Software: 2
DRESDYN (DREsden Sodium facility for DYNamo and thermohydraulic studies)1 Image: 1
OpenAIRE95 Audiovisual: 4
Dataset: 49
Image: 3
Software: 39
ELBE (Electron Linac for beams with high Brilliance and low Emittance)85 Dataset: 74
Image: 2
Other: 1
Software: 8
Research field: Energy103 Dataset: 53
Image: 1
Other: 3
Software: 46
Free-Electron Laser (FELBE)5 Dataset: 4
Other: 1
Department of Information Services and Computing7 Dataset: 1
Image: 1
Other: 1
Software: 4
Institute of Fluid Dynamics161 Audiovisual: 2
Dataset: 104
Other: 6
Software: 47
Text: 2
Helmholtz Institute Freiberg for Resource Technology2 Dataset: 2
Institute of Ion Beam Physics and Materials Research105 Audiovisual: 1
Dataset: 91
Image: 8
Other: 4
Software: 1
Institute of Radiation Physics14 Dataset: 9
Other: 4
Software: 1
Nuclear Physics Department1 Dataset: 1
Institute of Radiooncology – OncoRay1 Other: 1
Institute of Resource Ecology7 Dataset: 7
Research field: Health35 Audiovisual: 1
Dataset: 28
Image: 2
Other: 1
Software: 3
Helmholtz International Beamline for Extreme Fields (HIBEF)4 Dataset: 2
Other: 2
Helmholtz-Zentrum Dresden-Rossendorf218 Audiovisual: 7
Dataset: 126
Image: 7
Other: 19
Software: 54
Text: 5
Ion Beam Center86 Audiovisual: 1
Dataset: 74
Image: 8
Other: 3
Research field: Matter91 Audiovisual: 3
Dataset: 70
Image: 5
Other: 4
Software: 8
Text: 1
Mu2e7 Dataset: 5
Software: 2
Neutron Time-Of-Flight Measurements1 Dataset: 1
Institute of Radiooncology - OncoRay15 Dataset: 14
Other: 1
OpenFOAM29 Software: 29
The Photon and Neutron Open Science Cluster (PaNOSC)3 Dataset: 3
Positrons (pELBE)10 Dataset: 10
Center for Positron Emission Tomography6 Dataset: 6
ROBL – The Rossendorf Beamline at ESRF2 Dataset: 2
Rodare1,022 Audiovisual: 12
Dataset: 736
Image: 36
Other: 47
Software: 185
Text: 6
ROFEX - Ultrafast electron beam X-ray computed tomography17 Dataset: 14
Other: 3
The Superconducting Electron Linear Accelerator (at ELBE)4 Dataset: 4
Superradiant THz source (TELBE)11 Dataset: 9
Other: 1
Software: 1
TOPFLOW -Transient Two Phase FlowTest Facility81 Dataset: 78
Other: 1
Software: 2
ZRT - Institute of Radiopharmaceutical Cancer Research 4 Dataset: 4
Bremsstrahlung (γELBE) 2 Dataset: 2

Types 
Type Items
Audiovisual 12
Dataset 751
Image 36
Other 47
Software 185
Text 6
last check: 2025-01-20
Endpoint: link [count]
Datasets: 47
last check: 2025-01-20
ILL data.ill.eu/...
Endpoint: link [Identify]
Items: 0
last check: 2025-01-20
Endpoint: link [count]
Status: Error
last check: 2025-01-20
ISIS data.isis.stfc.ac.uk/data...
Endpoint: link [Identify]
Harvesting suspended.
last check: 2025-01-20
Endpoint: link [count]
Datasets: 165,270
last check: 2025-01-20
MAX IV scicat.maxiv.lu.se/...
Endpoint: link [Identify]
Querying failed.
last check: 2025-01-20
Endpoint: link [count]
Datasets: 100
last check: 2025-01-20
PSI doi.psi.ch/...
Endpoint: link [Identify]
Items: 0
last check: 2025-01-20
Endpoint: link [count]
Datasets: 3,402
last check: 2025-01-20
SESAME access.sesame.org.jo/get-...
last check:
SOLEIL datacatalog.synchrotron-s...
Endpoint: link [Identify]
Querying failed.
last check: 2025-01-20
Endpoint: link [count]
Status: Error
last check: 2025-01-20
Totals Items: 37,223
Datasets: 29,519
Collections: 7,413
Datasets
+Collections: 36,932
Datasets: 588,149