Ingestion#
Here we introduce you to the tools and processes involved in bringing data in to the DKRZ CDP from different data sources. We aim to teach you how to use these tools yourselves.
New data#
Data that is introducued to the DKRZ CDP is either primary published or replicated. Primary published data is published on the ESGF node at DKRZ. This process is initialized by a request from the data submitter. While this data “only” needs to be transferred securely into the DKRZ CDP storage space for publication, the transfer of data for replication from other nodes is more complex.
Archived Data#
Data archived in the World Data Center of Climate, like CMIP6 data, can be retrieved on Levante with the tool “jblob”. Here we show you how to create a download script with the WDCC CMIP6 data download generator.