Sie sind hier: Startseite / Services / Data Management / DKRZ CMIP Data Pool (DKRZ CDP)
Info
Alle Inhalte des Nutzerportal sind nur auf Englisch verfügbar.

DKRZ CMIP Data Pool (DKRZ CDP)

The DKRZ CMIP data pool (DKRZ CDP) contains often needed flagship collections of climate model data, is hosted as part of the DKRZ data infrastructure and supports scientists in high volume climate data collection, access and processing.

Furthermore, DKRZ provides support for IPCC authors by making the DKRZ CDP and the in-house technical infrastructure available in the framework of the DDC support activity, where DDC support consists of the IPCC DDC, in form of the local IPCC DDC activities, and IPCC Working Group Technical Support Units (WG TSU's). 

Working with the CMIP Data Pool

Data access

The DKRZ CDP is part of the in-house file system and can be accessed by any DKRZ user with a current account. 

  • If you do not have an account yet, please see https://www.dkrz.de/up/my-dkrz/getting-started/getting-started-at-dkrz.
  • After registration users need to join specific groups to assign them with personal storage and compute resources. Examples of such groups are:
    • DKRZ_MIP_POOL_Analysis:  to support IPCC related data analysis activities
    • ECAS: to support data analysis activities as part of the EOSCHUB project
    • DICAD: to support data analysis activities in the German DICAD consortium
    • IS-ENES: (starting early 2019) European service activity to support data analysis activities
  • Contact [Email protection active, please enable JavaScript.] for dedicated support regarding the DKRZ CDP

Data pool content 

Flagship datasets contained in the DKRZ CDP can be accessed on the HPC filesystem by symbolic link:

  • CMIP3 -> /pool/data/CMIP3
  • CMIP5 -> /pool/data/CMIP5
  • CMIP6 -> /pool/data/CMIP6
  • CORDEX -> /pool/data/CORDEX
  • REKLIES -> /pool/data/REKLIES

A complete overview of the DKRZ CDP data is available by browsing through

  • /work/kd0956 and /work/ik1017

Based on current estimates, around 2 PByte are reserved for providing replicated CMIP data from other ESGF data nodes from around the world. Around 100 TByte are currently reserved for storing derived data products.

In case you miss some ESGF CMIP(6) data available at other ESGF nodes you can request a data replication by contacting [Email protection active, please enable JavaScript.]. In case you have requirements with respect to storage of derived data products or the inclusion of non-ESGF accessible data sets please contact [Email protection active, please enable JavaScript.].

Data search and browsing

There are different possibilities to search for and browse through data items in the DKRZ CDP:

  • Via command-line using your favorite UNIX-shell environment
  • Use the web accessible search index (only for CMIP5 and CMIP6 data, requires login):
  • Use the search index on the login nodes
    • Log into the interactive nodes at DKRZ
    • Load the required software: module load cmip6-dicad/1.0
    • Start with: freva --databrowser --help
  • Use the DKRZ ESGF portal

Data processing

  • Batch jobs: Memory and compute intensive parallel data analysis is supported by the submission of batch compute jobs to the DKRZ HPC computer. The DKRZ CDP is accessible from all compute nodes. 
  • Virtual machines: Users can request virtual machines with direct access to the data pool. Requests need to be sent to [Email protection active, please enable JavaScript.]. These requests are evaluated and the resources are granted based on the result of this evaluation. Users can then directly log into these virtual machines and install and run their analysis codes.

 For an overview of the pre-installed libraries and tools please refer to

Data access from remote resources

Data from the data pool can be replicated to remote, e.g. institutional, resources using different methods:

User support

  • [Email protection active, please enable JavaScript.]

Artikelaktionen