Data Hubs

 

The GHGA data infrastructure is designed as a federated network: it consists of a central coordination node (GHGA Central) and several GHGA Data Hubs. These are located at six sites throughout Germany.

The GHGA Data Portal serves as a central point of contact for uploading, downloading and analysing genome data. Behind the scenes, data processing is coordinated by GHGA Central and the GHGA Data Hubs. 

Responsibilities and Locations of GHGA Data Hubs

The GHGA Data Hubs are operated by universities and research institutions. They are connected to local genome research centres that make genome data available via GHGA. Our Data Hubs, hosted at academic research institutions across Germany, act as infrastructure providers, offering storage, computational resources and expert staff to support the operation of the GHGA Data Portal.

Currently, the GHGA Data Hubs are located at seven sites:

  • DKFZ Heidelberg
  • University of Tübingen
  • TU Dresden
  • University of Cologne
  • TU Munich
  • Berlin, jointly run by MDC and BIH at Charité

Data Security

Due to the sensitive nature of the archived human genome data, all GHGA Data Hubs are obliged to guarantee the highest level of data security. To this end, the data is encrypted during transmission and storage. The software used follows the latest security  concepts such as the Zero Trust Security principle. 

Additionally, the sites conduct regular security audits and penetration tests and implement the established security standards in an information security management system (ISMS). As consortium leader of GHGA, the DKFZ approves and audits the GHGA Data Hubs with regard to compliance with the defined security standards. 

Contribution to genomDE Model Project

The GHGA Data Hubs also play an important role as part of the federated data platform in the genomDE model project, which is supported by the Federal Institute for Drugs and Medical Devices (BfArM). As part of this framework, some GHGA Data Hubs operate so-called Genome Data Centres (GRZ).