Unique infrastructures

DataHub

DataHub is a research and experimentation infrastructure that allows the storage and processing of large amounts of data quickly and securely.

This infrastructure is composed of three types of components:

  • Control Components: allow monitoring the status and configuring the infrastructure according to the requirements at any given time.

  • Storage Components: store information in a shared data space with efficiency and security requirements according to the volume and type of data.

  • Computing Components: process the information contained in the storage components using CPU and GPU processors in combination.

This enables efficient execution of tasks requiring high computational power and storage, such as artificial intelligence and large-scale data processing. Its purpose is to create a universal shared Data Space accessible to any entity, regardless of its characteristics.

Examples of projects using the DATA HUB:

MODERATE (Marketable Open Data Solutions for Optimized Building-Related Energy Services): the GPU node is being used for the development of ML models, both by CTIC and European project partners, with the University of Vienna being the main contributor.

GAIA-X: The use of the DataHub enables the development and deployment of solutions for creating Data Spaces aligned with the GAIA-X initiative standards and serves as the technological foundation for deploying Data Spaces in the Agri-food sector, which began development in 2025.

AI4ES (Excellence Network in Data-Driven Enabling Technologies): the DataHub was used for distributed training of AI models and for developing STT (speech-to-text) models.

CEL.IA (Cervera Consortium for Leadership in Applied AI R&D&I): The GPU node hosts and continues to run NLP (Natural Language Processing) models for speech-to-text.

AI.MEE: The DataHub also provides computational support for CTIC's "Generative AI Laboratory," enabling deployment and operation of Large Language Models (LLMs), including the internally developed AI.MEE application, based on generative AI for leveraging private knowledge bases.

CTIC-DATAHUB has been funded by the Asturias Program.