The Lymphoma data hub(LDH)

Technology assistant for your research projects

The LDH (Lymphoma Data Hub) is a platform for centralizing, enriching, and leveraging data through artificial intelligence.

Developed since 2020 by the R&D team of the Carnot CALYM Institute, the Lymphoma Data Hub is a tool available to all entities within the consortium. The strength of this new platform lies in its ability to leverage a substantial volume of existing data. It was initially designed to provide researchers and doctors with a secure sharing platform and harmonized data processing, enabling them to delve deeper into their analyses through a full cloud solution.

This platform can be deployed for academic projects as well as collaborative projects with industrial partners seeking to collaborate with academic teams. Although initially conceived and developed for projects in the field of lymphoma, the LDH is now open to all holders of scientific research projects without restriction on the research domain.

The LDH allows project stakeholders to share data so that it can be utilized by other project participants without the data passing through third-party environments. The data remains within the platform, and various contributors (data providers, physicians responsible for data enrichment, data scientists, etc.) work directly on the platform.

The LDH is a platform hosted in an HDS-certified Cloud. The entire architecture is designed for complete isolation of data between different projects.

Ongoing projects on the LDH
TiB of stored data
Millions of distinct pieces of data
Different types of data

Accelerate lymphoma research

Collaborate with CALYM

How can the LDH assist you?

The LDH allows you to:

By depositing your data in the LDH, you can subsequently benefit from increased computational power to process this data, notably through cloud computing technology.

By depositing your data in the LDH, you can benefit from a secure and decentralized cloud environment to engage external expertise (data scientists, data managers, etc.).

By depositing your data and your colleagues’ data in the LDH, you can use our Datalake and easily link them together for collaborative exploration !

You can deposit your data on the LDH and allow your colleagues to enrich them directly in this secure environment.

For example, if you have pathology slide scans and you want them to be annotated by your colleague doctor for use by an AI model, the LDH is the solution !


Various types of data are deposited on the LDH, including:

  • Nifti (.nii)
  • Sas (.sas7bcat)
  • Sql (.sql)
  • Mediacal Image with Aperio format(.svs)
  • Variant Call Format (.vcf)
  • Excel (.xls/.xlsx)
  • Binary Alignment Map (.bam)
  • CSV (.csv)
  • Fichier DAT (.dat)
  • DICOM (.dcm)
  • Word (.docx)
  • Fastq File (.fastq)
  • INI File (.ini)
  • MIRAX File (.mrxs)
  • NanoZoomer Digital Pathology Image File(.ndpi)
  • Nifti File(.nii)
  • Zip File (.zip)

And much more...

They have used the LDH

Needs: External expertise in AI as well as enrichment of deposited data.

Objective: Define predictive histophenomic algorithms for the classification of DLBCL tumors and the stratification of DLBCL patients.

Camille Laurent, Christiane Copie
ILIA Project

Needs: External expertise in AI and centralization of data from LYSARC and the University Hospitals of Toulouse and Dijon.

Objective: Predict POD24 (Progression or Death within 24 months) for patients with Follicular Lymphomas based on heterogeneous imaging data using Deep Learning algorithms.

Cédric Rossi, Luc Xerri, Salim Kanoun
PAPET24 project



Everything is feasible on the LDH, you just have to build it together!

Initially conceived and developed for projects in the field of lymphoid pathologies, LDH can now be deployed in research projects on other pathologies. Similarly, the tools now available on the LDH are largely from previous projects and it is therefore quite possible to enrich the LDH with new features as part of a specific project to complete the service offer of the platform.


In order to answer you better, we will analyze together:

  • Your field of scientific research,
  • The type of data you want to use (Images, Tabular, …)
  • The volume of forecast data,
  • The improvements planned on these data,
  • The types of valuation you imagine for your data (statistics, Machine Learning, Deep Learning, …).

As part of the information you provide, we will estimate the costs associated with your project. Working closely with you, we take into account the technical aspects, the necessary resources and any complexities to provide the most detailed assessment possible. This estimate includes an in-depth analysis of your requirements, possible developments, the duration of the project and the costs related to human and technological resources.

Our goal is to offer you a transparent and accurate estimate, allowing you to plan your project with confidence. This will cover:

The costs of preparing the platform and evolution if necessary,
Operating costs once the project is started.

Before launching To complete your project, we define the regulatory framework together:

  • Data governance: sharing and validation of policies, procedures and governance structures to ensure the effective, secure and ethical management of data throughout its life cycle.
  • Contractual Agreements: establishment of clear and detailed contractual agreements, defining the rights, responsibilities and obligations of all parties involved in the collection, processing and sharing of data.
  • Regulatory Compliance: Ensures compliance with applicable regulations, such as GDPR, EDS or other industry-specific standards, to ensure compliance with legal and ethical standards.


Depending on your needs, Calym can support:

  • The control of the LDH aspect only,
  • The complete management of the research project.

At the end of the project, and as provided for in the governance provided for in the EDS repository, Calym monitors the data over time until its deletion.

To discuss about your project Lymphoid or others pathologies