Supported modalities

As of March 2026, only the modalities below are supported by K-Pro Data Model:

  • Clinical data

  • Molecular data:

    • Bulk RNA seq

    • Single / nuclei cell RNA seq

    • Spatial transcriptomics (VisiumSD)

    • Whole Exome Sequencing

    • Proteomics

  • Histology - slide annotation linked to a patient

circle-info

Extra modalities / technologies are available upon request. By default K-Pro expects to receive processed data (in the format of count matrixes for example for bulk and single cell data), however Owkin has also developed in-house processing pipeline that can process raw files for both bulk RNAseq, Chromium single-cell and Visium spatial sequencing.

Clinical data

Standard clinical data formats are supported (.csv, .tsv, .xlsx, …).

Molecular data

Here is a table of supported modalities and source formats.

Modalities

Formats

Bulk RNAseq

Count matrix (.txt / .tsv / .csv); AnnData (.h5ad)

Single cell RNAseq

Matrix Market (.mtx + .tsv); HDF5 (.h5); AnnData (.h5ad); Seurat object (.rds)

Spatial transcriptomics

Matrix Market (.mtx + barcodes.tsv + features.tsv); HDF5 (.h5); AnnData (.h5ad); Seurat object (.rds)

WES / WGS

VCF

Proteomics

Normalized intensity matrix (.txt / .tsv / .csv); AnnData (.h5ad)

Imaging data

Here is a table of supported modalities and formats.

Modalities

Formats

Histology (H&E WSI)

.tif, .tiff, .svs, .dcm, .svs, .ndpi, .mrxs

Immunohistochemistry (IHC)

.tif, .tiff, .svs, .dcm, .svs, .ndpi, .mrxs

Note that Owkin developed several imaging processing pipelines that can be applied to imaging data, including:

  • Cell segmentation and cell annotation via the HIPEarrow-up-right model . Resulting cell-type quantifications are then saved (format: .csv) and used by K pro.

  • IHC score extraction with proprietary models (e.g., HER2 score, NMR, etc.). Resulting patient-level scores are then saved and used by K-pro.

Self Supervised Features extraction via Foundation Modelsarrow-up-right: slides are divided into tiles from which latent representations are computed, saved (format: .npy)

Last updated

Was this helpful?