> For the complete documentation index, see [llms.txt](https://docs.owkin.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.owkin.com/help-center/getting-started/what-data-is-available-in-k-pro.md).

# What data is available in K-Pro?

{% hint style="info" %}
**What this article covers** This article describes the datasets available in K-Pro, clarifies how biological data is labelled, and explains what to do when you can't find expected data.
{% endhint %}

### Standard datasets included in K-Pro

* **TCGA** — The Cancer Genome Atlas: pan-cancer genomic and clinical data
* **MOSAIC Window** — Owkin's proprietary multi-omics dataset. Covers **11 cancer indications, 6 data modalities, and 2,716 patients** (clinical, genomics, transcriptomics, spatial transcriptomics, proteomics, histology). *Numbers reflect the May 2026 dataset snapshot — see the live Dataset Catalog for current totals.* Full dataset catalog available at docs.owkin.com
* **Scientific literature search via Consensus** — AI-powered scientific search engine (consensus.app) integrated into K-Pro. Enables literature-backed hypothesis validation, paper discovery on a gene/disease, and scientific background research.

### K-Pro free vs. add-on data scope

K-Pro free includes the public MOSAIC and TCGA scope. Some datasets, indications, and modalities (e.g., additional proteomics for Epkin, client-specific cohorts for Servier) are only enabled as add-ons for specific organizations. Your Owkin contact will have shared the exact list of datasets included in your instance during onboarding.

### Why can't I find certain data?

{% hint style="info" %}
**"I can't find blood or adjacent non-tumor tissue data"**

The MOSAIC dataset focuses on tumoral biopsies. It does not include blood transcriptomics data. Tissue types that might seem equivalent to "adjacent non-tumor" are labelled by cell type (e.g., epithelial, stromal, immune cells) rather than by tissue origin label.

**Example prompt:** "Plot the expression of EGFR in malignant and epithelial cells in lung patients from MOSAIC"
{% endhint %}

{% hint style="info" %}
**"I can't find data for my cancer indication (e.g., lung cancer, PAAD)"**

MOSAIC Window covers 11 cancer indications in total. However, in some trial configurations, only a subset of indications is enabled for your instance. To see the full, up-to-date list of indications available in K-Pro, visit **docs.owkin.com → Dataset Catalog**. If you believe your trial should include an indication that you cannot query, contact your Owkin project manager.
{% endhint %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.owkin.com/help-center/getting-started/what-data-is-available-in-k-pro.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
