> For the complete documentation index, see [llms.txt](https://docs.owkin.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.owkin.com/integrate-your-data/connecting-your-data-sources.md).

# Connecting your data sources

Once data has been prepared, a secure connection must be established between the client's data repositories and K. The preferred access model should be agreed upon between the client and Owkin during the pre-sales phase.

Three connectivity options are available:

* **Transfer to Owkin-managed storage** Data is transferred into an Owkin-managed environment where it is stored and served directly to K. This is the simplest option and is well suited for clients who prefer a fully managed approach.
* **Expose customer-managed storage** Data remains in the client's own infrastructure. The client provides the necessary credentials and configuration so that K can access the data remotely. This option preserves the client's existing data residency and governance controls.
* **Connect an existing data platform** K integrates directly with platforms such as Databricks or Snowflake, leveraging the client's existing governance policies and data pipelines. This is the preferred option for organizations that have already invested in a centralized data platform.
* **Upload files in product**: K Pro users can add datasets themselves, best suited for small, simple, public datasets (e.g., a GEO dataset linked to a publication)
  * Upload interface limits: files must be ≤1 GB; accepted formats are CSV/TSV/Parquet (tabular) and PDF/MD/text (text)
  * Not intended for raw FASTQ files or voluminous raw data
  * Also supports web browser-based retrieval to fetch resources from the internet

For proprietary assets like screening results, assay data, target assessments, or compound libraries, the same pattern applies: bring the data in your preferred tabular or platform-backed structure, then align it to the supported model.

Owkin can also provide tooling and services for transformation, and optional AI enrichment can extract additional features before analysis.

***

K Pro's connection can be tailored to the use case. For instance, when connecting to a client's data platform, we can establish a live connection to a specific view of the data, or alternatively connect to a safeguarded version that requires manual updates — the choice depends entirely on the use cases being served. The same flexibility applies to other integration methods, such as connecting to storage (whether Owkin-managed or client-managed).


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.owkin.com/integrate-your-data/connecting-your-data-sources.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
