X-Cell

A diffusion language model for genome-scale perturbation prediction across diverse cellular contexts

License: CC BY-NC-SA 4.0 Model Dataset Paper Docs

--- > **Status: Model weights and inference code coming soon.** > The Python API, model weights, and tutorials are under active development. > Star/watch this repository for release updates.

X-Cell Architecture

X-Cell predicts genome-scale transcriptional responses to genetic perturbations across diverse cellular contexts. Trained on **X-Atlas/Pisces** (25.6M perturbed single cells, 7 CRISPRi Perturb-seq screens), X-Cell integrates multi-modal biological priors through cross-attention and generalizes zero-shot to unseen cell types and perturbations. ## Highlights - **Diffusion LM** with iterative inference-time refinement - **Multi-modal priors** via cross-attention (ESM-2, STRING, GenePT, DepMap, JUMP-Cell Painting, scGPT) - **X-Cell Mini** (55M) — a compact model for single-GPU perturbation prediction - **5× higher Pearson Δ** than the next-best method on held-out perturbations - **Zero-shot generalization** to unseen cell types, confirmed on primary human T cells and melanocyte progenitors ## Installation ```bash pip install xcell ``` ## Quick Start ```python import anndata as ad from xcell import XCell # Load pretrained X-Cell Mini model = XCell.from_pretrained("Xaira-Therapeutics/X-Cell", variant="mini") # Predict from an AnnData object adata = ad.read_h5ad("control_cells.h5ad") predictions = model.predict(adata, perturbation="BRCA1") # Or from one or more .h5ad paths predictions = model.predict( ["screen1.h5ad", "screen2.h5ad"], perturbation="BRCA1", ) ``` See the [documentation](https://xaira-therapeutics.github.io/X-Cell) for full examples. ## License This project is licensed under the [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-nc-sa/4.0/). ## Model | Model | Parameters | HuggingFace | |-------|-----------|-------------| | X-Cell Mini | 55M | [Xaira-Therapeutics/X-Cell](https://huggingface.co/Xaira-Therapeutics/X-Cell) | ## Data **X-Atlas/Pisces** is available at [Xaira-Therapeutics/X-Atlas-Pisces](https://huggingface.co/datasets/Xaira-Therapeutics/X-Atlas-Pisces). ## Citation ```bibtex @article{xcell2026, title = {X-Cell: Scaling Causal Perturbation Prediction Across Diverse Cellular Contexts via Diffusion Language Models}, year = {2026}, } ``` See [LICENSE](LICENSE) for details.