scvi-tools：基于PyTorch的单细胞基因组学概率模型Python框架

scvi-tools by davila7/claude-code-templates

189 周安装量

24,300 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/davila7/claude-code-templates --skill scvi-tools

AI/机器学习 Python Web框架生物信息学

🇨🇳中文介绍

scvi-tools

概述

scvi-tools 是一个用于单细胞基因组学概率模型的综合性 Python 框架。它基于 PyTorch 和 PyTorch Lightning 构建，利用变分推断提供深度生成模型，用于分析多种单细胞数据模态。

何时使用此技能

在以下情况下使用此技能：

分析单细胞 RNA-seq 数据（降维、批次校正、整合）
处理单细胞 ATAC-seq 或染色质可及性数据
整合多模态数据（CITE-seq、多组学、配对/非配对数据集）
分析空间转录组学数据（反卷积、空间映射）
对单细胞数据进行差异表达分析
执行细胞类型注释或迁移学习任务
处理专门的单细胞模态（甲基化、细胞计数、RNA 速率）
为单细胞分析构建自定义概率模型

核心能力

scvi-tools 提供按数据模态组织的模型：

1. 单细胞 RNA-seq 分析

用于表达分析、批次校正和整合的核心模型。参见 references/models-scrna-seq.md 了解：

scVI : 无监督降维和批次校正
scANVI : 半监督细胞类型注释和整合
AUTOZI : 零膨胀检测和建模
VeloVI : RNA 速率分析
contrastiveVI : 扰动效应分离

2. 染色质可及性（ATAC-seq）

用于分析单细胞染色质数据的模型。参见 references/models-atac-seq.md 了解：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

🇺🇸English

scvi-tools

Overview

scvi-tools is a comprehensive Python framework for probabilistic models in single-cell genomics. Built on PyTorch and PyTorch Lightning, it provides deep generative models using variational inference for analyzing diverse single-cell data modalities.

When to Use This Skill

Use this skill when:

Analyzing single-cell RNA-seq data (dimensionality reduction, batch correction, integration)
Working with single-cell ATAC-seq or chromatin accessibility data
Integrating multimodal data (CITE-seq, multiome, paired/unpaired datasets)
Analyzing spatial transcriptomics data (deconvolution, spatial mapping)
Performing differential expression analysis on single-cell data
Conducting cell type annotation or transfer learning tasks
Working with specialized single-cell modalities (methylation, cytometry, RNA velocity)
Building custom probabilistic models for single-cell analysis

Core Capabilities

scvi-tools provides models organized by data modality:

1. Single-Cell RNA-seq Analysis

Core models for expression analysis, batch correction, and integration. See references/models-scrna-seq.md for:

scVI : Unsupervised dimensionality reduction and batch correction
scANVI : Semi-supervised cell type annotation and integration
AUTOZI : Zero-inflation detection and modeling
VeloVI : RNA velocity analysis
contrastiveVI : Perturbation effect isolation

2. Chromatin Accessibility (ATAC-seq)

Models for analyzing single-cell chromatin data. See references/models-atac-seq.md for:

PeakVI : Peak-based ATAC-seq analysis and integration
PoissonVI : Quantitative fragment count modeling
scBasset : Deep learning approach with motif analysis

3. Multimodal & Multi-omics Integration

Joint analysis of multiple data types. See references/models-multimodal.md for:

totalVI : CITE-seq protein and RNA joint modeling
MultiVI : Paired and unpaired multi-omic integration
MrVI : Multi-resolution cross-sample analysis

4. Spatial Transcriptomics

Spatially-resolved transcriptomics analysis. See references/models-spatial.md for:

DestVI : Multi-resolution spatial deconvolution
Stereoscope : Cell type deconvolution
Tangram : Spatial mapping and integration
scVIVA : Cell-environment relationship analysis

5. Specialized Modalities

Additional specialized analysis tools. See references/models-specialized.md for:

MethylVI/MethylANVI : Single-cell methylation analysis
CytoVI : Flow/mass cytometry batch correction
Solo : Doublet detection
CellAssign : Marker-based cell type annotation

Typical Workflow

All scvi-tools models follow a consistent API pattern:

# 1. Load and preprocess data (AnnData format)
import scvi
import scanpy as sc

adata = scvi.data.heart_cell_atlas_subsampled()
sc.pp.filter_genes(adata, min_counts=3)
sc.pp.highly_variable_genes(adata, n_top_genes=1200)

# 2. Register data with model (specify layers, covariates)
scvi.model.SCVI.setup_anndata(
    adata,
    layer="counts",  # Use raw counts, not log-normalized
    batch_key="batch",
    categorical_covariate_keys=["donor"],
    continuous_covariate_keys=["percent_mito"]
)

# 3. Create and train model
model = scvi.model.SCVI(adata)
model.train()

# 4. Extract latent representations and normalized values
latent = model.get_latent_representation()
normalized = model.get_normalized_expression(library_size=1e4)

# 5. Store in AnnData for downstream analysis
adata.obsm["X_scVI"] = latent
adata.layers["scvi_normalized"] = normalized

# 6. Downstream analysis with scanpy
sc.pp.neighbors(adata, use_rep="X_scVI")
sc.tl.umap(adata)
sc.tl.leiden(adata)

Key Design Principles:

Raw counts required : Models expect unnormalized count data for optimal performance
Unified API : Consistent interface across all models (setup → train → extract)
AnnData-centric : Seamless integration with the scanpy ecosystem
GPU acceleration : Automatic utilization of available GPUs
Batch correction : Handle technical variation through covariate registration

Common Analysis Tasks

Differential Expression

Probabilistic DE analysis using the learned generative models:

de_results = model.differential_expression(
    groupby="cell_type",
    group1="TypeA",
    group2="TypeB",
    mode="change",  # Use composite hypothesis testing
    delta=0.25      # Minimum effect size threshold
)

See references/differential-expression.md for detailed methodology and interpretation.

Model Persistence

Save and load trained models:

# Save model
model.save("./model_directory", overwrite=True)

# Load model
model = scvi.model.SCVI.load("./model_directory", adata=adata)

Batch Correction and Integration

Integrate datasets across batches or studies:

# Register batch information
scvi.model.SCVI.setup_anndata(adata, batch_key="study")

# Model automatically learns batch-corrected representations
model = scvi.model.SCVI(adata)
model.train()
latent = model.get_latent_representation()  # Batch-corrected

Theoretical Foundations

scvi-tools is built on:

Variational inference : Approximate posterior distributions for scalable Bayesian inference
Deep generative models : VAE architectures that learn complex data distributions
Amortized inference : Shared neural networks for efficient learning across cells
Probabilistic modeling : Principled uncertainty quantification and statistical testing

See references/theoretical-foundations.md for detailed background on the mathematical framework.

Additional Resources

Workflows : references/workflows.md contains common workflows, best practices, hyperparameter tuning, and GPU optimization
Model References : Detailed documentation for each model category in the references/ directory
Official Documentation : https://docs.scvi-tools.org/en/stable/
Tutorials : https://docs.scvi-tools.org/en/stable/tutorials/index.html
API Reference : https://docs.scvi-tools.org/en/stable/api/index.html

Installation

uv pip install scvi-tools
# For GPU support
uv pip install scvi-tools[cuda]

Best Practices

Use raw counts : Always provide unnormalized count data to models
Filter genes : Remove low-count genes before analysis (e.g., min_counts=3)
Register covariates : Include known technical factors (batch, donor, etc.) in setup_anndata
Feature selection : Use highly variable genes for improved performance
Model saving : Always save trained models to avoid retraining
GPU usage : Enable GPU acceleration for large datasets (accelerator="gpu")
Scanpy integration : Store outputs in AnnData objects for downstream analysis

Weekly Installs

145

Repository

davila7/claude-…emplates

GitHub Stars

23.4K

First Seen

Jan 21, 2026

Security Audits

Gen Agent Trust HubWarn SocketPass SnykPass

Installed on

claude-code129

opencode120

gemini-cli115

cursor115

antigravity107

codex104

scvi-tools：基于PyTorch的单细胞基因组学概率模型Python框架

🇨🇳中文介绍

scvi-tools

概述

何时使用此技能

核心能力

1. 单细胞 RNA-seq 分析

2. 染色质可及性（ATAC-seq）

相关 Skills

3. 多模态与多组学整合

4. 空间转录组学

5. 专门模态

典型工作流程

常见分析任务

差异表达

模型持久化

批次校正与整合

理论基础

其他资源

安装

最佳实践

🇺🇸English

scvi-tools

Overview

When to Use This Skill

Core Capabilities

1. Single-Cell RNA-seq Analysis

2. Chromatin Accessibility (ATAC-seq)

3. Multimodal & Multi-omics Integration

4. Spatial Transcriptomics

5. Specialized Modalities

Typical Workflow

Common Analysis Tasks

Differential Expression

Model Persistence

Batch Correction and Integration

Theoretical Foundations

Additional Resources

Installation

Best Practices

最新 Skills