Temporally unified RNA velocity for single cell trajectory inference (UniTVelo) is implementated on Python 3 and TensorFlow 2. The model estimates velocity of each gene and updates cell time based on phase portraits concurrently.
The major features of UniTVelo are,
- Using spliced RNA oriented design to model RNA velocity and transcription rates
- Introducing a unified latent time (
Unified-time mode
) across whole transcriptome to incorporate stably and monotonically changed genes - Retaining gene-spcific time matrics (
Independent mode
) for complex datasets
UniTVelo has proved its robustness in 10 different datasets. Details can be found via our manuscript in bioRxiv which is currently under review (UniTVelo).
UniTVelo is designed based on TensorFlow's automatic differentiation architecture. Please make sure TensorFlow 2 and relative CUDA dependencies are correctly installed.
Use the following scripts to confirm TensorFlow is using the GPU.
import tensorflow as tf
print ("Num GPUs Available: ", len(tf.config.list_physical_devices('GPU')))
If GPU is not available, UniTVelo will automatically switch to CPU for model fitting or it can be spcified in config.py
(see Getting Started
below).
(Optional) Create a separate conda environment for version control and to avoid potential conflicts.
conda create -n unitvelo python=3.7
conda activate unitvelo
UniTVelo package can be conveniently installed via PyPI or directly from GitHub repository.
pip install unitvelo
or
pip install git+https://github.com/StatBiomed/UniTVelo
https://drive.google.com/drive/folders/1A-Gcu0zhjVv4N8UZHttM_RULSztUzaUU?usp=sharing
Examples of UniTVelo and steps for reproducible results are provided in Jupyter Notebook under notebooks
folder. Specifically, please refer to records analyzing Mouse Erythroid and Human Bone Marrow datasets.
UniTVelo has proved its performance through 10 different datasets and 4 of them have been incorporated within scVelo package, see datasets. Others can be obtained via link.
UniTVelo provides an integrated function for velocity analysis by default whilst specific configurations might need to be adjusted accordingly.
- Import package
import unitvelo as utv
- Sub-class and override base configuration file (here lists a few frequently used), please refer
config.py
for detailed arguments.
velo = utv.config.Configuration()
velo.R2_ADJUST = True
velo.IROOT = None
velo.FIT_OPTION = '1'
velo.GPU = 0
- Arguments:
- --
velo.R2_ADJUST
(bool), linear regression R-squared on extreme quantile (default) or full data (adjusted) - --
velo.IROOT
(str), specify root cell cluster would enable diffusion map based time initialization, default None - --
velo.FIT_OPTION
(str), '1' Unified-time mode (default), '2' Independent mode - --
velo.GPU
(int), specify the GPU card used for fitting, -1 will switch to CPU mode, default 0.
- --
- Run model (label refers to column name in adata.obs specifying celltypes)
adata = utv.run_model(path_to_adata, label, config_file=velo)
scv.pl.velocity_embedding_stream(adata, color=label, dpi=100, title='')
- Evaluation metrics (Optional)
# Cross Boundary Direction Correctness
# Ground truth should be given via `cluster_edges`
metrics = {}
metrics = utv.evaluate(adata, cluster_edges, label, 'velocity')
# Latent time estimation
scv.pl.scatter(adata, color='latent_time', color_map='gnuplot', size=20)
# Phase portraits for individual genes (experimental)
utv.pl.plot_range(gene_name, adata, velo, show_ax=True, time_metric='latent_time')