-
Supported operating systems: currently, Linux and macOS only.
-
Clone GitHub repository:
git clone https://github.com/amkozlov/cellphy
- Install dependencies
CellPhy is using bundled RAxML-NG to perform tree inference, hence this core functionality is available without installing any dependencies. However, some additional features (tree visualization, format conversion etc.) rely on external software. So in order to have a fully functional pipeline, it is highly recommended to install following packages:
- bcftools
- R environment
- R packages: ggtree and its dependencies (treeio, ape...)
Your installation options depend on the system:
a) For Ubuntu and RedHat/CentOS Linux with root access, we provide a script for automatic dependency installation:
cd cellphy
sudo ./install.sh
b) Under macOS, you can use homebrew:
brew install r
brew install bcftools
c) In a cluster environment, chances are the required packages are already installed and can be loaded with the module command, e.g.:
module load bcftools R
- Check that everything works:
./cellphy.sh
Standard analysis with default parameters:
./cellphy.sh input.vcf
General syntax:
./cellphy.sh [COMMAND] [options] input.vcf
COMMAND:
-
FULLFull analysis: Thorough tree search + bootstrapping + mutation mapping (default) -
SEARCHThorough tree search (20 starting trees) + mutation mapping -
FASTFast tree search (single starting tree) + mutation mapping
Options:
-
-aUse approximate 10-state model (~2x faster) -
-g FILETab-delimited list of SNVs for mapping, with respective gene names (example) -
-m MODELEvolutionary model definition in RAxML-NG format
NOTE: partitioned models are not supported at the moment! -
-o OUTGROutgroup taxon list (comma-separated), e.g.-o healthyor-o H1,H2 -
-p PREFIXPrefix for output files (default: input filename w/o extension) -
-rREDO mode: overwrite all result files -
-t THREADSNumber of threads to use (default: autodetect) -
-ySkip mutation mapping -
-zSkip tree plotting (no PDF/SVG output)
For some advanced usage examples, please see tutorial.
When using CellPhy, please cite this paper:
Alexey Kozlov, João M Alves, Alexandros Stamatakis, and David Posada (2022) CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data Genome Biol 23, 37 doi: 10.1186/s13059-021-02583-w
When using tree visualization features (e.g., mutation map), please additionally cite ggtree:
G Yu, DK Smith, H Zhu, Y Guan, TTY Lam (2017) ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2017, 8(1):28-36. doi: 10.1111/2041-210X.12628