CellCommander

Common single cell analysis tasks made runnable with the command line. This tool is still under active development, so the dev branch is the best place to get the latest updates/working code.

Why?

Single cell analysis is often characterized by “kitchensink” modeling (e.g. throwing everything at your data and seeing what sticks). Many of the most popular data analysis steps and methods are captured in these online books:

Though frameworks like scverse, Bioconductor, Seurat and ScanPy have made so many of these methods easily accessible, these methods are still difficult to manipulate into pipelines that you can run for automated analysis of many samples.

My goal was to build a tool that automates testing out pipelines end-to-end on a variety of data types.

Below we enumerate the modules that are currently available in CellCommander that one can think of as the building blocks of a single cell analysis pipeline. Each module contains the following in common:

An input argument, usually a path to a cell x feature matrix or fragment file
An output directory argument, where the output of the module will be saved
Generates a log file that contains a history of the commands that were run

`qc`

⭐ This is a command line module for performing initial QC and filtering of your cell x feature matrix. Use cellcommander qc --help to see all of the available options.

`--modes ["rna", "atac"]`

Determines what metrics should be calculated for the data.

So far, I’m liking SnapATAC2 and ArchR’s way of calculating QCs — Muon may not be the best option here. --atac_qc_tool** where we theoretically could use either of these two to load in the files for initial QC

`--filtering_strategy ["mad", "threshold"]`

Determines how cell outliers are calculated

So far, it seems like MAD usually keeps a lot of low quality cells

`remove-background`

⭐ This is a command line module for removing background signal from your cell x feature matrix. Use cellcommander remove-background --help to see all of the available options.

Currently hardcoded to 4 markers and 4 groups
Change initial clustering
Must be a single modality (RNA) adata
Adds 2 obs columns (pre and post soupx clustering)
Adds soupx counts and raw counts to layers
Moves soupx counts to .X
Plots pre and post removal UMAPs and clusterings
Saves the soupx object as an RDS
Saves statistics derived from the SoupX run in a pickle file

`detect-doublets`

⭐ This is a command line module for identifying and filtering out doublets from your cell x feature matrix.

`—-method ["scrublet", "scDblFinder", "amulet", "cellranger", "consensus"]`

Determines what algorithm is used for doublet detection

`normalize`

⭐ This is a command line module for normalizing your cell x feature matrix.

`—-method ["log1p", "scran", "pearson", "depth", "sctransform", "tfidf"]`

Determines how the matrix is normalized

`select-features`

⭐ This is a command line module for selecting columns of interest from an input cell x feature matrix.

`reduce-dimensions`

⭐ This is a command line module for creating a dimensionality reduced version of an input cell x feature matrix.

Will always compute a UMAP and some initial clustering on this dimensionality reduction

`—-method ["scanpy_default", "seurat_default", "seurat_sctransform", "signac_default", "lsi"]`

Determines what dimensionality reduction method to run

`annotate`

⭐ This is a command line module for annotating the cells with cell identity and state either manually or using prior knowledge.

`integrate`

⭐ This is a command line module for horizontally integrating multiple input cell x feature matrices.

`joint-integrate`

⭐ This is a command line module for vertically integrating multiple input cell x feature matrices.

`recipes`

⭐ This is a command line module for running pipelines of other tools.

`snapatac2`

[ ]

A note on interoperability

https://scverse-tutorials.readthedocs.io/en/latest/notebooks/scverse_data_interoperability.html https://scverse-tutorials.readthedocs.io/en/latest/index.html 10/29/2023 Brainstorm — Ideas

TODO

When a step is necessary to tweak in a pipeline, come back to this and improve the documentation
Add in a environment spec or singularity containers for running

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
cellcommander		cellcommander
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements-rtd.txt		requirements-rtd.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CellCommander

Why?

`qc`

`--modes ["rna", "atac"]`

`--filtering_strategy ["mad", "threshold"]`

`remove-background`

`detect-doublets`

`—-method ["scrublet", "scDblFinder", "amulet", "cellranger", "consensus"]`

`normalize`

`—-method ["log1p", "scran", "pearson", "depth", "sctransform", "tfidf"]`

`select-features`

`reduce-dimensions`

`—-method ["scanpy_default", "seurat_default", "seurat_sctransform", "signac_default", "lsi"]`

`annotate`

`integrate`

`joint-integrate`

`recipes`

`snapatac2`

A note on interoperability

TODO

About

Releases

Packages

Languages

License

adamklie/CellCommander

Folders and files

Latest commit

History

Repository files navigation

CellCommander

Why?

qc

--modes ["rna", "atac"]

--filtering_strategy ["mad", "threshold"]

remove-background

detect-doublets

—-method ["scrublet", "scDblFinder", "amulet", "cellranger", "consensus"]

normalize

—-method ["log1p", "scran", "pearson", "depth", "sctransform", "tfidf"]

select-features

reduce-dimensions

—-method ["scanpy_default", "seurat_default", "seurat_sctransform", "signac_default", "lsi"]

annotate

integrate

joint-integrate

recipes

snapatac2

A note on interoperability

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`qc`

`--modes ["rna", "atac"]`

`--filtering_strategy ["mad", "threshold"]`

`remove-background`

`detect-doublets`

`—-method ["scrublet", "scDblFinder", "amulet", "cellranger", "consensus"]`

`normalize`

`—-method ["log1p", "scran", "pearson", "depth", "sctransform", "tfidf"]`

`select-features`

`reduce-dimensions`

`—-method ["scanpy_default", "seurat_default", "seurat_sctransform", "signac_default", "lsi"]`

`annotate`

`integrate`

`joint-integrate`

`recipes`

`snapatac2`

Packages