Skip to content
@BiocPy

BiocPy

Facilitate Bioconductor Workflows in Python

BiocPy logo

BiocPy: Facilitate Bioconductor Workflows in Python

BiocPy is an effort to develop core data structures and representations from Bioconductor to Python. These structures, including BiocFrame and GenomicRanges, serve as essential and foundational data structures, acting as the building blocks for extensive and complex representations. For example, container classes like SummarizedExperiment, SingleCellExperiment, and MultiAssayExperiment represent single or multi-omic experimental data and metadata.

Note

Selected packages

For complete list of all packages, visit the GitHub:BiocPy repository.

Core Representations:

  • BiocFrame: Bioconductor-like data frames in Python. (GitHub, Docs)
  • IRanges: Python implementation of the IRanges package to support interval arithmetic. (GitHub, Docs)
  • GenomicRanges: Container class to represent genomic locations and support genomic analysis. (GitHub, Docs, BioC)
  • SummarizedExperiment: Container class to represent genomic experiments, following Bioconductor's SummarizedExperiment. (GitHub, Docs)
  • SingleCellExperiment: Container class to represent single-cell experiments; follows Bioconductor’s SingleCellExperiment. (GitHub, Docs)
  • MultiAssayExperiment: Container class to represent multiple experiments and assays performed over a set of samples, following Bioconductor's MAE R/Bioc Package. (GitHub, Docs)

Analysis Packages

  • scranpy: Python bindings to single-cell analysis methods from libscran and related C++ libraries. (GitHub, Docs)
  • singler: Python bindings to the singleR algorithm to annotate cell types from known references. (GitHub, Docs)

Interoperability with R

  • rds2py: Read RDS files directly in Python, supporting Bioconductor's SummarizedExperiment and SingleCellExperiment, in addition to matrices, data frames, and vectors. (GitHub, Docs)

Utility Packages

  • BiocUtils: Common utilities for use across packages, mostly to mimic convenient aspects of base R. (GitHub, Docs)
  • mopsy: Helper functions to perform row or column operations over numpy and scipy matrices, providing an interface similar to base R matrix methods/MatrixStats methods. (GitHub, Docs)
  • pyBiocFileCache: File system-based cache for resources & metadata. (GitHub, Docs)

Installation

All packages in the BiocPy ecosystem are published to PyPI. Use the biocpy wrapper to install the core packages:

pip install biocpy

Individual packages can also be installed separately. Refer to package's documentation for more details.


Interested in contributing? Check out the developer guide.

Pinned Loading

  1. rds2py rds2py Public

    Read RDS files, in Python

    Python 16 3

  2. GenomicRanges GenomicRanges Public

    Container class to represent genomic locations and support genomic analysis

    Python 17 4

  3. SummarizedExperiment SummarizedExperiment Public

    Container class for genomic experiments

    Python 5 2

  4. SingleCellExperiment SingleCellExperiment Public

    Container class for single-cell experiments

    Python 6 1

  5. cellarr cellarr Public

    Store collections of experimental data based on TileDB

    Python 3 1

  6. scrnaseq scrnaseq Public

    Collection of public single-cell RNA-seq datasets

    Python 1

Repositories

Showing 10 of 28 repositories
  • IRanges Public

    IRanges in Python

    BiocPy/IRanges’s past year of commit activity
    Python 3 MIT 0 3 4 Updated Jan 5, 2025
  • SingleCellExperiment Public

    Container class for single-cell experiments

    BiocPy/SingleCellExperiment’s past year of commit activity
    Python 6 MIT 1 2 0 Updated Jan 2, 2025
  • SummarizedExperiment Public

    Container class for genomic experiments

    BiocPy/SummarizedExperiment’s past year of commit activity
    Python 5 MIT 2 4 (1 issue needs help) 0 Updated Jan 2, 2025
  • BiocUtils Public

    Miscellaneous utilities for BiocPy, mostly to mimic base functionality in R.

    BiocPy/BiocUtils’s past year of commit activity
    Python 0 MIT 0 1 1 Updated Jan 2, 2025
  • scrnaseq Public

    Collection of public single-cell RNA-seq datasets

    BiocPy/scrnaseq’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Jan 2, 2025
  • BiocFrame Public

    Bioconductor-like data frames

    BiocPy/BiocFrame’s past year of commit activity
    Python 4 MIT 3 0 0 Updated Jan 2, 2025
  • assorthead Public

    Assorted C++ headers, vendored into a Python package.

    BiocPy/assorthead’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Dec 31, 2024
  • cellarr Public

    Store collections of experimental data based on TileDB

    BiocPy/cellarr’s past year of commit activity
    Python 3 MIT 1 9 1 Updated Dec 27, 2024
  • rds2py Public

    Read RDS files, in Python

    BiocPy/rds2py’s past year of commit activity
    Python 16 MIT 3 0 0 Updated Dec 24, 2024
  • GenomicRanges Public

    Container class to represent genomic locations and support genomic analysis

    BiocPy/GenomicRanges’s past year of commit activity
    Python 17 MIT 4 11 (2 issues need help) 0 Updated Dec 23, 2024

Top languages

Loading…

Most used topics

Loading…