Oxbow makes genomic data ready for high-performance analytics.
Oxbow is a genomic data I/O library that models and translates next-generation sequencing (NGS) file formats into Apache Arrow representations. One direct application of this unification layer is the ability to access conventional NGS files as in-memory or larger-than-memory data frames in Python, R, and more. The project is organized as a multi-package monorepo with three main components:
- rs-oxbow (
oxbow/) - A reusable, pure Rust library providing core parsing and streaming functionality - py-oxbow (
py-oxbow/) - Python bindings built with PyO3/maturin - r-oxbow (
r-oxbow/) - R bindings built with rextendr (minimal, under development)
Data I/O is handled entirely in Rust, with rich high-level features exposed via Python and R bindings.
Read the latest documentation.
Learn more from our blog post.
Want to contribute? See the contributing guide.