A bioinformatics extension of 🤗 Datasets library, built for ML applications on biological and omics data, offering easy integration of metadata and low-code data management tools.
-
Updated
Nov 16, 2024 - Python
A bioinformatics extension of 🤗 Datasets library, built for ML applications on biological and omics data, offering easy integration of metadata and low-code data management tools.
Imitating Keras to Deepen understanding.
ncRNA identification, annotation and functional prediction
Recode a DNA sequence to match a target amino‑acid sequence with the minimal number of nucleotide changes, preserving the reading frame. Includes a CLI and Python API.
A simple Python script for preprocessing FASTA and FASTQ files, performing reverse complement, trimming, and adaptor removal operations with comprehensive base composition statistics and quality score handling.
A simple tool to generate hierarchical clustering trees from nucleotide sequences. Supports a number of distance metrics and clustering algorithms. Includes a large testset of SARSCOV2 genomes.
Add a description, image, and links to the bioinfo topic page so that developers can more easily learn about it.
To associate your repository with the bioinfo topic, visit your repo's landing page and select "manage topics."