Biofilter-LOKI 3.0.0 is a lightweight, command-line–driven knowledge base builder designed to support BioBin and other legacy Biofilter workflows.
This version preserves the traditional LOKI architecture, while modernizing the codebase and deployment for current environments.
Biofilter-LOKI 3.0.0 was built to:
- Maintain full compatibility with BioBin
- Preserve the classic LOKI data model
- Provide a simple CLI-based workflow
- Support HPC module deployments
- Enable rapid database builds for analysis pipelines
LOKI (Library Of Knowledge Integration) is the knowledge ingestion engine behind Biofilter.
It builds a SQLite knowledge database by integrating multiple biological data sources, such as:
- SNP ↔ Gene
- Gene ↔ Pathway
- Gene ↔ Ontology
- Identifier mappings across databases
┌──────────────┐
│ Biofilter │ ← primary CLI
└──────────────┘
│
▼
┌─────────────────────┐
│ SQLite Knowledge DB │
│ (LOKI schema) │
└─────────────────────┘
▲
│
┌──────────────┐
│ loki-build │ ← ingestion engine
└──────────────┘
▲
│
External Sources
Key characteristics:
- SQLite backend
- Immutable batch loads
- No entity-level curation
- Optimized for downstream queries
Depending on build options, Biofilter-LOKI can ingest:
- dbSNP
- Entrez Gene
- Gene Ontology (GO)
- Pathways (KEGG / Reactome, if enabled)
- Chain files (genome build liftover)
- Identifier mappings
The available sources depend on how the package was built and deployed.
pip install biofilter-lokiBasic example:
loki-build \
--knowledge loki.db \
--load dbsnp entrez goUpdate existing database:
loki-build \
--knowledge loki.db \
--updateBuild from an archive:
loki-build \
--from-archive loki_sources.tar.gz \
--knowledge loki.db| Option | Description |
|---|---|
--knowledge |
Output SQLite database |
--load |
Load specific sources |
--update |
Update existing DB |
--from-archive |
Load from source archive |
--to-archive |
Save source archive |
--no-optimize |
Skip DB optimization |
--verbose |
Verbose logging |
Run loki-build --help for full details.
Once a LOKI knowledge database has been built, the biofilter command
can be used to query and inspect its contents.
Check version information:
biofilter --versionDisplay general help and available subcommands:
biofilter --helpList available data sources loaded into the database:
biofilter \
--knowledge loki.db \
--snp-file snps.txt \
--source kegg \
--annotate position_label snp position gene upstream downstream \
--report-invalid-input \
--report-configuration \
--overwrite \
--prefix outcomes_prefix \
--ucsc-build-version 19 \Where snps.txt contains one rsID per line.
Note: Available subcommands depend on which data sourcesmwere loaded during database construction.
- 📘 User Manual (PDF)
Biofilter Manual — Version 3.0.0
Developed and maintained by the Ritchie Lab University of Pennsylvania
Distributed under the original Biofilter license.
See LICENSE file for details.