Code and derived parameters for the manuscript: A multi-component power-law penalty corrects distance bias in single-cell co-accessibility and deep-learning chromatin interaction predictions.
This repository provides a framework to correct the systematic distance bias found in proxy data of 3D genome architecture.
The method takes biased proxy data and a reference Hi-C dataset as input. It fits a multi-component power-law model to the Hi-C data to derive a penalty function.
git clone [https://github.com/jlab-code/polymer-penalty.git](https://github.com/jlab-code/polymer-penalty.git)
cd polymer-penaltypip install -r requirements.txtUse our parameters derived for Soybean, Rice, and Maize.
- Place your co-accessibility scores in the
data/directory. - Open the Jupyter Notebook:
scripts/apply_correction.ipynb. - Select your target species:
# USER: Select species and model type
SPECIES = "Soybean"
USE_GLOBAL_CONSENSUS = True - Run all cells.
Use our GMM-pipeline to generate a custom model for any species.
- Place your Hi-C loops in
.bedpeformat in thedata/directory. - Open the Jupyter Notebook:
scripts/get_penalty_function.ipynb. - Update the data path:
hic_path = "../data/your_new_species_HiC.bedpe"- Run all cells.
A multi-component power-law penalty corrects distance bias in single-cell co-accessibility and deep-learning chromatin interaction predictions. (2026).