Here, we introduce CODE (Chain of Diffusion Embedding), a latent-space trajectory–based metric, and CONFIDE, an integrated score that jointly quantify topological and energetic frustrations underlying unreliable AlphaFold3 predictions. You can use these tools to detect hallucination in structure prediction, as loss functions to enhance de novo binder design, and also for virtual screening.
Inspired by the topological frustration theory in protein folding, we reformulated the diffusion embedding trajectories of the AF3 series structure predictors into CODE, a metric that captures topological frustration overlooked by the conventional confidence metric pLDDT. We propose CONFIDE, a unified framework integrating topological frustration and pLDDT-represented energetic frustration to comprehensively characterize the protein folding energy landscape.
- Python 3.10
- Required Python packages (see Installation section)
-
For basic installation instructions, please refer to the Boltz-1 repository.
-
Clone the repository:
git clone https://github.com/zjgao02/CONFIDE.git
cd CONFIDEFirst, you need to construct YAML files for Boltz1 predict. You can refer to 'examples/prot_no_msa.yaml'.
Next, for saving the CODE trajectory, you can modify the path in
CONFIDE/src/boltz/data/write/writer.py
Line 82 in b436e7c
Then you can run inference using Boltz with:
boltz predict input_path --out_dir output_path --cache ./You can get the confidence score directly from the Boltz output, which represents the energy frustration.
After that you can run the code.py code to perform energy combination analysis in CONFIDE:
python code.pyIf you have any questions, please feel free to contact the authors.
- Zijun Gao (zjgao24@cse.cuhk.edu.hk)
- Chunbin Gu (guchunbin200888@gmail.com)
- Changyu Hsieh (kimhsieh@zju.edu.cn)
