[Upload in progress, please be patient]
Scripts and files associated with the protocols reported in:
Accelerating protein design by scaling experimental characterization
Recent advances in de novo protein design have greatly outpaced standard protein biochemistry workflows, and experimental testing has been a bottleneck in the validation of new designs and methodologies. Here, we describe experimental and computational workflows to address the issues of scale, speed and reproducibility of common in vitro protein testing methods, enabling at least an order of magnitude increase in throughput while reducing wetlab time. Semi-Automated Protein Production (SAPP) is a rapid, modular, scalable and cost-effective protocol, enabling up to milligram-scale protein production, and standardized characterization including yield, dispersity, and oligomeric state of hundreds of designs per day, at the cost-equivalent of a few DNA oligos per construct. End-to-end protocol execution takes 48 hours, with about 6 hours spent benchside using mostly standard laboratory equipment. This protocol has become the standard at our institute, providing critical experimental validation for dozens of projects spanning tens of thousands of designs. Since at least 80% of SAPP's total cost comes from synthetic DNA, we also developed a scalable demultiplexing protocol (DMX) to leverage oligo pools as input DNA, providing a further 5-fold reduction in costs, enabling >1000 designs to be purified and characterized in arrayed, clonal format at a cost of $5 per construct. By reframing standard molecular biology practices and orchestrating wetlab workflows with partial automation instead of complex end-to-end robotics, these protocols should be widely adoptable, accelerating protein design.
Content:
JB: contains scripts/files for automated DNA fragment generation for SAPP.SAPP: contains scripts/files for automated analysis of protein expression/purification data generated with the SAPP protocol(s).DMX: contains the scripts/files for demultiplexing oligo pools.TMSC_apptainer.def: definition for building the apptainer to run the different scripts
We recommend building the above apptainer and execute the scritps/notebooks with them to resolve any dependency issues you may encounter.
If you use this code in your research, please cite the following paper:
@article{qian2025_SAPP_DMX,
title = {Accelerating protein design by scaling experimental characterization},
author = {Qian, Jason and Milles, Lukas F. and Wicky, Basile I. M. and Motmaen, Amir and Li, Xinting and Kibler, Ryan D. and Stewart, Lance and Baker, David},
url = {https://www.biorxiv.org/content/10.1101/2025.08.05.668824v1},
doi = {10.1101/2025.08.05.668824},
journal = {bioRxiv},
publisher={Cold Spring Harbor Laboratory},
year = {2025},
}