Welcome to this Biopython workshop!
We will learn how to use Biopython for sequence manipulation, filtering, writing, and BLASTing!
There are three options to run the workshop code:
-
Click on the desired workshop day link in the files section
-
Open the main github directory in google colab and select the notebook (.ipynb) you wish to run.
-
Click on the green Code button and choose Download Zip, unzip the folder on your computer, open Jupyter Lab (through Anaconda Navigator or the command line), navigate to the folder you just downloaded, and open the appropriate notebook.
-
Day 1 – Introduction to strings, Biopython, and Biopython sequences
-
Day 2 – Opening, closing, and saving sequence files with Biopython
-
Day 3 – More sequence modification and data extraction
-
Day 4 – Extracting and storing sequence data, working with GenBank files
-
Day 5 – BLAST-ing against the NCBI database
In the main directory you will find lesson notebooks, answer notebooks, example data, and annotated images.
There are five jupyter notebooks, one for each day, with lessons and questions.
There are five jupyter notebooks, one for each day, with the answers to all the exercises in the lessons.
Different data files of extension .fasta, .fastq, .gbk, and .xml
Examples of biological sequence data types and how Biopython reads them. Useful companion for lessons.
Northwestern University Information Technology Research Computing Services
Special thanks to Colby Witherup Wood for their assistance