This repository contains the code and analysis for the blog post "Exploring Blood-Based Biosurveillance, Part 3: The Blood Virome."
- R (version 4.0.0 or higher)
- AWS CLI
- Quarto (latest version)
- Required R packages (will be automatically installed using pacman):
- tidyverse
- RColorBrewer
- scales
- ggbeeswarm
- ggpubr
- grid
- gridExtra
- ggforce
- extrafont
- cowplot
- pacman
main.qmd
: Main Quarto document containing the analysis code and figure generationscripts/aux_plot-theme.R
: Helper script for consistent ggplot2 theme settingsscripts/download_data.sh
: Bash script to download required data from AWS S3
-
Clone this repository:
git clone https://github.com/naobservatory/the-blood-virome.git cd the-blood-virome
-
Download the data from AWS S3:
mkdir mgs-analysis cd mgs-analysis chmod +x ../scripts/download_data.sh ../scripts/download_data.sh
Hence your directory structure should look like this:
├── the-blood-virome (this repo) ├── figures/ # Generated figures ├── scripts/ # Helper scripts ├── resources/ # Additional resources ├── main.qmd # Main Quarto document ├── mgs-analysis/ # Source data directories ├── aydillo2022 ├── cebriamendoza2021 ├── mengyi2023 ├── oconnell2023 ├── thijssen2023 └── thompson2023
-
Install required R packages:
install.packages("pacman") pacman::p_load(tidyverse, RColorBrewer, scales, ggbeeswarm, ggpubr, grid, gridExtra, ggforce, extrafont, cowplot)
-
Render the analysis document (this will take a while):
quarto render main.qmd
alternatively, you can open the document in VSCode or another IDE that supports Quarto and interactively run the code chunks (this is what we recommend).
The analysis will generate multiple figures in the figures/
directory and create an HTML document with the complete analysis.
If you use this analysis in your work, please cite:
Bhasin, H., McLaren, M., & Justen, L. (2025, February 7). Exploring Blood-Based Biosurveillance, Part 3: The Blood Virome. Nucleic Acid Observatory Blog. https://naobservatory.org/blog/exploring-blood-biosurveillance-part3
For questions or feedback, please contact Lennart Justen at lenni@securebio.org or Harmon Bhasin at harmon@securebio.org.