Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
77ecf62
fix: remove .snakemake dir from clean
corneliusroemer May 5, 2025
0da48e0
Add default snakemake profile for ergonomics
corneliusroemer May 5, 2025
ba93a59
Add basic "running the workflow" to README
corneliusroemer May 5, 2025
9585334
Fix precommit exclude
corneliusroemer May 5, 2025
d9c8599
precommit fixes
corneliusroemer May 5, 2025
82b0263
snakemake ergonomics
corneliusroemer May 5, 2025
f3a7772
nextclade readme edit
corneliusroemer May 5, 2025
29a9cc0
Remove .snakemake dir from clean
corneliusroemer May 5, 2025
72a1eb9
force `rm -rf` is a typo away from bad things, maybe unnecessary as w…
corneliusroemer May 5, 2025
3fc85d0
E712 has false positives for pandas so let's just disable
corneliusroemer May 5, 2025
14ecd2b
Fix Nextclade tree title, add extra (free) metadata to coloring
corneliusroemer May 5, 2025
3d0e98f
Print export output, helpful for debugging
corneliusroemer May 5, 2025
9dbb8ac
fix forgotten trailing \
corneliusroemer May 5, 2025
495d298
Systematic logging to logfile and stdout
corneliusroemer May 5, 2025
acf9166
don't clean .snakemake with snakemake
corneliusroemer May 5, 2025
f16bedc
add default profile
corneliusroemer May 5, 2025
c8d3527
fix missing trailing \
corneliusroemer May 5, 2025
51611fe
Add sample sequences to nextclade defaults to make the dataset proper…
corneliusroemer May 5, 2025
6e61385
Add how to test the nextclade dataset
corneliusroemer May 5, 2025
6264414
Try different clade E1 tsv
corneliusroemer May 5, 2025
337ebee
General phylogenetic improvements
corneliusroemer May 6, 2025
846bcf0
Clades tweaks
corneliusroemer May 6, 2025
ff3f4b4
Fix precommit
corneliusroemer May 6, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
default_language_version:
python: python3
exclude: '\.(tsv|fasta|gb)$|^ingest/vendored/|'
exclude: '\.(tsv|fasta|gb)$|^ingest/vendored/'
repos:
- repo: https://github.com/pre-commit/sync-pre-commit-deps
rev: v0.0.3
Expand Down
14 changes: 14 additions & 0 deletions ingest/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,20 @@ If you have another data source or private data that needs to be
formatted for the phylogenetic workflow, then you can use a similar
workflow to curate your own data.

## Running the workflow

You can run the workflow with default parameters using the `nextstrain` cli:

```sh
nextstrain build .
```

or at your own risk using `snakemake` directly (if you've got all the typical Nextstrain workflow dependencies):

```sh
snakemake
```

## Config

The config directory contains all of the default configurations for
Expand Down
3 changes: 1 addition & 2 deletions ingest/Snakefile
Original file line number Diff line number Diff line change
Expand Up @@ -14,15 +14,14 @@ include: "rules/curate.smk"
rule clean:
params:
targets = [
".snakemake",
"benchmarks",
"data",
"logs",
"results",
]
shell:
"""
rm -rfv {params.targets}
rm -rv {params.targets}
"""

# Import custom rules provided via the config.
Expand Down
3 changes: 3 additions & 0 deletions ingest/profiles/default/config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cores: all
printshellcmds: true
ri: true
19 changes: 19 additions & 0 deletions nextclade/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,3 +23,22 @@ FIXME CITE WHO PAPER HERE.
* If any changes are needed to the clade-defining mutations, add
changes to `defaults/clades.tsv` and re-run the workflow
* Repeat as needed

## Using local ingest data

By default, this workflow pulls starting sequences and metadata from Nextstrain's AWS s3 bucket.
If you want to use data from a local ingest run, just copy the data over:

```sh
cp -r ../ingest/results data
```

## Testing the dataset locally

You can test the dataset locally by serving it, e.g. with `npx serve`:

```sh
npx serve ./dataset --cors
```

And then navigating to the following URL in your browser: <https://clades.nextstrain.org/?dataset-url=http://localhost:3000>
3 changes: 1 addition & 2 deletions nextclade/Snakefile
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,6 @@ include: "rules/assemble_dataset.smk"
rule clean:
params:
targets = [
".snakemake",
"auspice",
"benchmarks",
"data",
Expand All @@ -26,5 +25,5 @@ rule clean:
]
shell:
"""
rm -rfv {params.targets}
rm -rv {params.targets}
"""
42 changes: 41 additions & 1 deletion nextclade/defaults/auspice_config.json
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
{
"title": "Real-time tracking of rubella virus full genome virus evolution",
"title": "Rubella E1 gene Nextclade reference tree",
"maintainers": [
{
"name": "John SJ Anderson",
Expand All @@ -20,6 +20,11 @@
"title": "Genotype",
"type": "categorical"
},
{
"key": "date",
"title": "Collection Date",
"type": "temporal"
},
{
"key": "region",
"title": "Region",
Expand All @@ -29,6 +34,41 @@
"key": "country",
"title": "Country",
"type": "categorical"
},
{
"key": "division",
"title": "Division",
"type": "categorical"
},
{
"key": "location",
"title": "Location",
"type": "categorical"
},
{
"key": "length",
"title": "Length",
"type": "continuous"
},
{
"key": "date_released",
"title": "Date Released",
"type": "temporal"
},
{
"key": "author",
"title": "Author",
"type": "categorical"
},
{
"key": "host",
"title": "Host",
"type": "categorical"
},
{
"key": "institution",
"title": "Institution",
"type": "categorical"
}
],
"geo_resolutions": ["country", "region"],
Expand Down
1 change: 0 additions & 1 deletion nextclade/defaults/colors.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -81,4 +81,3 @@ clade 1G #DFA43B
clade 1H #E68434
clade 1I #E2582C
clade 1J #DB2823

1 change: 0 additions & 1 deletion nextclade/defaults/nextclade-dataset/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,4 +37,3 @@ This dataset supports:

Read more about Nextclade datasets in the Nextclade documentation:
<https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html>

Loading