Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new tutorial for deciphering viral populations using SNV and baculovirus isolates (Variant analysis) #5700

Open
wants to merge 47 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
47 commits
Select commit Hold shift + click to select a range
558ba7d
Initial commit: Basic tutorial structure and content added
wennj Jan 10, 2025
96f9542
Minor tutorial update.
wennj Jan 13, 2025
5fae409
VCF to table transformation, snv specificity and visualization framew…
wennj Jan 13, 2025
f3edc33
VCF to table transformation added.
wennj Jan 14, 2025
02c69c2
First version of the complete tutorial
wennj Jan 16, 2025
e6e8e8b
First student review: Adjustments and feedback integration
wennj Jan 17, 2025
6f5c87e
Conclusion added.
wennj Jan 17, 2025
5e42df7
Add wennj as contributor.
wennj Jan 17, 2025
a8ee5a9
Merge remote-tracking branch 'upstream/main'
wennj Jan 17, 2025
cc836bf
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
10311ac
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
d85f994
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
2163765
Update tutorial.md - empty reference removed
wennj Jan 18, 2025
0a5c432
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
02fc2f3
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
89adfa9
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
1f92a42
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
5a7e8ed
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
069a16d
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
1cd5641
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
e0e01b1
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
69d35b4
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
025d154
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
671e684
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
35538ee
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
ce7e921
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
b78b757
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
af10933
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 18, 2025
0706c5a
Add outdated suggestions from Pull Request comments
wennj Jan 18, 2025
856c544
Update CONTRIBUTORS.yaml
wennj Jan 21, 2025
2600bad
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 21, 2025
7071118
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 21, 2025
a6cff50
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 21, 2025
f546250
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 21, 2025
edfbae9
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
wennj Jan 21, 2025
0a5e4ec
Update tutorial.md - Time estimation changed.
wennj Jan 21, 2025
d79277b
Replace old workflow with a correct formatted workflow file
wennj Jan 22, 2025
ffa783c
Update and clean up of bibliography: citations updated, fixed and sor…
wennj Jan 22, 2025
cad9c7d
Question on the occurrence of alternative nucleotides revised.
wennj Jan 23, 2025
09fbae1
Isolate/sample issue resolved and wording improved.
wennj Jan 24, 2025
1006bbb
Input description in hand-on boxes updated for better guidance.
wennj Jan 24, 2025
d185ddc
Merge branch 'main' into main
wennj Jan 27, 2025
4fade29
Fix most CI complaints and a few minor things
wm75 Jan 28, 2025
1b33ecd
Try to restore Gemfile.lock
wm75 Jan 28, 2025
8a18b71
CI issues and wording.
wennj Jan 30, 2025
d8824c6
Workflow test data added.
wennj Jan 30, 2025
72b7750
Filter on column 25 corrected.
wennj Jan 31, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update topics/variant-analysis/tutorials/baculovirus-isolate-variatio…
…n/tutorial.md

Co-authored-by: Wolfgang Maier <maierw@posteo.de>
  • Loading branch information
wennj and wm75 authored Jan 21, 2025
commit f546250d2078b0c4b88c82e2abc744d3ee9818ae
Original file line number Diff line number Diff line change
Expand Up @@ -611,7 +611,7 @@ As a result, we get a SNV plot that shows the relative frequency of the first al

# SNV Specificity Determination

Now we come to the last but most complex section of this tutorial - determining SNV specificities. We have seen that CpGV-V15 is a mixed isolate. Now we want to find out which isolates were mixed and how this SNV pattern can be explained. In the next step, we will consider SNV positions as markers and look for SNV positions that are only variable for one or more sequenced isolates of CpGV. If an SNV position is only variable for CpGV-S (i.e. the relative frequency is greater than 0; `REL.ALT > 0`) but is equal to zero for the other isolates, then this position is a marker for CpGV-S, or specific for CpGV-S. Since we cannot perform this determination by hand for all positions, I have written a small program implemented in the **Text reformatting with awk** tool that does the work for us.
Now we come to the last but most complex section of this tutorial - determining SNV specificities. We have seen that CpGV-V15 is a mixed isolate. Now we want to find out which isolates were mixed and how this SNV pattern can be explained. In the next step, we will consider SNV positions as markers and look for SNV positions that are only variable for one or more sequenced isolates of CpGV. If a SNV position is only variable for CpGV-S (i.e. the relative frequency is greater than 0; `REL.ALT > 0`) but is equal to zero for the other isolates, then this position is a marker for CpGV-S, or specific for CpGV-S. Since we cannot perform this determination by hand for all positions, I have written a small program implemented in the **Text reformatting with awk** tool that does the work for us.

Before we get started, I would like to explain specificity in more detail using an example. First, we decide that we want to determine the specificities for the SNV positions for the following isolates:
* CpGV-M
Expand Down