Skip to content

Commit cc009f6

Browse files
committed
limiting to contigs over 1000bp
1 parent db15af9 commit cc009f6

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

Workshops/COMBINE_WA_2024.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -642,6 +642,16 @@ And now we plot _all_ the raw data.
642642
sns.lineplot(data=melted_df, x='Sample', y='Depth', hue='contig', legend=False)
643643
```
644644

645+
## Filter for longer contigs
646+
647+
We filter our data frame so that we only consider contigs > 1000 bp. You can change this limit if you wish.
648+
649+
```
650+
minlength = 1000
651+
longcontigs = seqlengths[seqlengths['length'] > minlength].index
652+
dfs = dfs[dfs.index.isin(longcontigs)]
653+
```
654+
645655
## Calculate the corrrelations
646656

647657
Now that we have the contigs and their average depth across the samples, we calculate a pairwise correlation between all contigs and all other contigs.

0 commit comments

Comments
 (0)