Skip to content

Commit 9a9101d

Browse files
Update README.md
added read stepper
1 parent d53135b commit 9a9101d

File tree

1 file changed

+6
-1
lines changed

1 file changed

+6
-1
lines changed

README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -240,7 +240,12 @@ cd <path to out directory containing genome.island.merge.gff and islander.gff fi
240240
cd <path to out directory containing resolve output>; typing.pl resolve3.gff
241241
```
242242

243-
243+
### Optional ReadStepper: Examining the flank of a query sequence in readset data
244+
245+
ReadStepper is a simple grep-trim-sort algorithm that allows the user to examine how raw short (eg. Illumina) sequencing reads inform a given flank of a query sequence. This is useful for examining challenging regions of an assembly, for example at repetitive regions or the closure of a circular contig. The user inputs a short query sequence (a 20-mer is usually adequate) and one or more paths to a fastq file. By default the downstream flank of the query is examined, but this can be switched to the upstream flank. The first step is to grep for the query sequence and its reverse complement among the reads, orienting all hit read sequences to the sense of the query. The second step is trim the query sequence and its unexamined flank from all reads. Then query-flanking sequences are alphabetically sorted. If the flanking sequence is single-copy, the sorted sequences should form a single horn with some noise due to occasional sequence errors. If the query sequence is from a repetitive sequence with two different right flanks, two horns will be produced (as in Fig. S1 of the reference).
246+
247+
Reference: Hudson CM, Bent ZW, Meagher RJ, Williams KP. 2014. Resistance determinants and mobile genetic elements of an NDM-1 encoding Klebsiella pneumoniae strain. PLoS One 9:e99209
248+
244249

245250
## Output files
246251
There are numerous output files generated from TIGER and Islander pipeline. Many of these are only needed if you plan to rerun the software. This guide will describe the 4 main output files

0 commit comments

Comments
 (0)