Skip to content

Assembly metrics from fasta file

Latest
Compare
Choose a tag to compare
@biofcallejas biofcallejas released this 14 Apr 19:29
2d69ed4

This script will summarize the nucleotide content in a fasta file by calculating:

Total sequences
Total bases
Total Ns (undefined bases)
Total Gaps (strings of undefined bases longer than 5)
Shortest sequence
Longest sequence
Mean length
N50
Overall G+C content
Min G+C per sequence
Max G+C per sequence