Releases: terrimporter/18SClassifier
18S Classifier v4.1.1
New release to activate Zenodo DOI.
18S Classifier v4.1
These files are ready to be used with the RDP classifier 2.12 and trained to make assignments at the genus rank. This dataset was created from the SILVA 138 SSURef Nr99 release. Sequences without a genus rank rank assignment from SILVA were excluded.
Place holder taxa were inserted for ranks that were missing from the SILVA file, ex. Eukrayota_undef. This was done to maintain a file format easily parsed by downstream programs.
Total decompressed file size is 269 Mb.
18S Classifier v4.1-ref
These are the files used to train the 18S v4.1 classifier that are provided for reference only. Created from the SILVA 138 SSU Ref Nr99 dataset. SILVA sequences without a genus rank assignment were discarded.
Place holder taxa were inserted for ranks that were missing from the SILVA file, ex. Eukrayota_undef. This was done to maintain a file format easily parsed by downstream programs.
Total decompressed file size is 71Mb.
18S Classifier v4.0
These files are ready to be used with the RDP classifier 2.12 and trained to make assignments at the genus rank. This dataset was created from the SILVA 138 SSURef Nr99 release. Sequences without a genus rank rank assignment from SILVA were excluded. Total decompressed file size is 269 Mb.
18S Classifier v4.0-ref
These are the files used to train the 18S v4.0 classifier that are provided for reference only. Created from the SILVA 138 SSU Ref Nr99 dataset. SILVA sequences without a genus rank assignment were discarded.
Total decompressed file size is 70Mb.
18S Classifier v3.2
This version was updated to include a few invasive species of interest even though they were not originally included in the SILVA 132 SSURef Nr99 release. The sequences were mined from GenBank.
Included here are the files ready to be used with the RDP classifier, no further training needed. Total decompressed file size is 208Mb .
18S Classifier v3.2-ref
This version was updated to include a few invasive species of interest even though they were not originally included in the SILVA 132 SSURef Nr99 release. The sequences were mined from GenBank.
Included here are the original taxonomy and FASTA files used to train this version of the 18S classifier. Total decompressed file size is 16Mb.
18S Classifier v3.1
Fixes punctuation errors in sequence headers and taxonomy that breaks the classifier.
18S Classifier v3.1-ref
Fixes punctuation errors in sequence headers and taxonomy that breaks the classifier.
mydata_training.tar.gz
18S Classifier v3.0
These files are ready to be used with the RDP classifier 2.12 . This dataset was created from the SILVA 132 SSURef Nr99 release. Species with more than one taxonomic lineage were excluded.