Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A failure of the classification of a reference genome #603

Open
biotengwk opened this issue Sep 14, 2024 · 1 comment
Open

A failure of the classification of a reference genome #603

biotengwk opened this issue Sep 14, 2024 · 1 comment
Labels
error Help required for a GTDB-Tk error.

Comments

@biotengwk
Copy link

biotengwk commented Sep 14, 2024

Dear authors, @pchaumeil

Hi! I'm a post-doctor researcher from China and my name is Wenkai Teng. Just in these days I have tried to classify a group of genomes using GTDBtk (v2.4.0) with the latest reference database (GTDB release 220). However, a genome from GTDB r220 itself, with the ID RS_GCF_002158865.1, and classification as Comamonas_E serinivorans, could not be classified with the result:

'GCF_002158865.1 Unclassified Bacteria ... Insufficient number of amino acids in MSA (3.1%)'

My colleague tried this using the GTDBtk of another version and got the similar result. Could you please help me to check why that is?

Thanks in advance,

Wenkai Teng

@biotengwk biotengwk added the error Help required for a GTDB-Tk error. label Sep 14, 2024
@biotengwk
Copy link
Author

I think I have found it.

When running prodigal for GCF_002158865.1, -g 11 should be used. However, -g 4 were used by GTDBtk.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
error Help required for a GTDB-Tk error.
Projects
None yet
Development

No branches or pull requests

1 participant