Skip to content

Troubleshooting GTDB-Tk Database Installation and Environment Configuration #596

Open
@soojunglee98

Description

@soojunglee98

Environment

  • Installed via pip (include the output of pip list)
  • [0] Using a conda environment (include the output of conda list && conda list --revisions)
  • Using a Docker container (include the IMAGE ID of the container)

Debugging information

  • [0 ] gtdbtk.log has been included (drag and drop the file to upload).
  • Genomes have been included (if possible, and there are few).

Additional comments

As mentioned on this website, I installed the conda environment and tried to download the database.

When I run download.sh, it says:

"Cannot write to '/home/spotgiet/miniconda3/envs/gtdbtk-2.1.1/share/gtdbtk-2.1.1/db/gtdbtk_r207_v2_data.tar.gz' (No space left on device)."

So, I manually downloaded the database to another directory due to space limitations in my home directory. However, I encountered this error:


================================================================================
                                     ERROR                                      
________________________________________________________________________________

          The 'GTDBTK_DATA_PATH' environment variable is not defined.           

            Please set this variable to your reference data package.            
               https://github.com/Ecogenomics/GTDBTk#installation               
================================================================================

================================================================================
                                     ERROR                                      
________________________________________________________________________________

           The GTDB-Tk reference data does not exist or is corrupted.           
                GTDBTK_DATA_PATH=/path/to/unarchived/gtdbtk/data                

   Please compare the checksum to those provided in the download repository.    
          https://github.com/Ecogenomics/GTDBTk#gtdb-tk-reference-data          
================================================================================

So again, as mentioned on the website, I activated my conda environment and tried to run:

conda env config vars set GTDBTK_DATA_PATH="/scratch/raskin_root/raskin0/shared_data/Soojung_Sarah/gtdb_tk/release220"

But it keeps saying: "To make your changes take effect, please reactivate your environment." even though my conda environment is already activated. Any suggestions? Thank you so much!

Metadata

Metadata

Assignees

No one assigned

    Labels

    errorHelp required for a GTDB-Tk error.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions