Skip to content

knowmics-lab/BacExplorer

Repository files navigation

BacExplorer

Welcome to BacExplorer, a bacterial analysis tool with an user-friendly GUI.

BACEXPLORER

BacExplorer is a software system for the analysis of Microbial data. Its aim is to annotate Bacterial genome with information about Antimicrobial resistance, virulence factor, plasmids, serotypings and so on. The analysis is developed with the workflow management system Snakemake. BacExplorer permits to analyze both raw fastq data and fasta.

immagine

Please read the following guide for a correct installation and usage.

SYSTEM REQUIREMENTS:

Installation:

Internet connection is required to fetch the Docker image and external resources.

LINUX USERS: the package xdg-open is required. Make sure to install it with the following command:

sudo apt install xdg-utils

Analysis:

It can be performed both with .fasta and .fastq files, with the following requirements:

  • at least 36 GB of storage space;
  • FASTA: at least 8 GB of RAM to analyze one sample;
  • FASTQ: at least 8 GB of RAM to analyze one sample;

Storage:

BacExplorer comes with a large amount of data due to the many databases of the softwares used for the analyses. The whole app data is found in the following directories:

  • Linux Users: /home/< user >/.config/BacExplorer
  • MacOS Users: ~/Library/Application Support/BacExplorer
  • Windows Users: C:\Users< user >\AppData\Roaming\BacExplorer

INSTALLATION

An UNIX environment is required to make Snakemake able to perform. To ensure this, the entire Snakemake pipeline runs into a Docker container, whether your OS is Linux, macOS or Windows Professional.

Step 1 (OPTIONAL): Install Docker

Docker is essential to run the application, so for the first usage you will be able to check whether Docker or Docker Desktop is already installed on your machine. In case it is not, the link to Docker will be provided to you. In case you still want to download it before getting started with the application itself, check the version for your platform here: https://www.docker.com/.

Step 2: Download the tool

Visit the following link and download the latest release: https://github.com/knowmics-lab/BacExplorer/releases. You will find the one that suits your OS in the Assets menu.

Step 3: Run Docker and start BacExplorer

Docker or Docker Desktop must be running throughout the whole usage of the application.

Step 4: Home page

You are now finished with the installation process and ready for your first analysis. Be sure to click on Go to setup if this is your first usage. Read the following paragraph and let us guide you through all the steps.

App setup

SETUP

  1. If you have not downloaded Docker yet, click on Check. If not found, the system will provide you the link to download the correct version for your platform.
  2. Start Docker or Docker Desktop.
  3. Run the Environment Setup. It will take some time to automatically set up the container and eventually run it. The entire process, which requires no manual intervention, consists of the following steps:
    • pull the Docker image of the container;
    • download external resources. Databases will be stored in the app data directory, BacExplorer/snakemake/resources, and other tools will be stored in BacExplorer/snakemake/tools. Please do not move any of these directories, since they are mounted on the container as a volume;
    • create the container and start it;
    • update databases inside the container. Once the Environment Setup is finished, you can move on to the analysis page.

INSERIRE IMMAGINE DEL SETUP COMPLETATO

Docker setup

USAGE

  1. Input
    The user should put the input files in a specific folder, where the outputs are going to be saved. FASTQ file formats need to be either for single-end
  • .fastq.gz
  • .fq.gz for paired-end
  • _1.fastq.gz _2.fastq.gz
  • _R1.fastq.gz _R2.fastq.gz
  • _R1_L001.fastq.gz _R2_L001.fastq.gz
  • _1.fq.gz _2.fq.gz
  • _R1.fq.gz _R2.fq.gz
  • _R1_L001.fq.gz _R2_L001.fq.gz Fasta file extension should be
  • .fasta
  • .fa
  • .fna
  • .fsa
  1. Parameters Setting
    The user needs to set the following parameters for the analysis:
  • The analysis name, this will also be the name of the HTML report;
  • If all the samples belongs to the same Genus and Species, it is possible to specify them. In this case, the kraken2 taxonomy analysis will be skipped;
  • Identity and Coverage value for the filtering of AMR and virulence factor results. The default parameter is 90% for both;
  • The folder with the data to be analyzed.

Parameters_setting

  1. Output organization Inside the input folder the system will create an output folder with several subfolders:
  • abricate
  • abricate_ecoli
  • agrvate
  • amrfinder
  • ClermonTyping
  • ectyper
  • emmtyper
  • fasta_output - only in fastq analysis
  • file
  • fimtyper
  • genomad
  • hicap
  • kleborate
  • kleborate_escherichia
  • kraken2
  • legsta
  • lissero
  • meningotype
  • mlst
  • ngmaster
  • pasty
  • pbptyper
  • quality_assessment/fastqc - only in fastq analysis
  • quality_assessment/quast
  • sccmec
  • shigatyper
  • shigeifinder
  • spatyper
  • trim - only in fastq analysis
  • virulencefinder
  • Report.html
  1. Report page

immagine

immagine

immagine

Test

FASTQ

To test BacExplorer with fastq samples it is possible to download:

FASTA

Fasta samples to test BacExplorer can be found in "test_data" folder in this repository.

Software and Database

Software

Database

Citation

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages