Skip to content

omicscode/eVaiutilities

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

eVaiutilities:

  • Analyzing and identifying genomic variants is particularly helpful in case of linking the disease onset to the genomic predictions. Genomic variants play a key role in the identification and characterization of the disease onset and also allow for interlinking genomic variability and hypothesis testing. eVai, which is a variant annotation platform provides annotation of the genomic variants using several backhand open source databases and tools. eVaiutilities proposed in this paper, is a data management software for the analysis of the eVai output files. It allows the analysis of the genomic variants further such as analyzing the multiple genomic annotated variants, reference and alternate allele, enabling coordinate search, coordinate search with specified variants and annotation search across a large number of population. The availability of the command line parameters allows for a large scale analysis across the several population files.

Role of eVaiutilities:

  • eVaiutilities is used for the population scale analysis the eVai results. It supports all versions of eVai and uses RUST version rustc 1.89.0 (29483883e 2025-08-04).
  • It analyzes the eVai output files for variant annotation to filtering, preparating user reports and database and annotator and more. See the example folder for the older and the newer version files.
  • It provides the population scale variant searching given a folder according to the variant, annotation, transcript, gene, and coordinates.
  • It uses this version of the human genome from GENCODE
  • It summaries several stats and plots figures for the variant analysis.

DOI

  • Scientific Rust

  • In this new version, it plots the variants accumulated also.

 cargo build
___  \ \   / /   __ _  (_) | | | | | |_  (_) | | (_) | |_  (_)   ___   ___
/ _ \  \ \ / /   / _` | | | | | | | | __| | | | | | | | __| | |  / _ \ / __|
|  __/   \ V /   | (_| | | | | |_| | | |_  | | | | | | | |_  | | |  __/ \__ \
\___|    \_/     \__,_| |_|  \___/   \__| |_| |_| |_|  \__| |_|  \___| |___/


annotating and analyzing eVai results.

   ************************************************
   Gaurav Sablok,
   Email: codeprog@icloud.com
   ************************************************

Usage: eVaiutilities <COMMAND>

Commands:
variant-analyzer                analyzer for the variants
variant-filter                  filter the variants
variant-database                create variant database
gtf-analyze                     analyze the corresponding gtf
variant-seq                     prepare the variant seq annotation
download-genome                 download the human genome
acmg-transcript                 variant-transcriptids
sequence-profile                sequence profiling
population-variant-search       search for the variant across population
coordinate-search               search according to coordinates
coordinate-search-variant       search according to coordinates and variant
annotation-search               search according to annotation
pathogenicity-filter            pathogenicity filter
population-variant-searcholder  search for the variant across population older version
coordinate-searcholder          search according to coordinates older version
coordinate-searc-variantholder  search according to coordinates and variant older version
annotation-searcholder          search according to annotation older version
pathogenicity-filterolder       pathogenicity filter older version
transcript-search               search by the specific transcript
transcript-searcholder          search by the specific transcript older
alt-allele                      search for the ref allele
alt-ref-allele                  search according to ref allele and alt allele
alt-allele-older                search for the ref allele older version
alt-ref-allele-older            search according to ref allele and alt allele older version
variant-plotter                 accumulate all variants for the plots
help                            Print this message or the help of the given subcommand(s)

Options:
-h, --help     Print help
-V, --version  Print version

 variant-analyzer ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv
 variant-filter ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv TTT
 gtf-analyze ./testfiles/single-population/samplegtf.gtf
 variant-seq ./testfiles/single-population/sample1.tsv ./single/sample.fasta
 download-genome yes
 acmg-transcript ./testfiles/single-population/sample1.tsv
 sequence-profile ./testfiles/single-population/sample1.tsv ./data-repo/sample.fasta 10 10
 population-variant-search ./testfiles/newversion-population G samplename
 coordinate-search ./testfiles/newversion-population 14464 16495 samplename
 annotation-search ./testfiles/newversion-population AL645608.6 samplename
 pathogenicity-filter ./testfiles/newversion-population 0.0 1.5 samplename
 population-variant-searcholder ./testfiles/oldversion-population G samplename
 coordinate-searcholder ./testfiles/oldversion-population 14464 16495 samplename
 annotation-searcholder ./testfiles/oldversion-population WASH7P samplename
 pathogenicity-filterolder ./testfiles/oldversion-population 0 1.5 samplename
 transcript-search ./testfiles/newversion-population ENST00000635509.2 samplename
 alt-allele ./testfiles/newversion-population T samplename
 alt-allele-older ./testfiles/oldversion-population T samplename
 alt-ref-allele ./testfiles/newversion-population A T samplename
 alt-ref-allele-older ./testfiles/oldversion-population A T samplename
 variant-plotter ./testfiles/variant-accumulation
  • Below you will find a detailed usage of the eVaiutilities.

  • The variant analyzer allows you to merge all the information present in the ACMG file and the TSV file for the annotate variants into a single file and removes the redundant information.This allows for a single file analysis.

Usage: eVaiutilities variant-analyzer <ACMGFILE> <TSVFILE>
Arguments:
  <ACMGFILE>  provide the path to the acmg file
  <TSVFILE>   provide the path to the tsv file

variant-analyzer ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv
  • This option allows for the filtering of the variants according to a defined variant and allows for the filtering of the variant from a single annotated variant. It takes a ACMG and TSV file and a variant which needs to be filtered.
Usage: eVaiutilities variant-filter <ACMGFILE> <TSVFILE> <VARIANT>
Arguments:
  <ACMGFILE>  provides the path to the acmg file
  <TSVFILE>   provides the path to the tsv file
  <VARIANT>   provide the variant

variant-filter ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv TTT

  • This option allows for the analysis of the corresponding GTF and gives you the corresponding annotated exons of the annotated genes.It collates all the exons present in the gene. analyze the corresponding gtf
Usage: eVaiutilities gtf-analyze <GTFFILE>
Arguments:
  <GTFFILE>  path to the gtf file

gtf-analyze ./testfiles/single-population/samplegtf.gtf
  • This gives you the annotated sequence associated with the variants given the fasta sequence. prepare the variant seq annotation.
Usage: eVaiutilities variant-seq <ACMGFILE> <FASTAFILE>
Arguments:
  <ACMGFILE>   provide the ACMG file
  <FASTAFILE>  provide fasta file

variant-seq ./testfiles/single-population/sample1.tsv ./single/sample.fasta
  • This option allows you to download the latest version of the genome from the GENCODE.
Usage: eVaiutilities download-genome <INPUT>
Arguments:
  <INPUT>
 download-genome yes
  • This option gives you the associated transcript ids with the variants in the ACMG transcripts.
Usage: eVaiutilities acmg-transcript <ACMGFILE>
Arguments:
  <ACMGFILE>  provide the ACMG file
acmg-transcript ./testfiles/single-population/sample1.tsv
  • This allows you to search for the variant and then extract the upstream and downstream information associated with the particular variant.
Usage: eVaiutilities sequence-profile <ACMGFILE> <FASTAFILE> <UPSTREAM> <DOWNSTREAM> <VARIANT>
Arguments:
  <ACMGFILE>    provide the acmg file
  <FASTAFILE>   provide the fasta file
  <UPSTREAM>    upstream location
  <DOWNSTREAM>  downstream location
  <VARIANT>     variant
sequence-profile ./testfiles/single-population/sample1.tsv ./data-repo/sample.fasta 10 10
  • This allows you to search for a particular variant across a population. This takes a directory containing all the ACMG files correponding to a particular population.
Usage: eVaiutilities population-variant-search <ACMGDIR> <VARIANT> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg directory
  <VARIANT>  provide the variant
  <NAME>     analysis name
population-variant-search ./testfiles/newversion-population G samplename
  • This option allows you to search for a variant between the coordinates. In the above example, we provide a directory containing the ACMG files and then search for the variant between these coordinates across all those files and output a filename with the same samplename.
Usage: eVaiutilities coordinate-search <ACMGDIR> <START> <END> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg directory
  <START>    provide the coordinate
  <END>      provide the end coordinate
  <NAME>     analysis name
coordinate-search ./testfiles/newversion-population 14464 16495 samplename
  • This allows you to search for a variant with the specificed directory containing all the ACMG files and then associated with the specific transcript id across all those files.
Usage: eVaiutilities annotation-search <ACMGDIR> <GENENAME> <NAME>
Arguments:
  <ACMGDIR>   provide the acmg directory
  <GENENAME>  search the annotation
  <NAME>      analysis name
annotation-search ./testfiles/newversion-population AL645608.6 samplename
  • This allows you to filter a directory with the ACMG files and you to have provide the range of the pathogenicity filter with the start and the end value and the sample name as the file to be written.
Usage: eVaiutilities pathogenicity-filter <ACMGDIR> <START> <END> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg dir
  <START>    start value
  <END>      end value
  <NAME>     analysis name
pathogenicity-filter ./testfiles/newversion-population 0.0 1.5 samplename
  • This allows you to search for a particular variant across a population. This takes a directory containing all the ACMG files correponding to a particular population. This option is for the older version of the eVai.
Usage: eVaiutilities population-variant-searcholder <ACMGDIR> <VARIANT> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg directory
  <VARIANT>  provide the variant
  <NAME>     analysis name
population-variant-searcholder ./testfiles/oldversion-population G samplename
  • This option allows you to search for a variant between the coordinates. In the above example, we provide a directory containing the ACMG files and then search for the variant between these coordinates across all those files and output a filename with the same samplename. This supports the older version of the eVai.
Usage: eVaiutilities coordinate-searcholder <ACMGDIR> <START> <END> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg directory
  <START>    provide the coordinate
  <END>      provide the end coordinate
  <NAME>     analysis name
coordinate-searcholder ./testfiles/oldversion-population 14464 16495 samplename
  • This allows you to search for a variant with the specificed directory containing all the ACMG files and then associated with the specific transcript id across all those files. This supports the older version of the eVai.
Usage: eVaiutilities annotation-searcholder <ACMGDIR> <GENENAME> <NAME>
Arguments:
  <ACMGDIR>   provide the acmg directory
  <GENENAME>  search the annotation
  <NAME>      analysis name
annotation-searcholder ./testfiles/oldversion-population WASH7P samplename
  • This allows you to filter a directory with the ACMG files and you to have provide the range of the pathogenicity filter with the start and the end value and the sample name as the file to be written. This supports the older version.
Usage: eVaiutilities pathogenicity-filterolder <ACMGDIR> <START> <END> <NAME>
Arguments:
  <ACMGDIR>  provide the acmg dir
  <START>    start
  <END>      end
  <NAME>     analysis name
pathogenicity-filterolder ./testfiles/oldversion-population 0 1.5 samplename
  • This option allows you to search for a specific transcript across the entire population and takes a ACMG folder and the transcript name and a ouptufile name.
Usage: eVaiutilities transcript-search <ACMGDIR> <TRANSCRIPT> <NAME>
Arguments:
  <ACMGDIR>     provide the acmg directory
  <TRANSCRIPT>  provide the transcript name
  <NAME>        name of the analysis
transcript-search ./testfiles/newversion-population ENST00000635509.2 samplename
  • This option allows for the filtering of the entire population study and takes a directory containing ACMG files and a ref allele to look for and a output file name.
Usage: eVaiutilities alt-allele <ACMGDIR> <REFALLELE> <NAME>
Arguments:
  <ACMGDIR>    provide the acmg directory
  <REFALLELE>  provide the alt allele
  <NAME>       provide the analysis name
alt-allele ./testfiles/newversion-population T samplename
  • This option allows for the filtering of the entire population study and takes a directory containing ACMG files and a ref allele to look for and a output file name. This supports the older version of the eVai.
Usage: eVaiutilities alt-allele-older <ACMGDIR> <REFALLELE> <NAME>
Arguments:
  <ACMGDIR>    provide the acmg directory
  <REFALLELE>  provide the alt allele
  <NAME>       provide the analysis name
alt-allele-older ./testfiles/oldversion-population T samplename
  • This option allows you to search for a ref allele that has a specific alternate allele across a entire population and takes a directory containing ACMG files and the ref and the alternate allele to search for and an output filename to which the information should be written.
Usage: eVaiutilities alt-ref-allele <ACMGDIR> <REFALLELE> <ALTALLELE> <NAME>
Arguments:
  <ACMGDIR>    provide the acmd directory
  <REFALLELE>  provide the ref allele
  <ALTALLELE>  provide the alt allele
  <NAME>       provide the analysis name
 alt-ref-allele ./testfiles/newversion-population A T samplename
  • This option allows you to search for a ref allele that has a specific alternate allele across a entire population and takes a directory containing ACMG files and the ref and the alternate allele to search for and an output filename to which the information should be written. This supports the older version.
Usage: eVaiutilities alt-ref-allele-older <ACMGDIR> <REFALLELE> <ALTALLELE> <NAME>
Arguments:
  <ACMGDIR>    provide the acmd directory
  <REFALLELE>  provide the ref allele
  <ALTALLELE>  provide the alt allele
  <NAME>       provide the analysis name
 alt-ref-allele-older ./testfiles/oldversion-population A T samplename
  • This option allows you to plot all the variants across the entire population. It takes a folder containing the folder containing the annotated variants and then accumulate all the variants across that study and plots a bar plot to show the frequency of the variants across that study.
Usage: eVaiutilities variant-plotter <PATHFOLDER>
Arguments:
  <PATHFOLDER>  provide the path to the folder
variant-plotter ./testfiles/variant-accumulation
  • To install windows version:
rustup component add llvm-tools
rustup target add x86_64-pc-windows-msvc
git clone https://github.com/IBCHgenomic/eVaiutilities.git
cd ensemblcov
cargo xwin build --target x86_64-pc-windows-msvc

Gaurav Sablok
codeprog@icloud.com

Packages

No packages published

Contributors 3

  •  
  •  
  •