- Analyzing and identifying genomic variants is particularly helpful in case of linking the disease onset to the genomic predictions. Genomic variants play a key role in the identification and characterization of the disease onset and also allow for interlinking genomic variability and hypothesis testing. eVai, which is a variant annotation platform provides annotation of the genomic variants using several backhand open source databases and tools. eVaiutilities proposed in this paper, is a data management software for the analysis of the eVai output files. It allows the analysis of the genomic variants further such as analyzing the multiple genomic annotated variants, reference and alternate allele, enabling coordinate search, coordinate search with specified variants and annotation search across a large number of population. The availability of the command line parameters allows for a large scale analysis across the several population files.
- eVaiutilities is used for the population scale analysis the eVai results. It supports all versions of eVai and uses RUST version rustc 1.89.0 (29483883e 2025-08-04).
- It analyzes the eVai output files for variant annotation to filtering, preparating user reports and database and annotator and more. See the example folder for the older and the newer version files.
- It provides the population scale variant searching given a folder according to the variant, annotation, transcript, gene, and coordinates.
- It uses this version of the human genome from GENCODE
- It summaries several stats and plots figures for the variant analysis.
-
In this new version, it plots the variants accumulated also.
cargo build
___ \ \ / / __ _ (_) | | | | | |_ (_) | | (_) | |_ (_) ___ ___
/ _ \ \ \ / / / _` | | | | | | | | __| | | | | | | | __| | | / _ \ / __|
| __/ \ V / | (_| | | | | |_| | | |_ | | | | | | | |_ | | | __/ \__ \
\___| \_/ \__,_| |_| \___/ \__| |_| |_| |_| \__| |_| \___| |___/
annotating and analyzing eVai results.
************************************************
Gaurav Sablok,
Email: codeprog@icloud.com
************************************************
Usage: eVaiutilities <COMMAND>
Commands:
variant-analyzer analyzer for the variants
variant-filter filter the variants
variant-database create variant database
gtf-analyze analyze the corresponding gtf
variant-seq prepare the variant seq annotation
download-genome download the human genome
acmg-transcript variant-transcriptids
sequence-profile sequence profiling
population-variant-search search for the variant across population
coordinate-search search according to coordinates
coordinate-search-variant search according to coordinates and variant
annotation-search search according to annotation
pathogenicity-filter pathogenicity filter
population-variant-searcholder search for the variant across population older version
coordinate-searcholder search according to coordinates older version
coordinate-searc-variantholder search according to coordinates and variant older version
annotation-searcholder search according to annotation older version
pathogenicity-filterolder pathogenicity filter older version
transcript-search search by the specific transcript
transcript-searcholder search by the specific transcript older
alt-allele search for the ref allele
alt-ref-allele search according to ref allele and alt allele
alt-allele-older search for the ref allele older version
alt-ref-allele-older search according to ref allele and alt allele older version
variant-plotter accumulate all variants for the plots
help Print this message or the help of the given subcommand(s)
Options:
-h, --help Print help
-V, --version Print version
variant-analyzer ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv
variant-filter ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv TTT
gtf-analyze ./testfiles/single-population/samplegtf.gtf
variant-seq ./testfiles/single-population/sample1.tsv ./single/sample.fasta
download-genome yes
acmg-transcript ./testfiles/single-population/sample1.tsv
sequence-profile ./testfiles/single-population/sample1.tsv ./data-repo/sample.fasta 10 10
population-variant-search ./testfiles/newversion-population G samplename
coordinate-search ./testfiles/newversion-population 14464 16495 samplename
annotation-search ./testfiles/newversion-population AL645608.6 samplename
pathogenicity-filter ./testfiles/newversion-population 0.0 1.5 samplename
population-variant-searcholder ./testfiles/oldversion-population G samplename
coordinate-searcholder ./testfiles/oldversion-population 14464 16495 samplename
annotation-searcholder ./testfiles/oldversion-population WASH7P samplename
pathogenicity-filterolder ./testfiles/oldversion-population 0 1.5 samplename
transcript-search ./testfiles/newversion-population ENST00000635509.2 samplename
alt-allele ./testfiles/newversion-population T samplename
alt-allele-older ./testfiles/oldversion-population T samplename
alt-ref-allele ./testfiles/newversion-population A T samplename
alt-ref-allele-older ./testfiles/oldversion-population A T samplename
variant-plotter ./testfiles/variant-accumulation
-
Below you will find a detailed usage of the eVaiutilities.
-
The variant analyzer allows you to merge all the information present in the ACMG file and the TSV file for the annotate variants into a single file and removes the redundant information.This allows for a single file analysis.
Usage: eVaiutilities variant-analyzer <ACMGFILE> <TSVFILE>
Arguments:
<ACMGFILE> provide the path to the acmg file
<TSVFILE> provide the path to the tsv file
variant-analyzer ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv
- This option allows for the filtering of the variants according to a defined variant and allows for the filtering of the variant from a single annotated variant. It takes a ACMG and TSV file and a variant which needs to be filtered.
Usage: eVaiutilities variant-filter <ACMGFILE> <TSVFILE> <VARIANT>
Arguments:
<ACMGFILE> provides the path to the acmg file
<TSVFILE> provides the path to the tsv file
<VARIANT> provide the variant
variant-filter ./testfiles/single-population/sample1.tsv ./single/sampletest.tsv TTT
- This option allows for the analysis of the corresponding GTF and gives you the corresponding annotated exons of the annotated genes.It collates all the exons present in the gene. analyze the corresponding gtf
Usage: eVaiutilities gtf-analyze <GTFFILE>
Arguments:
<GTFFILE> path to the gtf file
gtf-analyze ./testfiles/single-population/samplegtf.gtf
- This gives you the annotated sequence associated with the variants given the fasta sequence. prepare the variant seq annotation.
Usage: eVaiutilities variant-seq <ACMGFILE> <FASTAFILE>
Arguments:
<ACMGFILE> provide the ACMG file
<FASTAFILE> provide fasta file
variant-seq ./testfiles/single-population/sample1.tsv ./single/sample.fasta
- This option allows you to download the latest version of the genome from the GENCODE.
Usage: eVaiutilities download-genome <INPUT>
Arguments:
<INPUT>
download-genome yes
- This option gives you the associated transcript ids with the variants in the ACMG transcripts.
Usage: eVaiutilities acmg-transcript <ACMGFILE>
Arguments:
<ACMGFILE> provide the ACMG file
acmg-transcript ./testfiles/single-population/sample1.tsv
- This allows you to search for the variant and then extract the upstream and downstream information associated with the particular variant.
Usage: eVaiutilities sequence-profile <ACMGFILE> <FASTAFILE> <UPSTREAM> <DOWNSTREAM> <VARIANT>
Arguments:
<ACMGFILE> provide the acmg file
<FASTAFILE> provide the fasta file
<UPSTREAM> upstream location
<DOWNSTREAM> downstream location
<VARIANT> variant
sequence-profile ./testfiles/single-population/sample1.tsv ./data-repo/sample.fasta 10 10
- This allows you to search for a particular variant across a population. This takes a directory containing all the ACMG files correponding to a particular population.
Usage: eVaiutilities population-variant-search <ACMGDIR> <VARIANT> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<VARIANT> provide the variant
<NAME> analysis name
population-variant-search ./testfiles/newversion-population G samplename
- This option allows you to search for a variant between the coordinates. In the above example, we provide a directory containing the ACMG files and then search for the variant between these coordinates across all those files and output a filename with the same samplename.
Usage: eVaiutilities coordinate-search <ACMGDIR> <START> <END> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<START> provide the coordinate
<END> provide the end coordinate
<NAME> analysis name
coordinate-search ./testfiles/newversion-population 14464 16495 samplename
- This allows you to search for a variant with the specificed directory containing all the ACMG files and then associated with the specific transcript id across all those files.
Usage: eVaiutilities annotation-search <ACMGDIR> <GENENAME> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<GENENAME> search the annotation
<NAME> analysis name
annotation-search ./testfiles/newversion-population AL645608.6 samplename
- This allows you to filter a directory with the ACMG files and you to have provide the range of the pathogenicity filter with the start and the end value and the sample name as the file to be written.
Usage: eVaiutilities pathogenicity-filter <ACMGDIR> <START> <END> <NAME>
Arguments:
<ACMGDIR> provide the acmg dir
<START> start value
<END> end value
<NAME> analysis name
pathogenicity-filter ./testfiles/newversion-population 0.0 1.5 samplename
- This allows you to search for a particular variant across a population. This takes a directory containing all the ACMG files correponding to a particular population. This option is for the older version of the eVai.
Usage: eVaiutilities population-variant-searcholder <ACMGDIR> <VARIANT> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<VARIANT> provide the variant
<NAME> analysis name
population-variant-searcholder ./testfiles/oldversion-population G samplename
- This option allows you to search for a variant between the coordinates. In the above example, we provide a directory containing the ACMG files and then search for the variant between these coordinates across all those files and output a filename with the same samplename. This supports the older version of the eVai.
Usage: eVaiutilities coordinate-searcholder <ACMGDIR> <START> <END> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<START> provide the coordinate
<END> provide the end coordinate
<NAME> analysis name
coordinate-searcholder ./testfiles/oldversion-population 14464 16495 samplename
- This allows you to search for a variant with the specificed directory containing all the ACMG files and then associated with the specific transcript id across all those files. This supports the older version of the eVai.
Usage: eVaiutilities annotation-searcholder <ACMGDIR> <GENENAME> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<GENENAME> search the annotation
<NAME> analysis name
annotation-searcholder ./testfiles/oldversion-population WASH7P samplename
- This allows you to filter a directory with the ACMG files and you to have provide the range of the pathogenicity filter with the start and the end value and the sample name as the file to be written. This supports the older version.
Usage: eVaiutilities pathogenicity-filterolder <ACMGDIR> <START> <END> <NAME>
Arguments:
<ACMGDIR> provide the acmg dir
<START> start
<END> end
<NAME> analysis name
pathogenicity-filterolder ./testfiles/oldversion-population 0 1.5 samplename
- This option allows you to search for a specific transcript across the entire population and takes a ACMG folder and the transcript name and a ouptufile name.
Usage: eVaiutilities transcript-search <ACMGDIR> <TRANSCRIPT> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<TRANSCRIPT> provide the transcript name
<NAME> name of the analysis
transcript-search ./testfiles/newversion-population ENST00000635509.2 samplename
- This option allows for the filtering of the entire population study and takes a directory containing ACMG files and a ref allele to look for and a output file name.
Usage: eVaiutilities alt-allele <ACMGDIR> <REFALLELE> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<REFALLELE> provide the alt allele
<NAME> provide the analysis name
alt-allele ./testfiles/newversion-population T samplename
- This option allows for the filtering of the entire population study and takes a directory containing ACMG files and a ref allele to look for and a output file name. This supports the older version of the eVai.
Usage: eVaiutilities alt-allele-older <ACMGDIR> <REFALLELE> <NAME>
Arguments:
<ACMGDIR> provide the acmg directory
<REFALLELE> provide the alt allele
<NAME> provide the analysis name
alt-allele-older ./testfiles/oldversion-population T samplename
- This option allows you to search for a ref allele that has a specific alternate allele across a entire population and takes a directory containing ACMG files and the ref and the alternate allele to search for and an output filename to which the information should be written.
Usage: eVaiutilities alt-ref-allele <ACMGDIR> <REFALLELE> <ALTALLELE> <NAME>
Arguments:
<ACMGDIR> provide the acmd directory
<REFALLELE> provide the ref allele
<ALTALLELE> provide the alt allele
<NAME> provide the analysis name
alt-ref-allele ./testfiles/newversion-population A T samplename
- This option allows you to search for a ref allele that has a specific alternate allele across a entire population and takes a directory containing ACMG files and the ref and the alternate allele to search for and an output filename to which the information should be written. This supports the older version.
Usage: eVaiutilities alt-ref-allele-older <ACMGDIR> <REFALLELE> <ALTALLELE> <NAME>
Arguments:
<ACMGDIR> provide the acmd directory
<REFALLELE> provide the ref allele
<ALTALLELE> provide the alt allele
<NAME> provide the analysis name
alt-ref-allele-older ./testfiles/oldversion-population A T samplename
- This option allows you to plot all the variants across the entire population. It takes a folder containing the folder containing the annotated variants and then accumulate all the variants across that study and plots a bar plot to show the frequency of the variants across that study.
Usage: eVaiutilities variant-plotter <PATHFOLDER>
Arguments:
<PATHFOLDER> provide the path to the folder
variant-plotter ./testfiles/variant-accumulation
- To install windows version:
rustup component add llvm-tools
rustup target add x86_64-pc-windows-msvc
git clone https://github.com/IBCHgenomic/eVaiutilities.git
cd ensemblcov
cargo xwin build --target x86_64-pc-windows-msvc
Gaurav Sablok
codeprog@icloud.com