MIP_ORACLE

A software to filter and identify unique target regions with diagnostic significance in antimicrobial resistance genes, and various other pathogen genomes.

Molecular Inversion Probe(MIP)

Molecular Inversion Probes(MIPs) are single-stranded DNA molecules containing two complementary regions that flank the target DNA. These molecules often have a Fluorophore, DNA barcode, or Molecular tag for unique identification.

Rough Design Outline -

Start with all possible MIPs by moving along the strand one base pair at a time.
Design MIPs for both the forward and reverse strands so that we have the highest probability of binding and then proceed to filter them according to three user-specified criteria: Temperature GC Content Nucleotide Repeats
Create a database in MMseqs2 format to only include the human (host) genome. Then filter the MIPs by searching them against the host genome(human).
To increase the probability of the MIP binding to the correct target region, search them against the non-redundant nucleotides database.
Filter out any MIPs which do not match any other organisms.

Workflow

Demo -

Obtain sequences of interest in a FASTA format, and make sure the organism name is present in the definition line of each sequence.
Following this download all the program files and store them in the same directory as the FASTA file.
Fill out the requirements to filter MIPs in the config file provided. The MIPs within the ranges given will be accepted. ex. all MIPs with 45<temp<70 will be taken.

Run the shell script provided as so:

MIP_ORACLE.sh -i Trial_File -o trial_final_results -l mip_oracle -j /DATA/databases/blast/nt

nohup can also be used:

nohup MIP_ORACLE.sh -i Trial_File -o trial_final_results -l mip_oracle -j /DATA/databases/blast/nt > trial_log.out &

The following files will be generated:

The first file will contain all possible MIPs for the sequences provided.
The second and third files will contain Passable MIPs(The MIPs that met user requirements as per the config file), and Eliminated MIPs(MIPs that were filtered out).
The fourth file is the input file for MMseqs2 search containing arm1+target+arm2 sequences.
The fifth and sixth files are the MMseqs2 databases for only human sequences and other non-redundant nucleotides sequence format created from nt BLAST DB input .
The seventh file will contain the MIPs with no hits in the MMseqs2 DB, and the eighth file will have the filtered results. Lastly, the final result file will be generated with filtered MIPs.

Add files image

Requirements -

Nucleotide BLAST 2.12.0 + with the nt database.

Python 3.6 and the following python packages:

pandas=1.1.5
biopython=1.70
configparser
regex
xlsxwriter
openpyxl

Users can install the required packages through conda using the following command

conda create -n mip_oracle --file mip_oracle_env.txt

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
MIP_1.py		MIP_1.py
MIP_2.py		MIP_2.py
MIP_3.py		MIP_3.py
MIP_4.py		MIP_4.py
MIP_5.py		MIP_5.py
MIP_ORACLE.sh		MIP_ORACLE.sh
Organism Dictionary.xlsx		Organism Dictionary.xlsx
README.md		README.md
config.txt		config.txt
mip_oracle_env.txt		mip_oracle_env.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MIP_ORACLE

Table of contents

Molecular Inversion Probe(MIP)

Rough Design Outline -

Workflow

Demo -

Run the shell script provided as so:

Requirements -

About

Releases

Packages

Languages

ShwetaNagre/MIP_Oracle_by_MMseqs

Folders and files

Latest commit

History

Repository files navigation

MIP_ORACLE

Table of contents

Molecular Inversion Probe(MIP)

Rough Design Outline -

Workflow

Demo -

Run the shell script provided as so:

Requirements -

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages