Genie.jl Evolution Simulation Package

Genie.jl is a Julia package designed to simulate the evolution of multiple sequence alignments (MSAs) based on specified parameters and conditions. Its primary function, run_evolution, enables users to simulate evolutionary trajectories for amino acid or nucleotide codon sequences under complex interaction parameters.

The complete description of the algorithm is available at

Emergent time scales of epistasis in protein evolution
Leonardo Di Bari, Matteo Bisardi, Sabrina Cotogno, Martin Weigt, Francesco Zamponi; doi: https://www.pnas.org/doi/10.1073/pnas.2406807121

Please cite this article if you use Genie.jl!

Genie.jl Evolution Simulation Package

Genie.jl is a Julia package designed to simulate the evolution of multiple sequence alignments (MSAs) based on specified parameters and conditions. Its primary function, run_evolution, enables users to simulate evolutionary trajectories for amino acid or nucleotide codon sequences under complex interaction parameters.

Installing the Genie Package via Git Clone

To install the Genie package directly from its Git repository, follow these steps:

Open a terminal and run the following command to clone the Genie repository:

git clone https://github.com/spqb/Genie.jl.git

then enter in the Genie.jl folder and call Julia with

julia --project

once inside the Julia REPL install the dependencies

using Pkg
Pkg.activate(".")
Pkg.instantiate()

Running the Genie Package: Using the Example Notebook or Julia REPL

Once you have installed the Genie package, you can either use the example notebook provided in the examples folder or work directly from the Julia REPL with parallel processing.

Option 1: Using the Example Notebook

Navigate to the examples Folder: Locate the examples folder inside the Genie package directory. This folder contains an example Jupyter notebook designed to help you explore Genie’s features.

Option 2: working from the Julia REPL

Navigate to the Genie package folder: Open Julia in the local environment with n threads over which the MCMC samoling can be parallelized by doing

../julia-1.10.0/bin/julia --project=. --thread n

then you can directly copy each cell of the 'examples' folder in your terminal to explore Genie's features (remember to adjust the paths of the data).

Key Function: `run_evolution`

Overview

The run_evolution function simulates the evolution of a given multiple sequence alignment (MSA) over a specified number of steps. It uses a combination of Gibbs sampling and Metropolis sampling to evolve the sequences, supporting options for random initialization, codon usage bias, and saving intermediate MSAs at specified intervals.

Parameters

start_msa::Array{T,2}: Initial MSA as a 2D array where T can be either Int8 for amino acids or String for nucleotide codons. If amino acids are provided, the corresponding codons will be randomly sampled among those coding for the amino acids. The array dimensions must be (L, M), where L is the sequence length, and each column represents a different sequence.
h::Array{T,2}: A 2D array of size (q, L) representing the field parameters.
J::Array{T,4}: A 4D array of size (q, L, q, L) representing the coupling parameters.

Optional Parameters

N_steps::Int: Number of steps for the simulation (default is 100).
temp: Temperature parameter for the simulation (default is 1.0).
p: Probability for choosing Metropolis Sampling (default is 0.5). A value of 0 will use only Gibbs Sampling with single nucleotide mutations, while 1 will use only Metropolis Sampling with indels.
N_points::Union{Int, Nothing}: Number of points to save the MSA in logarithmic scale along the trajectory (default is nothing). Specify either N_points or each_step, but not both.
each_step::Union{Int, Nothing}: Interval to save the MSA every each_step steps along the trajectory (default is nothing). Specify either N_points or each_step, but not both.
rand_init::Bool: Whether to initialize sequences randomly (default is false).
q::Int: Number of unique amino acids in the sequences (default is 21).
codon_bias::Union{Nothing, Dict{String, Float64}}: Codon usage bias dictionary (default is nothing; assumes no codon bias).
verbose::Bool: Whether to print progress information (default is false).

Returns

The function returns a named tuple containing the results of the simulation. The structure of the output depends on whether N_points or each_step is specified.

If N_points or each_step are not specified:
- msa::Array{Int8, 2}: Final MSA in amino acid form.
- msa_dna::Array{String, 2}: Final MSA in DNA format.
- codon_usage::Dict{String, Float64}: Codon usage dictionary used in the simulation.
- p::Float64: Probability of choosing Metropolis Sampling.
- temp::Float64: Temperature used in the simulation.
If either N_points or each_step are specified:
- step_msa::Array{Array{Int8, 2}, 1}: List of MSAs at different time points in amino acid format.
- msa_dna::Array{Array{String, 2}, 1}: List of MSAs in DNA format at different time points.
- codon_usage::Dict{String, Float64}: Codon usage dictionary used in the simulation.
- p::Float64: Probability of choosing Metropolis Sampling.
- temp::Float64: Temperature used in the simulation.
- steps::Array{Int, 1}: Steps at which MSAs were saved.

Description

The run_evolution function simulates the evolution of an initial MSA over a specified number of steps. It employs both Gibbs sampling and Metropolis sampling to generate evolved sequences, with options for random initialization, codon usage bias, and saving intermediate MSAs at specified intervals. The function is highly customizable and suitable for simulating complex evolutionary dynamics under various conditions.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
examples		examples
src		src
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Genie.jl Evolution Simulation Package

Genie.jl Evolution Simulation Package

Installing the Genie Package via Git Clone

Running the Genie Package: Using the Example Notebook or Julia REPL

Option 1: Using the Example Notebook

Option 2: working from the Julia REPL

Key Function: `run_evolution`

Overview

Parameters

Optional Parameters

Returns

Description

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

spqb/Genie.jl

Folders and files

Latest commit

History

Repository files navigation

Genie.jl Evolution Simulation Package

Genie.jl Evolution Simulation Package

Installing the Genie Package via Git Clone

Running the Genie Package: Using the Example Notebook or Julia REPL

Option 1: Using the Example Notebook

Option 2: working from the Julia REPL

Key Function: run_evolution

Overview

Parameters

Optional Parameters

Returns

Description

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Key Function: `run_evolution`

Packages