Protocol for a phylogenomics study

A SEDA pipeline created in Compi that implements the "Protocol for a phylogenomics study" SEDA-based protocol. Created using the SEDA-Compi pipelines framework.

This protocol shows how to retrieve files and prepare datasets to be used in detailed phylogenomics studies. The given example concerns the use of mitochondrial genomes to pinpoint the most likely phylogenetic relationship between Rosaceae species, using a concatenated sequence approach.

Quick-start: running the pipeline with sample data

Download this ZIP and decompress it. The path where it is extracted will be referred as "working directory" (/path/to/working_dir).

Then, move to the working directory and simply run ./run.sh "$(pwd)" to execute the entire pipeline with the provided input files.

The input data for this protocol can be also obtained at NCBI, by querying Rosaceae [Organisms] AND complete genome mitochondrion on the search field. After, download each mitochondrial genome individually, selecting Coding sequences in the Send to: option, and choosing the FASTA Nucleotide file type. Please note that some species have more than one mitochondrial genome data available, and some of the genomes may present an UNVERIFIED prefix. To avoid redundant or misleading data, the use of the most recent genome available and the exclusion of unverified datasets are highly advisable.

To run specific tasks an additional parameter can be passed to the run.sh script: ./run.sh "$(pwd)" "--single-task clustal-align" or ./run.sh "$(pwd)" "--until clustal-align".

Applying the protocol to other case studies

Applying the protocol to other case studies is easy, you only need to:

Put the protein FASTA files at input/rename-header-multipart_1/.
Edit the params/*.sedaParams in case you need to tune the operations parameters for your case study. These files can be also created and exported using the SEDA GUI.

Contributors

^{Made with contrib.rocks.}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.vscode		.vscode
pipeline-runner		pipeline-runner
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compi.project		compi.project
pipeline.xml		pipeline.xml
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Protocol for a phylogenomics study

Quick-start: running the pipeline with sample data

Applying the protocol to other case studies

Contributors

About

Releases

Packages

Contributors 2

Languages

License

pegi3s/seda-pipeline-phylogenomics-study

Folders and files

Latest commit

History

Repository files navigation

Protocol for a phylogenomics study

Quick-start: running the pipeline with sample data

Applying the protocol to other case studies

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages