Releases · Plant-Food-Research-Open/genepal · GitHub

06 Oct 21:33

GallVp

Version 0.4.0 Pre-release

Pre-release

What's Changed

`Added`

Added orthofinder_annotations param
Added FASTA_GFF_ORTHOFINDER sub-workflow
Added evaluation by BUSCO #41
Included common tax ids for eggnog mapper #27
Implemented hierarchical naming scheme: geneI.tJ, geneI.tJ.exonK, geneI.tJ.cdsK #19, #34
Now sorting list of bam and list of fastq before cat to avoid resume cache misses
Allowed BAM files for RNA evidence #3
Added GXF_FASTA_AGAT_SPADDINTRONS_SPEXTRACTSEQUENCES sub-workflow for splice type statistics #11
Changed orthofinder_annotations from FASTA/GFF to protein FASTA #43
Added param enforce_full_intron_support to turn on/off strict model purging by TSEBRA #21
Added param filter_liftoff_by_hints to evaluate liftoff models with TSEBRA to make sure they have the same level of evidence as BRAKER #28
Added a script to automatically check module version updates
Reduced BRAKER3 threads to 8 #55
Now the final annotations are stored in the annotations folder #53
Now a single fasta file can be directly specified for protein_evidence
eggnogmapper_db_dir is not a required parameter anymore
eggnogmapper_tax_scope is now set to 1 (root div) by default
Added a test profile based on public data
Added parameter add_attrs_to_proteins_fasta to enable/disable addition of decoded gff attributes to proteins fasta #58
Added a check for input assemblies. If an assembly is smaller than 1 MB (or 300KB in zipped format), the pipeline errors out before starting the downstream processes #47
Now REPEATMASKER GFF output is saved via CUSTOM_RMOUTTOGFF3 #54
Added benchmark column to the input sheet and used GFFCOMPARE to perform benchmarking #63
Added SEQKIT_RMDUP to detect duplicate sequence and wrap the fasta to 80 characters
Updated parameter section labels for annotation and post-annotation filtering #64
Updated modules and sub-workflows

`Fixed`

Fixed BRAKER spellings #36
Fixed liftoff failure when lifting off from a single reference #40
Added versions from GFF_STORE sub-workflows #33

`Dependencies`

NextFlow!>=23.04.4
nf-validation=1.1.3

`Deprecated`

Renamed external_protein_fastas param to protein_evidence
Renamed fastq param to rna_evidence
Renamed braker_allow_isoforms param to allow_isoforms
Moved liftoffID from gene level to mRNA/transcript level
Moved version_check.sh to .github/version_checks.sh
Removed dependency on https://github.com/kherronism/nf-modules.git for BRAKER3 and REPEATMASKER modules which are now installed from https://github.com/GallVp/nxf-components.git
Removed dependency on https://github.com/PlantandFoodResearch/nxf-modules.git
Now the final annotations are not stored in the final folder
Now BRAKER3 outputs are not saved by default #53 and saved under etc folder when enabled
Removed local profile. Local executor is the default when no executor is specified. Therefore, the local profile was not needed.
Removed CUSTOM_DUMPSOFTWAREVERSIONS

Full Changelog: 0.3.3...0.4.0

Assets 2

20 Jun 05:03

GallVp

Version 0.3.3 Pre-release

Pre-release

What's Changed

Full Changelog: 0.3.2...0.3.3

`Added`

Added a stub test to evaluate the case where an assembly is soft masked but has no annotations

`Fixed`

Fixed a bug where is_masked was ignored by the pipeline
Fixed a bug in param validation which allowed specification of braker_hints without braker_gff3

`Dependencies`

NextFlow!>=23.04.4
nf-validation=1.1.3

`Deprecated`

Assets 2

13 May 02:35

GallVp

Version 0.3.2 Pre-release

Pre-release

What's Changed

Now removing comments from FASTA file before feeding it to BRAKER by @GallVp in #32

Full Changelog: 0.3.1...0.3.2

Contributors

GallVp

Assets 2

09 May 23:40

GallVp

Version 0.3.1 Pre-release

Pre-release

What's Changed

Increased time limit for longer tasks by @GallVp in #30

Full Changelog: 0.3.0...0.3.1

Contributors

GallVp

Assets 2

30 Apr 01:19

GallVp

Version 0.3.0 Pre-release

Pre-release

What's Changed

Commit history: v0.2...0.3.0

`Added`

Added changelog and semantic versioning
Changed license to MIT
Updated .editorconfig
Moved .literature to test/ branch
Renamed pangene_local to local_pangene
Renamed pangene_pfr to pfr_pangene
Added versioning checking
Updated github workflow to use pre-commit instead of prettier and editorconfig check
Added central singularity cache dir for pfr config
Added SORTMERNA_INDEX before SORTMERNA
Fixed sample contamination bug introduced by file.simpleName
Now using empty files for stub testing in CI
Now BRAKER can be skipped by including BRAKER outputs from previous runs in the target_assemblies param
Added gffcompare to merge liftoff annotations
Renamed samplesheet param to fastq
Now using assemblysheet in combination with nf-validation for assembly input
Added nextflow_schema.json
Now using nf-validation to validate fastqsheet provided by params.fastq
Moved manifest.config and reporting_defaults.config content to nextflow.config
Now using a txt file for params.external_protein_fastas
Now using nf-validation for params.liftoff_annotations
Now using nf-validation for all the parameters
Added PURGE_BREAKER_MODELS sub-workflow
Added GFF_EGGNOGMAPPER sub-workflow
Now using a custom version of GFFREAD which supports meta and fasta
Now using TSEBRA to purge models which do not have full intron support from BRAKER hints
Added params eggnogmapper_evalue and eggnogmapper_pident
Added PURGE_NOHIT_BRAKER_MODELS sub-workflow
Now merging BRAKER and liftoff models before running eggnogmapper
Added GFF_MERGE_CLEANUP sub-workflow
Now using description field to store notes and textual annotations in the gff files
Now using mRNA in place of transcript in gff files
Now eggnogmapper_purge_nohits is set to false by default
Added GFF_STORE sub workflow
external_protein_fastas and eggnogmapper_db_dir are not mandatory parameters
Added contributors
Add a document for the pipeline parameters
Updated pfr_pangene and pfr/profile.config
Now using local tests/stub files for GitHub CI
Now removing iso-forms left by TSEBRA using AGAT_SPFILTERFEATUREFROMKILLLIST
Added pyproject.toml
Now using PFAMs from eggnog if description is '-'

`Fixed`

Removed liftoff models with valid_ORF=False
Updated license text to include 'Copyright (c) 2024 The New Zealand Institute for Plant and Food Research Limited'

`Dependencies`

NextFlow!>=23.04.4
nf-validation=1.1.3

`Deprecated`

Assets 2