Since a couple of years, many shortcomings of our assemblies became clearer. Once of the biggest issues concerns spatial information, mainly caused by the huge number of scaffolds and no idea about their actual position along chromosomes. Because of that we'll try to use a chromosome-level assembly to reorder our assemblies accordingly. Given that we have no clue about accuracy, we will try three different mappers: minimap2, mummer and lastal.
- Unzip JGI assembly with
gzip -d assemblies/Alyrata_384_v1_only1-8.fa.gz - Add A. halleri and A. lyrata assemblies in
assembliesfolder. You can download them here and here. Please rename the two assemblies toAhal_v2_2.faandAlyr_v2_2.fa. If you prefer other names, please modify all scripts inscripts/accordingly - Install Conda or miniconda3 and run
conda create env -f envs/mappers.yaml -n mappersto create a Conda environment with (almost) everything we need. - Clone the
RagTaggithub repository:git clone https://github.com/malonge/RagTag.git - Activate
mappersviaconda activate mappersand installRagTagviapython setup.py install(within theRagTagfolder)
First activate mappers (if you didn't already) via conda activate mappers, then to run all analyses please stay on the RemappingAssemblies/ folder and use:
sh scripts/minimap.sh
sh scripts/mummer.sh
sh scripts/lastal.sh
In the results folder, for each method and each assembly, there is a txt file with different scores for each contig of our assembly. Another agp file provides information about the inferred location in the JGI assembly.