Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subject: Using QuarTeT GapFiller with Unplaced Scaffolds and Flanks from a related species. #42

Open
Isoris opened this issue Oct 19, 2024 · 5 comments

Comments

@Isoris
Copy link

Isoris commented Oct 19, 2024

Hello,

I would like to use QuarTeT GapFiller to fill gaps in my Clarias macrocephalus assembly chromosomes using unplaced scaffolds. I have mapped these unplaced scaffolds to a closely related species, Clarias fuscus, and obtained 50 Mb of sequence with high accuracy. However, QuarTeT AssemblyMapper outputs the results with 100 bp gaps.

My aim is to use the contigs along with the flanking regions from Clarias fuscus, making them chimeric. This way, I can run QuarTeT GapFiller with the initial chromosomes from C. macrocephalus and these modified scaffolds that include flanks from C. fuscus. The idea is to provide the necessary homology for effective gap filling, as the original assembly lacks sufficient flank homology for direct gap filling.

Would it be possible to get the output with the reference flanks from AssemblyMapper to achieve this?

Thank you!

@Echoring
Copy link
Collaborator

I'm not sure I got your point. What you wish is when AssemblyMapper place 2 adjacent contigs, it not only use 100N to connect them but also merge some flanking sequence from the reference? This looks somehow hard to declare the border. If you want a longer flanking sequence to reach the homologous region with reference, can it be achieved by use a large -f option?

@Isoris
Copy link
Author

Isoris commented Oct 20, 2024 via email

@Echoring
Copy link
Collaborator

QQ20241023-124238
I'm still not sure, in this case (alignment breaks a little far from gap), you want the red circle or the pink circle?

@Isoris
Copy link
Author

Isoris commented Oct 23, 2024

Hello,

It may seem strange but I would like that we could specify a flanking length in the reference but outward.
For instance if a contig A aligns to the ref at the ref position 6000 to 10,000 and we specify the option -fo or flank outward for instance = 3000 it would make a chimeric sequence with 5' 3000 bases from the ref + the aligned contig + 3000 bases from the ref in 3'

image

This way we can use this chimera to fill gaps in genome assemblies because sometimes the contigs that we get are cut when there is repeats but maybe in a close species this repeat was already solved if the assembly was made with ultra long reads and was lucky to have a read which spanned the full region.

@Echoring
Copy link
Collaborator

I've made a quick try at branch extract-ref-flanks.
I'm not sure it works fine. Would you like to have a try?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants