Skip to content

adding annotation on a gff file results in uniqueID offset-1968619510" #1556

Open
@Jungal10

Description

I have a gff file that contains some info on the annotation of my genome. E.g:

##gff-version 3
##annot-version v2.1
##species Amaranthus hypochondriacus
Contig115|quiver|       phytozomev12    gene    5405    8135    .       -       .       ID=AH023784.v2.1;Name=AH023784
Contig115|quiver|       phytozomev12    mRNA    5405    8135    .       -       .       ID=AH023784-RA.v2.1;Name=AH023784-RA;pacid=38173840;longest=1;Parent=AH023784.v2.1
Contig115|quiver|       phytozomev12    CDS     8115    8135    .       -       0       ID=AH023784-RA.v2.1.CDS.1;Parent=AH023784-RA.v2.1;pacid=38173840

I have another file with extra information on the annotation that I parsed and pasted to the 9th column of the gff file. I followed the same pattern, having the feature identifier, equal sign, feature and semi-colon. e.g "Panther=PTHR10668,PTHR10668:SF80;"

The new gff is looking like this:

##species Amaranthus hypochondriacus    V2      V3      V4      V5      V6      V7      V8      V9
Contig177|quiver|       phytozomev12    CDS     231594  231692  .       +       0       ID=AH023665-RA.v2.1.CDS.7;Parent=AH023665-RA.v2.1;pacid=38166878 locus_name=AH018775;transcriptName=AH018775-RA;Pfam=PF05199,PF00732;Panther=PTHR10668,PTHR10668:SF80;KOG=;EC=1.1.3.20;KO=K17756;GO=GO:0055114,GO:0016614,GO:0050660;Best-hit-arabi-name=AT4G19380.1;arabi-symbol=;arabi-defline=Long-chain fatty alcohol dehydrogenase family protein locus_name=AH018775;transcriptName=AH018775-RA;Pfam=PF05199,PF00732;Panther=PTHR10668,PTHR10668:SF80;KOG=;EC=1.1.3.20;KO=K17756;GO=GO:0055114,GO:0016614,GO:0050660;Best-hit-arabi-name=AT4G19380.1;arabi-symbol=;arabi-defline=Long-chain fatty alcohol dehydrogenase family protein
Contig177|quiver|       phytozomev12    three_prime_UTR 231693  231975  .       +       .       ID=AH023665-RA.v2.1.three_prime_UTR.1;Parent=AH023665-RA.v2.1;pacid=38166878 locus_name=AH018775;transcriptName=AH018775-RA;Pfam=PF05199,PF00732;Panther=PTHR10668,PTHR10668:SF80;KOG=;EC=1.1.3.20;KO=K17756;GO=GO:0055114,GO:0016614,GO:0050660;Best-hit-arabi-name=AT4G19380.1;arabi-symbol=;arabi-defline=Long-chain fatty alcohol dehydrogenase family protein locus_name=AH018775;transcriptName=AH018775-RA;Pfam=PF05199,PF00732;Panther=PTHR10668,PTHR10668:SF80;KOG=;EC=1.1.3.20;KO=K17756;GO=GO:0055114,GO:0016614,GO:0050660;Best-hit-arabi-name=AT4G19380.1;arabi-symbol=;arabi-defline=Long-chain fatty alcohol dehydrogenase family protein

When I tried to use bin/flatfile-to-json.pl I had several error messages, so I opted to follow the indexed file format to add the annotation.

My problem now is that instead of having all these features that I added no the gff file column, I only have a feature stating "uniqueID offset-1968619510" <img width="276" alt="Screenshot 2020-11-12 at 23 38 15" src="https://user-images.githubusercontent.com/49553532/99004909-25f6d480-2540-11eb-9354-8e02840fdee4.png">

Can you help me with this issue, please? Thank you

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions