Skip to content

N3 dump is very slow since rdflib >= 5.0.0 #1752

Open
@chabotsi

Description

Hi,

We are using rdflib to generate rdf dump in various format (xml, nt, n3) and we have noticed a huuuuge slow down with rdflib >= 5.0.0 when n3 serializing.

For the purpose of this issue, we have limited the export, and we have the following results. It took less than 12s with rdflib 4.2.2 and more than 10 minutes (!) with rdflib 5.0.0 (same results with rdflib 6).

The profiling suggests that with rdflib 5.0.0 the compute_qname() function costs a lot. On our small example, the startswith() method has been called more than 2 billion times !

The corresponding code may have been introduced by: #649

Any help or suggestion would be appreciated !
Thank you,


One can find the profiling results below.

  • with rdflib 4.2.2
    image

  • with rdflib 5.0.0
    image

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions