perf: improve n-quad parser using map-based lookups #84

michaeladler · 2025-07-01T13:48:35Z

Summary

Replace the O(n) per-triple equality check with an O(1) map-based set for each graph, improving parsing performance for large datasets (>1000 entries).

To make Quad hashable, all structs implementing the Node interface are now value-based (e.g., IRI implements the Node interface instead of *IRI).

Note: The Equal method from the Node interface is now unused and could be removed. However, I've kept it to maintain compatibility with existing downstream code.

Basic Example

No change in existing behavior.

Motivation

The motivation is (yet again) performance. Benchmark results from my machine show:

For 1000 objects: new implementation is 1.8s faster
For 2000 objects: 6.3s faster

Checks

Passes make test

Replace the O(n) per-triple equality check with an O(1) map-based set for each graph, improving parsing performance for large (>1000) datasets. To make Quad hashable, all structs implementing the Node interface are now value-based (e.g., IRI implements the Node interface instead of *IRI). Signed-off-by: Michael Adler <michael.adler@siemens.com>

kazarena

Looks good, thank you.

kazarena approved these changes Jul 14, 2025

View reviewed changes

kazarena merged commit 3fbc0ad into piprate:master Jul 14, 2025
6 checks passed

michaeladler deleted the perf/improve-n-quads-2 branch July 15, 2025 06:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: improve n-quad parser using map-based lookups #84

perf: improve n-quad parser using map-based lookups #84

Uh oh!

michaeladler commented Jul 1, 2025

Uh oh!

kazarena left a comment

Uh oh!

Uh oh!

Uh oh!

perf: improve n-quad parser using map-based lookups #84

perf: improve n-quad parser using map-based lookups #84

Uh oh!

Conversation

michaeladler commented Jul 1, 2025

Summary

Basic Example

Motivation

Checks

Uh oh!

kazarena left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!