Currently there are duplicate benchmark names for eg set-benchmarks-set (maybe others as well).
This messes with some tooling which assumes names are unique per run.
One example:
union-disj_nn,6.1186233632740655e-6,...
union-disj_nn,6.147778407898306e-6,...