add `lookuptables` for writing efficient lookup tables; make symbolRank, symbolName `O(1)` #18044

timotheecour · 2021-05-19T02:27:44Z

tiny self-contained module that defines efficient lookup tables; private for now
inspired by pseudorandom probing for hash collision #13418 (which was IMO wrongly reverted but that's a separate topic, see hashing collision: improve performance using linear/pseudorandom probing mix #13440 for performance benchmarks)
see benchmark in tests/benchmarks/tlookuptables.nim
makes the cost of symbolRank, symbolName O(1) (follows jsonutils: add customization for toJson via ToJsonOptions; generalize symbolName; add symbolRank #18029)

EDIT

it works but don't review yet, i'm improving the API

Varriount · 2021-05-19T09:48:56Z

Why does this need to be added directly to the standard library? Or alternately, why shouldn't this be an external package?
What differences does this have, compared to the other table types?

timotheecour · 2021-05-19T16:27:07Z

Why does this need to be added directly to the standard library? Or alternately, why shouldn't this be an external package?

because std/enumutils (itself used by std/jsonutils) depends on it, turning an O(N) API into O(1), and in future PRs other stdlib modules will depend on it. Note that the module is private for now (std/private/lookuptables).

What differences does this have, compared to the other table types?

it's 2 to 3 times faster than equivalent code using tables (see benchmarks in this PR)
it's a self-contained module with no dependency. So instead of depending on tables (which adds these recursive import dependencies: tables,hashes,math,bitops,fenv,algorithm, as shown by --processing:filenames), you only depend on lookuptables

Clyybber · 2021-05-19T16:52:31Z

There's a much simpler solution to symbolName if we remove symbolRank introduced in #18029 again:
Simply generate an enum type similar to T without the string representation overrides and then convert to that enum and call $ on it.

timotheecour · 2021-05-19T17:41:28Z

There's a much simpler solution to symbolName

the problem is not with ordinal enums (which don't need symbolRank), the problem is holey enums, for which your approach (which isn't really simpler btw) just cannot work efficiently (it'd be O(N) instead of O(1) as in this PR). And symbolRank (as well as lookuptables) is useful regardless of symbolName.

Clyybber · 2021-05-20T00:49:42Z

Please reread my proposal. It would work with holey enums and would be O(1) because afaik $ for enums is O(1).

timotheecour · 2021-05-20T00:53:00Z

Please reread my proposal. It would work with holey enums and would be O(1) because afaik $ for enums is O(1).

please read implementation of reprEnum

# ugh we need a slow linear search:

Clyybber · 2021-05-20T10:23:37Z

# ugh we need a slow linear search:

I see, but then reprEnum should be improved instead of only improving symbolName/symbolRank.

Varriount · 2021-05-21T08:05:30Z

So why can't these improvements be applied to the tables module?

arnetheduck · 2021-06-25T07:34:03Z

because X depends on it,

The solution here is to move X out of the library as well - not introduce Y which soon will be used to motivate including A, B and C as well.

c-blake · 2021-06-26T11:14:20Z

If I literally just take his benchmark and impl and compile with -d:nimIntHash1 then I get that Table is faster than this proposal. 88% the time of this new table in genData1 and 66% the time in genData2 on an i7-6700k, nim-devel, gcc-11. Just one cpu/backend/person. A more impulsive person might say "Table is 1.5X faster!"

Even if this lookuptables work did help (and my experiment above suggests it may well hurt), how many enum values are there usually? Is converting to strings in a fast & furious mode even common outside some json microbenchmark? Why is this an optimization so important that it needs a new table? In general, having the whole stdlib captive to any one person's json microbenchmarks seems like a truly terrible plan.

stale · 2022-06-28T00:43:14Z

This pull request has been automatically marked as stale because it has not had recent activity. If you think it is still a valid PR, please rebase it on the latest devel; otherwise it will be closed. Thank you for your contributions.

timotheecour changed the title ~~add lookuptables for writing efficient lookup tables; make symbolRank, symbolName O(1)~~ add lookuptables for writing efficient lookup tables; make symbolRank, symbolName O(1) May 19, 2021

timotheecour force-pushed the pr_lookuptable branch 2 times, most recently from f1c1594 to f421881 Compare May 19, 2021 02:34

add efficient std/private/lookuptables

ada3678

timotheecour force-pushed the pr_lookuptable branch from f421881 to ada3678 Compare May 19, 2021 02:44

This was referenced Jul 6, 2021

typetraits: add rangeof(T), a shortcut for low(T)..high(T) #15232

Closed

warn on enum conversions #18430

Closed

stale bot added the stale Staled PR/issues; remove the label after fixing them label Jun 28, 2022

stale bot closed this Jul 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add `lookuptables` for writing efficient lookup tables; make symbolRank, symbolName `O(1)` #18044

add `lookuptables` for writing efficient lookup tables; make symbolRank, symbolName `O(1)` #18044

Uh oh!

timotheecour commented May 19, 2021 •

edited

Loading

Uh oh!

Varriount commented May 19, 2021

Uh oh!

timotheecour commented May 19, 2021 •

edited

Loading

Uh oh!

Clyybber commented May 19, 2021

Uh oh!

timotheecour commented May 19, 2021 •

edited

Loading

Uh oh!

Clyybber commented May 20, 2021 •

edited

Loading

Uh oh!

timotheecour commented May 20, 2021 •

edited

Loading

Uh oh!

Clyybber commented May 20, 2021

Uh oh!

Varriount commented May 21, 2021

Uh oh!

arnetheduck commented Jun 25, 2021

Uh oh!

c-blake commented Jun 26, 2021

Uh oh!

stale bot commented Jun 28, 2022

Uh oh!

Uh oh!

Uh oh!

add lookuptables for writing efficient lookup tables; make symbolRank, symbolName O(1) #18044

add lookuptables for writing efficient lookup tables; make symbolRank, symbolName O(1) #18044

Uh oh!

Conversation

timotheecour commented May 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

EDIT

Uh oh!

Varriount commented May 19, 2021

Uh oh!

timotheecour commented May 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Clyybber commented May 19, 2021

Uh oh!

timotheecour commented May 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Clyybber commented May 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timotheecour commented May 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Clyybber commented May 20, 2021

Uh oh!

Varriount commented May 21, 2021

Uh oh!

arnetheduck commented Jun 25, 2021

Uh oh!

c-blake commented Jun 26, 2021

Uh oh!

stale bot commented Jun 28, 2022

Uh oh!

Uh oh!

add `lookuptables` for writing efficient lookup tables; make symbolRank, symbolName `O(1)` #18044

add `lookuptables` for writing efficient lookup tables; make symbolRank, symbolName `O(1)` #18044

timotheecour commented May 19, 2021 •

edited

Loading

timotheecour commented May 19, 2021 •

edited

Loading

timotheecour commented May 19, 2021 •

edited

Loading

Clyybber commented May 20, 2021 •

edited

Loading

timotheecour commented May 20, 2021 •

edited

Loading