Skip to content

FLOAT_LITERAL_CACHE is dangerous and a source of leaks #1993

Open
@fingolfin

Description

When coding GAP functions that contain float constants, these constants are coded either as "lazy" or "eager".

Eager constants are using a specific type of floats (e.g. the built-in machine floats). These are coded by evaluating the constant into float object, which is stored in EAGER_FLOAT_LITERAL_CACHE, and then the index where it was stored is coded into the function. Example:

gap> f:=x->42.0_;;
gap> EAGER_FLOAT_LITERAL_CACHE;
[ 42. ]

Lazy constants are re-evaluated if the user sets a different default floats type using SetFloats. These, too, are assigned a unique global index; when the float constant expression is evaluated, the result is cached in FLOAT_LITERAL_CACHE; this cache is emptied each time SetFloats is called. Example:

gap> g:=x->23.0;;
gap> FLOAT_LITERAL_CACHE;
Error, Variable: 'FLOAT_LITERAL_CACHE' must have a value
not in any function at *stdin*:2
gap> g(1);
23.
gap> FLOAT_LITERAL_CACHE;
[ ,,,, 23. ]

There are several problems with this: First off, exposing these caches allows for crashes and inconsistent computations. While exposing them may have been helpful when developing this feature, it provides a major risk. E.g. we can trivially crash GAP:

gap> EAGER_FLOAT_LITERAL_CACHE:="house";; f:=x->1.0_;;
Assertion failed: (IS_PLIST(EAGER_FLOAT_LITERAL_CACHE)), function CodeEagerFloatExpr, file src/code.c, line 1907.
Abort trap: 6

Moreover, in HPC-GAP, EAGER_FLOAT_LITERAL_CACHE is an atomic list and not immutable, so wonderful things like this are possible:

gap> f:=x->42.0_;;
gap> EAGER_FLOAT_LITERAL_CACHE;
[ 42. ]
gap> EAGER_FLOAT_LITERAL_CACHE[1]:=23.;
23.
gap> Display(f);
function ( x )
    return 42.0_;
end
gap> f(1);
23.

The same is possible with FLOAT_LITERAL_CACHE even in regular GAP, although there we could at least in principle mark it as immutable.

Another problem is that GAP functions are objects like any other, and hence can be garbage collected. But they contain no reference to the lazy or eager floats they contain. Hence the need for us to use the caches. But that also means that if a function is GCed, then any float constant it referenced will stay around for ever. I.e. we have a leak.

The first part of this issue can be resolved by not exposing these caches to the user. That's rather easy to do. To solve the second issue, here are some thoughts:

For the eager float constants, we could do the following: first, add a new entry to struct BodyHeader, say Obj floatcache. Then when coding a body (T_BODY), any eager floats get assigned to that cache. Drawback is that this causes some overhead for every function which has a body, even if they don't use floats.

For the lazy ones, we need a way to store them in a way that still makes them flushable. So perhaps FLOAT_LITERAL_CACHE could become an (atomic) record (which we abuse as a "sparse list" here). Then we add a custom free function for T_BODY, which iterates over the syntax tree stored in the T_BODY, and for each float cache index i in it, it unbinds FLOAT_LITERAL_CACHE.(i).
(This is somewhat related to issue #1889, I guess). Or, to have it a bit simpler, we could also store the indices for the lazy constants separately, perhaps even in the same floatcache. I.e. each floatcache entry will either be an eagerly evaluated floatean, or else a a positive immediate integer which is an "index" into FLOAT_LITERAL_CACHE.

Metadata

Assignees

No one assigned

    Labels

    kind: bugIssues describing general bugs, and PRs fixing themtopic: kernel

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions