FLOAT_LITERAL_CACHE is dangerous and a source of leaks

When coding GAP functions that contain float constants, these constants are coded either as "lazy" or "eager".

Eager constants are using a specific type of floats (e.g. the built-in machine floats). These are coded by evaluating the constant into float object, which is stored in `EAGER_FLOAT_LITERAL_CACHE`, and then the index where it was stored is coded into the function. Example:
```
gap> f:=x->42.0_;;
gap> EAGER_FLOAT_LITERAL_CACHE;
[ 42. ]
```
Lazy constants are re-evaluated if the user sets a different default floats type using `SetFloats`. These, too, are assigned a unique global index; when the float constant expression is evaluated, the result is cached in `FLOAT_LITERAL_CACHE`; this cache is emptied each time `SetFloats` is called. Example:
```
gap> g:=x->23.0;;
gap> FLOAT_LITERAL_CACHE;
Error, Variable: 'FLOAT_LITERAL_CACHE' must have a value
not in any function at *stdin*:2
gap> g(1);
23.
gap> FLOAT_LITERAL_CACHE;
[ ,,,, 23. ]
```

There are several problems with this: First off, exposing these caches allows for crashes and inconsistent computations. While exposing them may have been helpful when developing this feature, it provides a major risk. E.g. we can trivially crash GAP:
```
gap> EAGER_FLOAT_LITERAL_CACHE:="house";; f:=x->1.0_;;
Assertion failed: (IS_PLIST(EAGER_FLOAT_LITERAL_CACHE)), function CodeEagerFloatExpr, file src/code.c, line 1907.
Abort trap: 6
```
Moreover, in HPC-GAP, `EAGER_FLOAT_LITERAL_CACHE` is an atomic list and *not* immutable, so wonderful things like this are possible:
```
gap> f:=x->42.0_;;
gap> EAGER_FLOAT_LITERAL_CACHE;
[ 42. ]
gap> EAGER_FLOAT_LITERAL_CACHE[1]:=23.;
23.
gap> Display(f);
function ( x )
    return 42.0_;
end
gap> f(1);
23.
```
The same is possible with `FLOAT_LITERAL_CACHE` even in regular GAP, although there we could at least in principle mark it as immutable.


Another problem is that GAP functions are objects like any other, and hence can be garbage collected. But they contain no reference to the lazy or eager floats they contain. Hence the need for us to use the caches. But that also means that if a function is GCed, then any float constant it referenced will stay around for ever. I.e. we have a leak.

The first part of this issue can be resolved by not exposing these caches to the user. That's rather easy to do. To solve the second issue, here are some thoughts:

For the eager float constants, we could do the following: first, add a new entry to `struct BodyHeader`, say `Obj floatcache`. Then when coding a body (`T_BODY`), any eager floats get assigned to that cache. Drawback is that this causes some overhead for *every* function which has a body, even if they don't use floats. 

For the lazy ones, we need a way to store them in a way that still makes them flushable. So perhaps `FLOAT_LITERAL_CACHE` could become an (atomic) record (which we abuse as a "sparse list" here). Then we add a custom free function for `T_BODY`, which iterates over the syntax tree stored in the `T_BODY`, and for each float cache index `i` in it, it unbinds `FLOAT_LITERAL_CACHE.(i)`.
(This is somewhat related to issue #1889, I guess). Or, to have it a bit simpler, we could also store the indices for the lazy constants separately, perhaps even in the same `floatcache`. I.e. each `floatcache` entry will either be an eagerly evaluated floatean, or else a a positive immediate integer which is an "index" into `FLOAT_LITERAL_CACHE`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FLOAT_LITERAL_CACHE is dangerous and a source of leaks #1993

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development