I am doing a few tests with sample_get_x towards vectorising it, see https://github.com/madgraph5/madgraph4gpu/issues/963 Apart from the issue reported in #968, I think I identified another two trivial but useful improvements in sample_get_x - One, some minor streamlining of xbin_min and xbin_max calculations seems to be useful - Two, I checked that in a case like CMS DY+3j #943, the function is most often called with xmin=0 or xmax=1, and it is possible to cache these values This is WIP to be confirmed.