Given we do a run-time compile of OpenCL kernels, "how hard can it be" to do things like 1. Dynamic-compiler format, that whips up a semi optimized OpenCL kernel and runs it. 2. An External mode, that wips up an OpenCL kernel for producing candidates. It's just a matter of really wanting to do it. Both ideas have tumbled around in my head for quite a while.