Open
Description
Hi there,
It would be great to have support for GrokAdamW optimizer but with low bit quantization.
You can check reference implementation: https://github.com/cognitivecomputations/grokadamw
It has shown promising results already.
Hi there,
It would be great to have support for GrokAdamW optimizer but with low bit quantization.
You can check reference implementation: https://github.com/cognitivecomputations/grokadamw
It has shown promising results already.