Open
Description
If I understand correctly, there are currently there are currently 4 interconnected boolean command line options relating to intgemm
--optimize
--optmize8
--intgemm-shift
--intgemm-shift-all
This does not make sense to me as options are either mutually exclusive (things can't be 8-bit and 16-bit at the same time), or imply one another, e.g. intgemm-shift implies intgemm-shift and intgemm-shift implies optimize8.
These four options should be combined into a single string-valued option --optimize with the values
- none or no (default; regular float computation)
- intgemm16 (implicit, so that --optimize works as before)
- intgemm8
- intgemm8-shift
- intgemm8-shift-all
which are then interpreted in cpu::Backed::configureDevice() in tensor/cpu/backend.h
This should happen sooner rather than later so that we don't have command line interface legacy issues. Easy to fix now, a nightmare to change later once people have started using these options.
cc: @kpu
Metadata
Assignees
Labels
No labels