Open
Description
With DAG divergence and RegBankSelect, theoretically, codegen should now have enough information to handle the promotion of uniform 16-bit ops directly. This was originally done as a workaround for having no uniformity info after the IR.
AMDGPUCodeGenPrepare awkwardly uses forward iteration, which just so happens to work around a defect in uniformity info where newly created values are assumed uniform. I currently have a problem that requires iterating in reverse like a normal combiner, and this is blocking the reversal