Support MX4 E3M0 format and add stochastic rounding #477

NicoleMayer · 2024-07-04T14:58:51Z

No description provided.

pytorch-bot · 2024-07-04T14:58:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/477

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torchao/prototype/mx_formats/constants.py

vkuzo · 2024-07-05T16:15:57Z

torchao/prototype/mx_formats/custom_cast.py

@@ -21,6 +21,8 @@

 from torchao.prototype.mx_formats.constants import (
    DTYPE_FP4,
+    DTYPE_FP4_E2M1,


for all the changes in this file, can you rebase past #363 ?

and in the code after that PR, perhaps we can add stochastic rounding option to _f32_to_fpx_unpacked?

for rounding mode, how about something like this instead of a boolean?

class RoundingMode(enum.Enum): TIE_TO_EVEN = auto() # default STOCHASTIC = auto() # added in this PR def foo(..., rounding_mode=RoundingMode.TIE_TO_EVEN, ...): ...

vkuzo · 2024-07-05T16:17:42Z

test/prototype/mx_formats/test_e3m0.py

+@pytest.mark.parametrize("device", ["cuda", "cpu"])
+@pytest.mark.parametrize("sign", [1, -1])
+@pytest.mark.parametrize("use_stochastic_rounding", [False, True])
+def test_overflow_cast(hp_dtype, device, sign, use_stochastic_rounding):


can we add these tests to test/prototype/mx_formats/test_custom_cast.py to keep the testing of MX numerics in one place?

vkuzo · 2024-07-05T16:18:20Z

thanks for adding this! left some comments, mostly on rebasing past https://github.com/pytorch/ao/pull/363/files and code style

summerdengfb

The E3M0 numerics implementation looks good to me.

summerdengfb · 2024-07-06T00:29:49Z

torchao/prototype/mx_formats/custom_cast.py

+        denormal_x = denormal_x.view(torch.float)
+
+        # adjust the denormal values back
+        denormal_x -= min_normal


The SR code up to this line looks good to me.

…ctor

gau-nernst added 11 commits June 14, 2024 22:03

refactor custom fp cast

47f7bc1

add dequant

da17611

small formating

3345740

compile with fullgraph=True

2690b92

add fullgraph=true

8aa0146

undo

be77632

add another version

95f4582

fast path for mbits=1

dcd5a05

Merge branch 'pytorch:main' into custom_fpx

f61ff05

Merge branch 'pytorch:main' into custom_fpx

4ad065f

add back docstring

bd64efc

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 4, 2024

msaroufim requested a review from vkuzo July 4, 2024 17:36

vkuzo reviewed Jul 5, 2024

View reviewed changes

torchao/prototype/mx_formats/constants.py Outdated Show resolved Hide resolved

vkuzo reviewed Jul 5, 2024

View reviewed changes

summerdengfb reviewed Jul 6, 2024

View reviewed changes

NicoleMayer force-pushed the hanmei-e3m0-impl branch 3 times, most recently from 57165ff to 60fe4c3 Compare July 10, 2024 00:03

NicoleMayer and others added 3 commits July 9, 2024 17:16

add e3m0 support

ea3efa0

add stochastic rounding support for MX6 and MX4

fcce64e

add unit test for e3m0 and stochastic rounding

5326dce

NicoleMayer force-pushed the hanmei-e3m0-impl branch from 60fe4c3 to e202e6e Compare July 10, 2024 00:35

fix the subnormal part for stochastic rounding

dfdd8db

NicoleMayer force-pushed the hanmei-e3m0-impl branch from 55dfe1f to 58a9f01 Compare July 10, 2024 06:32

delete DTYPE_FP4 and use DTYPE_FP4_E2M1/DTYPE_FP4_E3M0 separately

45520d2

NicoleMayer force-pushed the hanmei-e3m0-impl branch from 58a9f01 to 45520d2 Compare July 10, 2024 06:34

NicoleMayer added 3 commits July 11, 2024 06:13

update RoundingMode API

67255c3

fix the bug for subnormal part

9a8575e

add rounding before calculating the largest power of 2 for scaling fa…

65ed552

…ctor

NicoleMayer closed this Jul 16, 2024

NicoleMayer deleted the hanmei-e3m0-impl branch July 17, 2024 06:16

NicoleMayer restored the hanmei-e3m0-impl branch July 17, 2024 06:16

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Improve error messages and README for download errors (pytorch#477)

a37b0a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support MX4 E3M0 format and add stochastic rounding #477

Support MX4 E3M0 format and add stochastic rounding #477

NicoleMayer commented Jul 4, 2024

pytorch-bot bot commented Jul 4, 2024 •

edited

Loading

vkuzo Jul 5, 2024

vkuzo Jul 5, 2024

vkuzo commented Jul 5, 2024

summerdengfb left a comment

summerdengfb Jul 6, 2024

Support MX4 E3M0 format and add stochastic rounding #477

Support MX4 E3M0 format and add stochastic rounding #477

Conversation

NicoleMayer commented Jul 4, 2024

pytorch-bot bot commented Jul 4, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/477

vkuzo Jul 5, 2024

Choose a reason for hiding this comment

vkuzo Jul 5, 2024

Choose a reason for hiding this comment

vkuzo commented Jul 5, 2024

summerdengfb left a comment

Choose a reason for hiding this comment

summerdengfb Jul 6, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Jul 4, 2024 •

edited

Loading