Add unique op #1547

a-gardner1 · 2024-05-15T22:03:21Z

Add support for exporting torch.unique following the conclusion of pytorch/pytorch#113118.

onnxscript/function_libs/torch_lib/ops/core.py

codecov · 2024-05-15T22:26:27Z

Codecov Report

Attention: Patch coverage is 57.14286% with 18 lines in your changes missing coverage. Please review.

Project coverage is 72.25%. Comparing base (ddce766) to head (249b004).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
onnxscript/function_libs/torch_lib/ops/core.py	57.14%	16 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1547      +/-   ##
==========================================
- Coverage   72.28%   72.25%   -0.03%     
==========================================
  Files         217      217              
  Lines       29097    29138      +41     
  Branches     3455     3462       +7     
==========================================
+ Hits        21034    21055      +21     
- Misses       6935     6953      +18     
- Partials     1128     1130       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

justinchuby

Thanks for your contribution! Could you follow the CLA bot's instruction to get that cleared?

onnxscript/function_libs/torch_lib/ops/core.py

a-gardner1 · 2024-05-15T22:29:42Z

Thanks for your contribution! Could you follow the CLA bot's instruction to get that cleared?

Yea, I may have jumped the gun a bit. Working on officially getting permission from my employer.

a-gardner1 · 2024-05-16T21:58:31Z

@a-gardner1 please read the following Contributor License Agreement(CLA). If you agree with the CLA, please reply with the following information.
@microsoft-github-policy-service agree [company="{your company}"]
Options:

(default - no company specified) I have sole ownership of intellectual property rights to my Submissions and I am not making Submissions in the course of work for my employer.
@microsoft-github-policy-service agree
(when company given) I am making Submissions in the course of work for my employer (or my employer has intellectual property rights in my Submissions by contract or applicable law). I have permission from my employer to make Submissions and enter into this Agreement on behalf of my employer. By signing below, the defined term “You” includes me and my employer.
@microsoft-github-policy-service agree company="Microsoft"
Contributor License Agreement

@microsoft-github-policy-service agree [company="Radiance Technologies"]

@microsoft-github-policy-service agree company="Radiance Technologies"

a-gardner1 · 2024-05-16T21:59:47Z

@microsoft-github-policy-service agree company="Radiance Technologies"

tests/function_libs/torch_lib/ops_test_data.py

onnxscript/function_libs/torch_lib/ops/core.py

justinchuby · 2024-05-18T00:52:43Z

Thanks for completing the CLA. I will take a look next week

onnxscript/function_libs/torch_lib/ops/core.py

tests/function_libs/torch_lib/ops_test_data.py

onnxscript/function_libs/torch_lib/ops/core.py

…port to succeed

onnxscript/function_libs/torch_lib/ops/core.py

Follow-up to #113118 and #124306. Developed in coordination with the solution to microsoft/onnxscript#1547 This PR adds the missing fake tensor implementation for `aten.unique_dim`, thus enabling tracing and compilation of `torch.unique` when `dim` is not None. Local testing has proceeded with the following simple script (provided that one has checked out the changes in microsoft/onnxscript#1547): ```python import onnx import onnxruntime as ort import logging import numpy as np onnx_program = torch.onnx.dynamo_export( lambda x: torch.unique(x, dim=0, return_inverse=True), torch.arange(10), export_options=torch.onnx.ExportOptions( dynamic_shapes=True, diagnostic_options=torch.onnx.DiagnosticOptions( verbosity_level=logging.DEBUG))) onnx_program.save("torch_unique.onnx") onnx_inputs = onnx_program.adapt_torch_inputs_to_onnx(torch.arange(10)) onnx_outputs = onnx_program(*onnx_inputs) loaded_onnx_program = onnx.load("torch_unique.onnx") onnx.checker.check_model(loaded_onnx_program) ort_session = ort.InferenceSession("torch_unique.onnx") inputs = np.random.randint(0, 10, 10) print(f"Inputs: {inputs}") outputs = ort_session.run(None, { "l_x_": inputs }) print(f"Outputs: {outputs}") print("Success") ``` Co-authored-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #126561 Approved by: https://github.com/ezyang

Follow-up to pytorch#113118 and pytorch#124306. Developed in coordination with the solution to microsoft/onnxscript#1547 This PR adds the missing fake tensor implementation for `aten.unique_dim`, thus enabling tracing and compilation of `torch.unique` when `dim` is not None. Local testing has proceeded with the following simple script (provided that one has checked out the changes in microsoft/onnxscript#1547): ```python import onnx import onnxruntime as ort import logging import numpy as np onnx_program = torch.onnx.dynamo_export( lambda x: torch.unique(x, dim=0, return_inverse=True), torch.arange(10), export_options=torch.onnx.ExportOptions( dynamic_shapes=True, diagnostic_options=torch.onnx.DiagnosticOptions( verbosity_level=logging.DEBUG))) onnx_program.save("torch_unique.onnx") onnx_inputs = onnx_program.adapt_torch_inputs_to_onnx(torch.arange(10)) onnx_outputs = onnx_program(*onnx_inputs) loaded_onnx_program = onnx.load("torch_unique.onnx") onnx.checker.check_model(loaded_onnx_program) ort_session = ort.InferenceSession("torch_unique.onnx") inputs = np.random.randint(0, 10, 10) print(f"Inputs: {inputs}") outputs = ort_session.run(None, { "l_x_": inputs }) print(f"Outputs: {outputs}") print("Success") ``` Co-authored-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch#126561 Approved by: https://github.com/ezyang

a-gardner1 · 2024-11-26T20:00:41Z

Circling back around to this @justinchuby. At the time, I had been waiting for you to resolve the bug that required a hacky workaround, but I realize that might not be clear.

There were also a couple of other potential unresolved bugs outside the scope of this PR, e.g., this comment.

How would you like to proceed?

justinchuby · 2024-11-26T22:33:54Z

Sorry for missing the clarity. I would suggest that you remove all the hacks so that the code is at its desirable state. If tests fail because of that, that’s ok. I will then go ahead to fix what’s needed. (After I’m back from vacation)

a-gardner1 · 2024-11-26T22:40:34Z

Sorry for missing the clarity. I would suggest that you remove all the hacks so that the code is at its desirable state. If tests fail because of that, that’s ok. I will then go ahead to fix what’s needed. (After I’m back from vacation)

Sounds good. FYI, the hacks were removed in b8b4cb1. As a reminder, the unit tests within onnxscript pass(ed) without the hacks, but the full export from torch to ONNX via Dynamo fails. This script should reproduce the errors with torch==2.4 or later (any release that includes pytorch/pytorch#126561).

If I get a chance, I'll try to rebase this PR and resolve conflicts first.

kabyanil · 2024-12-23T08:59:04Z

I am implementing a CTC decoder class in pytorch -

class GreedyCTCDecoder(torch.nn.Module):
    def __init__(self, labels, blank=0):
        super().__init__()
        self.labels = labels
        self.blank = blank

    def forward(self, emission: torch.Tensor) -> List[str]:
        """Given a sequence emission over labels, get the best path
        Args:
          emission (Tensor): Logit tensors. Shape `[num_seq, num_label]`.

        Returns:
          List[str]: The resulting transcript
        """
        indices = torch.argmax(emission, dim=-1)  # [num_seq,]
        indices = torch.unique_consecutive(indices, dim=-1)
        indices = [i for i in indices if i != self.blank]
        joined = "".join([self.labels[i] for i in indices])
        return joined.replace("|", " ").strip().split()


greedy_decoder = GreedyCTCDecoder(tokens)

I'm not able to export this class to onnx. Here is my error -

/usr/local/lib/python3.10/dist-packages/torch/onnx/_internal/_exporter_legacy.py:116: UserWarning: torch.onnx.dynamo_export only implements opset version 18 for now. If you need to use a different opset version, please register them with register_custom_op.
  warnings.warn(
---------------------------------------------------------------------------
DynamicOutputShapeException               Traceback (most recent call last)
[/usr/local/lib/python3.10/dist-packages/torch/_dynamo/utils.py](https://localhost:8080/#) in run_node(tracer, node, args, kwargs, nnmodule)
   2131             if op == "call_function":
-> 2132                 return node.target(*args, **kwargs)
   2133             elif op == "call_method":

53 frames
DynamicOutputShapeException: aten.unique_consecutive.default

The above exception was the direct cause of the following exception:

RuntimeError                              Traceback (most recent call last)
RuntimeError: Failed running call_function <function boolean_dispatch.<locals>.fn at 0x7ba8c8f9dd80>(*(FakeTensor(..., size=(s0,), dtype=torch.int64),), **{'dim': -1}):
aten.unique_consecutive.default

During handling of the above exception, another exception occurred:

Unsupported                               Traceback (most recent call last)
Unsupported: dynamic shape operator: aten.unique_consecutive.default; Operator does not have a meta kernel that supports dynamic output shapes, please report an issue to PyTorch

from user code:
   File "<ipython-input-43-c1d03e7f78a6>", line 16, in forward
    indices = torch.unique_consecutive(indices, dim=-1)

Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information


The above exception was the direct cause of the following exception:

OnnxExporterError                         Traceback (most recent call last)
[/usr/local/lib/python3.10/dist-packages/torch/onnx/_internal/_exporter_legacy.py](https://localhost:8080/#) in dynamo_export(model, export_options, *model_args, **model_kwargs)
   1231             f"Please report a bug on PyTorch Github: {_PYTORCH_GITHUB_ISSUES_URL}"
   1232         )
-> 1233         raise errors.OnnxExporterError(message) from e
   1234 
   1235 

OnnxExporterError: Failed to export the model to ONNX. Generating SARIF report at 'report_dynamo_export.sarif'. SARIF is a standard format for the output of static analysis tools. SARIF logs can be loaded in VS Code SARIF viewer extension, or SARIF web viewer (https://microsoft.github.io/sarif-web-component/). Please report a bug on PyTorch Github: https://github.com/pytorch/pytorch/issues

How can I resolve this?

tests/function_libs/torch_lib/extra_opinfo.py

justinchuby · 2025-03-07T01:08:08Z

@a-gardner1 sorry for the delay. I think this PR is now good to merge (started the merge)

github-advanced-security bot found potential problems May 15, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems May 15, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Fixed Show fixed Hide fixed

onnxscript/function_libs/torch_lib/ops/core.py Fixed Show fixed Hide fixed

a-gardner1 marked this pull request as draft May 15, 2024 22:27

justinchuby reviewed May 15, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Outdated Show resolved Hide resolved

justinchuby added the module: torchlib Related to the torch/aten function lib in development label May 15, 2024

a-gardner1 commented May 16, 2024

View reviewed changes

tests/function_libs/torch_lib/ops_test_data.py Outdated Show resolved Hide resolved

a-gardner1 mentioned this pull request May 17, 2024

Add fake impl for aten.unique_dim pytorch/pytorch#126561

Closed

a-gardner1 force-pushed the wip-113118-add-unique-ops branch from 453783f to b528a6a Compare May 17, 2024 20:35

a-gardner1 marked this pull request as ready for review May 17, 2024 20:35

a-gardner1 commented May 17, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Outdated Show resolved Hide resolved

justinchuby self-assigned this May 18, 2024

github-advanced-security bot found potential problems May 18, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Fixed Show fixed Hide fixed

onnxscript/function_libs/torch_lib/ops/core.py Fixed Show fixed Hide fixed