[WIP] Introduce benchgc, a benchmark tool for graph compiler #60

WangJialei-A · 2024-05-10T06:09:10Z

Try to resolve #58

Tasks

kurapov-peter · 2024-05-10T13:24:07Z

What is this? Why do we introduce yet another graph representation and how it is related to correctness?

ZhennanQin · 2024-05-10T13:40:39Z

What is this? Why do we introduce yet another graph representation and how it is related to correctness?

It's not another graph representation. This benchmark tool can accept an .mlir file written in oneDNN graph dialect, and compile & validate it with the pytorch reference. This tool will validate the correctness in the nightly & weekly tests.

kurapov-peter · 2024-05-14T20:18:26Z

It's not another graph representation.

It is, there's a graph definition and all the ops. I also don't see why would we need one for compilation and validation.

WangJialei-A · 2024-05-21T08:11:11Z

It's not another graph representation.

It is, there's a graph definition and all the ops. I also don't see why would we need one for compilation and validation.

Hi, @kurapov-peter
The graphs and operator definitions you see all originate from the existing Graph API's JSON format; we have not introduced any new definitions.

To conduct correctness tests, the testing tool needs to understand the entire graph's topology and the involved operators in order to populate appropriate data and set correct thresholds. Considering that the test cases are in MLIR format, which is quite low-level and challenging to directly discern the topology structure from, we need to translate the MLIR format into the Graph API's JSON format to facilitate the testing tool in making correct behaviors.

Considering that complex MLIR files will be exported from the framework side, and that MLIR files lack readability, BenchGC will provide a feature to convert MLIR format into Graph API JSON format. This will facilitate developers in using visualization tools like Neutron to understand the MLIR files and what kind of kernels the framework is actually running.

kurapov-peter · 2024-05-21T10:40:08Z

Considering that complex MLIR files will be exported from the framework side, and that MLIR files lack readability, BenchGC will provide a feature to convert MLIR format into Graph API JSON format.

Why not use JSON as input in the first place then?

WangJialei-A · 2024-05-22T00:37:40Z

Considering that complex MLIR files will be exported from the framework side, and that MLIR files lack readability, BenchGC will provide a feature to convert MLIR format into Graph API JSON format.

Why not use JSON as input in the first place then?

@kurapov-peter
Although it is possible to obtain a JSON-formatted graph when calling Graph Compiler through the Graph API, considering that in the future Graph Compiler might integrate with other components, such as Torch MLIR, it would then be impossible to export the graph in JSON format.

kurapov-peter · 2024-05-22T12:35:57Z

@kurapov-peter Although it is possible to obtain a JSON-formatted graph when calling Graph Compiler through the Graph API, considering that in the future Graph Compiler might integrate with other components, such as Torch MLIR, it would then be impossible to export the graph in JSON format.

You wouldn't be able to use the same flow since something like torch-mlir would not map to the representation you are introducing. This is not an argument in favor of having python graph api.

WangJialei-A · 2024-05-24T02:08:41Z

@kurapov-peter
I guess I've figured out where our point of disagreement. The current goal is only to verify the correctness of a single operator, so you might feel that introducing a Python-expressed Graph API is unnecessary. Under the condition of a single operator, many things can be simplified. However, BenchGC is not designed for a single operator. The verification of a single operator is just a special case; its future goal is to support the verification work of complex partitions containing many OPs. At that time, the corresponding MLIR test cases will also be very complex, and without the Graph API Json as a bridge, it would be difficult to translate them into PyTorch execution.

Using JSON as the input for test cases is unacceptable. Our library is an MLIR library, and the envisioned goal of BenchGC is to verify any MLIR case based on the OneDNN Graph Dialect. If JSON is used as the input and then translated into MLIR format, it would not be able to express all the possibilities that the MLIR format can represent.

kurapov-peter · 2024-05-27T14:24:03Z

At that time, the corresponding MLIR test cases will also be very complex, and without the Graph API Json as a bridge, it would be difficult to translate them into PyTorch execution.

I don't think I follow. You either test PyTorch end-to-end or test a specific optimization/pattern/whatever via MLIR input (or one of the existing representations like onednn graph api or json). Introducing another representation is redundant.

Using JSON as the input for test cases is unacceptable. Our library is an MLIR library, and the envisioned goal of BenchGC is to verify any MLIR case based on the OneDNN Graph Dialect.

This path is specific to onednn, if we are to target generic cases the proposed python interface should not rely on onednn semantics (which it does now). It should also not be a part of a benchmarking tool.

WangJialei-A · 2024-05-28T05:14:53Z

At that time, the corresponding MLIR test cases will also be very complex, and without the Graph API Json as a bridge, it would be difficult to translate them into PyTorch execution.

I don't think I follow. You either test PyTorch end-to-end or test a specific optimization/pattern/whatever via MLIR input (or one of the existing representations like onednn graph api or json). Introducing another representation is redundant.

Using JSON as the input for test cases is unacceptable. Our library is an MLIR library, and the envisioned goal of BenchGC is to verify any MLIR case based on the OneDNN Graph Dialect.

This path is specific to onednn, if we are to target generic cases the proposed python interface should not rely on onednn semantics (which it does now). It should also not be a part of a benchmarking tool.

@kurapov-peter
I am completely confused by your reply. Can you present your plan on the validation work #58 ? This will help us better understand each other's differences.

aregm · 2024-07-02T18:32:27Z

Why are you adding a benchgc into sources? Why do we need it vendored here and not as a 3rd party dependency?

WangJialei-A · 2024-07-16T04:31:41Z

new implementation at #161

WangJialei-A requested review from xurui1995, AshburnLee and ZhennanQin May 10, 2024 06:09

WangJialei-A added this to the Functional llama2 milestone May 11, 2024

WangJialei-A force-pushed the wangjial/benchgc branch 10 times, most recently from a638a37 to deff617 Compare May 14, 2024 05:43

WangJialei-A added 2 commits May 26, 2024 22:24

actions: add python code format check

796c9cd

lit: exclude mlir test case in benchgc folder

d09d815

WangJialei-A force-pushed the wangjial/benchgc branch from deff617 to 2ea9fd0 Compare May 28, 2024 07:29

test: benchgc: initial commit for benchgc

04b3545

WangJialei-A force-pushed the wangjial/benchgc branch from 2ea9fd0 to 04b3545 Compare May 29, 2024 05:09

WangJialei-A closed this Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Introduce benchgc, a benchmark tool for graph compiler #60

[WIP] Introduce benchgc, a benchmark tool for graph compiler #60

Uh oh!

WangJialei-A commented May 10, 2024 •

edited

Loading

Uh oh!

kurapov-peter commented May 10, 2024

Uh oh!

ZhennanQin commented May 10, 2024

Uh oh!

kurapov-peter commented May 14, 2024

Uh oh!

WangJialei-A commented May 21, 2024

Uh oh!

kurapov-peter commented May 21, 2024

Uh oh!

WangJialei-A commented May 22, 2024

Uh oh!

kurapov-peter commented May 22, 2024

Uh oh!

WangJialei-A commented May 24, 2024

Uh oh!

kurapov-peter commented May 27, 2024

Uh oh!

WangJialei-A commented May 28, 2024 •

edited

Loading

Uh oh!

aregm commented Jul 2, 2024

Uh oh!

WangJialei-A commented Jul 16, 2024

Uh oh!

Uh oh!

[WIP] Introduce benchgc, a benchmark tool for graph compiler #60

[WIP] Introduce benchgc, a benchmark tool for graph compiler #60

Uh oh!

Conversation

WangJialei-A commented May 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kurapov-peter commented May 10, 2024

Uh oh!

ZhennanQin commented May 10, 2024

Uh oh!

kurapov-peter commented May 14, 2024

Uh oh!

WangJialei-A commented May 21, 2024

Uh oh!

kurapov-peter commented May 21, 2024

Uh oh!

WangJialei-A commented May 22, 2024

Uh oh!

kurapov-peter commented May 22, 2024

Uh oh!

WangJialei-A commented May 24, 2024

Uh oh!

kurapov-peter commented May 27, 2024

Uh oh!

WangJialei-A commented May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aregm commented Jul 2, 2024

Uh oh!

WangJialei-A commented Jul 16, 2024

Uh oh!

Uh oh!

WangJialei-A commented May 10, 2024 •

edited

Loading

WangJialei-A commented May 28, 2024 •

edited

Loading