Support inference for GCU #14142

EnflameGCU · 2024-11-01T09:11:07Z

配置GCU设备的推理流程

tools/infer/utility.py

jzhang533 · 2024-11-04T02:29:50Z

tools/infer/utility.py

@@ -41,6 +41,7 @@ def init_args():
    parser.add_argument("--use_xpu", type=str2bool, default=False)
    parser.add_argument("--use_npu", type=str2bool, default=False)
    parser.add_argument("--use_mlu", type=str2bool, default=False)
+    parser.add_argument("--use_gcu", type=str2bool, default=False)


please include a short message to help user understand its purpose.
To me, at the first glance, I am guessing gcu stands for GraphCore Unit, realized it's Enflame's device after searching.

Other arguments(i.e.: uie_xpu, use_npu, use_mlu) also need help message to clarify the respective devices, but could be addressed in separate PRs.

OK, thanks for pointing that out. I'll add comments to GCU first.

I am afraid that our description of other arguments may not be accurate, and they may need to be described by the developers of the corresponding hardware manufacturers.

I am afraid that our description of other arguments may not be accurate, and they may need to be described by the developers of the corresponding hardware manufacturers.

agree

jzhang533 · 2024-11-04T06:11:01Z

tools/infer/utility.py

+    parser.add_argument(
+        "--use_gcu", type=str2bool, default=False
+    )  # Use Enflame GCU(General Compute Unit)


Suggested change

parser.add_argument(

"--use_gcu", type=str2bool, default=False

) # Use Enflame GCU(General Compute Unit)

parser.add_argument(

"--use_gcu", type=str2bool, default=False,

help="Use Enflame GCU(General Compute Unit)",

)

Thank you for your suggestion.

jzhang533 · 2024-11-04T06:12:28Z

tools/infer/utility.py

@@ -41,6 +41,7 @@ def init_args():
    parser.add_argument("--use_xpu", type=str2bool, default=False)
    parser.add_argument("--use_npu", type=str2bool, default=False)
    parser.add_argument("--use_mlu", type=str2bool, default=False)
+    parser.add_argument("--use_gcu", type=str2bool, default=False)


I am afraid that our description of other arguments may not be accurate, and they may need to be described by the developers of the corresponding hardware manufacturers.

agree

cuicheng01 · 2024-11-11T12:00:01Z

tools/infer/utility.py

@@ -293,6 +299,28 @@ def create_predictor(args, mode, logger):
            config.enable_custom_device("mlu")
        elif args.use_xpu:
            config.enable_xpu(10 * 1024 * 1024)
+        elif args.use_gcu:  # for Enflame GCU(General Compute Unit)
+            import paddle_custom_device.gcu.passes as gcu_passes


Should we first check whether the paddle_custom_device package is present? If it’s not, remind the user to install it first?

Thank you for your suggestion.

nepeplwu · 2024-11-11T12:14:06Z

tools/infer/utility.py

+            if paddle.framework.use_pir_api():
+                config.enable_new_ir(True)
+                config.enable_new_executor(True)
+                kPirGcuPasses = gcu_passes.inference_passes(


Why do we need to build passes based on "PaddleOCR" or "PaddleXXX"?

We need to optimize performance based on hardware features and even network structure, and the processing here is to increase this flexibility.

GreatV requested a review from jzhang533 November 1, 2024 10:31

GreatV reviewed Nov 1, 2024

View reviewed changes

tools/infer/utility.py Outdated Show resolved Hide resolved

EnflameGCU force-pushed the support_gcu branch from 6f3a1aa to f2269b9 Compare November 1, 2024 11:09

jzhang533 reviewed Nov 4, 2024

View reviewed changes

EnflameGCU force-pushed the support_gcu branch from f2269b9 to e293196 Compare November 4, 2024 03:34

jzhang533 reviewed Nov 4, 2024

View reviewed changes

EnflameGCU force-pushed the support_gcu branch from e293196 to 893ec61 Compare November 4, 2024 06:32

cuicheng01 reviewed Nov 11, 2024

View reviewed changes

nepeplwu reviewed Nov 11, 2024

View reviewed changes

[GCU] Support inference for GCU

9fef9a9

EnflameGCU force-pushed the support_gcu branch from 893ec61 to 9fef9a9 Compare November 12, 2024 06:51

paddle-bot bot added the contributor label Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support inference for GCU #14142

Support inference for GCU #14142

EnflameGCU commented Nov 1, 2024 •

edited

Loading

jzhang533 Nov 4, 2024

EnflameGCU Nov 4, 2024

jzhang533 Nov 4, 2024

jzhang533 Nov 4, 2024

EnflameGCU Nov 4, 2024

jzhang533 Nov 4, 2024

cuicheng01 Nov 11, 2024

EnflameGCU Nov 12, 2024

nepeplwu Nov 11, 2024

EnflameGCU Nov 12, 2024

Support inference for GCU #14142

Are you sure you want to change the base?

Support inference for GCU #14142

Conversation

EnflameGCU commented Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EnflameGCU commented Nov 1, 2024 •

edited

Loading