Skip to content

[RFC, NPU] NPU benchmark - as a baseline for follow-up tasks. #1159

@zheliuyu

Description

@zheliuyu

Motivation

Show the current state of NPU benchmarks. It also sets the direction for the next phase of performance optimization tasks.

Evaluation criteria: The speedup ratio of each kernel compared to huggingface/torch should be greater than 1.

Env

Test machine environment

  • platform: https://www.autodl.com/
  • NPU: Atlas 900 A2 PoD(64G)
  • HDK=25.2.0
  • CANN=8.5.0
  • CPU: 24 vCPU Kunpeng-920
  • OS: ubuntu22.04

Software dependencies

  • python=3.10.8
  • Liger-Kernel=0.7.0, Commit ID: 781083b
  • torch=2.6.0
  • torch_npu=2.6.0
  • torchvision==0.21.0
  • triton-ascend=3.2.0
  • transformers=5.2.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions