Generating onnx file for the inference of Mamba?

Dear @tridao , @albertfgu ,

It looks like it is not straightforward to generate onnx file due to following reason using torch.onnx.export: 

1) It looks like the underlying scan operator is implemented in the triton 
2) We need the recursive version of scan for the inference which I believe is located starting line 119 (lines 119 to 133 where 133 is the return)  as shown here:  https://github.com/state-spaces/mamba/blob/main/mamba_ssm/modules/mamba_simple.py#L119 

Above two prevents (based on my understanding) generating onnx file.  It would be great to have onnx file for the inference part for the smallest model. 

**Any suggestions how we can generate onnx file for the inference?** (also for training separately)?    



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generating onnx file for the inference of Mamba? #200

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Generating onnx file for the inference of Mamba? #200

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions