Passing arguments through iron.jit decorator #2213

ypapadop-amd · 2025-04-18T18:42:30Z

This PR allows to pass arguments through the iron.jit decorator to the called function.

jgmelber · 2025-04-18T19:38:34Z

I like this!

programming_examples/basic/vector_vector_add/vector_vector_add.py

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py

mawad-amd · 2025-04-18T23:30:27Z

I don't think this will work, unfortunately. The operator () (i.e., vector_vector_add(device_map[args.device], input0, input1, output) expects Tensors that match what is inside the runtime sequence because these will be passed to the kernel. I need to add some unittests.

mlir-aie/python/iron/jit.py

Lines 83 to 103 in d291a69

    
               def __call__(self, *args): 
        
                   """ 
        
                   Allows the kernel to be called as a function with the provided arguments. 
        
                   Parameters: 
        
                       args (IRON Tensors): Arguments to pass to the kernel. 
        
                   """ 
        
                   opcode = 3 
        
                   kernel_args = [] 
        
                   for tensor in args: 
        
                       if not hasattr(tensor, "buffer_object"): 
        
                           raise TypeError( 
        
                               f"Expected Tensor with .buffer_object(), got {type(tensor)}" 
        
                           ) 
        
                       kernel_args.append(tensor.buffer_object()) 
        
                   h = self.__kernel(opcode, self.__insts_buffer_bo, self.__n_insts, *kernel_args) 
        
                   r = h.wait() 
        
                   if r != xrt.ert_cmd_state.ERT_CMD_STATE_COMPLETED: 
        
                       raise Exception(f"Kernel returned {r}")

mawad-amd · 2025-04-18T23:49:45Z

I like the iron.jit though (w/o arguments). I tried to implement previously but had issues.

ypapadop-amd · 2025-04-19T03:13:21Z

I don't think this will work, unfortunately. The operator () (i.e., vector_vector_add(device_map[args.device], input0, input1, output) expects Tensors that match what is inside the runtime sequence because these will be passed to the kernel. I need to add some unittests.

mlir-aie/python/iron/jit.py

Lines 83 to 103 in d291a69

def __call__(self, *args):

"""

Allows the kernel to be called as a function with the provided arguments.

Parameters:

args (IRON Tensors): Arguments to pass to the kernel.

"""

opcode = 3

kernel_args = []

for tensor in args:

if not hasattr(tensor, "buffer_object"):

raise TypeError(

f"Expected Tensor with .buffer_object(), got {type(tensor)}"

)

kernel_args.append(tensor.buffer_object())

h = self.__kernel(opcode, self.__insts_buffer_bo, self.__n_insts, *kernel_args)

r = h.wait()

if r != xrt.ert_cmd_state.ERT_CMD_STATE_COMPLETED:

raise Exception(f"Kernel returned {r}")

You mean because of the device arg? That can be fixed if we always assume that the first argument is device (or make it positional).

mawad-amd · 2025-04-19T17:24:25Z

Particularly for the device, I think it's best if we do things like:

@iron.jit
def kernel()
   device = iron.get_current_device()

iron.set_device(device)

Or via contexts:

@iron.jit
def kernel()
   device = iron.get_current_device()

with iron.device('npu'):
    kernel()

But it's likely we will need to pass more arguments to the kernel that we can't set as globals. We could possibly start adding annotations (e.g., iron.kernel_argument) or something like that.

I think it's best if we just meet and discuss to keep the momentum going. ~~I will schedule something~~. We can talk more on Tuesday.

jgmelber · 2025-04-19T21:04:40Z

We can talk more on Tuesday.

Sounds good to me

ypapadop-amd · 2025-04-21T15:34:08Z

I can see us controlling not just which device, but how many tiles to use etc. Which I think makes it equivalent to what numba does with kernel invocation (https://numba.readthedocs.io/en/stable/cuda/kernels.html).

But I'd like us to think what's appropriate for our devices, not how the GPU world converged around CUDA.

programming_examples/basic/vector_vector_add/vector_vector_add.py

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py

mawad-amd · 2025-04-21T19:22:38Z

I like the syntax of kernel[configs](kernel_arg) because then users won't be confused about kernel arguments, their order and how they relate to the runtime sequence. Eventually, we want the user to get a nice error when they mess that up (also the asserts you added, these are great but should be eventually done by compiler).

I also like your config argument, although, I would prefer it to be the last argument. I wonder if we can get rid of tensor_ty = np.ndarray[(num_elements,), np.dtype[dtype]] and other numpy arrays and reuse the input tensors too (not sure how the objectfifo work though).

ypapadop-amd · 2025-04-22T18:53:26Z

Here's how numba does it: https://numba.pydata.org/numba-doc/dev/roc/examples.html

Triton: https://triton-lang.org/main/getting-started/tutorials/03-matrix-multiplication.html#sphx-glr-getting-started-tutorials-03-matrix-multiplication-py (I think they infer the device from the inputs). In triton, functions with @triton.jit if you pass them as parameters to other functions, which is very convenient.

ypapadop-amd · 2025-04-22T19:14:04Z

Here's the separation I have in my head:

Function arguments should be passed to function parameters, e.g., def vector_vector_add(input0, input1, output) should be called with vector_vector_add(input0, input1, output).
Attributes that affect how the code is compiled, e.g., is_placed, should go as decorator arguments, e.g., @iron.jit(is_placed=True). I think that also includes any type constraints, like numba does (@roc.jit('(float32[:,:], float32[:,:], float32[:,:])') in https://numba.pydata.org/numba-doc/dev/roc/examples.html) but I'm not sure if we need something like this right now.
Attributes that are affecting run-time behaviour, e.g., to use a cached version or not, where is the cache, which device we target, etc. should be passed somehow else, e.g., the config variable, a singleton that is visible upon import etc.

The distinction between 2 and 3 is that 2 has to do with the code that follows (think of C++ strong typing) whereas 3 is more on where and how that code will be called.

ypapadop-amd · 2025-04-24T16:22:55Z

I added a iron.set_default_device() / iron.get_default_device() that eliminates the config parameter. However, the column_id is still a problem to express via a config.

jgmelber · 2025-04-24T16:25:41Z

I added a iron.set_default_device() / iron.get_default_device() that eliminates the config parameter. However, the column_id is still a problem to express via a config.

column_id should be removed. The NPU runtime firmware handles this well. This is an artifact from supporting vck5000 in this example previously.

jgmelber · 2025-04-24T16:53:19Z

😄 5737615

programming_examples/basic/vector_vector_add/vector_vector_add.py

mawad-amd · 2025-04-24T16:58:48Z

This is looking awesome, thanks Yiannis for the improvements. And thanks for fixing the tensor random initialization bug too. I suggest adding one comment on why the column id is 0 in the code for the readers (I didn't know about that). Other than that, the PR is looking great.

programming_examples/basic/vector_vector_add/vector_vector_add.py

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py

python/iron/config.py

jgmelber · 2025-04-24T22:03:23Z

the PR is looking great

Agreed!

ypapadop-amd · 2025-04-24T23:36:59Z

programming_guide/mini_tutorial/exercise_4/exercise_4b/exercise_4b.py

-dev = NPU2()
-
-# Define tensor types
+# Define tensor shape
 data_height = 3


This is something I haven't yet figured out. The tensors are 1D, yet they are described as 2D here, and I don't have a mechanism to pass additional arguments.

This is something we discussed today with @SamuelBayliss and @erwei-xilinx for a different use-case.

I think the pointer is the only thing that really matters. My understanding is that the tensor shape is only used for a verifier, checking whether the wrap-and-stride access pattern goes out of bound. The amount of data being copied is coded in TensorAccessPattern's sizes.

Still, we lack the mechanism to pass this information in. Another example would be if you have a function that can generate vector-add or vector-sub using one extra argument. It can't be part of the arguments, since those are supposed to be tensors only. How do we pass that in?

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

test/python/set_current_device.py

Co-authored-by: Muhammad Awad <112003944+mawad-amd@users.noreply.github.com>

Co-authored-by: Yiannis Papadopoulos <Yiannis.Papadopoulos@amd.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Muhammad Awad <112003944+mawad-amd@users.noreply.github.com>

ypapadop-amd · 2025-04-25T19:13:49Z

This is looking awesome, thanks Yiannis for the improvements. And thanks for fixing the tensor random initialization bug too. I suggest adding one comment on why the column id is 0 in the code for the readers (I didn't know about that). Other than that, the PR is looking great.

What's the comment here? 0 implies runtime figures out placement? @jgmelber

jgmelber · 2025-04-25T19:17:02Z

I thought column_id was removed?

ypapadop-amd · 2025-04-25T19:22:47Z

I thought column_id was removed?

It did, I'm just asking if an additional comment is needed.

jgmelber · 2025-04-25T19:23:47Z

It did, I'm just asking if an additional comment is needed.

I don't think so

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py

mawad-amd

Thanks, Yiannis!

ypapadop-amd requested a review from mawad-amd April 18, 2025 18:42

ypapadop-amd added the enhancement New feature or request label Apr 18, 2025

jgmelber reviewed Apr 18, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add.py Outdated Show resolved Hide resolved

jgmelber reviewed Apr 18, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py Outdated Show resolved Hide resolved

ypapadop-amd force-pushed the ypapadop/decorator-arg-passthrough branch from 7dacfb7 to f6b8e2a Compare April 21, 2025 16:05

mawad-amd reviewed Apr 21, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add.py Outdated Show resolved Hide resolved

mawad-amd reviewed Apr 21, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add.py Outdated Show resolved Hide resolved

mawad-amd reviewed Apr 21, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py Outdated Show resolved Hide resolved

mawad-amd reviewed Apr 21, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py Outdated Show resolved Hide resolved

ypapadop-amd force-pushed the ypapadop/decorator-arg-passthrough branch from 6ad02a4 to e69c83e Compare April 24, 2025 14:20

jgmelber reviewed Apr 24, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add.py Show resolved Hide resolved

ypapadop-amd commented Apr 24, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add.py Show resolved Hide resolved

ypapadop-amd commented Apr 24, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py Show resolved Hide resolved

jgmelber reviewed Apr 24, 2025

View reviewed changes

python/iron/config.py Show resolved Hide resolved

ypapadop-amd commented Apr 24, 2025

View reviewed changes

ypapadop-amd and others added 6 commits April 24, 2025 21:34

Documentation

0ae713c

Add lit test set_current_device.py

d277615

Fix typo in lit test

3a4ee9a

Update test/python/set_current_device.py

80f0d97

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update test/python/set_current_device.py

8c86c0b

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update test/python/set_current_device.py

14fd5ff

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

ypapadop-amd force-pushed the ypapadop/decorator-arg-passthrough branch from 758cd3e to 14fd5ff Compare April 25, 2025 02:34

Getting name of instance in test

42bcac6

mawad-amd reviewed Apr 25, 2025

View reviewed changes

test/python/set_current_device.py Outdated Show resolved Hide resolved

mawad-amd reviewed Apr 25, 2025

View reviewed changes

test/python/set_current_device.py Outdated Show resolved Hide resolved

ypapadop-amd and others added 2 commits April 25, 2025 10:27

Changing current device test to pytest test

1834bef

Merge branch 'main' into ypapadop/decorator-arg-passthrough

842999f

mawad-amd reviewed Apr 25, 2025

View reviewed changes

test/python/set_current_device.py Outdated Show resolved Hide resolved

Remove pytest import

3f1d874

Co-authored-by: Muhammad Awad <112003944+mawad-amd@users.noreply.github.com>

ypapadop-amd marked this pull request as ready for review April 25, 2025 17:49

ypapadop-amd requested review from AndraBisca, denolf and fifield as code owners April 25, 2025 17:49

Detect device from xrt-smi in JIT (#2226)

6211d80

Co-authored-by: Yiannis Papadopoulos <Yiannis.Papadopoulos@amd.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Muhammad Awad <112003944+mawad-amd@users.noreply.github.com>

jgmelber approved these changes Apr 25, 2025

View reviewed changes

mawad-amd reviewed Apr 25, 2025

View reviewed changes

programming_examples/basic/vector_vector_add/vector_vector_add_placed.py Show resolved Hide resolved

mawad-amd approved these changes Apr 25, 2025

View reviewed changes

ypapadop-amd added this pull request to the merge queue Apr 25, 2025

Merged via the queue into main with commit ba43849 Apr 25, 2025
51 checks passed

ypapadop-amd deleted the ypapadop/decorator-arg-passthrough branch April 25, 2025 20:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Passing arguments through iron.jit decorator #2213

Passing arguments through iron.jit decorator #2213

ypapadop-amd commented Apr 18, 2025 •

edited

Loading

jgmelber commented Apr 18, 2025

mawad-amd commented Apr 18, 2025 •

edited

Loading

mawad-amd commented Apr 18, 2025 •

edited

Loading

ypapadop-amd commented Apr 19, 2025

mawad-amd commented Apr 19, 2025 •

edited

Loading

jgmelber commented Apr 19, 2025

ypapadop-amd commented Apr 21, 2025

mawad-amd commented Apr 21, 2025

ypapadop-amd commented Apr 22, 2025

ypapadop-amd commented Apr 22, 2025

ypapadop-amd commented Apr 24, 2025

jgmelber commented Apr 24, 2025 •

edited

Loading

jgmelber commented Apr 24, 2025

mawad-amd commented Apr 24, 2025

jgmelber commented Apr 24, 2025

ypapadop-amd Apr 24, 2025

erwei-xilinx Apr 24, 2025

ypapadop-amd Apr 25, 2025

ypapadop-amd commented Apr 25, 2025

jgmelber commented Apr 25, 2025

ypapadop-amd commented Apr 25, 2025

jgmelber commented Apr 25, 2025

mawad-amd left a comment

Passing arguments through iron.jit decorator #2213

Passing arguments through iron.jit decorator #2213

Conversation

ypapadop-amd commented Apr 18, 2025 • edited Loading

jgmelber commented Apr 18, 2025

mawad-amd commented Apr 18, 2025 • edited Loading

mawad-amd commented Apr 18, 2025 • edited Loading

ypapadop-amd commented Apr 19, 2025

mawad-amd commented Apr 19, 2025 • edited Loading

jgmelber commented Apr 19, 2025

ypapadop-amd commented Apr 21, 2025

mawad-amd commented Apr 21, 2025

ypapadop-amd commented Apr 22, 2025

ypapadop-amd commented Apr 22, 2025

ypapadop-amd commented Apr 24, 2025

jgmelber commented Apr 24, 2025 • edited Loading

jgmelber commented Apr 24, 2025

mawad-amd commented Apr 24, 2025

jgmelber commented Apr 24, 2025

ypapadop-amd Apr 24, 2025

Choose a reason for hiding this comment

erwei-xilinx Apr 24, 2025

Choose a reason for hiding this comment

ypapadop-amd Apr 25, 2025

Choose a reason for hiding this comment

ypapadop-amd commented Apr 25, 2025

jgmelber commented Apr 25, 2025

ypapadop-amd commented Apr 25, 2025

jgmelber commented Apr 25, 2025

mawad-amd left a comment

Choose a reason for hiding this comment

ypapadop-amd commented Apr 18, 2025 •

edited

Loading

mawad-amd commented Apr 18, 2025 •

edited

Loading

mawad-amd commented Apr 18, 2025 •

edited

Loading

mawad-amd commented Apr 19, 2025 •

edited

Loading

jgmelber commented Apr 24, 2025 •

edited

Loading