Open
Description
As title, the model definition is here: https://github.com/pytorch/executorch/blob/main/examples/models/llama2/llama_transformer.py
There are two parts not lowered to ANE, including embedding and kv cache update
As title, the model definition is here: https://github.com/pytorch/executorch/blob/main/examples/models/llama2/llama_transformer.py
There are two parts not lowered to ANE, including embedding and kv cache update