Skip to content

llama model is not fully lowered to ANE (coreml backend) #4091

Open
@cccclai

Description

@cccclai

As title, the model definition is here: https://github.com/pytorch/executorch/blob/main/examples/models/llama2/llama_transformer.py

There are two parts not lowered to ANE, including embedding and kv cache update

Metadata

Metadata

Assignees

No one assigned

    Labels

    module: coremlIssues related to Apple's Core ML delegation and code under backends/apple/coreml/triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions