[FEATURE] Static Inference Support for RawC and GGML

# Feature Request

## Describe the Feature
Static inference in RawC and GGML backends is not supported. Currently, these backends rely on dynamic execution even when constant inputs are provided. Static inference will allow for pre-computation of operations at compile time, thereby optimizing performance.

## Motivation
This feature will eliminate the need for dynamic execution, improve efficiency and reduce runtime overhead when constant inputs are supplied.

## Proposed Solution

**1. RawC Backend**

- Develop Python wrapper functions to execute supported operations directly on the RawC backend when static inputs are supplied.

**2. GGML Backend**

- Utilize RawC backend operations as the basis for computations.
- Convert GGML arrays to C arrays before passing them to functions, and convert the results back to GGML arrays.
- In GGML code generation, bypass tensor creation and graph marking for statically inferred keys by directly assigning these keys to the output. 

## Alternatives Considered

An alternative approach for the GGML backend would involve creating a separate dynamic library to manage the GGML flow for tensor operations. However, this method would require context and memory buffer allocation for each static inference, potentially offsetting the performance benefits. 

## Additional Context



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Static Inference Support for RawC and GGML #228

Feature Request

Describe the Feature

Motivation

Proposed Solution

Alternatives Considered

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[FEATURE] Static Inference Support for RawC and GGML #228

Description

Feature Request

Describe the Feature

Motivation

Proposed Solution

Alternatives Considered

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions