Skip to content

[Feature]: Support Multimodality, image understanding #147

@christ-tt

Description

@christ-tt

Is your feature request related to a problem?

We want to input image/audio in our chat.

Describe the Solution you'd like

Supports Encoder; Modify Request and other runtime features; implement image understanding model definitions.

Alternatives Considered (Optional)

No response

Additional Context (Optional)

No response

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions