Skip to content

[FEATURE] Support Multi-Modal input to __call__ #212

Open
@zastrowm

Description

@zastrowm

Problem Statement

I would like strands to support passing in more datatypes to Agent __call__ so that the images, audio, etc can be passed into the agent

Proposed Solution

This should support a ToolResult as input

agent = Agent()
agent(ToolResult(...))

This should also allow for an empty input to invoke, and it will invoke the current messages array.

agent = Agent(messages=[...])
agent()

Use Case

Passing in images or text or both would allow responding to agents needing more than just text

Alternatives Solutions

No response

Additional Context

No response

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions