Skip to content

[Roadmap]: Gemini Integration #2387

Open

Description

Tip

Want to get involved?

We'd love it if you did! Please get in contact with the people assigned to this issue, or leave a comment. See general contributing advice here too.

Intro

Gemini is a Google GenAI product that offers another set of GenAI models for users. Since December 2023, our "gemini" branch has seen significant community interest, including the integration of Gemma with autogen earlier this year. We're now excited to bring the experimental Gemini branch into AutoGen's main release. While we currently offer Gemini chat completion and vision models, this roadmap aims to unlock more potential of Google Gemini within AutoGen, laying out TODOs, feature requests, and future directions.

The Roadmap

Merged PR: #2360

Future:

  • GenerationConfig Parameters: temperature, max_tokens, etc.
  • Multiple Responses: generate several responses at the same time.
  • More Accurate Cost/Token Calculation. Great feedback from @joshkyh
  • Function Calling.
  • JSON Output: structure outputs in JSON.
  • Image Generation.
  • Other Modalities: audio/video generation and processing.
  • Safety Settings: user-customizable safety controls. Feedback from @marklysze chat.send_message("hi", stream=stream, safety_settings=safety). documentation
  • Multi-turn Vision Chat: improve conversational context for image tasks.
  • Streaming: real-time output from the client.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

0.2Issues which were filed before re-arch to 0.4needs-triageroadmapIssues related to roadmap of AutoGen

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions