[Roadmap]:  Gemini Integration

> [!TIP]
> ## Want to get involved?
> We'd love it if you did! Please get in contact with the people assigned to this issue, or leave a comment. See general contributing advice [here](https://microsoft.github.io/autogen/docs/Contribute) too.


### Intro

Gemini is a Google GenAI product that offers another set of GenAI models for users. Since December 2023, our "gemini" branch has seen significant community interest, including the integration of Gemma with autogen earlier this year.  We're now excited to bring the experimental Gemini branch into AutoGen's main release. While we currently offer Gemini chat completion and vision models, this roadmap aims to unlock more potential of Google Gemini within AutoGen, laying out TODOs, feature requests, and future directions.

### The Roadmap

Merged PR: https://github.com/microsoft/autogen/pull/2360


Future:
- GenerationConfig Parameters: temperature, max_tokens, etc.
- Multiple Responses: generate several responses at the same time.
- More Accurate Cost/Token Calculation. Great feedback from @joshkyh
- Function Calling.
- JSON Output: structure outputs in JSON. 
- Image Generation.
- Other Modalities: audio/video generation and processing.
- Safety Settings: user-customizable safety controls. Feedback from @marklysze `chat.send_message("hi", stream=stream, safety_settings=safety)`. [documentation](https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/configure-safety-attributes)
- Multi-turn Vision Chat: improve conversational context for image tasks. 
- Streaming: real-time output from the client.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap]: Gemini Integration #2387

BeibinLi
openedon Apr 15, 2024

Want to get involved?

Intro

The Roadmap

Assignees

Labels

Type

Projects

Milestone

Relationships

Development