Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dealing with new model behaviors #56

Open
MrCsabaToth opened this issue Sep 30, 2024 · 3 comments
Open

Dealing with new model behaviors #56

MrCsabaToth opened this issue Sep 30, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@MrCsabaToth
Copy link
Member

MrCsabaToth commented Sep 30, 2024

This is partially about the firebase-ertexai upgrade (#53), and also the new release of the -002 stable models. Seems like function calling behavior and other things changed.

@MrCsabaToth MrCsabaToth added the enhancement New feature or request label Sep 30, 2024
@MrCsabaToth MrCsabaToth self-assigned this Sep 30, 2024
@MrCsabaToth
Copy link
Member Author

New behavior: function calling changes

  1. I switched over to the -002 models explicitly, and the function calling behavior changed. In the past (and on the submission demo) I can simply ask "What will be the weather tomorrow" or "What will be the weather next week". The model assumed (correctly) that I implicitly meant the weather at my current location, and compared to the current date/time.
  2. The new model is very specific and picky, it doesn't think anything implied. It asks if I'll stay at my current location tomorrow (or next week) to answer the question, and it also doesn't seem to be aware of the current date / time, so that needs to be stuffed into the prompt.
  3. The new models also cannot comprehend how to query the weather for "next week". The schema of the weather tool allows for start and end date parameters. The old model could easily deduct and substitute these two variables to obtain the next week's weather. The new model is lame and gives up, it somehow doesn't able to comprehend how to achieve this. It states simply that it can obtain a weather for a specific day but not for a week. Which is false.

@MrCsabaToth
Copy link
Member Author

Another breaking change: the function calling stopped working all together because Please ensure that function call turn comes immediately after a user turn or after a function response turn.
Others are dealing with this too (GD Community Gemini API thread): https://discord.com/channels/1009525727504384150/1289794849003802735

MrCsabaToth added a commit that referenced this issue Sep 30, 2024
MrCsabaToth added a commit that referenced this issue Sep 30, 2024
MrCsabaToth added a commit that referenced this issue Sep 30, 2024
…after a user turn or after a function response turn." problem #53 #56
@MrCsabaToth
Copy link
Member Author

So function calling is kinda solved. Even though it is really nondeterministic: for example for the "What's the weather tomorrow" question about half of the time the model states it doesn't have means to obtain it, like it wouldn't see the weather tool at all. But half of the time just works.

Then latest is that the model flat out states it cannot process images. The Pro seems to stick to this always, whereas the Flash can process the image (for example it correctly OCRs reading on the food packaging), but then it continues by saying it cannot handle images. Say what?

MrCsabaToth added a commit that referenced this issue Nov 12, 2024
…atform.endpoints.predict' denied on resource '//aiplatform.googleapis.com/projects/open-mmpa/locations/us-central1/publishers/google/models/gemini-1.5-flash-002' (or it may not exist)." #56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant