Direct any model (named without `claude-`) to the OpenAI backend. #14

namin · 2024-10-06T04:24:24Z

The old logic (OpenAI backend only for gpt- names) directed any OpenAI-compatible server model configuration to the Claude backend, as it was picked by default for any model named without gpt-.

It is easier to just switch the default backend, rather than considering whether the environment variable OPENAI_BASE_URL is set.

I confirmed the fix on the following run:

Use Ollama: export OPENAI_BASE_URL="http://localhost:11434/v1"
Set dummy OpenAI key: export OPENAI_API_KEY="ollama"
Run README command with extra agent parameters: aide agent.code.model="qwen2.5" agent.feedback.model="qwen2.5" data_dir="example_tasks/house_prices" goal="Predict the sales price for each house" eval="Use the RMSE metric between the logarithm of the predicted and observed values."

The old logic (OpenAI backend only for `gpt-` names) directed any OpenAI-compatible server model configuration to the Claude backend, as it was picked by default for any model named without `gpt-`. It is easier to just switch the default backend, rather than considering whether the environment variable `OPENAI_BASE_URL` is set. I confirmed the fix on the following run: - Use Ollama: `export OPENAI_BASE_URL="http://localhost:11434/v1"` - Set dummy OpenAI key: `export OPENAI_API_KEY="ollama"` - Run README command with extra `agent` parameters: `aide agent.code.model="qwen2.5" agent.feedback.model="qwen2.5" data_dir="example_tasks/house_prices" goal="Predict the sales price for each house" eval="Use the RMSE metric between the logarithm of the predicted and observed values."`

ZhengyaoJiang · 2024-10-22T10:04:59Z

Thanks for the contribution!

ZhengyaoJiang merged commit d9a3340 into WecoAI:main Oct 22, 2024

dexhunter mentioned this pull request Oct 23, 2024

📝 Update README to include local llm usage #17

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Direct any model (named without `claude-`) to the OpenAI backend. #14

Direct any model (named without `claude-`) to the OpenAI backend. #14

Uh oh!

namin commented Oct 6, 2024

Uh oh!

ZhengyaoJiang commented Oct 22, 2024

Uh oh!

Uh oh!

Direct any model (named without claude-) to the OpenAI backend. #14

Direct any model (named without claude-) to the OpenAI backend. #14

Uh oh!

Conversation

namin commented Oct 6, 2024

Uh oh!

ZhengyaoJiang commented Oct 22, 2024

Uh oh!

Uh oh!

Direct any model (named without `claude-`) to the OpenAI backend. #14

Direct any model (named without `claude-`) to the OpenAI backend. #14