Support for Custom Vertex AI Models via PSC Endpoint with api_base #15403

Sameerlite · 2025-10-10T06:59:22Z

Title

Fix Vertex AI embeddings JSON serialization error and add PSC endpoint support

Relevant issues

Fixes LIT-1096

Pre-Submission checklist

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix
🆕 New Feature

Changes

This PR adds comprehensive support for Vertex AI Private Service Connect (PSC) endpoints, allowing users to use custom api_base URLs for both completion and embedding requests. This enables access to privately deployed Vertex AI models through internal network endpoints.

Key Features Added

PSC Endpoint URL Construction: Enhanced _check_custom_proxy() to properly construct full PSC URLs with the format:
```
{api_base}/v1/projects/{project}/locations/{location}/endpoints/{model}:{endpoint}
```
Numeric Model ID Support: Modified routing logic to ensure numeric endpoint IDs (common for custom deployments) properly use the HTTP-based handler that respects api_base.
Comprehensive Parameter Passing: Updated all Vertex AI handlers to pass necessary parameters (vertex_project, vertex_location, vertex_api_version) for proper PSC URL construction.
Bug Fix: Fixed a pre-existing JSON serialization bug in Vertex AI embeddings where non-serializable objects were being passed to TypedDict constructors.

Technical Changes

Core URL Construction (`litellm/llms/vertex_ai/vertex_llm_base.py`)

Enhanced _check_custom_proxy() to detect PSC endpoints and construct full URL paths
Added logic to handle both PSC endpoints and standard proxy configurations
Updated function signatures to accept additional Vertex AI parameters

Routing Logic (`litellm/llms/vertex_ai/common_utils.py`)

Modified get_vertex_ai_model_route() to route numeric model IDs with api_base to the HTTP-based handler
Ensures PSC endpoints use the correct code path that respects custom api_base

Handler Updates

Updated all Vertex AI handlers to pass required parameters:

vertex_gemma_models/main.py
vertex_model_garden/main.py
context_caching/vertex_ai_context_caching.py
batches/handler.py

Bug Fix (`litellm/llms/vertex_ai/vertex_embeddings/transformation.py`)

Fixed JSON serialization issue by filtering optional_params to only include valid TypedDict fields
Prevents ClientSession and other non-serializable objects from being passed to JSON serialization

Usage Example

import litellm

# PSC endpoint configuration
response = litellm.completion(
    model="vertex_ai/1234567890",  # Numeric endpoint ID
    messages=[{"role": "user", "content": "Hello"}],
    api_base="http://10.96.32.8",  # PSC endpoint
    vertex_project="my-project-id",
    vertex_location="us-central1"
)

# Embeddings also supported
response = litellm.embedding(
    model="bge-small-en-v1.5",
    input=["Hello", "World"],
    api_base="http://10.96.32.8",
    vertex_project="my-project-id", 
    vertex_location="us-central1"
)

Or specify in config.yaml:

model_list:
  - model_name: bge-small-en-v1.5
    litellm_params:
      model: vertex_ai/1234567890 
      api_base: http://10.96.32.8  # Your PSC IP
      vertex_project: my-project-id
      vertex_location: us-central1

vercel · 2025-10-10T06:59:29Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Ready	Preview	Comment	Oct 10, 2025 9:21am

Support for Custom Vertex AI Models via PSC Endpoint with api_base

2b93d57

Add docs related psc

aa79f1c

vercel bot deployed to Preview October 10, 2025 07:03 View deployment

remove not needed files

69097ab

vercel bot deployed to Preview October 10, 2025 07:07 View deployment

remove print statemnt

85f72d2

vercel bot deployed to Preview October 10, 2025 07:09 View deployment

fix mypy errors

9531478

vercel bot deployed to Preview October 10, 2025 09:21 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support for Custom Vertex AI Models via PSC Endpoint with api_base #15403

Support for Custom Vertex AI Models via PSC Endpoint with api_base #15403

Uh oh!

Sameerlite commented Oct 10, 2025

Uh oh!

vercel bot commented Oct 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Support for Custom Vertex AI Models via PSC Endpoint with api_base #15403

Are you sure you want to change the base?

Support for Custom Vertex AI Models via PSC Endpoint with api_base #15403

Uh oh!

Conversation

Sameerlite commented Oct 10, 2025

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Key Features Added

Technical Changes

Core URL Construction (litellm/llms/vertex_ai/vertex_llm_base.py)

Routing Logic (litellm/llms/vertex_ai/common_utils.py)

Handler Updates

Bug Fix (litellm/llms/vertex_ai/vertex_embeddings/transformation.py)

Usage Example

Uh oh!

vercel bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Core URL Construction (`litellm/llms/vertex_ai/vertex_llm_base.py`)

Routing Logic (`litellm/llms/vertex_ai/common_utils.py`)

Bug Fix (`litellm/llms/vertex_ai/vertex_embeddings/transformation.py`)

vercel bot commented Oct 10, 2025 •

edited

Loading