Skip to content

Vertex AI

Google Cloud Vertex AI platform for accessing Gemini and other models.

Configuration

PROVIDER=vertexai
PROJECT=your-gcp-project-id
LOCATION=us-central1
MODEL=gemini-1.5-pro
TEMPERATURE=0.3

Key Parameters

Parameter Description Default
PROJECT GCP project ID (required) -
LOCATION GCP region (required) -
MODEL Model to use -
TEMPERATURE Randomness (0.0-1.0) 0.3
TOP_P Nucleus sampling 1.0
TOP_K Top-k sampling -
MAX_TOKENS Max response tokens 1024
TIMEOUT Request timeout (seconds) 60

Features

  • ✅ Streaming responses
  • ✅ Function calling
  • ✅ MCP support
  • ✅ Vision capabilities
  • ✅ Enterprise features
  • ✅ Multi-modal support

Authentication

Vertex AI requires Google Cloud authentication. Set up authentication using one of these methods:

# Application Default Credentials
gcloud auth application-default login

# Service Account Key
export GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json

# Workload Identity (for GKE)
# Configured automatically in GKE environment

Important Notes

  • Based on Gemini provider implementation
  • Requires GCP project and location configuration
  • Automatic authentication via Google Cloud SDK
  • Enterprise-grade security and compliance
  • Regional model availability may vary
  • Billing through Google Cloud Console