Vertex AI¶
Google Cloud Vertex AI platform for accessing Gemini and other models.
Configuration¶
PROVIDER=vertexai
PROJECT=your-gcp-project-id
LOCATION=us-central1
MODEL=gemini-1.5-pro
TEMPERATURE=0.3
Key Parameters¶
Parameter | Description | Default |
---|---|---|
PROJECT |
GCP project ID (required) | - |
LOCATION |
GCP region (required) | - |
MODEL |
Model to use | - |
TEMPERATURE |
Randomness (0.0-1.0) | 0.3 |
TOP_P |
Nucleus sampling | 1.0 |
TOP_K |
Top-k sampling | - |
MAX_TOKENS |
Max response tokens | 1024 |
TIMEOUT |
Request timeout (seconds) | 60 |
Features¶
- ✅ Streaming responses
- ✅ Function calling
- ✅ MCP support
- ✅ Vision capabilities
- ✅ Enterprise features
- ✅ Multi-modal support
Authentication¶
Vertex AI requires Google Cloud authentication. Set up authentication using one of these methods:
# Application Default Credentials
gcloud auth application-default login
# Service Account Key
export GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json
# Workload Identity (for GKE)
# Configured automatically in GKE environment
Important Notes¶
- Based on Gemini provider implementation
- Requires GCP project and location configuration
- Automatic authentication via Google Cloud SDK
- Enterprise-grade security and compliance
- Regional model availability may vary
- Billing through Google Cloud Console