Skip to content

Gemini 3.5 Flash

provider: 'vertex' · model: 'gemini-3-5-flash'

Configuration

ts
{
  provider: 'vertex',
  model: 'gemini-3-5-flash',
  extra: {
    project: 'my-gcp-project',
    location: 'us-central1',
  },
  timeoutMs: 30_000,  // optional
}

See Google Vertex AI for authentication and region setup.

Specifications

Max input tokens1,000,000
Max output tokens65,536
Temperature0 (fixed — not configurable)

Cost

Cost
Input tokens$1.50 / million
Output tokens$9.00 / million

Costs are estimates based on Vertex AI pricing. Actual billing depends on your GCP contract and region.

Timeout

timeoutMs applies per individual LLM call within a batch. If unset, no explicit timeout is applied beyond the provider's default.