Gemini 3.5 Flash

provider: 'vertex' · model: 'gemini-3-5-flash'

Configuration

{
  provider: 'vertex',
  model: 'gemini-3-5-flash',
  extra: {
    project: 'my-gcp-project',
    location: 'us-central1',
  },
  timeoutMs: 30_000,  // optional
}

See Google Vertex AI for authentication and region setup.

Specifications


Max input tokens	1,000,000
Max output tokens	65,536
Temperature	`0` (fixed — not configurable)

Cost

	Cost
Input tokens	$1.50 / million
Output tokens	$9.00 / million

Costs are estimates based on Vertex AI pricing. Actual billing depends on your GCP contract and region.

Timeout

timeoutMs applies per individual LLM call within a batch. If unset, no explicit timeout is applied beyond the provider's default.

Google Vertex AI
LLM models overview

LLM models

Providers

Models

Finding workflow

Axioms

Layers and boundaries

Hidden coupling (connascence)

Shared meaning

Order-dependent code

Paired algorithms

Git history

Enrichers

Insights

Gemini 3.5 Flash

Configuration

Specifications

Cost

Timeout

Layers and boundaries

Hidden coupling (connascence)

Shared meaning

Order-dependent code

Paired algorithms

Git history

Gemini 3.5 Flash ​

Configuration ​

Specifications ​

Cost ​

Timeout ​

Related ​

Gemini 3.5 Flash

Configuration

Specifications

Cost

Timeout

Related