Gemini Embedding Models
Thegemini-embedding-2-preview model supports embedding across multiple modalities — text, image, video, and audio — through a single unified endpoint.
| Input Type | Supported Formats |
|---|---|
| Text | Plain string or structured object |
| Image | GCS URI, HTTPS URL, base64, data URI |
| Video | GCS URI, HTTPS URL, base64 |
| Audio | GCS URI, HTTPS URL, base64 |
task_type, dimensions
Embedding Text
Embedding Images
Embedding Videos
Embedding Audio
Multimodal Embedding (Mixed Inputs)
You can combine multiple input types in a single request:Setting Task Type and Dimensions
You can optionally specifytask_type and dimensions to control the embedding behavior:
Legacy Embedding Models
The following sections cover the older Vertex AI embedding models liketextembedding-gecko@003 and multimodalembedding@001.

