Skip to main content
Google Cloud Documentation
Technology areas
  • AI and ML
  • Application development
  • Application hosting
  • Compute
  • Data analytics and pipelines
  • Databases
  • Distributed, hybrid, and multicloud
  • Generative AI
  • Industry solutions
  • Networking
  • Observability and monitoring
  • Security
  • Storage
Cross-product tools
  • Access and resources management
  • Costs and usage management
  • Infrastructure as code
  • Migration
  • SDK, languages, frameworks, and tools
/
Console
  • English
  • Deutsch
  • Español
  • Español – América Latina
  • Français
  • Indonesia
  • Italiano
  • Português
  • Português – Brasil
  • 中文 – 简体
  • 中文 – 繁體
  • 日本語
  • 한국어
Sign in
  • Generative AI on Vertex AI
  • Documentation
Start free
Guides API reference Vertex AI Cookbook Prompt gallery Resources FAQ Pricing
Google Cloud Documentation
  • Technology areas
    • More
    • Guides
    • API reference
    • Vertex AI Cookbook
    • Prompt gallery
    • Resources
    • FAQ
    • Pricing
  • Cross-product tools
    • More
  • Console
  • Discover
    • Overview of Generative AI on Vertex AI
    • Generative AI beginner's guide
    • Glossary
  • Get started
    • Get an API key
    • Configure application default credentials
    • API quickstart
    • Vertex AI Studio quickstart
    • Migrate from Google AI Studio to Vertex AI
    • Deploy your Vertex AI Studio prompt as a web application
    • Vertex AI Studio capabilities
    • Generate an image and verify its watermark using Imagen
    • Google GenAI libraries
    • Compatibility with OpenAI library
    • Vertex AI in express mode
    • Overview
    • Console tutorial
    • API tutorial
  • Select models
    • Model Garden
    • Overview of Model Garden
    • Use models in Model Garden
    • Test model capabilities
    • Supported models
    • Google Models
    • Overview
    • Gemini
      • Gemini 2.5 Pro
      • Gemini 2.5 Flash
      • Gemini 2.5 Flash Image
      • Gemini 2.5 Flash Live API
      • Gemini 2.5 Flash-Lite
      • Gemini 2.0 Flash
      • Gemini 2.0 Flash-Lite
      • Vertex AI Model Optimizer
      • Migrate to the latest Gemini models
      • SDKs
    • Imagen
      • Imagen 3.0 Generate 002
      • Imagen 3.0 Generate 001
      • Imagen 3.0 Fast Generate 001
      • Imagen 3.0 Capability 001
      • Imagen 4.0 Generate
      • Imagen 4.0 Fast Generate
      • Imagen 4.0 Ultra Generate
      • Virtual Try-On Preview 08-04
      • Imagen product recontext preview 06-30
      • Migrate to Imagen 3
    • Veo
      • Veo 2
      • Veo 2 Preview
      • Veo 2 Experimental
      • Veo 3
      • Veo 3 Fast
      • Veo 3 preview
      • Veo 3 Fast preview
      • Veo 3.1 preview
      • Veo 3.1 Fast preview
    • Lyria
      • Lyria 2
    • Model versions
    • Managed models
    • Model as a Service (MaaS) overview
    • Partner models
      • Overview
      • Claude
        • Overview
        • Request predictions
        • Batch predictions
        • Prompt caching
        • Count tokens
        • Web search
        • Safety classifiers
        • Model details
        • Claude Sonnet 4.5
        • Claude Opus 4.1
        • Claude Haiku 4.5
        • Claude Opus 4
        • Claude Sonnet 4
        • Claude 3.7 Sonnet
        • Claude 3.5 Haiku
        • Claude 3 Haiku
      • Mistral AI
        • Overview
        • Model details
        • Mistral Medium 3
        • Mistral OCR (25.05)
        • Mistral Small 3.1 (25.03)
        • Codestral 2
    • Open models
      • Overview
      • Grant access to open models
      • Models
      • DeepSeek
        • Overview
        • DeepSeek-R1-0528
        • DeepSeek-V3.1
      • OpenAI
        • Overview
        • OpenAI gpt-oss-120b
        • OpenAI gpt-oss-20b
      • Qwen
        • Overview
        • Qwen 3 Next Instruct 80B
        • Qwen 3 Next Thinking 80B
        • Qwen 3 Coder
        • Qwen 3 235B
      • Embedding (e5)
        • Multilingual E5 Small
        • Multilingual E5 Large
      • Llama
        • Overview
        • Request predictions
        • Model details
        • Llama 4 Maverick
        • Llama 4 Scout
        • Llama 3.3
        • Llama 3.2
        • Llama 3.1 405b
        • Llama 3.1 70b
        • Llama 3.1 8b
      • Model deprecations (MaaS)
      • API
      • Call MaaS APIs for open models
      • Function calling
      • Thinking
      • Structured output
      • Batch prediction
    • Self-deployed models
    • Overview
    • Deploy models with custom weights
    • Google Gemma
      • Use Gemma
      • Tutorial: Deploy and inference Gemma (GPU)
      • Tutorial: Deploy and inference Gemma (TPU)
    • Llama
    • Use Hugging Face Models
    • Comprehensive guide to vLLM for Text and Multimodal LLM Serving (GPU)
    • vLLM TPU
    • Hex-LLM
    • xDiT
    • Tutorial: Deploy Llamma 3 models with SpotVM and Reservations
    • Model Garden notebooks
      • Tutorial: Optimize model performance with advanced features in Model Garden
  • Build
    • Prompt design
    • Introduction to prompting
    • Prompting strategies
      • Overview
      • Give clear and specific instructions
      • Use system instructions
      • Include few-shot examples
      • Add contextual information
      • Structure prompts
      • Compare prompts
      • Instruct the model to explain its reasoning
      • Break down complex tasks
      • Experiment with parameter values
      • Prompt iteration strategies
    • Task-specific prompt guidance
      • Design multimodal prompts
      • Design chat prompts
      • Design medical text prompts
    • Capabilities
    • Safety
      • Overview
      • Responsible AI
      • System instructions for safety
      • Configure content filters
      • Gemini for safety filtering and content moderation
      • Abuse monitoring
      • Process blocked responses
    • Text and code generation
      • Text generation
      • System instructions
      • Function calling
      • Structured output
      • Content generation parameters
      • Code execution
      • Medical text
    • Image generation
      • Gemini
      • Generate images with Gemini
      • Edit images with Gemini
      • Imagen
      • Imagen overview
      • Generate images using text prompts
      • Verify an image watermark
      • Configure Imagen parameters
        • Configure Responsible AI safety settings
        • Use prompt rewriter
        • Set text prompt language
        • Configure aspect ratio
        • Set output resolution
        • Omit content using a negative prompt
        • Generate deterministic images
      • Generate images for retail and e-commerce
        • Generate Virtual Try-On images
        • Recontextualize product images
      • Edit images
        • Overview
        • Insert objects into an image using inpaint
        • Remove objects from an image using inpaint
        • Expand the content of an image using outpaint
        • Replace the background of an image
        • Edit using Personalization
        • Edit images using text prompts
      • Customize images
        • Subject customization
        • Style customization
        • Controlled Customization
        • Instruct Customization
      • Upscale an image
      • Prompt and image attribute guide
      • Base64 encode and decode files
      • Responsible AI and usage guidelines for Imagen
      • Legacy features
        • Migrate to Imagen 3
        • Get image descriptions using visual captioning
        • Use Visual Question Answering
        • Get video descriptions using Imagen
    • Video generation
      • Introduction to Veo
      • Generate Veo videos from text prompts
      • Generate Veo videos from an image
      • Generate Veo videos using first and last video frames