Omit content using a negative prompt | Generative AI on Vertex AI

Skip to main content

Technology areas

AI and ML
Application development
Application hosting
Compute
Data analytics and pipelines
Databases
Distributed, hybrid, and multicloud
Generative AI
Industry solutions
Networking
Observability and monitoring
Security
Storage

Cross-product tools

Access and resources management
Costs and usage management
Infrastructure as code
Migration
SDK, languages, frameworks, and tools

/

Console

English
Deutsch
Español
Español – América Latina
Français
Indonesia
Italiano
Português
Português – Brasil
中文 – 简体
中文 – 繁體
日本語
한국어

Sign in

Generative AI on Vertex AI
Documentation

Start free

Guides API reference Vertex AI Cookbook Prompt gallery Resources FAQ Pricing

Technology areas
- More
Cross-product tools
- More
Console

Discover
Get started
Select models
- Model Garden
- Overview of Model Garden
- Use models in Model Garden
- Test model capabilities
- Supported models
- Google Models
- Overview
- Gemini
- Imagen
- Veo
- Lyria
  - Lyria 2
- Model versions
- Managed models
- Model as a Service (MaaS) overview
- Partner models
  - Overview
  - Claude
    Overview
    Request predictions
    Batch predictions
    Prompt caching
    Count tokens
    Web search
    Safety classifiers
    Model details
    Claude Sonnet 4.5
    Claude Opus 4.1
    Claude Haiku 4.5
    Claude Opus 4
    Claude Sonnet 4
    Claude 3.7 Sonnet
    Claude 3.5 Haiku
    Claude 3 Haiku
  - Mistral AI
    Overview
    Model details
    Mistral Medium 3
    Mistral OCR (25.05)
    Mistral Small 3.1 (25.03)
    Codestral 2
- Open models
  - Overview
  - Grant access to open models
  - Models
  - DeepSeek
    Overview
    DeepSeek-R1-0528
    DeepSeek-V3.1
  - OpenAI
    Overview
    OpenAI gpt-oss-120b
    OpenAI gpt-oss-20b
  - Qwen
    Overview
    Qwen 3 Next Instruct 80B
    Qwen 3 Next Thinking 80B
    Qwen 3 Coder
    Qwen 3 235B
  - Embedding (e5)
    Multilingual E5 Small
    Multilingual E5 Large
  - Llama
    Overview
    Request predictions
    Model details
    Llama 4 Maverick
    Llama 4 Scout
    Llama 3.3
    Llama 3.2
    Llama 3.1 405b
    Llama 3.1 70b
    Llama 3.1 8b
  - Model deprecations (MaaS)
  - API
  - Call MaaS APIs for open models
  - Function calling
  - Thinking
  - Structured output
  - Batch prediction
- Self-deployed models
- Overview
- Deploy models with custom weights
- Google Gemma
- Llama
- Use Hugging Face Models
- Comprehensive guide to vLLM for Text and Multimodal LLM Serving (GPU)
- vLLM TPU
- Hex-LLM
- xDiT
- Tutorial: Deploy Llamma 3 models with SpotVM and Reservations
- Model Garden notebooks
  - Tutorial: Optimize model performance with advanced features in Model Garden
Build
- Prompt design
- Introduction to prompting
- Prompting strategies
- Task-specific prompt guidance
- Capabilities
- Safety
- Text and code generation
- Image generation
- Video generation