Overview
Use standard embeddings for independent text embedding (queries, documents, and semantic search) where each text is self-contained.Models
| Model | Dimensions | Context | MRL | Quantization | Price ($/1M tokens) |
|---|---|---|---|---|---|
pplx-embed-v1-0.6b | 1024 | 32K | Yes | INT8/BINARY | $0.004 |
pplx-embed-v1-4b | 2560 | 32K | Yes | INT8/BINARY | $0.03 |
Basic Usage
Generate embeddings for a list of texts:Response
Response
Semantic Search Example
Build a simple semantic search system:Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
input | string | array[string] | Yes | - | Text(s) to embed. Max 512 texts per request. Each input must not exceed 32K tokens. Total tokens must not exceed 120,000. Empty strings are not allowed. |
model | string | Yes | - | Model identifier: pplx-embed-v1-0.6b or pplx-embed-v1-4b |
dimensions | integer | No | Full | Matryoshka dimension (128-1024 for 0.6b, 128-2560 for 4b) |
encoding_format | string | No | base64_int8 | Output encoding: base64_int8 (signed int8) or base64_binary (packed bits) |
Related Resources
Contextualized Embeddings
Document-aware embeddings for chunks that share context.
Best Practices
Batch processing, caching, and RAG patterns.