id | str | "meta-llama/Llama-2-70b-chat-hf" | The id of the DeepInfra model to use |
name | str | "DeepInfra" | The name of the model |
provider | str | "DeepInfra" | The provider of the model |
api_key | Optional[str] | None | The API key for DeepInfra (defaults to DEEPINFRA_API_KEY env var) |
base_url | str | "https://api.deepinfra.com/v1/openai" | The base URL for the DeepInfra API |
retries | int | 0 | Number of retries to attempt before raising a ModelProviderError |
delay_between_retries | int | 1 | Delay between retries, in seconds |
exponential_backoff | bool | False | If True, the delay between retries is doubled each time |