Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.nebius.com/llms.txt

Use this file to discover all available pages before exploring further.

You can monitor computing and storage resources of your Serverless AI endpoints and jobs on the dashboards in the Nebius AI Cloud web console. To do so, go to the page of the job or endpoint you would like to review and switch to the Metrics tab. Use the dashboards to monitor current resource utilization, get information to schedule quota increases and quickly identify anomalies. In case of endpoint and job issues, the dashboards help the Nebius support team investigate the issues. Data for the dashboards is collected automatically. For more information about metrics collection, see Monitoring agent on Compute virtual machines.

Explore the dashboard

Usage data for endpoints and jobs becomes available 5–10 minutes after a resource is created. Use time filters to view a specific period of usage. By default, the data is refreshed every 15 seconds. You can configure this interval to the right of the time filters.

Metrics

Serverless AI uses the same metrics as Compute containers over virtual machines. See the full list of metrics in Monitoring virtual machines in Nebius AI Cloud and Monitoring volumes in Nebius AI Cloud.