Nebius AI Cloud Observability is a unified solution for monitoring the performance, health and behavior of your cloud resources and workloads. It brings alerts, metrics and logs together in a single place, so you can detect issues early and investigate them without switching between tools. The starting point is the Observability Overview page in the web console. It acts as a single point of entry that surfaces the active alerts and links out to detailed views. The Overview page shows three main sections:Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
- Firing alerts: the alerts that are currently active in your project, so you can immediately spot issues that need attention. For instructions on how to create alerts, see Setting up alerts.
- Metrics: key performance indicators for your resources, such as CPU, memory, disk, network and GPU utilization. You can view them on preconfigured dashboards or stream them to Grafana® and Prometheus for deeper analysis. For more information, see Metrics in Nebius AI Cloud.
- Logs: the most recent logs from your services, so you can investigate behavior and debug issues. You can also export logs or ingest your own logs for unified analysis. For more information, see Logs in Nebius AI Cloud.
Supported services
See the list of Nebius AI Cloud services and resources that provide metrics, alerts and logs
Alerts
Set up alerts on key resources and critical metrics
Metrics
View and analyze resource metrics on dashboards, or integrate with Prometheus and Grafana
Logs
Collect, export and analyze Nebius AI Cloud logs, or ingest your own
Tracing
Deliver and view traces from your Managed Service for Kubernetes® applications
The Grafana Labs Marks are trademarks of Grafana Labs, and are used with Grafana Labs’ permission. We are not affiliated with, endorsed or sponsored by Grafana Labs or its affiliates.