API monitoring with Grafana Cloud metrics

Zato integrates with Grafana Cloud to let you track the health of your APIs and integrations in real-time.

You can monitor system resources, push custom metrics from your services, and build dashboards to visualize performance trends. You can also set up alerts on the metrics that matter to you, so you know about issues before they become problems.

What you get

API performance - How many req/s your services process
System metrics - CPU, memory, disk, filesystem, network and load statistics from your Zato containers
Custom metrics and pre-built dashboards - Push your own metrics from services and make use of pre-built dashboard templates

Prerequisites

To configure the integration, you need:

A Grafana Cloud account (any tier will work fine)
Your Grafana Cloud instance ID
An API key with metrics:write permissions
Your endpoint URL

To obtain these credentials:

Log in to your Grafana Cloud account at grafana.com
Go to My Account → Grafana Cloud portal
Click on your stack name (by default, it's the same as your subdomain on grafana.com)
Navigate to OpenTelemetry section
Note down the Instance ID and endpoint URL
Click Generate now in the "Password / API Token" field, while making sure it has metrics:write permissions (which should be among the default ones already)

Configuring Grafana Cloud in Zato

To configure the integration:

Go to Monitoring → Grafana Cloud in your Zato dashboard
Slide the toggle to enable the integration
Enter your Instance ID from Grafana Cloud
Enter your API key
Enter your Endpoint URL (e.g. https://otlp-gateway-prod-us-central-0.grafana.net/otlp)
Click Test connection to verify your credentials
Click Save to apply the configuration
The system will restart components to apply the changes

Testing the connection

Before saving, click Test connection to verify that Zato can reach your Grafana Cloud endpoint with the provided credentials.

If the test fails, you will see the error message returned by Grafana Cloud. Common issues include:

Invalid credentials - double-check your Instance ID and API key
Wrong endpoint - ensure you're using the correct endpoint from the OpenTelemetry section
Network issues - verify your container can reach the internet

Viewing service metrics

Every time a service is invoked, Zato automatically increments a counter metric in Grafana Cloud. This happens without any code changes on your part.

Service metrics use the naming pattern zato_service_<service_name>_total where dots and hyphens in the service name are replaced with underscores. The _total suffix is added automatically by Grafana.

For example, a service named demo.ping will appear in Grafana Cloud as zato_service_demo_ping_total.

To find your service metrics:

Go to Grafana Cloud (which is something like https://example.grafana.net, not https://grafana.com)
Go to Explore in the left sidebar
Select the data source for your services - it will have "prom" in its name (short for Prometheus)

Switch to the Code mode (check the yellow arrow in the screenshot below)
Search for zato_service_
You will see all your services listed

For instance, use the rate() function to calculate requests per second:

rate(zato_service_demo_ping_total[1m])

This shows requests per second averaged over 1 minute. Adjust the interval as needed ([5m], [30s], etc.).

Building a dashboard

You can create dashboards to visualize service performance:

Go to Dashboards → New → New Dashboard
Add a new panel
Use a query like rate(zato_service_demo_ping_total[1m]) to show requests per second
Add more panels for other services you want to monitor
Save the dashboard

This lets you track request rates, spot traffic patterns, and identify load trends across your API platform.

Pushing custom metrics from services

Use self.metrics.push to send custom metrics from your services to Grafana Cloud. This lets you track business-level indicators alongside system metrics.

Available methods

Your services have access to two metrics methods:

self.metrics.push(name, value) - Sets a gauge metric to a specific value. Use this for values that can go up or down, like queue depths, temperatures, or percentages.
self.metrics.incr(name, value=1) - Increments a counter metric. Use this for values that only increase, like total requests processed or errors encountered.

Basic example

# -*- coding: utf-8 -*-

from zato.server.service import Service

class FlightBoardingStatus(Service):

    def handle(self):

        # Get boarding data from the request
        passengers_boarded = self.request.payload['passengers_boarded']
        passengers_total = self.request.payload['passengers_total']

        # Calculate boarding percentage
        boarding_percentage = (passengers_boarded / passengers_total) * 100

        # Push metrics to Grafana Cloud
        self.metrics.push('airport.flight.passengers_boarded', passengers_boarded)
        self.metrics.push('airport.flight.boarding_percentage', boarding_percentage)

        # Continue with business logic
        self.response.payload = {'status': 'ok', 'boarding_percentage': boarding_percentage}

How to find your custom metrics in Grafana Cloud

After pushing metrics from your service, follow these steps to locate them:

Go to your Grafana Cloud instance (e.g. https://example.grafana.net)
Click Explore in the left sidebar
Select your Prometheus data source (it will have "prom" in its name)
Switch to Code mode using the toggle in the query editor
Type the name of your metric, for example airport_flight_passengers_boarded
Click Run query to see the data

Remember that dots in metric names are automatically converted to underscores by Grafana. So airport.flight.passengers_boarded becomes airport_flight_passengers_boarded.

Building a dashboard for custom metrics

To create a dashboard that displays your custom metrics:

In Grafana Cloud, go to Dashboards in the left sidebar
Click New → New Dashboard
Click Add visualization
Select your Prometheus data source
In the query editor, switch to Code mode
Enter your metric name, for example: airport_flight_boarding_percentage
Set the panel title to something descriptive like "Boarding progress"
Click Apply to add the panel
Click Save (disk icon) to save your dashboard

Filtering by service name

Every metric pushed through self.metrics.push or self.metrics.incr includes a service label with the name of the service that sent it. This lets you filter metrics by service.

For example, if you have multiple services pushing the same metric name:

airport_flight_passengers_boarded{service="my.boarding.service"}

Other use cases

You can build metrics fpr any domain where you need to track business or operational metrics in real-time.

For instance:

In cybersecurity, services processing events from SIEM systems can push metrics for failed authentication attempts, high-severity incidents, current threat levels, and blocked IP counts - letting security teams build dashboards that show attack patterns and response effectiveness.
In telecommunications, services receiving data from network equipment can push metrics for active calls per switch, signal strength, dropped call counts, bandwidth utilization, and SMS queue depths - giving network operations teams visibility into capacity, quality degradation, and congestion patterns.

Importing pre-built dashboards

Since Zato exports metrics using OpenTelemetry Protocol (OTLP), you can use any OTLP-compatible dashboard template from the Grafana community. This gives you access to hundreds of various existing dashboards without additional configuration.

To import a dashboard:

In Grafana Cloud, go to Dashboards → New → Import
Enter the dashboard ID or search by name
Select the dashboard and click Import
Choose your Prometheus data source
The dashboard will display your Zato container metrics

Useful pre-built dashboards

These community dashboards work with Zato's OTLP metrics:

OpenTelemetry Host Metrics (ID: 20376) - CPU, memory, disk, network, and filesystem metrics adapted from Node Exporter format. Good starting point for container monitoring.
OTEL Host Metrics (ID: 23319) - Alternative host metrics dashboard with a different layout.
OpenTelemetry Collector (ID: 15983) - Monitors the telemetry pipeline itself, useful for debugging metric delivery issues.

Using environment variables to persist configuration

To automatically configure Grafana Cloud when your container starts, using environment variables, as below:

docker run -it \
  -e Zato_Grafana_Cloud_Instance_ID=123456 \
  -e Zato_Grafana_Cloud_API_Key=your-api-key-here \
  -e Zato_Grafana_Cloud_Endpoint=https://otlp-gateway-prod-us-central-0.grafana.net/otlp \
  zatosource/zato-4.1

Environment variables take precedence over dashboard configuration. This is useful for:

Automated DevOps deployments
Kubernetes configurations
CI/CD pipelines where credentials are injected at runtime