Cloud Monitoring supports the Knative metrics listed in this document.
-
For a general explanation of the entries in the tables, including information about values like
DELTA
andGAUGE
, see Metric types .To chart or monitor metric types with values of type
STRING
, you must use Monitoring Query Language (MQL), and you must convert the value into a numeric value. For information about MQL string-conversion methods, seeString
. -
For information about the units used in the metric lists, see the
unit
field in theMetricDescriptor
reference. -
For information about statements of the form “Sampled every x seconds” and “After sampling, data is not visible for up to y seconds”, see Additional information: metadata .
-
The resource-hierarchy level tells you if the metric is written at the project, organization, or folder level(s). When the level is not specified in the metric descriptor, the metric writes at the project level by default.
- For pricing information, see Cloud Monitoring pricing summary .
- For information about the meaning of launch stages such as
GA
(General Availability) andBETA
(Preview), see Product launch stages .
Knative metrics
Metrics from Knative .
The following list was last generated at 2025-09-05 02:07:49 UTC. For more information about this process, see About the lists .
knative
Metrics from Knative components. Launch stages of these metrics: BETA
The "metric type" strings in this table must be prefixed
with knative.dev/
. That prefix has been
omitted from the entries in the table.
When querying a label, use the metric.labels.
prefix; for
example, metric.labels. LABEL
=" VALUE
"
.
Display name
Monitored resources
Labels
serving/activator/request_count
BETA
(project)
Activator Request Count
response_code_class
:
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.response_code
:
(INT64)
Response code of the request according to protocol.num_tries
:
(INT64)
The number of tries (retry on 503s only) it takes to get a response from a reserve revision.serving/activator/request_latencies
BETA
(project)
Activator Request Latencies
response_code_class
:
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.response_code
:
(INT64)
Response code of the request according to protocol.serving/autoscaler/actual_pods
BETA
(project)
Actual Pod Count
serving/autoscaler/desired_pods
BETA
(project)
Desired Pod Count
serving/autoscaler/panic_mode
BETA
(project)
Panic Mode
serving/autoscaler/panic_request_concurrency
BETA
(project)
Observed Panic Concurrency
serving/autoscaler/requested_pods
BETA
(project)
Requested Pod Count
serving/autoscaler/stable_request_concurrency
BETA
(project)
Observed Stable Concurrency
serving/autoscaler/target_concurrency_per_pod
BETA
(project)
Target Concurrency Per Pod
serving/revision/request_count
BETA
(project)
Request Count
response_code_class
:
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.response_code
:
(INT64)
Response code of the request according to protocol.serving/revision/request_latencies
BETA
(project)
Request Latencies
response_code_class
:
One of '1xx', '2xx', '3xx', '4xx', or '5xx' according to response code of the request. If any other code is returned, it should be reported as '5xx'.response_code
:
(INT64)
Response code of the request according to protocol.Generated at 2025-09-05 02:07:49 UTC.