Preview
This feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the Service Specific Terms . Pre-GA features are available "as is" and might have limited support. For more information, see the launch stage descriptions .
This document describes Agent Engine Threat Detection and its detectors.
Agent Engine Threat Detection is a built-in service of Security Command Center that helps you detect and investigate potential attacks on AI agents that are deployed to Vertex AI Agent Engine Runtime. If the Agent Engine Threat Detection service detects a potential attack, the service generates a finding in Security Command Center in near-real time.
Agent Engine Threat Detection monitors the supported AI agents and detects the most common runtime threats. Runtime threats include the execution of malicious binaries or scripts, container escapes, reverse shells, and the use of attack tools within the agent's environment.
In addition, control-plane detectors from Event Threat Detection
analyze various audit logs (including
Identity and Access Management, BigQuery, and Cloud SQL logs) and
Vertex AI Agent Engine logs ( stdout
and stderr
) to
detect suspicious activities. Control-plane threats include data
exfiltration attempts, excessive permission denials, and
suspicious token generation.
Benefits
Agent Engine Threat Detection offers the following benefits:
- Proactively reduce risk for AI workloads.Agent Engine Threat Detection helps you detect and respond to threats early by monitoring the behavior and environment of your AI agents
- Manage AI security in a unified location.Agent Engine Threat Detection findings appear directly in Security Command Center. You have a central interface to view and manage threat findings alongside other cloud security risks.
How it works
Agent Engine Threat Detection collects telemetry from the hosted AI agents to analyze processes, scripts, and libraries that might indicate a runtime attack. When Agent Engine Threat Detection detects a potential threat, it does the following:
-
Agent Engine Threat Detection uses a watcher process to collect event information while the agentic workload is running. The watcher process can take up to one minute to start and collect information.
-
Agent Engine Threat Detection analyzes the collected event information to determine whether an event indicates an incident. Agent Engine Threat Detection uses natural language processing (NLP) to analyze Bash and Python scripts for malicious code.
-
If Agent Engine Threat Detection identifies an incident, it reports the incident as a finding in Security Command Center.
-
If Agent Engine Threat Detection doesn't identify an incident, it doesn't store any information.
-
All data collected is processed in memory and doesn't persist after analysis unless identified as an incident and reported as a finding.
-
For information about how to review Agent Engine Threat Detection findings in the Google Cloud console, see Review findings .
Detectors
This section lists the runtime and control-plane detectors that monitor AI agents that are deployed to Vertex AI Agent Engine Runtime.
Runtime detectors
Agent Engine Threat Detection includes the following runtime detectors:
| Display name | Module name | Description |
|---|---|---|
AGENT_ENGINE_ADDED_MALICIOUS_BINARY_EXECUTED
|
A process executed a binary that threat intelligence identifies as malicious. This binary was not part of the original agentic workload. This event strongly suggests that an attacker has control of the workload and is running malicious software. |
|
AGENT_ENGINE_ADDED_MALICIOUS_LIBRARY_LOADED
|
A process loaded a library that threat intelligence identifies as malicious. This library was not part of the original agentic workload. This event suggests that an attacker likely has control of the workload and is running malicious software. |
|
AGENT_ENGINE_BUILT_IN_MALICIOUS_BINARY_EXECUTED
|
A process executed a binary that threat intelligence identifies as malicious. This binary was part of the original agentic workload. This event might suggest that an attacker is deploying a malicious workload. For example, the actor might have gained control of a legitimate build pipeline and injected the malicious binary into the agentic workload. |
|
AGENT_ENGINE_CONTAINER_ESCAPE
|
A process running inside the container attempted to bypass container isolation by using known exploit techniques or binaries, which threat intelligence identifies as potential threats. A successful escape can allow an attacker to access the host system and potentially compromise the entire environment. This action suggests that an attacker is exploiting vulnerabilities to gain unauthorized access to the host system or broader infrastructure. |
|
AGENT_ENGINE_KUBERNETES_ATTACK_TOOL_EXECUTION
|
A process executed a Kubernetes-specific attack tool, which threat intelligence identifies as a potential threat. This action suggests that an attacker has gained access to the cluster and is using the tool to exploit Kubernetes-specific vulnerabilities or configurations. |
|
AGENT_ENGINE_LOCAL_RECONNAISSANCE_TOOL_EXECUTION
|
A process executed a local reconnaissance tool that is not typically part of the agentic workload. Threat intelligence identifies these tools as potential threats. This event suggests that an attacker is trying to gather internal system information, such as mapping the infrastructure, identifying vulnerabilities, or collecting data on system configurations. |
|
AGENT_ENGINE_MALICIOUS_PYTHON_EXECUTED
|
A machine learning model identified executed Python code as malicious. An attacker can use Python to download tools or files into a compromised environment and execute commands without using binaries. The detector uses natural language processing (NLP) to analyze the Python code's content. Because this approach isn't based on signatures, detectors can identify known and novel malicious Python code. |
|
AGENT_ENGINE_MODIFIED_MALICIOUS_BINARY_EXECUTED
|
A process executed a binary that threat intelligence identifies as malicious. This binary was part of the original agentic workload but was modified at runtime. This event suggests that an attacker might have control of the workload and is running malicious software. |
|
AGENT_ENGINE_MODIFIED_MALICIOUS_LIBRARY_LOADED
|
A process loaded a library that threat intelligence identifies as malicious. This library was part of the original agentic workload but was modified at runtime. This event suggests that an attacker has control of the workload and is running malicious software. |
|
AGENT_ENGINE_MALICIOUS_SCRIPT_EXECUTED
|
A machine learning model identified executed Bash code as malicious. An attacker can use Bash to download tools or files into a compromised environment and execute commands without using binaries. The detector uses NLP to analyze the Bash code's content. Because this approach is not based on signatures, detectors can identify known and novel malicious Bash code. |
|
AGENT_ENGINE_MALICIOUS_URL_OBSERVED
|
Agent Engine Threat Detection observed a malicious URL in the argument list of a running process. The detector compares these URLs against the unsafe web resources lists maintained by the Google Safe Browsing service. If you believe that Google incorrectly classified a URL as a phishing site or malware, report the issue at Reporting Incorrect Data . |
|
AGENT_ENGINE_REVERSE_SHELL
|
A process started with stream redirection to a remote connected
socket. The detector looks for A reverse shell allows an attacker to communicate from a compromised workload to an attacker-controlled machine. The attacker can then command and control the workload—for example, as part of a botnet. |
|
AGENT_ENGINE_UNEXPECTED_CHILD_SHELL
|
A process that does not normally invoke shells unexpectedly spawned a shell process. The detector monitors process executions and generates a finding when a known parent process spawns a shell unexpectedly. |
Control-plane detectors
This section describes the control-plane detectors from Event Threat Detection that are specifically designed for AI agents deployed to Vertex AI Agent Engine Runtime. Event Threat Detection also has detectors for general AI-related threats .
These control-plane detectors are enabled by default. You manage these detectors the same way you do other Event Threat Detection detectors. For more information, see Use Event Threat Detection .
AGENT_ENGINE_IAM_ANOMALOUS_BEHAVIOR_SERVICE_ACCOUNT_GETS_OWN_IAM_POLICY
An identity associated with an AI agent deployed to Vertex AI Agent Engine was used to investigate the roles and permissions associated with that same service account.
Sensitive roles
Findings are classified as Highor Mediumseverity, depending on the sensitivity of the roles granted. For more information, see Sensitive IAM roles and permissions .
AGENT_ENGINE_BIG_QUERY_EXFIL_VPC_PERIMETER_VIOLATION
AGENT_ENGINE_BIG_QUERY_EXFIL_TO_EXTERNAL_TABLE
Detects the following scenarios of a BigQuery data exfiltration initiated by an agent deployed to Vertex AI Agent Engine:
-
Resources owned by the protected organization were saved outside of the organization, including copy or transfer operations.
This scenario corresponds to the
AGENT_ENGINE_BIG_QUERY_EXFIL_TO_EXTERNAL_TABLEfinding type and has Highseverity. -
Attempts were made to access BigQuery resources that are protected by VPC Service Controls.
This scenario corresponds to the
AGENT_ENGINE_BIG_QUERY_EXFIL_VPC_PERIMETER_VIOLATIONfinding type and has Lowseverity.
AGENT_ENGINE_CLOUDSQL_EXFIL_EXPORT_TO_PUBLIC_GCS
AGENT_ENGINE_CLOUDSQL_EXFIL_EXPORT_TO_EXTERNAL_GCS
Detects the following scenarios of a Cloud SQL data exfiltration initiated by an agent deployed to Vertex AI Agent Engine:
- Live instance data was exported to a Cloud Storage bucket outside of the organization.
- Live instance data was exported to a Cloud Storage bucket that is owned by the organization and is publicly accessible.
For project-level activations of the Security Command Center Premium tier, this finding is available only if the Standard tier is enabled in the parent organization.. Findings are classified as Highseverity by default.
AGENT_ENGINE_BIG_QUERY_EXFIL_TO_CLOUD_STORAGE
Detects the following scenarios of a BigQuery data extraction initiated by an agent deployed to Vertex AI Agent Engine:
- A BigQuery resource owned by the protected organization was saved, through extraction operations, to a Cloud Storage bucket outside the organization.
- A BigQuery resource owned by the protected organization was saved, through extraction operations, to a publicly accessible Cloud Storage bucket owned by that organization.
For project-level activations of the Security Command Center Premium tier, this finding is available only if the Standard tier is enabled in the parent organization.. Findings are classified as Lowseverity by default.
AGENT_ENGINE_EXCESSIVE_FAILED_ATTEMPT
AGENT_ENGINE_SUSPICIOUS_TOKEN_GENERATION_IMPLICIT_DELEGATION
iam.serviceAccounts.implicitDelegation
permission was
misused to generate access tokens from a more privileged service account through a Vertex AI Agent Engine. Findings are classified as Lowseverity by default.AGENT_ENGINE_SUSPICIOUS_TOKEN_GENERATION_CROSS_PROJECT_OPENID
The iam.serviceAccounts.getOpenIdToken
IAM permission was used across projects through a Vertex AI Agent Engine.
This finding isn't available for project-level activations. Findings are classified as Lowseverity by default.
AGENT_ENGINE_SUSPICIOUS_TOKEN_GENERATION_CROSS_PROJECT_ACCESS_TOKEN
The iam.serviceAccounts.getAccessToken
IAM permission was used across projects through an AI agent deployed to Vertex AI Agent Engine.
This finding isn't available for project-level activations. Findings are classified as Lowseverity by default.
What's next
- Learn how to use Agent Engine Threat Detection .
- Learn how to use Event Threat Detection .
- Learn how to respond to AI threat findings .
- Refer to the Threat findings index .

