Gemini Enterprise Agent Platform RAG Engine overview

This page describes what Gemini Enterprise Agent Platform RAG Engine is and how it works.

Description Console
To learn how to use the Vertex AI SDK to run Gemini Enterprise Agent Platform RAG Engine tasks, see the RAG quickstart for Python . Try Gemini Enterprise Agent Platform RAG Engine

Overview

Gemini Enterprise Agent Platform RAG Engine, a component of the Gemini Enterprise Agent Platform Platform, facilitates Retrieval-Augmented Generation (RAG). Gemini Enterprise Agent Platform RAG Engine is also a data framework for developing context-augmented large language model (LLM) applications. Context augmentation occurs when you apply an LLM to your data. This implements retrieval-augmented generation (RAG).

A common problem with LLMs is that they don't understand private knowledge, that is, your organization's data. With Gemini Enterprise Agent Platform RAG Engine, you can enrich the LLM context with additional private information, because the model can reduce hallucination and answer questions more accurately.

By combining additional knowledge sources with the existing knowledge that LLMs have, a better context is provided. The improved context along with the query enhances the quality of the LLM's response.

The following image illustrates the key concepts to understanding Gemini Enterprise Agent Platform RAG Engine.

Agent Platform RAG key
concepts

These concepts are listed in the order of the retrieval-augmented generation (RAG) process.

  1. Data ingestion: Intake data from different data sources. For example, local files, Cloud Storage, and Google Drive.

  2. Data transformation : Conversion of the data in preparation for indexing. For example, data is split into chunks.

1. Embedding : Numerical representations of words or pieces of text. These numbers capture the semantic meaning and context of the text. Similar or related words or text tend to have similar embeddings, which means they are closer together in the high-dimensional vector space.

  1. Data indexing: Gemini Enterprise Agent Platform RAG Engine creates an index called a corpus . The index structures the knowledge base so it's optimized for searching. For example, the index is like a detailed table of contents for a massive reference book.

  2. Retrieval: When a user asks a question or provides a prompt, the retrieval component in Gemini Enterprise Agent Platform RAG Engine searches through its knowledge base to find information that is relevant to the query.

  3. Generation: The retrieved information becomes the context added to the original user query as a guide for the generative AI model to generate factually grounded and relevant responses.

Supported regions

Gemini Enterprise Agent Platform RAG Engine is supported in the following regions:

Region Location Description Launch stage
us-central1
Iowa v1 and v1beta1 versions are supported. Allowlist, GA
us-east4
Virginia v1 and v1beta1 versions are supported. Allowlist, GA
us-east1
Moncks Corner, SC v1 and v1beta1 versions are supported. Allowlist, Preview
europe-west3
Frankfurt, Germany v1 and v1beta1 versions are supported. GA
europe-west4
Eemshaven, Netherlands v1 and v1beta1 versions are supported. GA
asia-east1
Taiwan v1 and v1beta1 versions are supported. Preview
asia-northeast1
Tokyo v1 and v1beta1 versions are supported. Preview
asia-northeast3
Seoul v1 and v1beta1 versions are supported. Preview
asia-south1
Mumbai v1 and v1beta1 versions are supported. Preview
asia-southeast1
Singapore v1 and v1beta1 versions are supported. Preview
europe-central2
Warsaw v1 and v1beta1 versions are supported. Preview
europe-north1
Finland v1 and v1beta1 versions are supported. Preview
europe-southwest1
Madrid v1 and v1beta1 versions are supported. Preview
europe-west1
Belgium v1 and v1beta1 versions are supported. Preview
europe-west2
London v1 and v1beta1 versions are supported. Preview
europe-west6
Zürich v1 and v1beta1 versions are supported. Preview
europe-west8
Milan v1 and v1beta1 versions are supported. Preview
europe-west9
Paris v1 and v1beta1 versions are supported. Preview
us-east5
Columbus, OH v1 and v1beta1 versions are supported. Preview
us-south1
Dallas, TX v1 and v1beta1 versions are supported. Preview
us-west1
Oregon v1 and v1beta1 versions are supported. Preview
us-west4
Las Vegas, NV v1 and v1beta1 versions are supported. Preview
  • us-central1 , us-east1 , and us-east4 are changed to Allowlist . If you'd like to experiment with Gemini Enterprise Agent Platform RAG Engine, try other regions.

Delete Gemini Enterprise Agent Platform RAG Engine

For more information about deleting a Gemini Enterprise Agent Platform RAG Engine, see the following:

What's next

Create a Mobile Website
View Site in Mobile | Classic
Share by: