Resource: CachedContent
A resource used in LLM queries for users to explicitly specify what to cache and how to cache.
name
string
Immutable. Identifier. The server-generated resource name of the cached content Format: projects/{project}/locations/{location}/cachedContents/{cachedContent}
displayName
string
Optional. Immutable. The user-generated meaningful display name of the cached content.
model
string
Immutable. The name of the publisher model to use for cached content. Format: projects/{project}/locations/{location}/publishers/{publisher}/models/{model}
systemInstruction
object ( Content
)
Optional. Input only. Immutable. Developer set system instruction. Currently, text only
contents[]
object ( Content
)
Optional. Input only. Immutable. The content to cache
tools[]
object ( Tool
)
Optional. Input only. Immutable. A list of Tools
the model may use to generate the next response
toolConfig
object ( ToolConfig
)
Optional. Input only. Immutable. Tool config. This config is shared for all tools
createTime
string ( Timestamp
format)
Output only. Creatation time of the cache entry.
A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: "2014-10-02T15:01:23Z"
and "2014-10-02T15:01:23.045123456Z"
.
updateTime
string ( Timestamp
format)
Output only. When the cache entry was last updated in UTC time.
A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: "2014-10-02T15:01:23Z"
and "2014-10-02T15:01:23.045123456Z"
.
expiration
Union type
expiration
can be only one of the following:expireTime
string ( Timestamp
format)
timestamp of when this resource is considered expired. This is always provided on output, regardless of what was sent on input.
A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: "2014-10-02T15:01:23Z"
and "2014-10-02T15:01:23.045123456Z"
.
ttl
string ( Duration
format)
Input only. The TTL for this resource. The expiration time is computed: now + TTL.
A duration in seconds with up to nine fractional digits, ending with ' s
'. Example: "3.5s"
.
JSON representation |
---|
{ "name" : string , "displayName" : string , "model" : string , "systemInstruction" : { object ( |
Tool
Tool details that the model may use to generate response.
A Tool
is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. A Tool object should contain exactly one type of Tool (e.g FunctionDeclaration, Retrieval or GoogleSearchRetrieval).
functionDeclarations[]
object ( FunctionDeclaration
)
Optional. Function tool type. One or more function declarations to be passed to the model along with the current user query. Model may decide to call a subset of these functions by populating FunctionCall
in the response. user should provide a FunctionResponse
for each function call in the next turn. Based on the function responses, Model will generate the final response back to the user. Maximum 128 function declarations can be provided.
retrieval
object ( Retrieval
)
Optional. Retrieval tool type. System will always execute the provided retrieval tool(s) to get external knowledge to answer the prompt. Retrieval results are presented to the model for generation.
googleSearchRetrieval
object ( GoogleSearchRetrieval
)
Optional. GoogleSearchRetrieval tool type. Specialized retrieval tool that is powered by Google search.
codeExecution
object ( CodeExecution
)
Optional. CodeExecution tool type. Enables the model to execute code as part of generation. This field is only used by the Gemini Developer API services.
JSON representation |
---|
{ "functionDeclarations" : [ { object ( |
Retrieval
Defines a retrieval tool that model can call to access external knowledge.
disableAttribution
(deprecated)
boolean
Optional. Deprecated. This option is no longer supported.
source
Union type
source
can be only one of the following:vertexAiSearch
object ( VertexAISearch
)
Set to use data source powered by Vertex AI Search.
vertexRagStore
object ( VertexRagStore
)
Set to use data source powered by Vertex RAG store. user data is uploaded via the VertexRagDataService.
JSON representation |
---|
{ "disableAttribution" : boolean , // source "vertexAiSearch" : { object ( |
VertexAISearch
Retrieve from Vertex AI Search datastore for grounding. See https://cloud.google.com/products/agent-builder
datastore
string
Required. Fully-qualified Vertex AI Search data store resource id. Format: projects/{project}/locations/{location}/collections/{collection}/dataStores/{dataStore}
JSON representation |
---|
{ "datastore" : string } |
VertexRagStore
Retrieve from Vertex RAG Store for grounding.
ragCorpora[]
(deprecated)
string
Optional. Deprecated. Please use ragResources instead.
ragResources[]
object ( RagResource
)
Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support.
similarityTopK
(deprecated)
integer
Optional. Number of top k results to return from the selected corpora.
vectorDistanceThreshold
(deprecated)
number
Optional. Only return results with vector distance smaller than the threshold.
JSON representation |
---|
{
"ragCorpora"
:
[
string
]
,
"ragResources"
:
[
{
object (
|
RagResource
The definition of the Rag resource.
ragCorpus
string
Optional. RagCorpora resource name. Format: projects/{project}/locations/{location}/ragCorpora/{ragCorpus}
ragFileIds[]
string
Optional. ragFileId. The files should be in the same ragCorpus set in ragCorpus field.
JSON representation |
---|
{ "ragCorpus" : string , "ragFileIds" : [ string ] } |
GoogleSearchRetrieval
Tool to retrieve public web data for grounding, powered by Google.
dynamicRetrievalConfig
object ( DynamicRetrievalConfig
)
Specifies the dynamic retrieval configuration for the given source.
JSON representation |
---|
{
"dynamicRetrievalConfig"
:
{
object (
|
DynamicRetrievalConfig
Describes the options to customize dynamic retrieval.
mode
enum ( Mode
)
The mode of the predictor to be used in dynamic retrieval.
dynamicThreshold
number
Optional. The threshold to be used in dynamic retrieval. If not set, a system default value is used.
JSON representation |
---|
{
"mode"
:
enum (
|
Mode
The mode of the predictor to be used in dynamic retrieval.
Enums | |
---|---|
MODE_UNSPECIFIED
|
Always trigger retrieval. |
MODE_DYNAMIC
|
Run retrieval only when system decides it is necessary. |
CodeExecution
This type has no fields.
Tool that executes code generated by the model, and automatically returns the result to the model.
See also [ExecutableCode]and [CodeExecutionResult] which are input and output to this tool.
ToolConfig
Tool config. This config is shared for all tools provided in the request.
functionCallingConfig
object ( FunctionCallingConfig
)
Optional. Function calling config.
JSON representation |
---|
{
"functionCallingConfig"
:
{
object (
|
FunctionCallingConfig
Function calling config.
mode
enum ( Mode
)
Optional. Function calling mode.
allowedFunctionNames[]
string
Optional. Function names to call. Only set when the Mode is ANY. Function names should match [FunctionDeclaration.name]. With mode set to ANY, model will predict a function call from the set of function names provided.
JSON representation |
---|
{
"mode"
:
enum (
|
Mode
Function calling mode.
Enums | |
---|---|
MODE_UNSPECIFIED
|
Unspecified function calling mode. This value should not be used. |
AUTO
|
Default model behavior, model decides to predict either function calls or natural language response. |
ANY
|
Model is constrained to always predicting function calls only. If "allowedFunctionNames" are set, the predicted function calls will be limited to any one of "allowedFunctionNames", else the predicted function calls will be any one of the provided "functionDeclarations". |
NONE
|
Model will not predict any function calls. Model behavior is same as when not passing any function declarations. |
UsageMetadata
metadata on the usage of the cached content.
totalTokenCount
integer
Total number of tokens that the cached content consumes.
textCount
integer
Number of text characters.
imageCount
integer
Number of images.
videoDurationSeconds
integer
Duration of video in seconds.
audioDurationSeconds
integer
Duration of audio in seconds.
JSON representation |
---|
{ "totalTokenCount" : integer , "textCount" : integer , "imageCount" : integer , "videoDurationSeconds" : integer , "audioDurationSeconds" : integer } |
Methods |
|
---|---|
|
Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage. |
|
Deletes cached content |
|
Gets cached content configurations |
|
Lists cached contents in a project |
|
Updates cached content configurations |