Migrate from OpenAI SDK to Gen AI SDK

This page explains how to migrate code designed for the OpenAI SDK to the Google Gen AI SDK to utilize Gemini models on Vertex AI.

Migration Overview

The following notebook demonstrates a practical migration from the openai library to the google-genai library:

API & Syntax Mapping

The following table compares the core components, methods, and parameters of the OpenAI SDK with the Gen AI SDK.

Feature	OpenAI SDK ( `openai` )	Gen AI SDK ( `google-genai` )
Client Initialization	`client = OpenAI(api_key=...)`	`client = genai.Client(vertexai=True, ...)`
Generation Method	`client.chat.completions.create`	`client.models.generate_content`
Streaming Method	`stream=True` (parameter)	`client.models.generate_content_stream` (method)
User Input	`messages=[{"role": "user", "content": "..."}]`	`contents="..."` (str) or `contents=[...]` (list)
System Instructions	`messages=[{"role": "system", "content": "..."}]`	`config=types.GenerateContentConfig(system_instruction=...)`
Response Access	`response.choices[0].message.content`	`response.text`
Chat History	Manual list management ( `messages.append` )	`client.chats.create()` (Stateful object)
Max Tokens	`max_tokens`	`max_output_tokens` (inside `config` )
Temperature	`temperature`	`temperature` (inside `config` )
JSON Mode	`response_format={"type": "json_object"}`	`response_mime_type="application/json"` (inside `config` )

Installation and Setup

Uninstall the OpenAI library and install the Gen AI SDK.

 pip  
install  
google-genai

2. Authentication & Initialization

While OpenAI uses an API Key, Vertex AI uses Identity and Access Management (IAM) credentials (Application Default Credentials) . You must explicitly define your Project ID and Location.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`from openai import OpenAI import os # Relies on OPENAI_API_KEY environment variable client = OpenAI ()`	`from google import genai # Use vertexai=True to use the Vertex AI platform client = genai . Client ( vertexai = True , project = 'your-project-id' , location = 'us-central1' )`

   from 
  
 openai 
  
 import 
 OpenAI 
 import 
  
 os 
 # Relies on OPENAI_API_KEY environment variable 
 client 
 = 
 OpenAI 
 ()

   from 
  
 google 
  
 import 
 genai 
 # Use vertexai=True to use the Vertex AI platform 
 client 
 = 
 genai 
 . 
 Client 
 ( 
 vertexai 
 = 
 True 
 , 
 project 
 = 
 'your-project-id' 
 , 
 location 
 = 
 'us-central1' 
 )

Tip:You can also set environment variables to initialize the client without arguments, similar to how the OpenAI client reads the API key from the environment.

Set GOOGLE_GENAI_USE_VERTEXAI , GOOGLE_CLOUD_PROJECT and GOOGLE_CLOUD_LOCATION , as shown:

  export 
  
 GOOGLE_GENAI_USE_VERTEXAI 
 = 
 true 
 export 
  
 GOOGLE_CLOUD_PROJECT 
 = 
 'your-project-id' 
 export 
  
 GOOGLE_CLOUD_LOCATION 
 = 
 'global'

Once configured, you can initialize the client without passing parameters:

  from 
  
 google 
  
 import 
 genai 
 client 
 = 
 genai 
 . 
 Client 
 ()

Code Examples

The following code samples show the differences between the OpenAI SDK and Google Gen AI SDK for common tasks.

Single-turn text generation

The following code samples show how to generate text. Note that in the Google Gen AI SDK, system instructions are handled as a configuration parameter rather than a message role in the input list.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`response = client . chat . completions . create ( model = "gpt-4" , messages = [ { "role" : "system" , "content" : "You are a helpful assistant." }, { "role" : "user" , "content" : "Explain quantum physics." } ] ) print ( response . choices [ 0 ] . message . content )`	`from google.genai import types response = client . models . generate_content ( model = "gemini-2.5-flash" , contents = "Explain quantum physics." , config = types . GenerateContentConfig ( system_instruction = "You are a helpful assistant." ) ) print ( response . text )`

   response 
 = 
 client 
 . 
 chat 
 . 
 completions 
 . 
 create 
 ( 
 model 
 = 
 "gpt-4" 
 , 
 messages 
 = 
 [ 
 { 
 "role" 
 : 
 "system" 
 , 
 "content" 
 : 
 "You are a helpful assistant." 
 }, 
 { 
 "role" 
 : 
 "user" 
 , 
 "content" 
 : 
 "Explain quantum physics." 
 } 
 ] 
 ) 
 print 
 ( 
 response 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 message 
 . 
 content 
 )

   from 
  
 google.genai 
  
 import 
 types 
 response 
 = 
 client 
 . 
 models 
 . 
 generate_content 
 ( 
 model 
 = 
 "gemini-2.5-flash" 
 , 
 contents 
 = 
 "Explain quantum physics." 
 , 
 config 
 = 
 types 
 . 
 GenerateContentConfig 
 ( 
 system_instruction 
 = 
 "You are a helpful assistant." 
 ) 
 ) 
 print 
 ( 
 response 
 . 
 text 
 )

Text generation with parameters

The following code samples show the differences in defining configuration parameters. In the Google Gen AI SDK, parameters like temperature , max_output_tokens (previously max_tokens ), and JSON formatting are grouped into a GenerateContentConfig object.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`response = client . chat . completions . create ( model = "gpt-4" , messages = [ { "role" : "user" , "content" : "List 3 types of apples in JSON." } ], temperature = 0.7 , max_tokens = 1000 , response_format = { "type" : "json_object" } ) print ( response . choices [ 0 ] . message . content )`	`from google.genai import types config = types . GenerateContentConfig ( temperature = 0.7 , max_output_tokens = 1000 , response_mime_type = "application/json" ) response = client . models . generate_content ( model = "gemini-2.5-flash" , contents = "List 3 types of apples in JSON." , config = config ) print ( response . text )`

   response 
 = 
 client 
 . 
 chat 
 . 
 completions 
 . 
 create 
 ( 
 model 
 = 
 "gpt-4" 
 , 
 messages 
 = 
 [ 
 { 
 "role" 
 : 
 "user" 
 , 
 "content" 
 : 
 "List 3 types of apples in JSON." 
 } 
 ], 
 temperature 
 = 
 0.7 
 , 
 max_tokens 
 = 
 1000 
 , 
 response_format 
 = 
 { 
 "type" 
 : 
 "json_object" 
 } 
 ) 
 print 
 ( 
 response 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 message 
 . 
 content 
 )

   from 
  
 google.genai 
  
 import 
 types 
 config 
 = 
 types 
 . 
 GenerateContentConfig 
 ( 
 temperature 
 = 
 0.7 
 , 
 max_output_tokens 
 = 
 1000 
 , 
 response_mime_type 
 = 
 "application/json" 
 ) 
 response 
 = 
 client 
 . 
 models 
 . 
 generate_content 
 ( 
 model 
 = 
 "gemini-2.5-flash" 
 , 
 contents 
 = 
 "List 3 types of apples in JSON." 
 , 
 config 
 = 
 config 
 ) 
 print 
 ( 
 response 
 . 
 text 
 )

Chat (Multi-turn)

The following code samples show the differences in managing chat history. Google Gen AI SDK simplifies this by providing a stateful chat object, whereas OpenAI requires manually appending messages to a list.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`# You must manually manage the list state messages = [{ "role" : "user" , "content" : "Hi" }] response = client . chat . completions . create ( model = "gpt-4" , messages = messages ) # Append the response to history manually messages . append ( response . choices [ 0 ] . message ) messages . append ({ "role" : "user" , "content" : "Next question" }) response2 = client . chat . completions . create ( model = "gpt-4" , messages = messages ) print ( response2 . choices [ 0 ] . message . content )`	`# The SDK manages history for you chat = client . chats . create ( model = "gemini-2.5-flash" , config = types . GenerateContentConfig ( system_instruction = "You are a helpful assistant." ) ) response1 = chat . send_message ( "Hi" ) print ( response1 . text ) # History is retained automatically in the chat object response2 = chat . send_message ( "Next question" ) print ( response2 . text )`

   # You must manually manage the list state 
 messages 
 = 
 [{ 
 "role" 
 : 
 "user" 
 , 
 "content" 
 : 
 "Hi" 
 }] 
 response 
 = 
 client 
 . 
 chat 
 . 
 completions 
 . 
 create 
 ( 
 model 
 = 
 "gpt-4" 
 , 
 messages 
 = 
 messages 
 ) 
 # Append the response to history manually 
 messages 
 . 
 append 
 ( 
 response 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 message 
 ) 
 messages 
 . 
 append 
 ({ 
 "role" 
 : 
 "user" 
 , 
 "content" 
 : 
 "Next question" 
 }) 
 response2 
 = 
 client 
 . 
 chat 
 . 
 completions 
 . 
 create 
 ( 
 model 
 = 
 "gpt-4" 
 , 
 messages 
 = 
 messages 
 ) 
 print 
 ( 
 response2 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 message 
 . 
 content 
 )

   # The SDK manages history for you 
 chat 
 = 
 client 
 . 
 chats 
 . 
 create 
 ( 
 model 
 = 
 "gemini-2.5-flash" 
 , 
 config 
 = 
 types 
 . 
 GenerateContentConfig 
 ( 
 system_instruction 
 = 
 "You are a helpful assistant." 
 ) 
 ) 
 response1 
 = 
 chat 
 . 
 send_message 
 ( 
 "Hi" 
 ) 
 print 
 ( 
 response1 
 . 
 text 
 ) 
 # History is retained automatically in the chat object 
 response2 
 = 
 chat 
 . 
 send_message 
 ( 
 "Next question" 
 ) 
 print 
 ( 
 response2 
 . 
 text 
 )

Streaming

The following code samples show the differences in streaming responses. Google Gen AI SDK uses a specific method ( generate_content_stream ) rather than a boolean flag.

OpenAI SDK Google Gen AI SDK

OpenAI SDK	Google Gen AI SDK
`stream = client . chat . completions . create ( model = "gpt-4" , messages = [{ "role" : "user" , "content" : "Write a story." }], stream = True ) for chunk in stream : if chunk . choices [ 0 ] . delta . content : print ( chunk . choices [ 0 ] . delta . content , end = "" )`	`stream = client . models . generate_content_stream ( model = "gemini-2.5-flash" , contents = "Write a story." ) for chunk in stream : print ( chunk . text , end = "" )`

   stream 
 = 
 client 
 . 
 chat 
 . 
 completions 
 . 
 create 
 ( 
 model 
 = 
 "gpt-4" 
 , 
 messages 
 = 
 [{ 
 "role" 
 : 
 "user" 
 , 
 "content" 
 : 
 "Write a story." 
 }], 
 stream 
 = 
 True 
 ) 
 for 
 chunk 
 in 
 stream 
 : 
 if 
 chunk 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 delta 
 . 
 content 
 : 
 print 
 ( 
 chunk 
 . 
 choices 
 [ 
 0 
 ] 
 . 
 delta 
 . 
 content 
 , 
 end 
 = 
 "" 
 )

   stream 
 = 
 client 
 . 
 models 
 . 
 generate_content_stream 
 ( 
 model 
 = 
 "gemini-2.5-flash" 
 , 
 contents 
 = 
 "Write a story." 
 ) 
 for 
 chunk 
 in 
 stream 
 : 
 print 
 ( 
 chunk 
 . 
 text 
 , 
 end 
 = 
 "" 
 )

What's next

Learn how to Use OpenAI libraries with Vertex AI .
See code examples for OpenAI compatibility .
Get started with Google Gen AI SDK quickstart .

Migrate from OpenAI SDK to Gen AI SDK Stay organized with collections Save and categorize content based on your preferences.

Migration Overview

API & Syntax Mapping

Installation and Setup

2. Authentication & Initialization

Code Examples

Single-turn text generation

Text generation with parameters

Chat (Multi-turn)

Streaming

What's next

Migrate from OpenAI SDK to Gen AI SDK