The latest Gemini models, like Gemini 3.1 Flash Image ( Nano Banana 2 ), are available to use with Firebase AI Logic! Learn more.

Gemini 2.0 Flash and Flash-Lite models will shut down on June 1, 2026 . To avoid service disruption, update to a newer model like gemini-2.5-flash-lite . Learn more.

All Imagen models will shut down on June 24, 2026 . Learn about migrating your apps to use Nano Banana.

Build hybrid experiences in Android apps with on-device and cloud-hosted models

You can build AI-powered Android apps and features with hybrid inference using Firebase AI Logic . Hybrid inference enables running inference using on-device models when available and seamlessly falling back to cloud-hosted models otherwise (and vice versa).

This page describes how to get started using the client SDK , as well as showing additional configuration options and capabilities , like temperature.

Note that on-device inference via Firebase AI Logic is supported for Android apps running on specific devices and is governed by the ML Kit terms , as well as terms specific to the Gen AI aspects of ML Kit .

Recommended use cases and supported capabilities

Recommended use cases

Using an on-device model for inferenceoffers:
- Enhanced privacy
- Local context
- Inference at no-cost
- Offline functionality
Using hybridfunctionality offers:
- Reach more of your audience by accommodating on-device model availability and internet connectivity

Supported capabilities and features for on-device inference

On-device inference only supports single-turn text generation ( not chat), with streaming or non-streaming output. It supports the following text-generation capabilities:

Generating text from text-only input
Generating text from text-and-image input , specifically a single Bitmap image as input

Make sure to review the list of not-yet-available features for on-device inference at the bottom of this page.

Before you begin

Take note of the following:

Supported APIs:
- In-cloud inference uses your chosen Gemini API provider (either the Gemini Developer API or the Vertex AI Gemini API ).
- On-device inference uses the Prompt API from ML Kit , which is in beta and only available on specific devices .
  
  On-device models usage is governed by the ML Kit terms , as well as terms specific to the Gen AI aspects of ML Kit .
This page describes how to get started.

After completing this standard setup, check out the additional configuration options and capabilities (like setting temperature).

Supported Android devices and their on-device models

For on-device inference (which uses the Prompt API from ML Kit), you can find a list of supported devices and their on-device models in the ML Kit documentation.

Get started

These get started steps describe the required general setup for any supported prompt request that you want to send.

Step 1: Set up a Firebase project and connect your app to Firebase

Sign into the Firebase console , and then select your Firebase project.

Don't already have a Firebase project?

If you don't already have a Firebase project, click the button to create a new Firebase project, and then use either of the following options:
- Option 1: Create a wholly new Firebase project (and its underlying Google Cloud project automatically) by entering a new project name in the first step of the workflow.
- Option 2: "Add Firebase" to an existing Google Cloud project by clicking Add Firebase to Google Cloud project(at bottom of page). In the first step of the workflow, start entering the project nameof the existing project, and then select the project from the displayed list.
Complete the remaining steps of the on-screen workflow to create a Firebase project. Note that when prompted, you do not need to set up Google Analytics to use the Firebase AI Logic SDKs.

Build hybrid experiences in Android apps with on-device and cloud-hosted models

Recommended use cases and supported capabilities

Recommended use cases

Supported capabilities and features for on-device inference

Before you begin

Supported Android devices and their on-device models

Get started

Step 1: Set up a Firebase project and connect your app to Firebase

Step 2: Add the required SDKs

Kotlin

Java

Step 3: Check if the on-device model is available

Kotlin

Java

Latency optimization

Step 4: Initialize the service and create a model instance

Kotlin

Java

Step 5: Send a prompt request to a model

Generate text from text-only input

Kotlin

Java

Generate text from text-and-image (multimodal) input

Kotlin

Java

What else can you do?

Features not yet available for on-device inference

Additional limitations

Build hybrid experiences in Android apps with on-device and cloud-hosted models Stay organized with collections Save and categorize content based on your preferences.

Recommended use cases and supported capabilities

Recommended use cases

Supported capabilities and features for on-device inference

Before you begin

Supported Android devices and their on-device models

Get started

Step 1: Set up a Firebase project and connect your app to Firebase

Step 2: Add the required SDKs

Kotlin

Java

Step 3: Check if the on-device model is available

Kotlin

Java

Latency optimization

Step 4: Initialize the service and create a model instance

Kotlin

Java

Step 5: Send a prompt request to a model

Generate text from text-only input

Kotlin

Java

Generate text from text-and-image (multimodal) input

Kotlin

Java

What else can you do?

Features not yet available for on-device inference

Additional limitations

Build hybrid experiences in Android apps with on-device and cloud-hosted models