GitHub - google-gemini/cookbook: Examples and guides for using the Gemini API

Welcome to the Gemini API Cookbook

This cookbook provides a structured learning path for using the Gemini API, focusing on hands-on tutorials and practical examples.

Gemini 3.5 Flash: For the most recent updates on our latest generation, please check the Get Started and the thinking guides who include migration guides.

Agents API: Create and run your custom agents using the new Antigravity agent !

🍌 Nano-Banana 2: Go bananas with our latest image generation model: Nano-Banana 2. Get started here with 512px, thinking, search and image grounding, and a ton of examples!

🎶 Lyria 3: Channel your inner composer with Lyria 3. Get started here with 30s clips and full song generation, image-to-music, and a ton of examples!

Navigating the Cookbook

This cookbook is organized into two main categories:

Quick Starts: Step-by-step guides covering both introductory topics ("Get Started ") and specific API features.
Examples: Practical use cases demonstrating how to combine multiple features.

We also showcase Demos in separate repositories, illustrating end-to-end applications of the Gemini API.

What's New?

Here are the recent additions and updates to the Gemini API and the Cookbook:

1. Quick Starts

The quickstarts section contains step-by-step tutorials to get you started with Gemini and learn about its specific features.

To begin, you'll need:

A Google account.
An API key (create one in Google AI Studio).

We recommend starting with the following:

Authentication : Set up your API key for access.
Get Started : Get started with Gemini models and the Gemini API, covering basic prompting and multimodal input.

Then, explore the other quickstarts tutorials to learn about individual features:

2. Examples (Practical Use Cases)

These examples demonstrate how to combine multiple Gemini API features or 3rd-party tools to build more complex applications.

3. Demos (End-to-End Applications)

These fully functional, end-to-end applications showcase the power of Gemini in real-world scenarios.

Gemini CLI: Open-source AI agent that brings the power of Gemini directly into your terminal
Gemini API quickstart: Python Flask App running with the Google AI Gemini API, designed to get you started building with Gemini's multi-modal capabilities
Multimodal Live API Web Console: React-based starter app for using the Multimodal Live API over a websocket
Fullstack Langgraph Quickstart: A fullstack application using a React frontend and a LangGraph-powered backend agent
Google AI Studio Starter Applets: A collection of small apps that demonstrate how Gemini can be used to create interactive experiences

Official SDKs

The Gemini API is a REST API. You can call it directly using tools like curl (see REST examples or the great Postman workspace), or use one of our official SDKs:

Get Help

Ask a question on the Google AI Developer Forum.

The Gemini API on Google Cloud Vertex AI

For enterprise developers, the Gemini API is also available on Google Cloud Vertex AI. See this repo for examples.

Contributing

Contributions are welcome! See CONTRIBUTING.md for details.

Thank you for developing with the Gemini API! We're excited to see what you create.