Models
Low-latency Gemini audio-to-audio Live API model for real-time dialogue.
Gemini 3.1 Flash text-to-speech model for controllable low-latency speech generation.
Google multimodal embedding model for text, image, video, audio, and PDFs.
Gemini 3.1 Flash image generation model for fast interactive image workflows.
Google native image generation model also known as Nano Banana Pro.
Google Imagen 4 fast image generation model.
Google Imagen 4 standard image generation model.
Google highest-quality Imagen 4 image generation model.
Google Gemini 1.5 Flash model for fast multimodal workloads with a 1M-token context window.
Google smaller Gemini 1.5 Flash variant for lower-cost, lower-latency multimodal workloads.
Google Gemini 1.5 Pro model for long-context multimodal tasks with up to 2M-token context.
Google Gemini 2.0 Flash model; deprecated by Google with migration recommended to newer Gemini models.
Google hybrid reasoning model with 1M-token context, optimized for price-performance and thinking workloads.
Google state-of-the-art Gemini 2.5 model for coding, complex reasoning, and long-context multimodal workloads.
Google most cost-efficient Gemini 3.1 model for high-volume agentic, translation, and simple processing workloads.
Preview release of Gemini 3.1 Flash-Lite for high-volume lightweight workloads.
Google Gemini 3.1 Pro Preview model for multimodal understanding, agentic workflows, and coding tasks.
Google speed-oriented Gemini 3.5 model with frontier intelligence, search, and grounding support.
Translate gemini-3.5-live-translate-preview Try it in Google AI Studio Our low-latency, real-time speech to speech translation model that supports 70+ languages.
Google Gemini 3 Flash preview model for fast multimodal workloads with thinking-token output pricing.
Preview models may change before becoming stable and have more restrictive rate limits.
Google Lyria 3 model for short music clips up to 30 seconds.
Google Lyria 3 model for full-song music generation.
Google embodied reasoning model for robotics and physical-world task planning.
Google Veo 3.1 video generation model priced by generated video seconds.
Google Veo 3.1 fast video generation model.
Google lower-cost Veo 3.1 Lite video generation model.