Gemini app gets AI music creation powered by Lyria 3 in beta

Google has added AI-powered music creation to the Gemini app. The feature uses Lyria 3, the latest generative music model from Google DeepMind, and is rolling out in beta. Continue reading “Gemini app gets AI music creation powered by Lyria 3 in beta”

Google rolls out Project Genie for creating interactive AI worlds

Google has started rolling out Project Genie, an experimental research prototype that allows users to create, explore, and remix interactive virtual worlds. Continue reading “Google rolls out Project Genie for creating interactive AI worlds”

Gemini 3 Flash gets Agentic Vision with code-based image analysis

Google DeepMind has introduced Agentic Vision in Gemini 3 Flash, a new capability that changes how the model understands images. Instead of analyzing visuals in a single, static pass, the model can now actively investigate images through step-by-step reasoning supported by code execution. Continue reading “Gemini 3 Flash gets Agentic Vision with code-based image analysis”

Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities

Google DeepMind introduced SIMA 2, the latest iteration of its generalist AI research, building on last year’s SIMA (Scalable Instructable Multiworld Agent). The original SIMA could follow instructions across multiple virtual environments, performing over 600 language-based tasks such as “turn left,” “climb the ladder,” and “open the map.” Continue reading “Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities”

Google rolls out Veo 3.1 in Flow for precise and immersive video editing

Google introduced Veo 3.1, enhancing its Flow platform with richer audio, improved realism, and stronger adherence to prompts. The update adds audio to existing features like Ingredients to Video, Frames to Video, and Extend, giving creators precise control over their scenes. Continue reading “Google rolls out Veo 3.1 in Flow for precise and immersive video editing”

Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments

Google DeepMind has announced Genie 3, an advanced world model capable of generating interactive 3D environments from just a single image. The model, trained without supervision or environment labels, allows users to control the character in a simulated world derived from the input image. Continue reading “Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments”

Google unveils Gemma 3 lightweight AI models for all devices

Google today introduced Gemma 3, a series of advanced, lightweight open models developed using the same research behind its Gemini 2.0 models. Clement Farabet, VP of Research at Google DeepMind, described them as “our most advanced, portable, and responsibly developed open models yet.” Continue reading “Google unveils Gemma 3 lightweight AI models for all devices”

Google unveils Veo 2 and Imagen 3 with advanced capabilities

Google on Monday unveiled two advanced AI models—Veo 2 for video generation and Imagen 3 for image generation—both designed to deliver state-of-the-art results. These models are now available through VideoFX, ImageFX, and the new Google Labs experiment, Whisk. Continue reading “Google unveils Veo 2 and Imagen 3 with advanced capabilities”

Google unveils ‘Gemma’ lightweight Open Models for AI development

Google has unveiled Gemma, a new series of open models designed to support developers and researchers in responsibly building AI systems. Continue reading “Google unveils ‘Gemma’ lightweight Open Models for AI development”