Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities

Google DeepMind introduced SIMA 2, the latest iteration of its generalist AI research, building on last year’s SIMA (Scalable Instructable Multiworld Agent). The original SIMA could follow instructions across multiple virtual environments, performing over 600 language-based tasks such as “turn left,” “climb the ladder,” and “open the map.” Continue reading “Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities”

Google rolls out Veo 3.1 in Flow for precise and immersive video editing

Google introduced Veo 3.1, enhancing its Flow platform with richer audio, improved realism, and stronger adherence to prompts. The update adds audio to existing features like Ingredients to Video, Frames to Video, and Extend, giving creators precise control over their scenes. Continue reading “Google rolls out Veo 3.1 in Flow for precise and immersive video editing”

Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments

Google DeepMind has announced Genie 3, an advanced world model capable of generating interactive 3D environments from just a single image. The model, trained without supervision or environment labels, allows users to control the character in a simulated world derived from the input image. Continue reading “Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments”

Google unveils Gemma 3 lightweight AI models for all devices

Google today introduced Gemma 3, a series of advanced, lightweight open models developed using the same research behind its Gemini 2.0 models. Clement Farabet, VP of Research at Google DeepMind, described them as “our most advanced, portable, and responsibly developed open models yet.” Continue reading “Google unveils Gemma 3 lightweight AI models for all devices”

Google unveils Veo 2 and Imagen 3 with advanced capabilities

Google on Monday unveiled two advanced AI models—Veo 2 for video generation and Imagen 3 for image generation—both designed to deliver state-of-the-art results. These models are now available through VideoFX, ImageFX, and the new Google Labs experiment, Whisk. Continue reading “Google unveils Veo 2 and Imagen 3 with advanced capabilities”

Google unveils ‘Gemma’ lightweight Open Models for AI development

Google has unveiled Gemma, a new series of open models designed to support developers and researchers in responsibly building AI systems. Continue reading “Google unveils ‘Gemma’ lightweight Open Models for AI development”