Google Deepmind | Fone Arena

Gemini 3.5 Flash gets native computer use for AI agents

Google has announced that computer use is now a built-in tool in Gemini 3.5 Flash, enabling developers to build AI agents that can interact across platforms, including browser, desktop, and mobile environments. Previously available only as a standalone Gemini 2.5 computer use model, the capability is now integrated directly into Gemini 3.5 Flash. Continue reading “Gemini 3.5 Flash gets native computer use for AI agents”

Google unveils DiffusionGemma open AI model with up to 4x faster text generation

Google has introduced DiffusionGemma, an experimental open-weight AI model that explores diffusion-based text generation. Released under the Apache 2.0 license, the 26-billion-parameter Mixture-of-Experts (MoE) model moves beyond the sequential token-by-token generation used by traditional autoregressive large language models, instead generating and refining entire blocks of text simultaneously. Continue reading “Google unveils DiffusionGemma open AI model with up to 4x faster text generation”

Google unveils Gemma 4 12B for local AI agents, coding, and multimodal reasoning

Google DeepMind has introduced Gemma 4 12B, a new open-weight multimodal model designed to bring agentic intelligence directly to laptops with mobile-first efficiency and advanced reasoning. Continue reading “Google unveils Gemma 4 12B for local AI agents, coding, and multimodal reasoning”

Google unveils TPU 8t and TPU 8i chips for agentic AI and reasoning workloads

At Google Cloud Next, Google announced its eighth-generation Tensor Processing Units (TPUs), introducing two purpose-built architectures: TPU 8t and TPU 8i. These chips are designed to support large-scale AI workloads, from model training and development to high-volume inference and agent-based systems. Continue reading “Google unveils TPU 8t and TPU 8i chips for agentic AI and reasoning workloads”

Gemini app gets AI music creation powered by Lyria 3 in beta

Google has added AI-powered music creation to the Gemini app. The feature uses Lyria 3, the latest generative music model from Google DeepMind, and is rolling out in beta. Continue reading “Gemini app gets AI music creation powered by Lyria 3 in beta”

Google rolls out Project Genie for creating interactive AI worlds

Google has started rolling out Project Genie, an experimental research prototype that allows users to create, explore, and remix interactive virtual worlds. Continue reading “Google rolls out Project Genie for creating interactive AI worlds”

Gemini 3 Flash gets Agentic Vision with code-based image analysis

Google DeepMind has introduced Agentic Vision in Gemini 3 Flash, a new capability that changes how the model understands images. Instead of analyzing visuals in a single, static pass, the model can now actively investigate images through step-by-step reasoning supported by code execution. Continue reading “Gemini 3 Flash gets Agentic Vision with code-based image analysis”

Google adds SynthID-based AI image verification to the Gemini app

Google is expanding its tools for identifying AI-generated content by bringing SynthID-based image verification directly to the Gemini app. The update is designed to give users clear context about whether an image was created or edited using Google’s AI models. Continue reading “Google adds SynthID-based AI image verification to the Gemini app”

Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities

Google DeepMind introduced SIMA 2, the latest iteration of its generalist AI research, building on last year’s SIMA (Scalable Instructable Multiworld Agent). The original SIMA could follow instructions across multiple virtual environments, performing over 600 language-based tasks such as “turn left,” “climb the ladder,” and “open the map.” Continue reading “Google DeepMind unveils SIMA 2 with reasoning and goal-oriented AI capabilities”

Google rolls out Veo 3.1 in Flow for precise and immersive video editing

Google introduced Veo 3.1, enhancing its Flow platform with richer audio, improved realism, and stronger adherence to prompts. The update adds audio to existing features like Ingredients to Video, Frames to Video, and Extend, giving creators precise control over their scenes. Continue reading “Google rolls out Veo 3.1 in Flow for precise and immersive video editing”

Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments

Google DeepMind has announced Genie 3, an advanced world model capable of generating interactive 3D environments from just a single image. The model, trained without supervision or environment labels, allows users to control the character in a simulated world derived from the input image. Continue reading “Google DeepMind unveils Genie 3 with unsupervised learning for interactive 3D environments”

Google unveils Gemma 3 lightweight AI models for all devices

Google today introduced Gemma 3, a series of advanced, lightweight open models developed using the same research behind its Gemini 2.0 models. Clement Farabet, VP of Research at Google DeepMind, described them as “our most advanced, portable, and responsibly developed open models yet.” Continue reading “Google unveils Gemma 3 lightweight AI models for all devices”

Google unveils Veo 2 and Imagen 3 with advanced capabilities

Google on Monday unveiled two advanced AI models—Veo 2 for video generation and Imagen 3 for image generation—both designed to deliver state-of-the-art results. These models are now available through VideoFX, ImageFX, and the new Google Labs experiment, Whisk. Continue reading “Google unveils Veo 2 and Imagen 3 with advanced capabilities”

Google unveils ‘Gemma’ lightweight Open Models for AI development

Google has unveiled Gemma, a new series of open models designed to support developers and researchers in responsibly building AI systems. Continue reading “Google unveils ‘Gemma’ lightweight Open Models for AI development”