Meta expands AWS partnership with large-scale deployment of graviton processors

Meta has finalized an agreement to deploy AWS Graviton processors at scale, marking a substantial expansion of its long-standing partnership with Amazon Web Services (AWS). The initial deployment will encompass tens of millions of Graviton cores, with provisions to scale further as Meta develops its next-generation artificial intelligence (AI) infrastructure.

This agreement highlights a broader industry shift in how AI infrastructure is architected, balancing the established use of hardware alongside newly optimized processors for emerging workloads.

The Shift Toward Agentic AI Workloads

While Graphics Processing Units (GPUs) remain foundational for training large AI models, the increasing prevalence of agentic AI—autonomous systems designed to reason, plan, and execute complex workflows—is generating massive demand for CPU-intensive infrastructure.

Meta is utilizing the Graviton deployment to support these specific agentic workloads, which include:

  • Real-time reasoning: Allowing autonomous systems to process and react to live data.
  • Code generation: Supporting the automated writing and structuring of software.
  • Search and orchestration: Coordinating complex, multi-step tasks that require managing billions of interactions simultaneously.

Purpose-built processors are currently viewed as the most efficient method for powering these CPU-bound operations at scale.

Technical Specifications: Graviton5 and AWS Nitro

To meet the demands of its frontier models, Meta’s infrastructure will rely heavily on the AWS Graviton5 processor. Engineered specifically for high-performance computing, the chips offer several structural upgrades over previous generations:

  • Core Architecture: Features 192 cores designed to handle continuous data processing.
  • Expanded Cache: Utilizes a cache that is five times larger than its predecessor, reducing inter-core communication delays by up to 33% and increasing overall bandwidth.
  • AWS Nitro System: The processors are built on the AWS Nitro System, integrating dedicated hardware and software to ensure high performance and security. This system enables bare-metal instances, granting Meta direct hardware access alongside familiar tools like the Elastic Network Adapter (ENA) and Amazon Elastic Block Store (EBS) to run virtual machines without performance degradation.
  • Elastic Fabric Adapter (EFA): Graviton5 instances support EFA, which facilitates the low-latency, high-bandwidth communication necessary to distribute large-scale AI tasks across multiple processors efficiently.
Energy Efficiency and Sustainability

As AI computation demands escalate globally, both cost management and environmental impact have become central to infrastructure planning.

AWS Graviton5 processors are manufactured using 3-nanometer chip technology. This smaller, more precise manufacturing process inherently yields more efficient processors. Because AWS oversees the entire pipeline—from fundamental chip design to server architecture integration—the hardware can be heavily optimized compared to off-the-shelf alternatives.

Consequently, Graviton5 delivers up to 25% better performance than the previous generation while maintaining leading energy efficiency. This allows organizations like Meta to scale their AI operations and deliver personalized experiences globally while remaining aligned with corporate sustainability targets.

Commenting on this, Nafea Bshara, Vice President and Distinguished Engineer, Amazon, said:

This isn’t just about chips; it’s about giving customers the infrastructure foundation, as well as data and inference services, to build AI that understands, anticipates, and scales efficiently to billions of people worldwide. Meta’s expanded partnership, deploying tens of millions of Graviton cores, shows what happens when you combine purpose-built silicon with the full AWS AI stack to power the next generation of agentic AI.

Santosh Janardhan, Head of Infrastructure, Meta, said:

As we scale the infrastructure behind Meta’s AI ambitions, diversifying our compute sources is a strategic imperative. AWS has been a trusted cloud partner for years, and expanding to Graviton allows us to run the CPU-intensive workloads behind agentic AI with the performance and efficiency we need at our scale.


Srivatsan Sridhar: Srivatsan Sridhar is a Mobile Technology Enthusiast who is passionate about Mobile phones and Mobile apps. He uses the phones he reviews as his main phone. You can follow him on Twitter and Instagram
Related Post