AWS Graviton5: A New Standard for Cloud Compute
Amazon Web Services has launched the M9g and M9gd instances, featuring the new Graviton5 processor, which doubles the core count to 192 per chip using TSMC’s 3nm fabrication process. According to official AWS documentation, the architecture includes 192 MB of L3 cache, DDR5-8800 memory, and PCIe Gen 6 support. These instances introduce the first formally verified hypervisor isolation in a production cloud environment, providing a mathematical guarantee of security for multi-tenant workloads.
Why AWS is positioning Graviton5 for “Agentic AI”
AWS is branding the Graviton5 as a purpose-built solution for agentic AI, real-time reasoning, and multi-step task orchestration. The company argues that modern agents require high-performance CPUs to manage large numbers of concurrent environments while keeping accelerators fed with data. However, industry observers suggest this is primarily a marketing shift for high-end general-purpose hardware. According to discussions on the r/hardware subreddit, developers note that these are standard ARM cores capable of handling any traditional database or web server task, regardless of the AI-focused branding.
Graviton5 chips are manufactured on a 3nm process, allowing AWS to pack 192 cores onto four chiplets, a significant increase in density over previous generations.
Performance Gains in Real-World Production
Early benchmarks from enterprise customers indicate substantial improvements over previous Graviton4 instances. Data shared by AWS customers during the preview period highlights the following gains:
- ClickHouse: Achieved a 36% performance boost compared to M8g instances without requiring code modifications.
- Honeycomb: Reported 36% better throughput per core across production observability workloads.
- HubSpot: Observed query duration drops of up to 60% after migrating MySQL databases to M9g hardware.
Airbnb’s principal performance engineer, Denis Sheahan, publicly described the hardware as some of the fastest EC2 instances the company has ever tested, according to AWS blog reports.
Security: The Role of Formal Verification
The sixth-generation Nitro System introduces the Nitro Isolation Engine, which uses formal verification to secure the hypervisor. Unlike traditional testing, formal verification uses mathematical proofs to establish that the isolation boundary between virtual machines is impenetrable. As noted by The New Stack, this provides a higher level of security for organizations running untrusted code or sensitive multi-tenant applications compared to standard cloud environments.
If you are managing high-demand inference or analytics workloads, consider waiting for the upcoming C9g (compute-optimized) or R9g (memory-optimized) variants in 2026, as these may offer better performance ceilings than the current general-purpose M9g instances.
Cost and Availability Trends
Migration decisions often hinge on the price-performance ratio. Analysis by DevelopersIO found that M9g on-demand rates in us-east-1 are 9% higher than M8g instances. When balanced against the 25% increase in compute performance, users can expect roughly 15% better value per dollar, though actual results depend on the specific workload. These instances are currently available in US East (N. Virginia, Ohio), US West (Oregon), and EU (Frankfurt).

Frequently Asked Questions
What is the main difference between Graviton5 and Graviton4?
Graviton5 doubles the core count to 192, utilizes a 3nm manufacturing process, and introduces PCIe Gen 6, resulting in a 25% compute performance improvement according to AWS.
Are these chips only for AI?
No. While AWS markets them for agentic AI, they are general-purpose ARM processors that perform significantly better on databases, web servers, and batch processing tasks.
What is formal verification in the Nitro System?
It is a security method that uses mathematical proofs to guarantee that virtual machines are isolated from one another, moving beyond traditional testing methods.
Are you planning to migrate your workloads to Graviton5? Share your thoughts or benchmarking experiences in the comments below, or subscribe to our weekly tech briefing for more deep dives into cloud infrastructure.
