The new reference architecture for AI on the cloud

A pattern has emerged

After thousands of deployments across the industry, the shape of a sound AI platform is no longer mysterious. It is a set of layers, each with a clear job, that you can build on your existing cloud rather than a single monolith.

Knowing the pattern lets you avoid the two common failures: a brittle prototype that cannot scale, and an over-built platform that ships nothing.

The layers that matter

At the base sits trustworthy data and pipelines. Above it, a retrieval layer that grounds models in your knowledge with sources. Then the models themselves, behind a routing layer so they stay swappable. Then the agent and orchestration logic that does the work. Wrapping all of it: guardrails, evaluations, and observability, plus the access controls security and legal require.

The discipline is to keep models swappable. Vendors and capabilities change every few months; an architecture that hard-wires one model ages badly.

Governed from day one

The teams that scale build observability and governance in from the first use case, not after an incident. Tracing, cost controls, evaluations, and audit trails are not a later phase. They are what lets the second, fifth, and twentieth use case ship quickly and safely on the same foundation.

The new reference architecture for AI on the cloud

A pattern has emerged

The layers that matter

Governed from day one

Key takeaways

Related perspectives

The AI blueprints worth building right now

Private AI: when to own your compute

Find your highest-value AI move in two minutes