Skip to content
Perspectives
Cloud 6 min read · April 2026

The new reference architecture for AI on the cloud

The pieces have settled into a pattern. Knowing the shape of a good AI platform is half the battle.

The new reference architecture for AI on the cloud

A pattern has emerged

After thousands of deployments across the industry, the shape of a sound AI platform is no longer mysterious. It is a set of layers, each with a clear job, that you can build on your existing cloud rather than a single monolith.

Knowing the pattern lets you avoid the two common failures: a brittle prototype that cannot scale, and an over-built platform that ships nothing.

The layers that matter

At the base sits trustworthy data and pipelines. Above it, a retrieval layer that grounds models in your knowledge with sources. Then the models themselves, behind a routing layer so they stay swappable. Then the agent and orchestration logic that does the work. Wrapping all of it: guardrails, evaluations, and observability, plus the access controls security and legal require.

The discipline is to keep models swappable. Vendors and capabilities change every few months; an architecture that hard-wires one model ages badly.

Governed from day one

The teams that scale build observability and governance in from the first use case, not after an incident. Tracing, cost controls, evaluations, and audit trails are not a later phase. They are what lets the second, fifth, and twentieth use case ship quickly and safely on the same foundation.

Key takeaways

  • A good AI platform is layered: data, retrieval, models, agents, guardrails, and observability.
  • Treat models as swappable behind a routing layer, not as the center of the design.
  • Build governance and observability in from the start, not as an afterthought.

Find your highest-value AI move in two minutes

Take the assessment. You get your readiness score and a clear place to start, then we turn it into a fixed-price plan in one working session.