Back to glossary

Infrastructure & DevOps

Saga Pattern

A pattern for managing distributed transactions across multiple microservices by breaking them into a sequence of local transactions, each with a compensating action for rollback. Sagas maintain data consistency without requiring distributed locks.

In a microservices architecture, a single business operation often spans multiple services. Traditional distributed transactions using two-phase commit do not scale well and create tight coupling. The saga pattern instead coordinates a sequence of local transactions: if any step fails, previously completed steps execute compensating transactions to undo their effects. Sagas can be orchestrated by a central coordinator or choreographed through events.

For AI product teams, the saga pattern matters when AI operations span multiple services. Consider an AI-powered order fulfillment system: it might validate the order, generate a personalized recommendation for upsell, reserve inventory, process payment, and trigger delivery. If payment fails, the saga must compensate by releasing the reserved inventory and canceling the recommendation tracking. Growth teams encounter saga-like patterns in complex experiment workflows where experiment assignment, feature flag evaluation, and analytics recording must remain consistent. Understanding sagas helps teams design AI-driven automation that degrades gracefully when individual components fail rather than leaving the system in an inconsistent state.

Related Terms

Content Delivery Network

A geographically distributed network of proxy servers that caches and delivers content from locations closest to end users. CDNs reduce latency, improve load times, and absorb traffic spikes by serving content from edge nodes rather than a single origin server.

Edge Computing

A distributed computing paradigm that processes data closer to the source of generation rather than in a centralized data center. Edge computing reduces latency, conserves bandwidth, and enables real-time processing for latency-sensitive applications.

Serverless Computing

A cloud execution model where the provider dynamically manages server allocation and scaling. Developers deploy functions or containers without provisioning infrastructure, paying only for actual compute time consumed rather than reserved capacity.

Function as a Service

A serverless computing category where developers deploy individual functions that execute in response to events. FaaS platforms like AWS Lambda, Google Cloud Functions, and Azure Functions handle all infrastructure management, scaling each function independently.

Platform as a Service

A cloud computing model that provides a complete development and deployment environment without managing underlying infrastructure. PaaS offerings like Heroku, Vercel, and Google App Engine handle servers, storage, networking, and runtime configuration.

Infrastructure as a Service

A cloud computing model that provides virtualized computing resources over the internet. IaaS offerings like AWS EC2, Google Compute Engine, and Azure Virtual Machines give teams full control over servers, storage, and networking without owning physical hardware.