Back to glossary

Infrastructure & DevOps

Redis

An open-source, in-memory data structure store used as a cache, message broker, and database. Redis supports strings, hashes, lists, sets, sorted sets, and streams, providing sub-millisecond latency for read and write operations.

Redis stores data in memory, making it orders of magnitude faster than disk-based databases for read-heavy workloads. Beyond simple key-value caching, Redis supports complex data structures, pub/sub messaging, Lua scripting, and transactions. Redis Cluster provides horizontal scaling and automatic failover for production deployments, while Redis Sentinel handles high availability for standalone instances.

For AI product teams, Redis serves multiple critical roles. It caches model inference results to avoid redundant computation, stores feature vectors for real-time model serving, manages session data for personalized experiences, and powers rate limiting for API endpoints. Growth teams use Redis for real-time analytics dashboards, experiment assignment storage that must be both fast and consistent, and event counting for trigger-based automation. The sorted set data structure is particularly useful for leaderboards, trending content algorithms, and priority queues that feed AI processing pipelines. Redis's combination of speed, versatility, and operational maturity makes it a foundational component in nearly every AI product's infrastructure stack.

Related Terms

Content Delivery Network

A geographically distributed network of proxy servers that caches and delivers content from locations closest to end users. CDNs reduce latency, improve load times, and absorb traffic spikes by serving content from edge nodes rather than a single origin server.

Edge Computing

A distributed computing paradigm that processes data closer to the source of generation rather than in a centralized data center. Edge computing reduces latency, conserves bandwidth, and enables real-time processing for latency-sensitive applications.

Serverless Computing

A cloud execution model where the provider dynamically manages server allocation and scaling. Developers deploy functions or containers without provisioning infrastructure, paying only for actual compute time consumed rather than reserved capacity.

Function as a Service

A serverless computing category where developers deploy individual functions that execute in response to events. FaaS platforms like AWS Lambda, Google Cloud Functions, and Azure Functions handle all infrastructure management, scaling each function independently.

Platform as a Service

A cloud computing model that provides a complete development and deployment environment without managing underlying infrastructure. PaaS offerings like Heroku, Vercel, and Google App Engine handle servers, storage, networking, and runtime configuration.

Infrastructure as a Service

A cloud computing model that provides virtualized computing resources over the internet. IaaS offerings like AWS EC2, Google Compute Engine, and Azure Virtual Machines give teams full control over servers, storage, and networking without owning physical hardware.