Reserved Instance — Technology Wiki

Overview

Direct Answer

A Reserved Instance is a cloud pricing model in which users make an upfront commitment to use a specified amount of compute resources for a fixed term (typically 1–3 years) in exchange for substantial discounts compared to on-demand pricing. This model trades flexibility for cost savings.

How It Works

Users select a resource configuration—such as instance type, region, and operating system—and commit to a contractual term. The cloud provider reserves capacity and applies a discounted hourly rate for the committed period. Unused capacity remains the user's financial responsibility; conversely, usage beyond the reservation defaults to on-demand rates.

Why It Matters

Organisations with predictable, long-running workloads can reduce infrastructure costs by 40–70 percent, directly improving total cost of ownership. This model enables financial forecasting and budget certainty, critical for capacity planning in regulated industries and enterprises managing large-scale deployments.

Common Applications

Reserved Instances suit persistent database servers, application servers supporting production environments, and batch processing infrastructure. They are widely adopted in financial services for trading platforms, in healthcare for patient data systems, and in e-commerce for baseline traffic handling.

Key Considerations

Committing to multi-year terms introduces inflexibility; business changes, technology shifts, or workload migrations can render reservations obsolete. Practitioners must carefully forecast demand and consider hybrid strategies combining Reserved Instances with on-demand and spot instances for optimal cost–flexibility balance.

Related in Strategy & Economics

Multi-Cloud

A strategy using services from multiple cloud providers to avoid vendor lock-in and optimise capabilities.

Hybrid Cloud

An IT architecture combining on-premises infrastructure with public and private cloud services.

Cloud Security

The set of policies, technologies, and controls deployed to protect cloud-based systems, data, and infrastructure.

Identity and Access Management

A framework for managing digital identities and controlling user access to resources and systems.

Single Sign-On

An authentication scheme allowing users to log in once and gain access to multiple related systems.

OAuth

An open standard for token-based authentication and authorisation on the internet.

Cloud Database

A database service built, deployed, and accessed through a cloud platform, offering scalability and managed operations.

Cloud Governance

The policies, procedures, and tools for managing cloud resource usage, security, compliance, and costs.

Multi-Tenancy

A software architecture where a single instance serves multiple customers, with each tenant's data isolated and invisible to others.

Cloud Bursting

A configuration where an application runs in a private cloud and bursts into a public cloud when demand spikes.

Cloud-Native Database

A database designed from the ground up to operate in cloud environments with automatic scaling and high availability.

Serverless Database

A database service that automatically provisions, scales, and manages infrastructure on demand without manual server management.

More in Cloud Computing

AI Infrastructure

Service Models

The specialised hardware, software, and networking stack required to train and serve AI models at scale, including GPU clusters, high-bandwidth interconnects, and model serving frameworks.

Managed Service

Service Models

A cloud service where the provider handles infrastructure management, maintenance, updates, and monitoring.

gRPC

Architecture Patterns

A high-performance remote procedure call framework developed by Google using Protocol Buffers for serialisation.

Green Cloud Computing

Service Models

Cloud computing practices that minimise environmental impact through renewable energy usage, efficient cooling, workload consolidation, and carbon-aware scheduling of compute tasks.

Platform as a Service

Service Models

Cloud computing model that provides a platform for developers to build, deploy, and manage applications without managing infrastructure.

Content Delivery Network

Architecture Patterns

A distributed network of servers that delivers web content to users based on their geographic location.

Cloud Migration

Deployment & Operations

The process of moving data, applications, and workloads from on-premises infrastructure to cloud environments.

GraphQL

Architecture Patterns

A query language for APIs that lets clients request exactly the data they need in a single request.