Overview
Direct Answer
A Reserved Instance is a cloud pricing model in which users make an upfront commitment to use a specified amount of compute resources for a fixed term (typically 1–3 years) in exchange for substantial discounts compared to on-demand pricing. This model trades flexibility for cost savings.
How It Works
Users select a resource configuration—such as instance type, region, and operating system—and commit to a contractual term. The cloud provider reserves capacity and applies a discounted hourly rate for the committed period. Unused capacity remains the user's financial responsibility; conversely, usage beyond the reservation defaults to on-demand rates.
Why It Matters
Organisations with predictable, long-running workloads can reduce infrastructure costs by 40–70 percent, directly improving total cost of ownership. This model enables financial forecasting and budget certainty, critical for capacity planning in regulated industries and enterprises managing large-scale deployments.
Common Applications
Reserved Instances suit persistent database servers, application servers supporting production environments, and batch processing infrastructure. They are widely adopted in financial services for trading platforms, in healthcare for patient data systems, and in e-commerce for baseline traffic handling.
Key Considerations
Committing to multi-year terms introduces inflexibility; business changes, technology shifts, or workload migrations can render reservations obsolete. Practitioners must carefully forecast demand and consider hybrid strategies combining Reserved Instances with on-demand and spot instances for optimal cost–flexibility balance.
More in Cloud Computing
AI Infrastructure
Service ModelsThe specialised hardware, software, and networking stack required to train and serve AI models at scale, including GPU clusters, high-bandwidth interconnects, and model serving frameworks.
Managed Service
Service ModelsA cloud service where the provider handles infrastructure management, maintenance, updates, and monitoring.
gRPC
Architecture PatternsA high-performance remote procedure call framework developed by Google using Protocol Buffers for serialisation.
Green Cloud Computing
Service ModelsCloud computing practices that minimise environmental impact through renewable energy usage, efficient cooling, workload consolidation, and carbon-aware scheduling of compute tasks.
Platform as a Service
Service ModelsCloud computing model that provides a platform for developers to build, deploy, and manage applications without managing infrastructure.
Content Delivery Network
Architecture PatternsA distributed network of servers that delivers web content to users based on their geographic location.
Cloud Migration
Deployment & OperationsThe process of moving data, applications, and workloads from on-premises infrastructure to cloud environments.
GraphQL
Architecture PatternsA query language for APIs that lets clients request exactly the data they need in a single request.