Overview
Direct Answer
A spot instance is a cloud computing service that offers access to spare server capacity at discounted rates, typically 70–90% below on-demand pricing, in exchange for the possibility of termination when capacity is reclaimed. Cloud providers allocate these instances from unused infrastructure, making them ideal for workloads that tolerate interruption.
How It Works
Cloud providers maintain a pool of underutilised compute resources and offer them at auction-like pricing that fluctuates based on real-time supply and demand. When an organisation's bid price falls below the current market rate or when capacity is needed for higher-priority workloads, instances are terminated with minimal notice—typically two minutes. Users specify bid prices and interruption tolerance profiles to manage deployment risk.
Why It Matters
Organisations pursuing aggressive cost optimisation can reduce compute expenses substantially whilst maintaining performance for non-critical workloads. This addresses budget constraints in data processing, testing environments, and batch jobs where fault tolerance is inherent to the application architecture.
Common Applications
Typical use cases include machine learning model training, big data analytics processing, batch rendering for media production, and development-stage testing environments. Financial services firms use them for risk simulation, whilst research institutions leverage them for computationally intensive scientific modelling.
Key Considerations
Sudden interruptions require robust retry logic and state persistence mechanisms; unsuitable for stateful applications or time-sensitive transactional workloads. Cost savings must be weighed against engineering complexity required to handle interruption gracefully.
Cross-References(1)
More in Cloud Computing
Multi-Cloud Strategy
Strategy & EconomicsAn approach that distributes workloads across multiple cloud providers to avoid vendor lock-in, optimise costs, meet regulatory requirements, and improve resilience.
Reserved Instance
Strategy & EconomicsA cloud pricing model where users commit to a specific resource configuration for a term in exchange for discounted rates.
Container
InfrastructureA lightweight, portable software package that bundles application code with all its dependencies for consistent execution.
Hybrid Cloud
Strategy & EconomicsAn IT architecture combining on-premises infrastructure with public and private cloud services.
Content Delivery Network
Architecture PatternsA distributed network of servers that delivers web content to users based on their geographic location.
gRPC
Architecture PatternsA high-performance remote procedure call framework developed by Google using Protocol Buffers for serialisation.
Virtual Machine
InfrastructureA software emulation of a physical computer that runs an operating system and applications independently.
Terraform
Deployment & OperationsAn open-source infrastructure as code tool for building, changing, and versioning infrastructure safely and efficiently.