Spot Instance — Technology Wiki

Overview

Direct Answer

A spot instance is a cloud computing service that offers access to spare server capacity at discounted rates, typically 70–90% below on-demand pricing, in exchange for the possibility of termination when capacity is reclaimed. Cloud providers allocate these instances from unused infrastructure, making them ideal for workloads that tolerate interruption.

How It Works

Cloud providers maintain a pool of underutilised compute resources and offer them at auction-like pricing that fluctuates based on real-time supply and demand. When an organisation's bid price falls below the current market rate or when capacity is needed for higher-priority workloads, instances are terminated with minimal notice—typically two minutes. Users specify bid prices and interruption tolerance profiles to manage deployment risk.

Why It Matters

Organisations pursuing aggressive cost optimisation can reduce compute expenses substantially whilst maintaining performance for non-critical workloads. This addresses budget constraints in data processing, testing environments, and batch jobs where fault tolerance is inherent to the application architecture.

Common Applications

Typical use cases include machine learning model training, big data analytics processing, batch rendering for media production, and development-stage testing environments. Financial services firms use them for risk simulation, whilst research institutions leverage them for computationally intensive scientific modelling.

Key Considerations

Sudden interruptions require robust retry logic and state persistence mechanisms; unsuitable for stateful applications or time-sensitive transactional workloads. Cost savings must be weighed against engineering complexity required to handle interruption gracefully.

Cross-References(1)

Cloud Computing

Related in Service Models

Cloud Computing

The delivery of computing services — servers, storage, databases, networking, software — over the internet on demand.

Infrastructure as a Service

Cloud computing model providing virtualised computing resources like servers, storage, and networking over the internet.

Platform as a Service

Cloud computing model that provides a platform for developers to build, deploy, and manage applications without managing infrastructure.

Software as a Service

Cloud computing model that delivers software applications over the internet on a subscription basis.

Function as a Service

A serverless cloud computing model where individual functions are executed in response to events.

Serverless Computing

A cloud execution model where the provider dynamically allocates resources, charging only for actual compute time used.

Cloud-Native

An approach to building applications that fully exploit cloud computing advantages like elasticity, resilience, and automation.

Private Cloud

Cloud computing resources used exclusively by a single organisation, either on-premises or hosted by a third party.

Public Cloud

Cloud computing resources shared among multiple organisations and available to the general public over the internet.

Managed Service

A cloud service where the provider handles infrastructure management, maintenance, updates, and monitoring.

Cloud Cost Optimisation

Strategies and practices for minimising cloud computing expenses while maintaining performance and functionality.

Cloud Orchestration

The automated arrangement, coordination, and management of complex cloud computing systems and services.

More in Cloud Computing

AI Infrastructure

Service Models

The specialised hardware, software, and networking stack required to train and serve AI models at scale, including GPU clusters, high-bandwidth interconnects, and model serving frameworks.

API

Architecture Patterns

Application Programming Interface — a set of protocols and tools for building and integrating software applications.

Cloud Database

Strategy & Economics

A database service built, deployed, and accessed through a cloud platform, offering scalability and managed operations.

Internal Developer Portal

Deployment & Operations

A centralised web interface that provides developers with self-service access to infrastructure, services, documentation, and templates within their organisation.

Container

Infrastructure

A lightweight, portable software package that bundles application code with all its dependencies for consistent execution.

Cloud Migration

Deployment & Operations

The process of moving data, applications, and workloads from on-premises infrastructure to cloud environments.

Load Balancer

Infrastructure

A device or software that distributes network traffic across multiple servers to ensure no single server is overwhelmed.

Disaster Recovery as a Service

Deployment & Operations

A cloud computing model that enables the replication and recovery of infrastructure and data in the cloud.