Cloud ComputingService Models

Serverless Computing

Overview

Direct Answer

Serverless computing is a cloud execution model in which developers deploy code without provisioning or managing underlying infrastructure, with the provider automatically scaling resources and charging exclusively for actual execution time measured in milliseconds.

How It Works

Developers package functions or application code, which the cloud provider executes on-demand when triggered by events such as API requests, database changes, or scheduled timers. The platform abstracts infrastructure management, automatically scaling compute capacity across multiple isolated execution environments and terminating instances immediately after code execution completes.

Why It Matters

Organisations reduce operational overhead by eliminating server management responsibilities whilst optimising costs through per-millisecond billing rather than reserved capacity. This model accelerates time-to-market for event-driven workloads and enables teams to focus engineering effort on business logic rather than infrastructure operations.

Common Applications

Real-world use cases include RESTful API backends triggered by HTTP requests, image processing pipelines activated by file uploads to cloud storage, real-time data processing from streaming sources, and scheduled batch jobs. Financial services use this model for transaction processing, whilst media and entertainment organisations employ it for content transformation workflows.

Key Considerations

Cold start latency—the delay when initialising a new execution environment—can impact latency-sensitive applications. Organisations must also evaluate vendor lock-in risks, monitor distributed function behaviour across multiple invocations, and manage stateless architecture constraints.

More in Cloud Computing