Vector Database — Technology Wiki

Overview

Direct Answer

A vector database is a specialised data system optimised for storing and efficiently querying high-dimensional numerical embeddings—dense representations of text, images, or other unstructured data. Unlike traditional relational databases, it indexes and retrieves records based on semantic similarity rather than exact matching or structured relationships.

How It Works

Vector databases employ approximate nearest neighbour search algorithms (such as hierarchical navigable small-world graphs or product quantisation) to enable fast similarity lookups across millions of embeddings without exhaustive comparison. Data is organised using spatial indexing structures that partition high-dimensional space, allowing the system to retrieve semantically related records by computing distance metrics between query vectors and stored embeddings.

Why It Matters

As organisations deploy large language models and retrieval-augmented generation systems, the ability to rapidly search semantic meaning at scale becomes critical to system performance and cost efficiency. Vector databases eliminate the need for exhaustive similarity computations, significantly reducing latency and infrastructure overhead in production language model applications.

Common Applications

Applications include semantic search across document repositories, recommendation engines based on user behaviour embeddings, and retrieval components in retrieval-augmented generation pipelines for chatbots and enterprise question-answering systems. Image similarity search and anomaly detection in high-dimensional feature spaces represent additional use cases across computer vision workflows.

Key Considerations

Practitioners must balance indexing speed against query accuracy; aggressive optimisation techniques reduce precision. Integration with embedding generation pipelines requires careful management of dimensionality, encoding consistency, and cost of continuous re-indexing as data evolves.

Related in Core NLP

Natural Language Processing

The field of AI focused on enabling computers to understand, interpret, and generate human language.

Seq2Seq Model

A neural network architecture that maps an input sequence to an output sequence, used in translation and summarisation.

Latent Dirichlet Allocation

A generative probabilistic model for discovering topics in a collection of documents.

Text Embedding

Dense vector representations of text passages that capture semantic meaning for similarity comparison and retrieval.

Semantic Search

Search technology that understands the meaning and intent behind queries rather than just matching keywords.

Constitutional AI

An approach to AI alignment where models are trained to follow a set of principles or constitution.

Natural Language Understanding

The subfield of NLP focused on machine reading comprehension and extracting meaning from text.

Natural Language Generation

The subfield of NLP concerned with producing natural language text from structured data or representations.

Document Understanding

AI systems that extract structured information from unstructured documents by combining optical character recognition, layout analysis, and natural language comprehension.

Slot Filling

The task of extracting specific parameter values from user utterances to fulfil a detected intent, such as identifying dates, locations, and names in booking requests.

Cross-Lingual Transfer

The application of models trained in one language to perform tasks in another language, leveraging shared multilingual representations learned during pre-training.

Text Embedding Model

A neural network trained to convert text passages into fixed-dimensional vectors that capture semantic meaning, enabling similarity search, clustering, and retrieval applications.

More in Natural Language Processing

Sentiment Analysis

Text Analysis

The computational study of people's opinions, emotions, and attitudes expressed in text.

Text Generation

Generation & Translation

The process of producing coherent and contextually relevant text using AI language models.

Abstractive Summarisation

Text Analysis

A text summarisation approach that generates novel sentences to capture the essential meaning of a document, rather than simply extracting and rearranging existing sentences.

Coreference Resolution

Parsing & Structure

The task of identifying all expressions in text that refer to the same real-world entity.

Language Model

Semantics & Representation

A probabilistic model that assigns probabilities to sequences of words, enabling prediction of the next word in a sequence.

Large Language Model

Semantics & Representation

A neural network trained on massive text corpora that can generate, understand, and reason about natural language.

Information Extraction

Parsing & Structure

The process of automatically extracting structured information from unstructured or semi-structured text sources.

Speech-to-Text

Speech & Audio

The automatic transcription of spoken language into written text using acoustic and language models, foundational to voice assistants and meeting transcription systems.