Question Answering — Technology Wiki

Overview

Direct Answer

Question Answering (QA) is an NLP task in which a system identifies and extracts or generates precise answers to natural language questions, typically by retrieving and reasoning over provided source documents or knowledge bases. It requires both semantic understanding of the question and contextual matching or inference within reference material.

How It Works

QA systems employ two primary architectures: retrieval-based systems locate relevant document passages and extract answer spans using ranking and span prediction models, whilst generative systems use sequence-to-sequence models to synthesise answers from context. Modern approaches leverage transformer-based encoders to encode questions and documents jointly, then apply classification layers to identify answer boundaries or generate token sequences via decoding.

Why It Matters

QA automation reduces human effort in customer support, compliance documentation, and knowledge management, whilst improving response latency and consistency. Organisations value this capability for scaling information access across large corpora without proportional staffing increases, particularly in regulated industries where audit trails and accuracy are critical.

Common Applications

Applications span customer support chatbots answering product queries, medical information systems responding to clinical questions, legal document analysis for contract review, and educational platforms supporting student enquiry. Search engines and virtual assistants increasingly integrate QA to provide direct answers rather than document links.

Key Considerations

Performance degrades when answers require multi-hop reasoning across documents or when reference material is absent or contradictory. Domain-specific vocabulary, question paraphrasing, and the distinction between extractive and abstractive answer generation significantly influence system selection and expected accuracy.

Related in Generation & Translation

Machine Translation

The use of AI to automatically translate text or speech from one natural language to another.

Text Generation

The process of producing coherent and contextually relevant text using AI language models.

Chatbot

A software application that simulates human conversation through text or voice interactions using NLP.

Conversational AI

AI systems designed to engage in natural, context-aware dialogue with humans across multiple turns.

Dialogue System

A computer system designed to converse with humans, encompassing task-oriented and open-domain conversation.

Top-K Sampling

A text generation strategy that restricts the model to sampling from the K most probable next tokens.

Text-to-SQL

The task of automatically converting natural language questions into executable SQL queries, enabling non-technical users to interrogate databases through conversational interfaces.

Extractive Summarisation

A summarisation technique that identifies and selects the most important sentences from a source document to compose a condensed version without generating new text.

Intent Detection

The classification of user utterances into predefined categories representing the user's goal or purpose, a fundamental component of conversational AI and chatbot systems.

Dialogue Management

The component of conversational systems that tracks conversation state, determines the next system action, and maintains coherent multi-turn interactions with users.

More in Natural Language Processing

Contextual Embedding

Semantics & Representation

Word representations that change based on surrounding context, capturing polysemy and contextual meaning.

Abstractive Summarisation

Text Analysis

A text summarisation approach that generates novel sentences to capture the essential meaning of a document, rather than simply extracting and rearranging existing sentences.

Structured Output

Semantics & Representation

The generation of machine-readable formatted responses such as JSON, XML, or code from language models, enabling reliable integration with downstream software systems.

Constitutional AI

Core NLP

An approach to AI alignment where models are trained to follow a set of principles or constitution.

Latent Dirichlet Allocation

Core NLP

A generative probabilistic model for discovering topics in a collection of documents.

Context Window

Semantics & Representation

The maximum amount of text a language model can consider at once when generating a response.

Natural Language Processing

Core NLP

The field of AI focused on enabling computers to understand, interpret, and generate human language.

Vector Database

Core NLP

A database optimised for storing and querying high-dimensional vector embeddings for similarity search.