AI Bias — Technology Wiki

Overview

Direct Answer

AI bias refers to systematic disparities in model predictions or outputs that disadvantage particular groups or outcomes, stemming from non-representative training data, encoded human prejudices, or algorithmic design choices that amplify historical inequities. These errors are distinct from random model noise and propagate through downstream decisions.

How It Works

Bias emerges when training datasets reflect historical imbalances—for example, loan approval systems trained on decades of discriminatory lending practices. Algorithms optimise to minimise loss across aggregate populations, inadvertently learning to replicate or magnify disparities present in source data. Feature selection, sampling strategies, and loss function design further influence which groups experience worse performance or harmful outcomes.

Why It Matters

Organisations face regulatory exposure under anti-discrimination law, operational risk from public backlash, and accuracy degradation in underrepresented segments. Financial services, healthcare, recruitment, and criminal justice systems experience material harm when biased models deny loans, misdiagnose conditions, reject qualified candidates, or influence sentencing recommendations.

Common Applications

Facial recognition systems exhibit higher error rates on darker skin tones; hiring algorithms have screened out female candidates; medical risk scores underestimate disease burden in Black patients; credit scoring models perpetuate lending disparities across protected groups.

Key Considerations

Detecting and correcting bias requires multi-stage governance—auditing training data composition, validating performance across demographic segments, and accepting that mitigation often involves accuracy-fairness tradeoffs. No single metric captures bias comprehensively across all stakeholder perspectives.

Related in Training & Inference

Causal Inference

The process of determining cause-and-effect relationships from data, going beyond correlation to establish causation.

AI Feature Store

A centralised platform for storing, managing, and serving machine learning features consistently across training and inference.

Federated Learning

A machine learning approach where models are trained across decentralised devices without sharing raw data, preserving privacy.

AI Inference

The process of using a trained AI model to make predictions or decisions on new, unseen data.

AI Training

The process of teaching an AI model to recognise patterns by exposing it to large datasets and adjusting its parameters.

Hyperparameter Tuning

The process of optimising the external configuration settings of a machine learning model that are not learned during training.

AutoML

Automated machine learning that automates the end-to-end process of applying machine learning to real-world problems.

Reinforcement Learning from Human Feedback

A training paradigm where AI models are refined using human preference signals, aligning model outputs with human values and quality expectations through reward modelling.

Direct Preference Optimisation

A simplified alternative to RLHF that directly optimises language model policies using preference data without requiring a separate reward model.

Model Merging

Techniques for combining the weights and capabilities of multiple fine-tuned models into a single model without additional training, creating versatile multi-capability systems.

More in Artificial Intelligence

Planning Algorithm

Reasoning & Planning

An AI algorithm that generates a sequence of actions to achieve a specified goal from an initial state.

Heuristic Search

Reasoning & Planning

Problem-solving techniques that use practical rules of thumb to find satisfactory solutions when exhaustive search is impractical.

AI Governance

Safety & Governance

The frameworks, policies, and regulations that guide the responsible development and deployment of AI technologies.

AI Explainability

Safety & Governance

The ability to describe AI decision-making processes in human-understandable terms, enabling trust and regulatory compliance.

AI Interpretability

Safety & Governance

The degree to which humans can understand the internal mechanics and reasoning of an AI model's predictions and decisions.

Turing Test

Foundations & Theory

A measure of machine intelligence proposed by Alan Turing, where a machine is deemed intelligent if it can exhibit conversation indistinguishable from a human.

Artificial Superintelligence

Foundations & Theory

A theoretical level of AI that surpasses human cognitive abilities across all domains, including creativity and social intelligence.

Neural Architecture Search

Models & Architecture

An automated technique for designing optimal neural network architectures using search algorithms.