Overview
Direct Answer
A confusion matrix is a square table that summarises the performance of a classification model by displaying the frequency of true positives, true negatives, false positives, and false negatives against actual class labels. It provides the raw data foundation for calculating derived performance metrics such as precision, recall, and F1-score.
How It Works
The matrix organises predictions into four cells: correct predictions (true positives and true negatives along the diagonal) and incorrect predictions (false positives and false negatives off-diagonal). Each row represents an actual class while each column represents a predicted class, allowing direct visual inspection of where the model makes systematic errors or excels by class type.
Why It Matters
Classification metrics derived from the matrix—such as sensitivity and specificity—enable practitioners to assess whether a model's errors are acceptable for the specific business context. In domains like medical diagnosis or fraud detection, understanding the distribution of error types is critical for regulatory compliance and risk management, as false positives and false negatives carry different operational costs.
Common Applications
The matrix is fundamental in evaluating binary and multiclass classifiers across medical imaging, credit risk assessment, spam detection, and disease screening programmes. It enables comparison of model performance before deployment and supports threshold optimisation when decision boundaries must be adjusted for production constraints.
Key Considerations
The matrix assumes balanced class representation; with severe class imbalance, derived metrics can be misleading without careful interpretation. Practitioners must select appropriate evaluation metrics from the matrix's four values based on domain-specific consequences of different error types rather than relying solely on overall accuracy.
More in Artificial Intelligence
AI Democratisation
Infrastructure & OperationsThe movement to make AI tools, knowledge, and resources accessible to non-experts and organisations of all sizes.
Ontology
Foundations & TheoryA formal representation of knowledge as a set of concepts, categories, and relationships within a specific domain.
Constraint Satisfaction
Reasoning & PlanningA computational approach where problems are defined as a set of variables, domains, and constraints that must all be simultaneously satisfied.
Causal Inference
Training & InferenceThe process of determining cause-and-effect relationships from data, going beyond correlation to establish causation.
AI Model Registry
Infrastructure & OperationsA centralised repository for storing, versioning, and managing trained AI models across an organisation.
Cognitive Computing
Foundations & TheoryComputing systems that simulate human thought processes using self-learning algorithms, data mining, pattern recognition, and natural language processing.
AI Governance
Safety & GovernanceThe frameworks, policies, and regulations that guide the responsible development and deployment of AI technologies.
Knowledge Graph
Infrastructure & OperationsA structured representation of real-world entities and the relationships between them, used by AI for reasoning and inference.