1.3 Responsible AI principles

Key Takeaways

Responsible-AI frameworks worldwide converge on a recurring set of principles despite differing wording.
Core principles include fairness, transparency/explainability, accountability, safety/robustness, privacy/security, human oversight, and contestability.
Fairness has multiple, mathematically incompatible definitions, so calling a system 'fair' requires an explicit, context-specific choice.
Human oversight is operationalized on a spectrum: human-in-the-loop, human-on-the-loop, and human-in-command.
Principles routinely conflict (e.g., accuracy vs. explainability, transparency vs. privacy), so responsible AI requires documented, context-specific balancing.

Last updated: July 2026

Where responsible-AI principles come from

Over the past several years a striking consensus has emerged across dozens of published AI ethics frameworks — from the OECD AI Principles and the UNESCO Recommendation to the NIST AI RMF's "trustworthy AI" characteristics and countless corporate codes. Although the wording varies, the same core principles recur. The AIGP expects you to know these principles, what each means operationally (how it shows up as a real control or practice), and — crucially — the tensions between them, because responsible AI is rarely about a single virtue in isolation. Do not memorize any one organization's exact list; instead learn the underlying ideas, because different frameworks group and name them differently while pointing at the same handful of concerns. The principles below are the ones that appear most consistently across the major sources the exam draws on.

The core principles

Fairness and non-discrimination

AI should treat people equitably and avoid unjustified, harmful bias against individuals or groups — especially on protected characteristics such as race, sex, age, or disability. Operationally this means testing training data and outputs for disparate impact, choosing appropriate fairness metrics, and remediating skew. A key subtlety: there are multiple, mathematically incompatible definitions of fairness (e.g., equal false-positive rates versus equal predictive value), so "fair" requires an explicit, context-specific choice.

Transparency and explainability

People affected by AI should be able to know that AI is being used and to obtain a meaningful explanation of how it works or why a particular decision was made. Transparency is broad disclosure (documentation, notices, model cards); explainability is the narrower technical ability to describe the factors behind an output. Operational practices include disclosure notices, model documentation, and interpretability techniques. Deep learning's opacity makes this hard.

Accountability

Someone must be answerable for an AI system's outcomes; responsibility cannot evaporate into "the algorithm did it." Operationally, accountability means assigning clear ownership, keeping documentation and audit trails, conducting impact assessments, and enabling redress. It underpins governance structures (roles, sign-offs, records) and is the principle most directly tied to legal liability.

Safety and robustness

AI should perform reliably, remain secure against manipulation, and avoid causing physical, psychological, or economic harm — including under unexpected or adversarial conditions. Robustness means the system degrades gracefully and resists attacks such as data poisoning or adversarial examples. Operational practices include rigorous testing, red-teaming, monitoring for drift, and fallback mechanisms.

Privacy and data governance

Because AI is data-hungry, it must respect privacy and sound data governance: a lawful basis for data, data minimization, security, and controls over how personal data is used in training and inference. This principle links AI governance back to established privacy law (such as the GDPR) and to security controls protecting both the data and the model.

Human oversight and human-centered values

Humans should remain meaningfully in control of consequential AI, and systems should serve human autonomy and well-being. This is often operationalized along a spectrum — human-in-the-loop (a person approves each decision), human-on-the-loop (a person monitors and can intervene), and human-in-command (humans set overall bounds). The EU AI Act mandates human oversight for high-risk systems.

Contestability and redress

Individuals affected by an AI-driven decision should be able to challenge it and seek a remedy. Operationally this means providing an appeal or review channel, a human re-examination path, and correction of errors — closely tied to transparency (you cannot contest what you cannot understand) and accountability (someone must receive and act on the challenge).

A compact way to remember the set:

Principle	Operational question it answers
Fairness	Does it treat groups equitably?
Transparency / explainability	Can we disclose and explain it?
Accountability	Who is answerable, with what records?
Safety / robustness	Does it work reliably and resist attack?
Privacy / security	Is data used lawfully and protected?
Human oversight	Can a person intervene or stop it?
Contestability	Can an affected person challenge it?

The tensions between principles

The reason governance is hard — and the reason the AIGP tests judgment rather than recitation — is that these principles frequently pull against each other:

Accuracy vs. explainability. The highest-performing deep models are the least interpretable; demanding full explainability may force a simpler, less accurate model.
Fairness vs. accuracy (and fairness vs. fairness). Correcting disparate impact can lower overall accuracy, and satisfying one fairness metric can mathematically violate another.
Transparency vs. privacy and security. Disclosing how a model works, or publishing training data, can leak personal data or expose the system to gaming and attack.
Personalization vs. privacy. More useful, tailored AI usually requires collecting more personal data.
Innovation/speed vs. oversight. Heavy human-in-the-loop review adds friction and cost the business may resist.

There is no universal formula for resolving these trade-offs; responsible AI requires context-specific balancing, documented reasoning, and stakeholder input. On the exam, a scenario that seems to make one principle absolute usually has a "best" answer that acknowledges the trade-off and reaches a proportionate, well-documented decision. Recognizing that principles coexist in tension — and that governance's job is to balance them deliberately rather than maximize any single one — is the central insight of this domain.

Test Your Knowledge

A governance team notes that the most accurate model available is a deep neural network whose decisions are very hard to explain to affected individuals. This dilemma most directly illustrates the tension between:

Privacy and security

Fairness and accountability

Accuracy and explainability

Innovation and cost

Test Your Knowledge

Which practice best operationalizes the principle of human oversight for a high-risk AI system?

Publishing the full training dataset online for maximum transparency

Configuring a human-in-the-loop review so a person can intervene before consequential decisions take effect

Encrypting the model parameters at rest

Increasing the size of the training data to improve accuracy

Up Next

1.4 Stakeholders, roles & the AI lifecycle

Continue learning

IAPP Artificial Intelligence Governance Professional

1Overview & AI Governance Foundations

2Laws & Regulations Applying to AI

3Standards & Frameworks

4Governing AI Development

5Governing AI Deployment & Use

6Operationalizing AI Governance

AIGP