A service must be restored within 45 minutes and can lose no more than 10 minutes of data. Which statement is correct?

RTO is 45 minutes and RPO is 10 minutes. RTO is the target time to restore service, here 45 minutes. RPO is the maximum acceptable data loss measured in time, here 10 minutes. Reversing the two values is the most common error on this topic.

Hosts on a subnet must keep a single default gateway IP even if the active router fails. Which technology provides this?

A First Hop Redundancy Protocol such as VRRP or HSRP. First Hop Redundancy Protocols like VRRP and HSRP present a shared virtual gateway IP backed by two or more routers, so clients keep using the same default gateway when the active router fails over to the standby.

Which exercise validates decisions and communication without necessarily moving production services to standby resources?

A tabletop test. A tabletop test gathers stakeholders to walk through a scenario, roles, and decisions on paper. It validates procedures and communication but does not prove the technical failover, which requires a failover or full-interruption test.

DR and HA Concepts: RPO, RTO, MTTR, MTBF, Si | Free Guide 2026

Disaster Recovery, High Availability, and Testing

Network+ objective 3.3 ties business requirements to resilient design. The right architecture depends on how much downtime, data loss, cost, and complexity the organization can tolerate. The exam tests four buckets: the recovery metrics, recovery sites, HA designs, and testing types.

Recovery and reliability terms

Term	Meaning	Example
RPO	Recovery Point Objective — max acceptable data loss, measured in time	Lose no more than 15 minutes of transactions
RTO	Recovery Time Objective — target time to restore service	Restore VPN within 1 hour
MTTR	Mean Time To Repair/Recover — average restore time	45 minutes to swap a failed switch
MTBF	Mean Time Between Failures — expected reliability interval	A power supply rated 100,000 hours

The single most-tested distinction: RPO is about data (how far back), RTO is about time (how long down). A nightly backup gives an RPO of up to 24 hours; if the requirement is "lose no more than 5 minutes," nightly backup fails and you need frequent replication. RPO and RTO are requirements set by the business; MTTR and MTBF are measurements used to evaluate supportability.

Recovery sites

Site type	Readiness	Cost	Recovery speed
Cold	Space and power, little or no equipment	Lowest	Slowest (days)
Warm	Some hardware and connectivity, partial data	Medium	Moderate (hours)
Hot	Fully equipped, current data, rapid failover	Highest	Fastest (minutes)

A cold site fits a low-priority function with a long RTO; a hot site is required when both RTO and RPO are short. A cloud / DRaaS option can blur these lines by spinning up capacity on demand.

High availability designs

Design	Description	Note
Redundant component	Dual power supply, link, switch, firewall, ISP	Removes a single point of failure
Load balancing	Spreads requests across nodes	Scale and availability
Clustering	Multiple nodes act as one service	Supports failover
Active-active	All nodes carry production traffic	Efficient but complex
Active-passive	Standby waits for failover	Idle capacity, simpler planning
FHRP (VRRP / HSRP)	Redundant default gateway	Hosts keep one virtual gateway IP
Geographic redundancy	Service runs/recovers in another region	Survives site loss

HA reduces downtime from component failures; DR addresses larger disruptions such as site loss, mass corruption, or disaster. First Hop Redundancy Protocols — VRRP (open standard) and HSRP (Cisco) — present one virtual gateway IP so clients keep working when the active router fails.

Failover and split-brain

Failover moves service from a failed component to a standby or peer and must define health checks, triggers, state synchronization, routing/DNS changes, and failback. Cluster designs must prevent split-brain, where two nodes both believe they are primary and accept conflicting writes — usually solved with a quorum or witness.

Testing methods

Test	What happens
Tabletop	Stakeholders discuss a scenario and decisions
Walk-through	Team reviews procedures step by step
Simulation	Realistic inputs without full production impact
Failover test	Service is actively moved to standby
Restore test	Data/config is restored and validated
Full interruption	Production is intentionally stopped under control

A tabletop validates decisions and communication, not the technical failover — only a failover or full-interruption test proves the systems actually cut over. Testing must produce evidence: timings, gaps, failed assumptions, and updated runbooks. An untested plan is only a theory.

Practical scenario

An ordering system must recover within 30 minutes (RTO) and lose no more than 5 minutes of orders (RPO). That points to a warm or hot design with frequent replication, FHRP or clustered gateways, tested failover, and current documentation. A nightly offline backup cannot meet the 5-minute RPO and would fail an audit.

Why a hot site still needs backups

A frequent exam misconception is that real-time replication to a hot site removes the need for backups. It does not, because replication faithfully copies everything — including a malicious encryption by ransomware, an accidental table drop, or a corrupted file. Within seconds the same damage exists at both sites, leaving no clean copy to restore from. Backups protect a different threat: they provide point-in-time recovery to a moment before the corruption, satisfying the RPO.

The resilient design therefore layers both — replication for fast failover against hardware and site loss, and versioned, ideally immutable or offline, backups against logical corruption and deletion.

Translating requirements into design

The exam often hands you an RTO and RPO and asks for the matching architecture. A long RTO and long RPO can be met cheaply with a cold site and nightly backups. A short RTO with a short RPO forces near-continuous replication, clustered or load-balanced services, FHRP gateway redundancy, and a warm or hot site with tested failover. The discipline is to let the business numbers drive the spend rather than over-engineering every service to active-active, which wastes money on functions that could tolerate hours of downtime.

Common exam traps

RTO is time to restore; RPO is acceptable data loss. Do not swap them.
Active-active is not always best — it adds cost and complexity; requirements drive design.
A hot site does not eliminate backups; replication can copy corruption or a deletion.
A tabletop test does not prove technical failover; only a failover/full-interruption test does.
MTTR and MTBF are measurements, not requirements; RPO and RTO are the requirements the business sets.

CompTIA Network+

CompTIA Network+

DR and HA Concepts: RPO, RTO, MTTR, MTBF, Sites, Failover, and Testing

Key Takeaways

Disaster Recovery, High Availability, and Testing

Recovery and reliability terms

Recovery sites

High availability designs

Failover and split-brain

Testing methods

Practical scenario

Why a hot site still needs backups

Translating requirements into design

Common exam traps

CompTIA Network+

1Introduction & Exam Overview

2Domain 1: Networking Concepts (23%)

3Domain 1: Media, Connectors, Transceivers, and Network Architectures

4Domain 1: IP Addressing and Subnetting Mastery

5Domain 2: Network Implementation - Routing

6Domain 2: Network Implementation - Switching

7Domain 2: Network Implementation - Wireless and Physical Implementation

8Domain 2: Network Implementation - Services Implementation

9Domain 3 Part A: Network Operations Documentation and Governance

10Domain 3 Part B: Monitoring, Remote Access, and Resilience

11Domain 4: Logical Security and Secure Access (14%)

12Domain 4: Segmentation, Risk, and Network Attacks (14%)

13Domain 4: Security Controls and Hardening (14%)

14Domain 5: Troubleshooting Methodology and Tools (24%)

15Domain 5: Physical, Interface, and Wireless Troubleshooting (24%)

16Domain 5: Network Services, Switching, Routing, and Performance Troubleshooting (24%)

17Performance-Based Question Labs

18High-Yield Reference and Final Study Plan

19Final Synthesis: Strategy, Remediation, and Readiness

CompTIA Network+

DR and HA Concepts: RPO, RTO, MTTR, MTBF, Sites, Failover, and Testing

Key Takeaways

Disaster Recovery, High Availability, and Testing

Recovery and reliability terms

Recovery sites

High availability designs

Failover and split-brain

Testing methods

Practical scenario

Why a hot site still needs backups

Translating requirements into design

Common exam traps