Direct, Indirect, Product, Continuous, and Discontinuous Measurement
Key Takeaways
- Direct measurement records behavior as it occurs; indirect relies on reports and yields hypotheses, not precise data.
- Permanent-product measurement is valid only when the durable outcome is exclusively linked to the target response.
- Continuous measurement captures every response; discontinuous systems sample and carry known biases.
- Partial-interval overestimates (good for behaviors to decrease); whole-interval underestimates (good for behaviors to increase).
- Momentary time sampling is efficient for ongoing states but misses brief, high-rate responses.
Direct vs. Indirect, and the Permanent-Product Shortcut
Measurement methods differ in how close they sit to the behavior itself, and that distance determines how confidently a BCBA can interpret the data. Direct measurement observes and records behavior as it actually occurs, in real time, by a trained observer. Indirect measurement relies on after-the-fact reports: interviews, rating scales, checklists, or questionnaires completed by people who were present (or by the client). Indirect tools are efficient for screening and history but are filtered through memory, interpretation, and bias, so they yield hypotheses rather than precise dependent-variable data.
Product (permanent-product) measurement records the durable outcome a behavior leaves behind: completed math problems, dishes washed, words written, items broken. Product measurement is technically a form of direct measurement of a product and is appealing because the observer need not be present during the behavior. Its central caution is attribution: the product proves an outcome exists, not who produced it or how. If a parent helps with homework, the worksheet overstates the child's independent responding. Product data are valid only when the product is reliably and exclusively linked to the target response.
Continuous vs. Discontinuous Measurement
Continuous measurement attempts to capture every instance of the target response during the observation period; discontinuous measurement samples behavior, recording only during or at sampled moments. Continuous direct measures (event recording, timing) are the gold standard for precise decisions but cost observer attention. Discontinuous systems trade some accuracy for feasibility in busy, multi-client, or all-day settings.
The three interval-based discontinuous systems each carry a predictable directional bias that the exam tests relentlessly:
| System | What you score | Bias | Best for |
|---|---|---|---|
| Partial-interval recording (PIR) | Behavior at any point in the interval | Overestimates occurrence/duration | Behaviors to decrease |
| Whole-interval recording (WIR) | Behavior for the entire interval | Underestimates occurrence/duration | Behaviors to increase (e.g., on-task) |
| Momentary time sampling (MTS) | Behavior at the moment the interval ends | Can over- or under-estimate; less biased with continuous/long-duration behavior | Ongoing states across long periods |
A useful exam heuristic: partial-interval inflates, whole-interval deflates. Because PIR overestimates, scoring a behavior you want to reduce with PIR is conservative (it will not hide success); because WIR underestimates, it is the cautious choice for a behavior you want to increase (it will not overstate success). MTS is the most efficient for staff and least biased for behaviors that occur in long bouts, but it can miss brief, high-rate responses entirely.
Choosing a Method Under Real Constraints
Method selection is a validity decision constrained by feasibility. Pick the least biased method the setting can implement accurately, not the easiest by default.
- Use direct continuous measurement when the behavior is discrete, the decision is high-stakes, and an observer can attend.
- Use product measurement when the outcome is durable and exclusively tied to the response.
- Use indirect measurement for screening, history, and generating hypotheses, never as the sole basis for treatment-change decisions.
- Use discontinuous sampling when continuous recording is impractical, but choose the interval system whose bias is conservative for your goal and keep intervals short enough to represent the behavior.
The defensible answer on Domain C items usually pairs a practical constraint with a data-quality safeguard: it preserves the link between the measure and the clinical decision, then reduces observer effort (shorter intervals, simpler definitions, IOA checks) enough that staff can run it correctly. Precision still matters even when you sample; you sample because you must, not because accuracy is unimportant.
Why Interval Bias Happens, and How Interval Size Matters
Understanding why each interval system is biased prevents memorization errors under exam pressure. Partial-interval recording (PIR) scores the interval as an occurrence if the behavior happens at all, even for one second of a ten-second interval. Because a single brief response "claims" the entire interval, PIR systematically reports the behavior as occurring across more time than it truly did, hence overestimation of duration and occurrence.
The flip side is whole-interval recording (WIR): the behavior must persist for the entire interval to be scored, so any lapse, however brief, zeroes the interval. WIR therefore underestimates, because real but interrupted behavior fails to meet the all-or-nothing criterion.
Interval size modulates the bias. Longer intervals make PIR overestimate more severely (one short response captures a larger block of time) and make WIR underestimate more severely (it is harder to sustain behavior across a long interval). Shorter intervals shrink both biases and bring sampled estimates closer to continuous values, which is why best practice keeps intervals short enough to represent the behavior, often 10 to 30 seconds.
Momentary time sampling (MTS) is least biased for behaviors that occur in long, steady bouts, because a momentary check is likely to land inside a long episode; it is most error-prone for brief, high-rate behaviors that can occur entirely between sampling moments.
A practical exam frame ties bias direction to the goal direction:
- Reducing a behavior? PIR's overestimation is conservative, it will not falsely show success, so it is a defensible choice.
- Increasing a behavior (e.g., on-task)? WIR's underestimation is conservative, it will not overstate progress, so it is the cautious choice.
- Tracking an ongoing state across a long period with limited staff? MTS is the efficient, reasonably accurate option.
The wrong-answer logic to watch for: an option that pairs a system with the non-conservative direction (e.g., using PIR to document reduction of a behavior you must prove you reduced is fine, but using WIR to claim you reduced a behavior could hide remaining occurrences). Reasoning from bias direction, not memorized labels, keeps these items reliable even when the scenario is unfamiliar.
A BCBA wants to track a third-grader's independent multiplication accuracy at home but cannot observe during homework. The parent reports the child "does great." Which measurement approach is most defensible, and what is its key caution?
A team must reduce a student's hand-flapping and wants a sampling system that will NOT mask whether the behavior is still occurring. Which discontinuous method is the most conservative choice and why?
Which statement about momentary time sampling (MTS) is accurate?
A behavior analyst is choosing between continuous event recording and an interval system for a low-frequency but severe self-injurious behavior in a 1:1 setting. Which choice is most defensible?