Occurrence, Rate, Duration, Latency, IRT, Trials to Criterion, and Efficiency
Key Takeaways
- Count is a raw tally; rate adds time and is preferred when observation periods differ in length.
- Latency is time from an antecedent (SD) to response onset; IRT is time between consecutive responses.
- Magnitude measures force/intensity and usually needs a calibrated instrument or defined product.
- Percentage and trials to criterion are derivative; percentage ignores the number of opportunities.
- Efficiency weighs outcomes against time, effort, prompts, errors, and resources, not speed alone.
Count, Frequency, and Rate
The dimensional quantities of behavior are the measurable properties a response can have, and Domain C expects fluent matching between each dimension and the decision question it answers. Count is the simple tally of responses (e.g., 14 hits). Frequency in ABA is often used synonymously with count, though some texts reserve it for count per unit time. Rate explicitly divides count by observation time, yielding responses per minute or per hour (e.g., 14 hits / 20 min = 0.7 per min).
Rate is preferred whenever observation periods differ in length, because a raw count of 14 in a 10-minute session and 14 in a 60-minute session represent very different performance. Converting both to rate (1.4/min vs. 0.23/min) makes them comparable across days. The exam routinely embeds unequal session lengths specifically to test whether you convert count to rate before comparing.
Time-Based Dimensions: Duration, Latency, IRT, and Magnitude
Four additional dimensions describe the temporal and intensive properties of behavior:
| Dimension | Measures | Use when the question is | Example |
|---|---|---|---|
| Duration | Total time a response lasts | How long does behavior persist? | Tantrum lasts 6 min |
| Latency | Time from SD/cue to response onset | How quickly does behavior begin? | 22 s from "begin" to first word written |
| IRT | Time between the end of one response and the start of the next | How are responses spaced/paced? | 45 s between successive SIB |
| Magnitude | Force, intensity, or severity | How strong is the response? | dB level of a scream; force on a sensor |
Latency and IRT are easily confused. Latency is anchored to an antecedent event (the SD or instruction) and measures the delay before the first response. IRT is anchored to the previous response and measures the gap between consecutive responses. There is also a reciprocal relationship between rate and IRT: shorter average IRTs correspond to higher rates, and lengthening IRTs (e.g., via DRL) lowers rate. Magnitude often requires a calibrated instrument or a defined product because "intensity" is otherwise subjective.
Derivative Measures and Efficiency
Some measures are derived from the raw dimensions and answer comparative or acquisition questions:
- Percentage = (count of one event / total opportunities) x 100. It is dimensionless and useful for accuracy (percent correct) or proportion of intervals. Its weakness: percentage ignores the number of opportunities; 4/5 (80%) and 80/100 (80%) look identical but differ enough in stability to mislead. Avoid percentage when opportunities are few or variable.
- Trials to criterion counts the number of teaching trials or opportunities needed to reach a mastery standard, indexing acquisition efficiency across targets or procedures.
- Celeration describes the change in rate across time (rate of rate change) and is plotted on the Standard Celeration Chart; an x2 celeration means the rate doubles per unit time.
Efficiency is not merely speed. It asks whether socially meaningful behavior change is achieved with acceptable time, effort, errors, prompts, materials, and staff capacity. For acquisition programs, trials to criterion and rate of independent correct responding usually matter more than percent correct on a single day. For reduction programs, choose rate, duration, latency, or IRT to match the socially significant concern. On the exam, the answer often turns on the decision word: "how quickly" -> latency; "how long" -> duration; "how often" with unequal sessions -> rate; "how many opportunities to mastery" -> trials to criterion.
Fluency, Topography, Locus, and the Full Dimensional Picture
Beyond the headline dimensions, Cooper, Heron, and Heward describe behavior as having repeatability, temporal extent, and temporal locus, and these categories organize the dimensions cleanly. Repeatable properties (count, rate, celeration) describe how often a behavior recurs. Temporal extent (duration) describes how long a single instance lasts. Temporal locus (latency and IRT) describes when responses occur relative to other events.
Two further descriptive properties, topography (the physical form or shape of the response) and locus (where in the environment behavior occurs), are not quantitative dimensions but matter for definitions and for selecting what to measure.
Fluency, the combination of accuracy and speed, deserves special exam attention. A learner who answers math facts correctly but slowly is not fluent; fluency targets a rate of accurate responding (e.g., 40 correct digits per minute) rather than mere percent correct. This is why fluency-based instruction (precision teaching) plots count per minute rather than percentage: rate captures both whether and how quickly the learner responds, predicting retention, endurance, and application better than accuracy alone.
A frequent exam decision is choosing between rate and percentage for an acquisition program. Percentage answers "how accurate," but it caps at 100% and cannot show continued growth in fluency once accuracy is high; rate keeps rising and reveals whether the skill is becoming automatic. Consider these worked contrasts:
- A student is 100% accurate on letter sounds but takes 4 seconds each. Percentage says mastered; rate/fluency reveals the skill is not yet usable. Measure rate.
- A staff member wants to compare disruptions across a 12-minute and a 40-minute session. Count misleads; rate (per minute) makes them comparable.
- A clinician must show that responses are spacing out under a DRL. Rate trends down, but IRT most directly shows the lengthening gaps the procedure targets.
The through-line is the same one Domain C tests everywhere: pick the dimension whose mathematical behavior matches the clinical question, then defend why the convenient-but-wrong dimension would mislead the decision.
On Monday a client engaged in 30 disruptions during a 15-minute session; on Tuesday, 30 disruptions during a 60-minute session. What is the most accurate interpretation?
A BCBA wants to know how long it takes a student to begin cleaning up after the teacher says "time to clean up." Which dimension fits, and how does it differ from IRT?
Why might reporting only "80% correct" be misleading for a skill-acquisition target on a given day?
A DRL procedure is in place to lower a high-rate behavior by reinforcing longer gaps between responses. Which measure most directly captures the targeted change?