Momentum, Matching Law, Imitation, Observational Learning, and Emergent Relations
Key Takeaways
- Behavioral momentum describes resistance to change (persistence under disruption) as a function of the reinforcement density in a context - it is about durability, not just high rate.
- The matching law predicts that relative response allocation across concurrent alternatives matches the relative rate, magnitude, immediacy, and quality of reinforcement for those alternatives.
- Imitation requires a model, formal similarity between model and response, and an immediate temporal relation; the model is the controlling antecedent (a generalized imitative repertoire).
- Observational learning is broader than imitation: a learner changes behavior after watching another's behavior AND its consequences (vicarious reinforcement/punishment).
- Emergent (derived) relations appear without direct training; stimulus equivalence is demonstrated by reflexivity, symmetry, and transitivity, yielding untrained relations like A=C after training A=B and B=C.
Behavioral Momentum
Behavioral momentum refers to the persistence (resistance to change) of behavior when conditions disrupt responding. Borrowing from physics, behavior that has been reinforced in a dense (rich) reinforcement context has more 'mass' and is harder to slow down. The key idea is resistance to disruption, not simply how fast the behavior occurs at baseline.
The disruptors the exam looks for include extinction, distraction, satiation/pre-session access, increased response effort, and schedule thinning. If behavior persists more in one stimulus context than another because that context had a richer reinforcement history, momentum is the right concept.
Momentum has a practical clinical application: the high-probability (high-p) request sequence ('behavioral momentum' technique). Delivering several easy, likely-to-be-completed requests that earn reinforcement builds momentum, so the learner is more likely to comply with a following low-probability request. Note the two senses are related but distinct: the theory is about resistance to change; the high-p procedure is the applied tactic derived from it.
The Matching Law
The matching law (Herrnstein) describes how organisms allocate behavior across concurrently available alternatives in proportion to the relative reinforcement those alternatives provide. Behavior is distributed to match not just reinforcement rate but also magnitude, immediacy (delay), and quality.
Applied example: a learner who can request help from Staff A or Staff B will allocate more requests to whichever staff reinforces more frequently, faster, or with better-quality reinforcement. If problem behavior produces attention on a richer schedule than appropriate behavior does, the learner allocates more behavior to the problem response — which is why DRA arranges a richer, faster, higher-quality schedule for the appropriate alternative.
Matching is not always perfect. Departures include:
- Bias — consistent preference for one alternative beyond what reinforcement ratios predict (due to effort, response preference, etc.).
- Undermatching — less sensitivity to the reinforcement ratio than predicted (allocation closer to indifferent).
- Overmatching — greater sensitivity than predicted.
The principle says relative behavior tends toward relative reinforcement; effort, delay, and magnitude differences shape how closely the match holds.
Imitation and Observational Learning
Imitation occurs when a model (the controlling antecedent) evokes a response that has formal similarity to the model and follows it with a close temporal relation (immediately or very soon). A true imitative response is novel-controlled by the model: copying a teacher's just-demonstrated clap is imitation. A generalized imitative repertoire means the learner imitates new models without each one being separately trained.
Three defining features for the exam:
- A model is present and serves as the antecedent.
- The response has formal similarity to (looks like) the model.
- The response follows the model immediately/with low latency.
Observational learning is broader. The learner acquires new behavior or changes future responding after watching another person's behavior AND the consequences that behavior produced — this is vicarious reinforcement or vicarious punishment. If a peer is praised for putting materials away and the observer later does so without direct prompting, observational learning is involved. Imitation copies the behavior; observational learning is sensitive to the observed consequence, so a learner may NOT copy a behavior that was observed to be punished.
Emergent and Derived Relations
Emergent (derived) relations are relations that appear without direct training after other relations are taught. The flagship example is stimulus equivalence, demonstrated through three tests:
- Reflexivity — matching a stimulus to itself without training (A = A; generalized identity matching).
- Symmetry — if trained A → B, the untrained reverse B → A emerges.
- Transitivity — if trained A → B and B → C, the untrained A → C emerges.
When symmetry and transitivity combine, the learner derives C → A (the equivalence relation). This is why teaching a few relations (spoken word → picture, picture → written word) can yield many untrained relations, vastly improving instructional efficiency.
| Concept | Defining cue | Distractor to avoid |
|---|---|---|
| Momentum | Persistence under disruption (extinction, distraction, effort) | Confusing it with high baseline rate |
| Matching law | Allocation across alternatives by relative reinforcement | Assuming a perfect 1:1 match |
| Imitation | Immediate response with formal similarity to a present model | Calling it imitation when the response differs in form |
| Observational learning | Behavior change after seeing another's behavior AND its consequence | Ignoring the vicarious consequence |
| Emergent relations | Untrained relations after trained ones (symmetry, transitivity) | Crediting direct training for a derived relation |
Telling These Five Apart Under Exam Pressure
Because these principles share surface features (they all involve behavior changing), the fastest path to the right answer is to ask what is being manipulated or measured in the stem.
- If the stem disrupts an established behavior (introduces extinction, distraction, satiation, or higher effort) and asks why it persists, think behavioral momentum / resistance to change.
- If the stem offers two or more concurrent options and asks where behavior goes, think matching law — and check whether reinforcement differs in rate, immediacy, magnitude, or quality.
- If a model is present and the response looks like the model and follows it immediately, think imitation.
- If the learner changed after watching another person experience a consequence (and may NOT copy a punished behavior), think observational learning / vicarious consequences.
- If a new, untrained relation appears after teaching other relations, think emergent / derived relations (symmetry, transitivity, equivalence).
Clinical payoff worth remembering:
- Matching law explains why problem behavior wins. If problem behavior earns reinforcement on a richer, faster schedule than appropriate behavior, allocation favors the problem response. DRA works by reversing that ratio — making the appropriate alternative the richer, quicker, higher-quality option.
- Emergent relations make instruction efficient. Teaching a handful of relations (spoken word → picture, picture → text) can yield many untrained relations, so a strong plan deliberately arranges conditions for equivalence rather than teaching every relation by brute force.
- Observational learning extends teaching beyond direct trials — peers modeling reinforced behavior can establish responding in observers, which is the basis of many group and peer-mediated interventions.
A teacher delivers three quick, easy requests the student almost always completes (each earning praise), then immediately delivers a harder request the student usually refuses. The student now completes the hard request. This tactic is derived from which principle?
A learner can request a break from either of two aides. Aide A grants breaks quickly and reliably; Aide B grants them slowly and inconsistently. The learner directs far more break requests to Aide A. This allocation is BEST explained by:
A child is trained that the spoken word 'cat' goes with a picture of a cat (A-B), and that the picture of a cat goes with the printed word CAT (B-C). Without any direct training, the child then selects the printed word CAT when hearing the spoken word 'cat' (A-C). This emergent A-C relation demonstrates:
A student watches a classmate get reprimanded by the teacher for blurting out answers. Afterward, the observing student raises his hand instead of blurting. The change in the observer's behavior is BEST described as: