6.5 Avoiding Overfitting and False Patterns
Key Takeaways
- Overfitting happens when a candidate builds a rule around incidental details from too few examples.
- False patterns often come from visual salience, English assumptions, or answer-choice attraction.
- The best rule is usually the simplest one that explains all examples and transfers cleanly.
- Error review should identify whether the miss came from overfitting, underchecking, or feature confusion.
The danger of a rule that is too personal
Overfitting means building a rule that explains one example but fails on the wider pattern. In visual-symbolic practice, it often happens when a candidate latches onto a vivid detail. If the first image is a red triangle above a box, you might decide the label means red, triangle, above, or box. Only comparison tells you which rule is real.
A false pattern can feel convincing because the human brain is good at finding order. That strength becomes a weakness when evidence is thin. If one label appears with one image, many rules are possible. If the same label appears with different objects but the same relation, relation becomes stronger. If a new example breaks the rule, revise it quickly.
A simple rule is not always correct, but it is a good starting point. Prefer the rule with fewer unsupported assumptions. If "ka" appears before labels for three cups, three keys, and three stars, "ka = plural" is simpler than "ka = three objects that are portable and shown on the left." The second rule may fit the examples, but it adds details not required by the evidence.
Overfitting signals
| Signal | What it suggests | Better response |
|---|---|---|
| Rule uses many visual details | Too specific | Strip to the changing feature |
| Rule fits only one example | Weak evidence | Find another contrast |
| Rule ignores a contradiction | Confirmation bias | Revise the rule |
| Rule depends on English wording | Outside assumption | Follow the item system |
| Rule cannot handle new item | Poor transfer | Rebuild from examples |
Answer choices can encourage overfitting. A choice may include the symbol from the first example and look familiar. Familiarity is not proof. Ask whether that symbol tracks the tested feature across the set. If not, it is a distractor.
Underfitting is the opposite problem: using a rule that is too broad. If you decide "pa means position" but the item distinguishes above from below, your rule is not specific enough. Good reasoning finds the middle: specific enough to choose among answers, broad enough to transfer across irrelevant changes.
For DLAB preparation, this matters because the test is aptitude-focused. Public facts describe a standardized government test used for language-learning potential, not a test of existing foreign-language knowledge. Aptitude items often reward adapting to a new system. Overfitting blocks adaptation because it turns the first clue into a rigid story.
Build an error log with three columns: my rule, contradiction, corrected rule. For example: "I thought pa = circle; contradiction: pa also appeared with square; corrected rule: pa = above." This is more useful than writing only the right answer. It trains revision, which is central to fresh-rule reasoning.
Ethical practice also matters. Because official public detail is limited, use original drills and do not seek supposed real DLAB items. You can train overfitting control with any invented symbol system. The learning target is the reasoning process, not memorized protected content.
What is overfitting in visual-symbolic reasoning?
Practice-style: You thought "pa" meant circle, but a square above a box is also labeled with "pa." What correction is most reasonable?
Which error-log entry is most useful?