Chapter 4
Cognitive Load & Automaticity
Your brain's conscious workspace is shockingly small: roughly four items at a time. That is not a metaphor; it is a measured, replicated constraint of human cognition. Every complex skill you have ever learned began by overwhelming that tiny workspace, and only became manageable once the basics dropped below the threshold of conscious effort.
The Working Memory Bottleneck
Cognitive psychologist Nelson Cowan spent years refining our understanding of working memory, the mental scratchpad where you hold and manipulate information in real time. His 2010 review synthesized decades of research and converged on a clear number: most adults can actively maintain about four independent items in working memory at once. Not seven, as an earlier estimate suggested. Four.
This is why learning something new often feels overwhelming. When you encounter an unfamiliar concept, every component of that concept occupies a slot. If understanding a single idea requires you to simultaneously hold five or six pieces in mind (a definition, a formula, the meaning of each variable, and how they relate) you have already exceeded capacity. The result is confusion, not because the material is inherently difficult, but because your workspace ran out of room.
~4
items in working memory at once
capacity of long-term memory
The contrast with long-term memory is striking. While working memory is narrow and fleeting, long-term memory appears to have no practical upper bound. Nobody has ever been shown to "run out" of long-term storage. The entire challenge of learning, then, is not a storage problem. It is a transfer problem: how do you move knowledge from the four-slot bottleneck into the vast warehouse behind it?
While working memory is severely limited in capacity and duration, long-term memory can be considered to hold an essentially unlimited amount of information. The key to effective instruction is facilitating schema acquisition and automation, moving knowledge from the limited working memory into long-term memory.
Sweller, Ayres & Kalyuga (2011)
Automaticity: When Foundations Become Invisible
Watch a young child learning to read. They stare at each word, laboriously mapping each letter to its sound, sounding out syllables one at a time. By the time they reach the end of a sentence, they have forgotten how it began. They are decoding, but they are not comprehending, because decoding is consuming every slot in their working memory, leaving nothing for meaning.
Now think about how you read. The letters, the phonics, the word recognition: all of it happens below conscious awareness. You do not "see" individual letters any more than you consciously feel your fingers on the keyboard while typing. Those low-level operations have become automatic, and that automaticity is what frees your mind to engage with ideas, arguments, and narrative.
This pattern repeats in every domain. A medical student learning to read an ECG initially has to consciously recall what each wave represents, what normal intervals look like, and how to measure them. Each of those sub-tasks claims a working memory slot. But after enough practice, reading the basic rhythm becomes effortless, and the student can focus on spotting subtle abnormalities that would have been invisible before.
Higher-level thinking depends on lower-level skills becoming invisible. You cannot reason creatively about a problem if you are still struggling to recall the foundational knowledge that the reasoning depends on.
The implication is direct: if you want to do sophisticated thinking in any field, the prerequisite knowledge must be so well-practiced that it no longer requires conscious effort. Automaticity is not a luxury. It is the mechanism that unlocks the next level of understanding.
Chunking and Dual Coding
If working memory is limited to about four items, the natural question is: what counts as an "item"? This turns out to be flexible. Through a process called chunking, you can group several related pieces of information into a single unit, effectively expanding the reach of your four slots.
Consider a phone number: 8-6-7-5-3-0-9 is seven separate items, well beyond working memory limits. But most people naturally group it into something like 867-5309, two or three chunks instead of seven isolated digits. Experts in any field do the same thing at a larger scale. A skilled programmer does not think about individual syntax tokens; they think in patterns and idioms. Each pattern is a chunk that compresses many details into one working memory slot.
Another powerful strategy is dual coding: encoding the same information through both verbal and visual channels. Your brain processes language and images through partly separate systems, so when you pair a written explanation with a diagram, you create two independent memory traces rather than one. Research by Allan Paivio showed that information encoded through both channels is significantly easier to retrieve later, because there are more pathways leading back to it.
Chunking and dual coding are not separate tricks. They work together: the more you practice retrieving and connecting information, the larger and more robust your chunks become. Each chunk you build frees up working memory for the next layer of complexity, creating a virtuous cycle where learning accelerates the more you know.
In RemNote
Flashcards are fundamentally an automaticity tool. By repeatedly retrieving foundational knowledge at spaced intervals, you drive it below the threshold of conscious effort. Over time, the facts and relationships you review become the invisible scaffolding that lets you think about harder problems without running out of mental workspace.
Further Reading
  • Sweller, J. (1988). “Cognitive load during problem solving.” Cognitive Science.
  • Cowan, N. (2010). “The magical mystery four: How is working memory capacity limited?” Current Directions in Psychological Science.
  • Paivio, A. (1991). “Dual coding theory: Retrospect and current status.” Canadian Journal of Psychology.