Working Memory Capacity as a Dynamic Process

Simmering, Vanessa R; Perone, Sammy

doi:10.3389/fpsyg.2012.00567

HYPOTHESIS AND THEORY article

Front. Psychol., 07 January 2013

Sec. Developmental Psychology

Volume 3 - 2012 | https://doi.org/10.3389/fpsyg.2012.00567

Working memory capacity as a dynamic process

A commentary has been posted on this article:

Working memory capacity: the need for process task-analysis
1. Read general commentary

Vanessa R. Simmering¹*

Sammy Perone²

¹Department of Psychology, University of Wisconsin – Madison, Madison, WI, USA
²Department of Psychology, University of Iowa and Delta Center, Iowa City, IA, USA

A well-known characteristic of working memory (WM) is its limited capacity. The source of such limitations, however, is a continued point of debate. Developmental research is positioned to address this debate by jointly identifying the source(s) of limitations and the mechanism(s) underlying capacity increases. Here we provide a cross-domain survey of studies and theories of WM capacity development, which reveals a complex picture: dozens of studies from 50 papers show nearly universal increases in capacity estimates with age, but marked variation across studies, tasks, and domains. We argue that the full pattern of performance cannot be captured through traditional approaches emphasizing single causes, or even multiple separable causes, underlying capacity development. Rather, we consider WM capacity as a dynamic process that emerges from a unified cognitive system flexibly adapting to the context and demands of each task. We conclude by enumerating specific challenges for researchers and theorists that will need to be met in order to move our understanding forward.

Introduction

Working memory (WM) has been dubbed the heart of intelligent behavior (Necka, 1992). One hallmark of the WM system is its highly limited capacity. WM capacity limitations are reliably associated with cognitive functions such as language comprehension, reasoning, planning, fluid intelligence, and scholastic achievement (e.g., Conway et al., 2003). Accordingly, capacity limitations are thought to influence cognitive development in a vast number of ways, from acquiring basic categories during infancy (Oakes et al., 2008), to higher-level skills such as following multi-step instructions (Gathercole et al., 2008). WM deficits have been observed in clinical populations, including children diagnosed with attention-deficit/hyperactivity disorder (ADHD; Willcutt et al., 2005), autism (Steele et al., 2007), developmental coordination disorder (Alloway, 2007), and schizophrenia (Cullen et al., 2010), as well as children born preterm (Vicari et al., 2004). Moreover, children with low WM scores (i.e., below the 10th percentile for age group on standardized assessment tasks) exhibit cognitive problems, symptoms of inattentiveness and distractibility, and difficulty with problem solving in academic settings (Alloway et al., 2009). WM capacity impacts cognitive functioning and development in multiple domains and populations. Understanding WM capacity development, therefore, has implications for our scientific understanding of human cognition and may enable researchers to foster positive changes in atypically developing populations.

How WM operates as part of a larger cognitive and behavioral system is highly complex. Facing such complexity, we must ask: What is our goal in studying WM capacity over development? One of the most obvious applications is to develop interventions that may alleviate the broad ranging negative consequences associated with poor WM (Alloway et al., 2009). Unfortunately, effective interventions have proven challenging to generate. For example, Shipstead et al. (2012) reviewed studies assessing whether training with the Cogmed program leads to improvements in WM and other symptoms of ADHD. Across studies, Shipstead et al. (2012) found mixed results, with some training effects failing to replicate, and overall more support for narrow versus broad transfer (an issue we discuss further below; for related discussion). We propose that a central limitation of previous training studies is that they are not motivated by theories of WM that specify how this system organizes itself across contexts.

To design effective interventions, we must understand the cognitive processes and behaviors we study. In this way, we argue, our theories are failing us: through decades of research we have not yet developed a robust approach to improving WM capacity, much less reliably influencing the related behavioral consequences. As an attempt to help our science move beyond its current limitations, we review data and theory on WM capacity development, evaluate limitations of existing theoretical approaches, and propose that understanding the cognitive and behavioral processes we measure in the laboratory requires a deeper appreciation for the complexity of the system we study. The empirical database on WM capacity development has grown immensely during the past half century. Despite this growth, much remains to be understood about the mechanisms that underlie developmental changes in WM capacity (Cowan, 2010). In fact, one could argue that such this growth has painted an unclear picture of children’s performance in WM tasks. We propose that the time has come to rethink existing approaches to WM capacity and its development.

The impetus to reconsider our approaches to studying WM capacity is motivated by our survey of developmental changes in WM capacity. This survey highlighted key challenges to our understanding of WM capacity development, which we discuss in further detail below. Briefly, we found that capacity estimates¹ increase reliably over development in nearly every study, but vary substantially across tasks and domains when compared within age groups. Furthermore, tasks designed to tax WM more (by adding manipulation or processing demands) or less (by requiring only storage) do not result in straight-forward modulations of capacity. These facts raised five key questions that we will address in this paper, which we summarize briefly here.

Do Different Tasks Simply “tap” WM Capacity Differently?

We will show that WM capacity for children of the same age varies substantially across task contexts. One view of such cross-task variation in performance is that different tasks tap WM capacity differently. We disagree: in our view, capacity does not exist as a separable property of the cognitive system and therefore cannot be “tapped.” Instead, we view cross-task variation in performance as a signature of a dynamically self-organizing cognitive and behavioral system.

Might Task Differences Mask a Child’s Underlying WM Capacity?

Historically, WM capacity has been viewed as a competency that an individual possesses with improvement over development. From this view, task differences can mask children’s underlying competency. In our view, task and stimulus differences are external influences on the dynamic organization of the cognitive and behavioral systems involved in performing the task at hand. Indeed, we will review our own work showing that the number of items maintained in memory actually changes across contexts. We reject the competence-performance distinction because, in our view, competence cannot be measured and is therefore an unfalsifiable construct.

Can We Only Understand WM Capacity Development by Using the Same Task Across Age Groups?

The immense growth of the literature on WM capacity development has led to the use of numerous tasks and variations of the same task across research groups. This, in turn, has created the substantial variation in children’s capacity estimates reported and an unclear picture of the trajectory along which capacity develops, revealing how complex the task of understanding capacity development is. At first glimpse, these facts might suggest that understanding children’s behavior over development would be simplified by merely using the same task with multiple age groups. We will show, by contrast, that use of multiple tasks within an age group illuminates the dynamics of the cognitive and behavioral systems we study. Thus, we suggest that measuring the WM system in multiple contexts is more informative than assessing the same context across multiple time points in development.

Can Theories Incorporating Multiple, Simultaneously Developing Abilities Account for WM Capacity Development?

We will describe several theories of WM capacity development that attribute change to a single mechanistic source. We suggest that such views are unable to simultaneously account for the cross-task variation in children’s performance and universal increase in capacity estimates. If various single mechanisms are unable to provide an account of children’s performance in WM tasks and development, might the addition of several, simultaneously developing abilities do the job? For example, might improvements over development in both rehearsal and resistance to interference provide an adequate account of increases in capacity estimates? In our view, the answer is no. Too often, such theories still treat these cognitive abilities as separable and independent. We view performance in WM tasks as determined by multiple, reciprocally coupled cognitive and behavioral processes that are not linearly separable.

Are Existing Theories Simply Underspecified?

We will argue that existing theories cannot simultaneously account for the cross-task variation in children’s performance and the universal increase in capacity estimates. Might increased specificity enable existing theories to overcome their limitations? In our view, existing theories of WM capacity are not simply underspecified. Moreover, we view existing theories as qualitatively different from the types of theories needed to understand WM capacity and its development. We advocate a new way of thinking about what capacity is, how capacity develops, and our global approach to studying capacity.

Overview

This paper is divided into three main sections. In the first section, we present a survey of developmental changes in WM capacity in visual and verbal domains (for more in-depth reviews, see Dempster, 1981; Jenkins et al., 1999; Cowan et al., 2007). We then review theoretical explanations of WM capacity development, which have historically attributed changes in capacity to single, or separable, mechanistic source(s). Such explanations have limited applicability across tasks and domains. Our second section outlines our proposal that focusing on single mechanistic sources – or even multiple, separable sources – of WM capacity limits and its developmental change cannot explain cross-context variation. Alternatively, we propose that a systems approach is best suited for understanding both general improvements in WM capacity over development and specific variation across contexts at a given time point. In this section, we present two case studies illustrating one systems approach to WM capacity development. In our third section, we discuss the challenges we believe theories and empirical inquires of WM capacity must meet in order to acquire a deeper scientific understanding of WM processes and ultimately develop interventions that foster positive cognitive outcomes for individuals suffering from WM deficits.

Working Memory Capacity Development

Survey of Empirical Results

Working memory is typically divided into separate verbal and visuo-spatial subsystems, each with a limited capacity (see Cowan, 2001, and commentaries for in-depth discussion). Within each domain, some researchers draw a distinction between tasks measuring short-term versus WM, with the former requiring only storage and the latter requiring additional manipulation of information (for discussion, see Cowan, 2008). However, this boundary is not uniform, with some theorists drawing no distinction between the two (e.g., Heyes et al., 2012). The application of the terms “short-term” and “working” memory have been inconsistent within the WM literature and, indeed, may not be separable processes. Here, we focus on comparing tasks according to similar methodology, but consider all tasks to fall under the umbrella term “WM” (see Appendix for methodological details and exclusions). More specifically, Table 1 shows a broad sample of studies using “simple span” tasks (i.e., requiring only storage) to assess verbal WM capacity. Table 2 shows the same for visuo-spatial WM capacity. Table 3 shows results from one common type of “complex span” task, specifically backward span in which participants are required to repeat stimuli in the opposite order from encoding. Within Table 3 we separate results by verbal versus visuo-spatial stimuli for ease of comparison with Tables 1 and 2. Lastly, Table 4 includes results from studies using other types of complex span tasks, such as those requiring further manipulation of information (e.g., counting objects to arrive at the digits to be remembered) or dual-tasks (e.g., serial recall of auditory digits while ignoring visually presented letters). Included in these tables is a survey of nearly 200 tasks and conditions from multiple studies across 50 papers that stimulated our rethinking of WM capacity and its development.

TABLE 1

Table 1. Estimates of verbal working memory capacity using simple span tasks.

TABLE 2

Table 2. Estimates of visuo-spatial working memory capacity using simple span tasks.

TABLE 3

Table 3. Estimates of working memory capacity using backward span tasks.

TABLE 4

Table 4. Estimates of working memory capacity using complex and/or dual-tasks.

One striking similarity across studies and domains is the almost universal increase in capacity estimates over development. Within nearly every study, capacity estimates are higher for older children than younger children. This strongly suggests a pervasive developmental change that manifests in a variety of tasks that estimate capacity in different ways. In contrast to the reliable increases in capacity estimates described above, however, these studies also illustrate that capacity estimates vary substantially across tasks and domains within a single age group. For example, consider 7-year-olds performing simple verbal tasks (Table 1): capacity estimates across 18 studies range from as low as 1.2 (Hulme et al., 1984) to as high as 5.4 items (Cowan et al., 1999). Even when comparing memory for only one type of stimuli – digits – the range still spans from 3.0 (Huttenlocher and Burke, 1976) to 5.4 (Cowan et al., 1999). Comparing 7-year-olds’ performance in simple visuo-spatial tasks (Table 2) shows another 27 studies or conditions with a range of 2.1 (Palmer, 2000) to 8.1 items (Wilson et al., 1987). This variability indicates substantial influence of task details on capacity estimates even for paradigms designed to assess storage alone.

When considering tasks designed to assess manipulation of information, results from 7-year-olds’ performance in backward span tasks (Table 3) are entirely within the range from forward tasks (albeit at the low end): for letters, 2.5 (Zuber et al., 2009) to 3.1 (Morra, 1994); for digits, 3.1 (Morra, 1994) to 3.2 (Isaacs and Vargha-Khadem, 1989); and for blocks in the Corsi task, 2.9 (Zuber et al., 2009) to 3.9 (Isaacs and Vargha-Khadem, 1989). One might argue that backward spans are more informative when compared to forward span within a single group of participants. Isaacs and Vargha-Khadem did such a comparison, testing the same children in both forward and backward versions of both digit and Corsi block tasks (see Tables 1– 3). They found that performance differed significantly based on order for digits only, not for blocks in the Corsi task. This suggests that the influence of manipulating order in memory also depends on the details of the task.

Potential sources of developmental change in WM capacity are no less puzzling when examining performance in complex tasks. Consider children’s performance in counting span tasks, the most common complex task variant in Table 4. For 6- and 7-year-olds, capacity estimates range from 1.7 (Case et al., 1982) to 4.4 (Russell et al., 1996) digits in this task, which is slightly lower than the range in Table 1. Interestingly, multiple studies showed that small differences in how stimuli were presented (e.g., in a canonical versus random spatial organization; Russell et al., 1996) influenced capacity estimates in this paradigm. Looking later in development, capacity estimates for 11- to 12-year-olds (still considering digits in the counting span task) range from 3.2 (Case et al., 1982) to 4.6 (Hutton and Towse, 2001), again showing variation based on stimulus presentation. What can we conclude from this range of findings? If we assume that complex tasks require processing or manipulation above and beyond the storage required in simple tasks, then we would (presumably) expect lower estimates from complex tasks than simple tasks. This is the case if we consider means across studies, but not necessarily the case if we consider the full range of estimates.

The results summarized in Tables 1– 4 raise a critical question: what do we strive to explain with our theories? Do we want to explain differences in means between age groups or differences across tasks within age groups? Is it more important to understand why some simple tasks yield lower estimates than some complex tasks? As we will discuss below, one limitation of current theories is they do not yet address such cross-task and -age variation. This is a critical limitation because complex tasks such as these have long been held as measuring individual differences in adults’ cognitive ability and as robust predictors of reading comprehension (Daneman and Carpenter, 1980), performance on standardized tests (Turner and Engle, 1989), and problem solving (Engle et al., 1999). Furthermore, Cowan et al. (2005) showed that both simple and complex task measures correlate with scholastic performance during middle childhood.

In summary, the goal of the research presented in these tables has been to map the developmental trajectory of WM capacity. However, achieving this goal has proven a daunting task. In fact, the vast literature on WM capacity development has created an unprecedented theoretical challenge: to explain the WM processes that underlie the universal increase in capacity estimates and how the very same WM processes produce variation in capacity estimates across tasks and domains. In the next section, we describe current and historical theoretical perspectives on WM capacity development.

Theories of WM Capacity Development

Providing a concise synthesis of theories of WM capacity development is not easy. One key obstacle is that theories in this domain have been closely tied to specific periods of development, specific domains, and/or the specific tasks they address. For example, there is currently a heated debate between “slots” and “resources” theories in research on visual WM capacity (e.g., Alvarez and Cavanagh, 2004; Bays and Husain, 2008; Zhang and Luck, 2008). However, these theories have thus far only been applied to memory for continuously varying visual stimuli (e.g., color, orientation) in simple tasks (recall or recognition of small arrays) with adults (but, see Heyes et al., 2012, for one developmental study in this debate), making it unclear how such theories would address the range of tasks and age groups shown in Table 2. Theories rarely, if ever, make contact with a broad range of tasks, domains, or developmental periods. In our synthesis, we discuss a few select theories based, in part, on their prevalence, historical significance, and their scope. Our aim was to present a review of the most comprehensive and compelling theories of development that have been put forth, rather than an extensive review of many competing theories.

The majority of theories of capacity development have focused on the universal increase in capacity estimates, rather than the variation in performance across tasks and domains. Many theorists have proposed that developmental increases in WM capacity arise from improvements in cognitive processes other than WM capacity such as processing speed, attention, encoding, response selection and generation, and/or retrieval (see, e.g., Towse and Hitch, 2007, for further discussion). For example, Case (1995) proposed that developmental improvements in performance reflected changes in processing speed and efficiency, which eventually lead to more information being stored with the same capacity/mental resources. He considered children’s experience to be a driving force behind improvements in processing. This feature of Case’s perspective could potentially account for differences across tasks, stimuli, and individuals. For example, as a child gains experience with counting, she/he may become more efficient at processing digits and therefore have an increased digit span, but not improve in spans of non-digit words. Indeed, Fry and Hale (1996) showed that measures of processing speed in non-span tasks account for much of the variance in capacity estimates (derived from storage plus processing tasks) both within and between age groups. The developmental hypothesis from this perspective, then, is that improvements in other cognitive processes enable the WM system to exhibit an increased capacity in laboratory tasks without any change in mental resources.

Dempster (1981) also proposed that increases in WM capacity estimates arise from sources other than capacity. Dempster classified 10 factors that lead to individual and developmental differences in capacity into two categories: four were strategic variables, and six were non-strategic variables. Although many researchers propose that strategic variables (e.g., rehearsal, chunking) are a primary source of developmental improvement in WM tasks, there is little direct evidence regarding how or why strategies might change over development, and Dempster concluded that the evidence at that time was not strong enough to support these types of explanations. For non-strategic variables (e.g., speed of processing, resistance to interference), on the other hand, theorists have put forth some mechanisms that may produce developmental change. In particular, Pascual-Leone (1970) and Case (1995) propose that mental resources, referred to as M-space or M-power, are changing over development. Some researchers suggest that M-space is increasing over development, whereas others believe that the efficiency with which M-space is used is the source of developmental improvement. In either case, the primary mechanism proposed to cause this developmental change is an increase in myelination. According to Case, this is a likely source because myelination not only increases the speed of neural conductivity, but also reduces interference through insulation. Although it seems intuitive that these changes would produce either an increase in resources available or improve the efficiency of the use of these resources, no specific process has been proposed by which increased neural conductivity would improve performance in span tasks.

Most early theoretical explanations of WM capacity development attribute increases in capacity estimates to a single cause (e.g., those reviewed by Dempster, 1981). One more recent example of this type of approach was illustrated by Oberauer and Kliegl (2001). These authors were interested in potential sources of aging-related declines in WM capacity. They discussed five potential sources of WM capacity limitations (limited resources, fixed slot-like capacity, speed of decay versus rehearsal, similarity-based interference, and inter-item confusion) and developed computational models to test whether these sources of WM capacity limitations could account for data comparing young and old adults’ performance on a complex WM task. They tested seven formal models that implemented these sources (three types of resource models and one model for each additional proposed source) and found that only two models – one with capacity limits arising through decay, the other arising through interference – provided a good fit to their data. This approach assumes that there is only one source of capacity limits. Can we assume that WM capacity would be limited by either decay or interference, but not both? Similarly, should we infer that only one of these would change over development?

As the aforementioned examples illustrate, single-cause explanations have fallen short in accounting for the type of variation across tasks and domains shown in Tables 1– 4. For example, although Fry and Hale (1996) showed that processing speed accounts for much of the variance they found in WM tasks, there was still a significant proportion of variance from age that was not accounted for by speed. Thus, WM performance increased over development due to some process(es) in addition to processing speed. The limitations of such single-cause explanations has led theorists to argue for explanations that incorporate multiple causes (e.g., Towse and Hitch, 2007; Cowan, 2010), suggesting the need for studies that control and compare multiple processes as well as storage constraints. Cowan et al. (2010), for example, highlights the distinction between processing-related and storage-specific limitations addressed through previous research. He argues that processing strategies (such as rehearsal, chunking, or imagery) are likely to vary across task contexts whereas storage limits stay constant across a wider variety of circumstances. He proposes that future research should aim to control or prevent processing strategies to gain a more precise measure of storage.

Similarly, Towse and Hitch (2007) outlined evidence for four sources of variation in WM performance: increases in processing speed, improved ability to maintain information while engaged in a separate processing task, increased storage capacity, and improved response-timing processes (e.g., latency to respond, pauses between items). They further proposed that developmental changes in performance arise from the combination of which processes are accessed and how well they function. We agree that theories must move beyond single-source models of WM capacity limits and development. However, our perspective makes the further claim that multiple sources of developmental differences cannot be considered in isolation. Below, we propose a new view of WM capacity that embraces the complex manner in which neural, cognitive, and behavioral systems interact over real and developmental time scales. We believe this view can provide new insights into the sources of both variation and stability across development. We provide evidence from two case studies that have tested novel predictions derived from this perspective.

A New View on Working Memory Capacity Development

Our proposal is that WM capacity and its development can best be understood as a dynamical system, from the perspective of a systems theory. A key feature of systems theories is their emphasis on process over representation (Thelen and Smith, 1994). Capacity has long been studied as a concrete property of the WM system: a given individual is considered to “have” a capacity in each domain, which is then brought to bear on laboratory tasks. Indeed, the fact that Tables 1– 4 present numerical estimates of how many items are held in memory points to this emphasis on representation. From a systems perspective, by contrast, the processes of remembering cannot be separated from what is being remembered (i.e., representation; Spencer and Schöner, 2003). We contend that WM capacity does not exist in the way it has historically been discussed: from our perspective, there is no such thing as capacity “in the head.” Rather, capacity estimates are emergent products of cognitive and behavioral systems interacting in real-time in a task context. Emergence – a central concept in systems theory – is the notion that a given property of a system exists only through the meeting of components of that system. To draw on an analogy from physics, consider the concept of friction, which exists solely as the product of the meeting of two surfaces: a ramp alone has no friction, as a ball alone has no friction, but rolling a ball down a ramp produces friction. Extending this to WM, capacity does not exist alone, but rather capacity estimates are an emergent product of various components of cognitive and behavioral systems organizing themselves over time within a specific stimulus and task context.

Additionally, capacity estimates arise from the nature of the coupling between cognitive and behavioral systems. Coupling – another central concept in systems theory – is the notion that the dynamics of one system depend on and are inseparable from the dynamics of another system. Consider the classic example of two pendulum clocks (Rosenblum and Pikovsky, 2003). In the seventeenth century, Huygens (1986) observed the spontaneous synchronization of the motion of two clocks. How did this happen? Both clocks were connected to the same wooden beam. The motion of the clocks generated vibration that was shared across the two clocks. The common source of energy – coupling – led the behavior of the two clocks to become perfectly synchronized without the need for any outside source of intervention. That is, the behavior of the two clocks was the product of a self-organizing dynamical system that emerged from the coupling between the two systems.

In our view, capacity in the classic sense (e.g., the “slots” metaphor) does not work. In the laboratory, we derive capacity estimates that are the emergent product of multiple, highly complex, coupled cognitive and behavioral systems operating within the task context. If we want to understand why capacity estimates appear limited and why they differ across individuals, development, and task contexts we must understand the dynamics of these systems (i.e., how the components of a system interact through time). We illustrate this claim below by reviewing two case studies from our own work. Our proposal stands in contrast to the historical approach to understanding capacity and its development. For instance, Cowan (2010) emphasized the role of processing (e.g., strategy) in explaining cross-task performance differences, while contending that storage remains relatively constant across tasks. Though we agree that both processing and storage must be considered to understand performance across tasks, we disagree with both the characterization of storage as a separable component of the system as well as the notion that storage is constant across tasks. In our view, storage capacity cannot be “tapped.” Storage is a process in and of itself that cannot be considered in isolation from the processes that contribute to (e.g., encoding, chunking) and operate upon (e.g., rehearsal, retrieval) stored information.

Below, we present two case studies illustrating how a systems approach can be applied to WM capacity development. These studies have tested specific predictions derived from the implementation of visual WM into a computational model, which allows for direct testing of how changes in a given set of processes may simulate developmental improvements in performance. These examples demonstrate how the specific details of the behavioral tasks designed to measure WM capacity influence the processes by which WM representations are formed and used in service of the tasks, and reveal that capacity may vary within the same participants depending on the manner in which information is presented and capacity is measured. Importantly, we do not consider these differences across tasks to be “noise” in our estimates, but rather believe this cross-task variation informs our understanding of how this dynamic cognitive and behavioral system operates and develops.

Case Study 1: Infant Visual Working Memory

Our first case study centers on a series of neural network simulations reported by Perone et al. (2011). Perone et al. showed that a single, complex system can produce remarkable variation in performance across contexts. More specifically, they tested the prediction that a single neuro-dynamical systems model of infant looking and memory could produce variation in infants’ capacity estimates across task conditions. They simulated infants’ performance in a change preference task designed by Ross-Sheehy et al. (2003) to estimate visual WM capacity. Figure 1A shows this task, in which infants viewed two displays of colored squares blinking on and off in synchrony. On a “no-change” display, all of the colors remained the same with each blink/delay. On a “change” display, one randomly selected color changed to a new color. Infants’ looking time to the two displays was compared, and a robust preference for the change display was interpreted as memory for the number of items per display (i.e., set size). Across set sizes, Ross-Sheehy et al. found that 6-month-olds showed a robust change preference only at set size one, whereas 10-month-olds showed change preferences up to set size four. They concluded that infants’ visual WM capacity increases from one to four items between 6 and 10 months.

FIGURE 1

Figure 1. Schematic illustrations of tasks used to assess visual working memory in (A) infants versus (B) children and adults; both present examples of set size three.

Perone et al. (2011) simulated infants’ performance in this task using a model of infant looking and memory. The model consists of a neurocognitive system that encodes object details (e.g., color) and a fixation system that is biased to sustain looking during encoding. Encoding leads to WM formation of the colors in the displays; once a robust WM is formed, inhibition biases the system to look away from remembered items and explore items that may be novel. The model exhibited a change preference through recognition of the items on the no-change display and detection of novelty on the change display. This preference emerged through real-time interactions between looking, encoding, and WM formation. Critically, Perone et al. found that a preference for the change display did not require memory for all items in the display, that is, the model exhibited a higher capacity estimate (measured through looking time) than the number of items maintained in WM.

This example highlights how multiple processes working together give rise to behavioral estimates of capacity. Critically, the challenge remains to understand how such processes give rise to variation in performance like that shown in Tables 1– 4. Within systems approaches, such variation is viewed as a signature of a system that organizes in real-time in response to the current task context. Perone et al. (2011) illustrated this concept by simulating a second experiment by Ross-Sheehy et al. (2003) in which they removed the delay to insure that young infants’ performance reflected a limitation in memory, not perception or attention. Indeed, young infants exhibited change preferences for set sizes up to three in this condition. This manipulation changed the task in two important ways. First, “blinks” on the change and no-change displays were no longer present, that is, there were no transient onsets within each presentation of the items. Second, it introduced a “flicker” associated only with the changing item on the change display. Perone et al. showed that these minor manipulations dramatically influenced looking behavior. In the DNF model, looking and memory are reciprocally coupled components of a larger cognitive and behavioral system. Manipulations of looking influenced memory formation. In particular, the “flicker” associated with the change display biased the model to preferentially look to the display. This had two consequences. First, it led the model to encode the changing item on the display, which, in turn, biased the model to look at the change display more. Second, it enabled the model to encode more items into WM on the change display, boosting the number of items that the model maintained. Performance in memory tasks reflects the real-time organization of a system in context rather than a property of the WM system.

Critically, this no-delay version of the change preference task does not depend on memory in the classic sense (i.e., maintaining information across a delay). Nevertheless, Perone et al. (2011) simulations showed that the same cognitive and behavioral system that captured infants’ performance in task contexts that involved a delay could also capture infants’ performance in a task context with no-delay. This illustrates the emergence of capacity in our perspective: in the no-delay condition, support from the task input allowed for more items to be encoded and maintained in memory. This, in turn, facilitated recognition of sameness on the no-change display and detection of novelty on the change display. The result was a robust preference for the change display. In contrast, the standard condition is less supportive and places a different burden on the memory system. The repeated delay slows memory formation for non-changing items and promotes the decay of items maintained in memory. This, in turn, led to a different pattern of looking by the model. This example highlights how a task context impacts the process of remembering, the number of items remembered, and capacity estimates.

In light of the data in Tables 1– 4, a critical question is whether such a system is able to produce both behavioral variability across contexts as well as stable developmental increases in capacity estimates. As illustrated by our simulations of the no-delay condition, in a dynamical system real-time behavior emerges via the interactions of its components in context. The tendency of a dynamical system to produce similar behaviors in the same context (i.e., stability) depends on the nature of interactions among its components (for an illustration and discussion, see Perone and Spencer, 2012). Thus, developmental change in stability requires modification in the interactions among components. Perone et al. (2011) proposed that older infants’ change preferences at higher set sizes emerged from changes in the strength of interaction among the excitatory and inhibitory neural components governing encoding and memory formation in their model. Stronger interactions have a variety of emergent consequences, including faster and more robust WM formation, stronger suppression of encoding via WM, and increased responsiveness to novelty. This impacts how cognitive and behavioral processes interact in real-time. Perone et al. showed that the same model with stronger interactions performed like older infants (i.e., exhibited higher capacity estimates), maintaining multiple items on the no-change display while encoding novelty on, and looking longer to, the change display.

Case Study 2: Visual Working Memory through Childhood Into Adulthood

Simmering (2008), under review) used a similar model architecture to address inconsistent estimates of visual WM capacity over development. As described above, Ross-Sheehy et al. (2003) estimated capacity at one item for 6-month-olds, increasing rapidly to an adult-like three to four items by 10 months of age. By contrast, studies with older children using the change detection task revealed estimates of only one to two items during the preschool years, not reaching adult levels of performance until adolescence (see Table 2; Cowan et al., 2005; Riggs et al., 2006; Simmering, 2012). In this task, a small number of colored squares are presented briefly (e.g., 500 ms), followed by a short delay (e.g., 1 s) and presentation of a test array that includes either all of the same colors, or one new color (see Figure 1B). Participants are instructed to respond different if any items changed, or same if all items were unchanged. One obvious source of the inconsistent estimates between infancy and later childhood is the different tasks used to assess capacity. Without a full understanding of the processes involved in these tasks, however, it was not possible to relate performance across the two tasks and development (see Riggs et al., 2006, for discussion).

Simmering (2008) addressed this inconsistency by implementing both tasks within the same neural network model, and then testing one group of participants (3-, 4-, 5-year-olds, and adults) in both tasks to compare performance directly. Model simulations showed how the same memory processes could operate in both tasks but yield strikingly different capacity estimates. In particular, the repeated presentation of items on both displays in the infant task allowed WM representations to build incrementally, which was particularly important for younger children, whose memory builds more slowly and is less stable. By contrast, successful performance in the change detection task requires robust WM representations to form quickly and be maintained stably enough to support generation of a same/different decision.

These simulations led Simmering (2008) to predict that capacity estimates should be higher in the change preference versus change detection task when measured in the same participants, but, critically, that performance should still be correlated across tasks due to the shared processes of WM formation and maintenance. Behavioral results confirmed these predictions, showing capacity estimates from the change preference task of at least six items for all age groups, but change detection capacity estimates of only two to three items for children and four to five items for adults. Moreover, change detection capacity was predicted by a behavioral signature of robust memory formation in looking tasks: the number of times participants looked back and forth (Kovack-Lesh et al., 2008; Perone and Spencer, 2012). In particular, the number of times participants looked back and forth between displays on set size four trials was positively correlated with capacity, even after controlling for age-related improvements (see Simmering, 2008, for details).

Following these behavioral results, Simmering (2008) quantitatively fit the model’s performance to children’s and adults’ results from both tasks, in order to understand how the same memory system could produce such different estimates across tasks. The model was indeed able to maintain more items in WM during the change preference versus change detection task. As such, Simmering concluded that the infant task did not over-estimate capacity, but actually produced higher capacity than the change detection task. The same model showing different capacity – not just estimates, but the actual number of items held in WM – across tasks illustrates the benefit of conceptualizing capacity as a process. Returning to the issues we raised at the beginning of this paper, alternative interpretations of these results could be that the looking task reveals a competence that is masked by the change detection task, or that the tasks tap different underlying memory systems. Combining the empirical and computational results, however, support our position that the system organizes in the task context: storage cannot be separated from the processes that form and act upon representations, and storage limits are not static across task contexts.

Challenges for Theories of Working Memory Capacity Development

The empirical results in Tables 1– 4 illustrate the complex pattern of WM capacity development: although capacity increases universally, different stimuli, paradigms, and domains have revealed high variability in estimates that remains to be explained. Until recently, the dominant theories of WM capacity development focused on single-cause explanations for capacity limits and their developmental increase, which we contend are unable to capture the full range of behavior shown in Tables 1– 4. There has already been a move to theories that consider multiple causal factors (e.g., Towse and Hitch, 2007; Cowan et al., 2010), but such theories characterize processes and storage as separable cognitive components. As an alternative, we believe that successful theories of WM capacity and development will re-conceptualize capacity as part of the process involved in performing various tasks rather than a separable, static storage component. Our case studies illustrate how this new approach can shed light on previously unclear behavioral results. We are not arguing that existing theories are simply underspecified. We are arguing for a re-conceptualization of what capacity estimates truly measure.

We have identified three challenges that we believe theories must confront in order to provide a comprehensive account of WM capacity and its development. Our position is that meeting these challenges will allow our theories to overcome previous limitations and extend our theories beyond the tasks we use in the lab to realize positive impacts on real world behaviors that relate to WM capacity.

Challenge 1: System Components Must be Specified

A central challenge for any theory of WM capacity is to postulate the components of the cognitive and behavioral systems under study, and to specify the coupling among those components. This is necessary to understand the real-time organization of the system in the behavioral tasks we use in the laboratory. Tables 1– 4 give estimates of capacity, which researchers derived through some calculation based on behavior. Across different tasks, the specific behaviors we measure can differ substantially (e.g., serial reproduction of verbal lists, judging each item in a list as old/new, yes/no comparisons of multi-item visual arrays, pointing to remembered locations), but we expect these diverse measures to reveal some common underlying cognitive mechanisms. Consider, for example, our second case study in which looking behavior in the change preference task was compared to same/different judgments in the change detection task. Without a formal proposal regarding the contribution of the items being held in memory versus the response processes, there would be no foundation for the predicted relationship between participants’ looking behavior and their capacity estimates.

By incorporating the specific details of the task designs and behaviors we measure in the laboratory into our theories, we can begin to understand how common cognitive and behavioral processes could produce the wide variety of estimates shown in Tables 1– 4. For example, it is generally accepted that capacity estimates are influenced by attention, processing speed, and inhibition (e.g., Johnson et al., 2003). How multiple, interacting processes organize themselves in service of behavior, however, is strongly influenced by the context. High attentional demands may lead to lower capacity estimates in one study while simpler stimuli lead to higher capacity estimates in another. To understand WM capacity and its development, we must simultaneously consider multiple contributions to behavior and the capacity estimated from it.

Our case studies above illustrate how such specification may be achieved through the use of computational models. Implementing multiple behavioral tasks within a single model architecture requires specifying what the components are, how they are coupled, and how they interact. This can lead to novel behavioral predictions. For example, in addition to the case studies we described above, Simmering and Patterson (2012) showed how the developmental mechanism proposed by Simmering (2008) to capture visual WM capacity development also predicts improvements in color discrimination in a single item memory task. Note that we are not proposing that computational models are necessary for a rigorous approach to understanding the link between cognition and behavior; rather, we emphasize keeping a tight link between theories and experiments with particular attention on real-time behaviors.

Challenge 2: Theories Must be Linked Closely to Real-Time Behavior

Theories of WM capacity and its development cannot evolve without a tight connection between theory and experiment. The work of Karmiloff-Smith and colleagues provides several elegant examples (from other domains of cognitive development) of how this may be accomplished without the use of computational models. For example, Paterson et al. (2006) assessed numerical skills across populations by testing multiple age groups with multiple behavioral tasks. Their results showed that assumptions about infant processing based on adult dissociations were not supported. Rather, understanding the cognitive and behavioral processes involved in each task at each point in development revealed how processing styles diverged between groups over development. This example illustrates how non-computational approaches can still achieve the necessary level of specificity to explain behavior across multiple tasks and populations, and that new insights can be gained by considering cognitive and behavioral processes in context.

Although there will always be a place in our science for the types of measures reported in Tables 1– 4, we believe new methods of analysis hold great promise in providing new insights into developmental changes into cognitive processes. One example of this type of approach across domains is to consider “micro-behaviors” – that is, behavioral indices of underlying cognitive processes that can be found in the fine details of real-time behavior – rather than macro-level measures like percent correct. In the context of our own case studies, we have demonstrated how looking dynamics (i.e., switching between displays) may be a more sensitive measure of capacity than looking preference alone. But even this measure is relatively coarse, and does not tell us about the temporal unfolding of switches and duration of each fixation across the course of a trial. These measures might be informative about the link between online cognitive processing and behavior.

Taking advantage of the richness of behavioral data promises to drive the study of WM capacity forward. This approach has led to new insights into infant habituation. For example, Perone and Spencer (2012) showed that the duration of looks at a stimulus early in learning structured the time course of memory formation. This, in turn, influenced the duration of looks later in learning. The details of fixation, then, provide meaningful information about the underlying process of forming a memory and how memory formation impacts fixation dynamics, above and beyond classic measures associated with habituation. In work with adults, Spivey (2007) has demonstrated how the real-time dynamics of decision making (measured through movements of the eyes and/or computer mouse in lab tasks) reveal probabilistic representations when overt behavioral measures (e.g., which item on the screen the participant clicked) could be interpreted as deterministic. This emphasis on the fine details of behavior presents a challenge to researchers because it will require new ways of collecting and analyzing data.

Challenge 3: Integration Across Types of Stimuli – Verbal and Visuo-Spatial – and Tasks – Simple and Complex

Historically, the separation of verbal and visuo-spatial memory systems has been widely accepted. At the very least, visual WM tasks typically require looking at images or events (e.g., colored squares, hand movements to blocks) and verbal WM tasks typically require reading or hearing stimuli (e.g., words, letters, digits). Critically, however, Tables 1– 3 show that visual and verbal WM can both be characterized by a universal increase in capacity estimates and cross-task variation in capacity estimates. Should we interpret this as parallel changes in unrelated systems, or evidence for shared processes across different types of stimuli? Can we reunite verbal and visual subsystems?

From a systems view, much of this work will be done by specifying the relevant components of the system (as described in Challenge 1). For example, many theories posit separate storage for verbal versus visuo-spatial information, but shared attentional mechanisms that influence encoding into these stores (e.g., Cowan et al., 2005). However, as illustrated by our example above of Isaacs and Vargha-Khadem’s (1989) study of backward digit and Corsi spans, it seems that this relationship will not be straight-forward: their results suggested that the process of manipulating item order was not comparable for verbal versus visuo-spatial stimuli. As we argue above, the processes that act upon stored information cannot be considered separately from the information being stored.

A related concern is where (and whether) to draw the line separating simple and complex tasks. The typical argument for this distinction emphasizes the need for processes other than storage in complex tasks. However, across the range of studies in Tables 1 and 2, we found that participants were required to encode and retrieve information in a variety of ways across tasks. As such, even “simple” tasks depend on more than just storage. Although these differences across tasks are relatively small compared to the processes involved in, for example, a counting span task, we argue that it is still informative to consider these tasks as arising from the same unified system: demands on some processes may be minimized in simple tasks, but these processes still exist and contribute to performance.

One compelling argument to consider simple and complex tasks separately is raised by Engle and colleagues (e.g., Shipstead et al., 2010). They emphasize that simple tasks do not correlate consistently with performance outside the lab, and suggests that simple tasks are therefore less informative for understanding individual differences and interventions. If we consider simple and complex tasks as part of the same self-organizing system, however, then we could use the differences between the task types as an indication of the processes that are most critical to measures of general intelligence. For example, Unsworth and Engle (2007) showed that performance on supraspan trials of simple tasks is maximally predictive of general fluid intelligence, whereas performance on complex tasks is maximally predictive at small spans. To us, this suggests that both types of tasks are dependent upon the same underlying system which organizes differently according to context. How this organization occurs in response to the task context may be just as important to understanding individual differences and/or interventions as understanding how the system operates in each context separately.

Rising to the Challenge: What is the Goal of Our Theories?

What do we want, or need, to achieve with theories of WM capacity? To date, our theories have often been limited to specific tasks, domains, and developmental periods. Might there be an advantage to develop theories with a broader goal in mind? We think the answer is yes. We contend that developing theories with broader goals will ultimately provide us with a deeper scientific understanding of the systems we study. Moreover, we contend that developing theories with broader goals will have practical implications in the real world. Consider WM training programs as one example. These involve intensive exercises designed to improve WM capacity. The target outcome of such programs is to improve cognitive functioning in the real world by reducing memory lapses, improve scholastic achievement, limit symptoms for those suffering from ADHD, and even increase intelligence (for review of the claimed benefits of one particular training program, see Shipstead et al., 2012).

Within the past few years, numerous reviews of WM training programs have emerged, highlighting this as a central goal of research on WM capacity (e.g., Shipstead et al., 2010; Melby-Lervåg and Hulme, 2012; Wass et al., 2012). Many of these reviews have reached the pessimistic conclusion that the evidence for WM training that achieves the intended broad, positive impact on cognitive function is mixed at best. What is the source of this pessimism? Our survey of Shipstead et al. (2012) review of the Cogmed WM training program and the related commentaries points to the central assumption underlying the development of such programs: capacity is a property of the WM system (for discussion, see Logie, 2012; see also Klingberg, 2012). If WM capacity can be enhanced, then individuals should exhibit benefits across the wide array of contexts that this property operates within. However, the evidence indicates that WM training has limited transfer, with the primary benefit on tasks that closely resemble the training tasks. Indeed, some have argued that improved performance on such tasks may arise from nothing more than familiarity with the task or even performing tasks on computers, and is not an effect of WM capacity at all (Hulme and Melby-Lervåg, 2012).

Even more worrisome is the fact that our current theories do not reliably predict which tasks will show transfer of WM training. Our view echoes that of Gibson et al. (2012), that our theories and WM training studies must inform each other. Moreover, we must have theoretically motivated research that specifies how WM training impacts the WM system and how the WM system is integrated with other cognitive and behavioral systems across contexts (Hulme and Melby-Lervåg, 2012; Shipstead et al., 2012). Finally, we must move our theories beyond a single task, domain, and developmental period in order to explain – and predict – how WM changes within an individual over time (Gibson et al., 2012). If we do not understand how the tasks we use in the lab relate to one another, we will not be able to predict when training in lab tasks will produce lasting improvements in behaviors outside of the lab.

Conclusion

Working memory capacity plays an important role in numerous cognitive functions and continuously serves behavior in the real world. Limitations in WM capacity impact cognition and development in both typical and atypical populations. The vast literature on WM capacity development has revealed a nearly universal increase in capacity estimates over development and substantial variation in capacity estimates across task contexts within age groups. We contend that theoretical explanations focusing on single or separable mechanistic sources of children’s performance in WM tasks or development changes in capacity limits cannot provide an adequate account of the full pattern of performance shown in Tables 1– 4.

We began by presenting five questions that arose from our survey of the literature on estimates of capacity development, which raised possible explanations for the wide variation in results in Tables 1– 4. As we have described above, we reject these potential explanations and propose a new approach to studying WM capacity development. In summary, we do not believe that different tasks are tapping capacity in different ways, that difficult tasks are masking underlying competencies, or that capacity can only be understood by developing a single “best” paradigm to use for comparisons. Furthermore, we do not think that existing theories could account for the breadth of data by incorporating more causal mechanisms and/or being more specific. Rather, we argue that capacity does not exist in the way it has been traditionally conceptualized, but is an emergent process within a dynamically coupled, self-organizing cognitive, and behavioral system.

We propose that WM capacity should be construed as a dynamic process rather than a property of WM, and that we will gain better understanding of developmental change by first considering how the relevant cognitive and behavioral systems organize in service of tasks within and outside the laboratory, and second, by examining how changes in real-time dynamics create the developmental change we observe. We illustrated how Perone et al. (2011) used such an approach to provide insights into variation in capacity estimates across contexts and development improvement in performance during infancy. Furthermore, Simmering (2008) (2008, under review) showed how differences in the method of presentation and behavioral measures across paradigms could result in higher or lower capacity within the same individuals. Critically, performance was correlated across these tasks, supporting the claim that these tasks depend on the same underlying cognitive system.

Finally, we considered three significant challenges that arise when conceptualizing WM capacity as a dynamic process within an integrated system. To develop an effective systems theory of any cognitive process, not just WM capacity, we must specify the system components and how they operate across behavioral tasks while keeping a tight link between theories of how those processes work and the paradigms we use to test such processes. Within WM research more specifically, we urge theorists to integrate studies across verbal and visuo-spatial domains as well as simple and complex tasks, to arrive at a more complete understanding of the WM system’s self-organization. By confronting these challenges we believe that theories of WM capacity development will be able to expand beyond laboratory tasks and understand the role WM plays in the great variety of real world behaviors of interest across both typical and atypical populations.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Thanks to Larissa K. Samuelson and John P. Spencer for suggesting this topic and for fruitful discussion. Preparation of this manuscript was partially funded by the National Institutes of Health (R03-HD067481, awarded to Vanessa R. Simmering).

Footnotes

^Note that we use the phrase “capacity estimate” to refer to data derived from behavioral studies, and “WM capacity” to refer to the psychological construct throughout this paper.

References

Alloway, T. P. (2007). Working memory, reading, and mathematical skills in children with developmental coordination disorder. J. Exp. Child Psychol. 96, 20–36.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Alloway, T. P., Gathercole, S. E., Kirkwood, H., and Elliott, J. (2009). The cognitive and behavioral characteristics of children with low working memory. Child Dev. 80, 606–621.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Alvarez, G. A., and Cavanagh, P. (2004). The capacity of visual short-term memory is set both by visual information load and by number of objects. Psychol. Sci. 15, 106–111.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

^†Bayliss, D. M., Jarrold, C., Baddeley, A. D., and Gunn, D. M. (2005a). The relationship between short-term memory and working memory: complex span made simple? Memory 13, 414–421.