- Neuroscience of Imagery, Cognition, and Emotion Research Lab, Carleton University, Ottawa, ON, Canada
The relationship between vivid visual mental images and unexpected recall (incidental recall) was replicated, refined, and extended. In Experiment 1, participants were asked to generate mental images from imagery-evoking verbal cues (controlled on several verbal properties) and then, on a trial-by-trial basis, rate the vividness of their images; 30 min later, participants were surprised with a task requiring free recall of the cues. Higher vividness ratings predicted better incidental recall of the cues than individual differences (whose effect was modest). Distributional analysis of image latencies through ex-Gaussian modeling showed an inverse relation between vividness and latency. However, recall was unrelated to image latency. The follow-up Experiment 2 showed that the processes underlying trial-by-trial vividness ratings are unrelated to the Vividness of Visual Imagery Questionnaire (VVIQ), as further supported by a meta-analysis of a randomly selected sample of relevant literature. The present findings suggest that vividness may act as an index of availability of long-term sensory traces, playing a non-epiphenomenal role in facilitating the access of those memories.
Introduction
People often report they experience vivid spontaneous visual mental images in situations in which they have to recall something they did not expect to recall (incidental recall). Early imagery studies revealed that the spontaneous and involuntary appearance of a vivid visual mental image consistently occurred in response to certain memory conditions and tasks involving incidental recall. For example, upon asking subjects to remember the type of breakfast one had in the morning (Galton, 1880), the number of windows in one’s house (Shepard, 1966) or to verify a property of an experienced event with no aid of a current percept (Goldenberg et al., 1992) individuals often report vivid images. In such context, vividness is traditionally defined as a construct expressing the self-rated degree of richness, amount of detail (resolution), and clarity of a mental image, as compared to the experience of actual seeing (D’Angiulli and Reeves, 2007). Although vividness correlates with performance on certain memory tasks (Baddeley and Andrade, 2000), with arousal level (Barrowcliff et al., 2004; Bywaters et al., 2004), with positive emotional valence toward a stimulus (Alter and Balcetis, 2010), and with increased visual cortex activity (Farah and Peronnet, 1989; Farah et al., 1989; Sparing et al., 2002; Cui et al., 2007; Cattaneo et al., 2011, 2012), any attempt to clarify its function and its relationship to underlying processes still presents numerous challenges.
Manipulating vividness directly is difficult, and the lack of converging analyses has generally led to the use of correlational approaches that examine vividness predominantly as an index of individual differences in the ability to generate mental images. Furthermore, many preceding studies either confounded vividness with other variables, or did not appropriately interpret the validity criteria by anchoring the vividness construct to models of memory and verbal report underlying processes. This is a situation analogous to the one denounced years ago by Ericsson and Simon (1980) in the context of models of verbal reports, instruments such as vividness ratings/scale/questionnaires seem to be used in a brute empirical fashion, without considering a satisfactory a priori theory of the processes involved in the measurement instruments themselves. For the latter reason, it has been argued that there has also been confusion between issues of validity (e.g., discriminant or construct) and issues of reliability (e.g., specificity and precision). In the context of these challenges, the measurement of vividness has been hotly debated. As Pearson (1995) points out, vividness is usually measured using the Vividness of Visual Imagery Questionnaire (VVIQ) or its updated version, the VVIQ2 (Marks, 1995). However, these are not ideal measures for the experimental study of vividness per se, as they only measure the overall individual’s ability to generate vivid mental images (“trait vividness”), not differences between single experiences of mental imagery (“state vividness”). To study specific processes behind the phenomenon of vividness itself, it is more appropriate to use trial-by-trial self-reports in which the vividness of each individual mental image is rated immediately after its generation by the subject (Begg, 1988; Hertzog and Dunlosky, 2006; D’Angiulli, 2009; Pearson et al., 2011). The self reports were successfully employed in several previous studies, where the findings were consistent with both VVIQ research and new results outside the VVIQ’s realm of individual differences, which demonstrates that it is a reasonably robust measure (D’Angiulli, 2002, 2008, 2009; D’Angiulli, 2002; D’Angiulli and Reeves 2007; Alter and Balcetis, 2010; Rabin et al., 2010; Pearson et al., 2011). Despite these successes, so far there has been no clear empirical evidence showing exactly why trial-by-trial vividness reports should be considered more informative and reliable than the VVIQ. Do these sets of verbal reports reflect different or overlapping processes?
Many of the mentioned challenges could be mitigated by developing a model of processes underlying trial-by-trial vividness self-reports in visual mental image generation tasks, as opposed to just VVIQ measurement. One of the goals of the model should be to clarify the non-epiphenomenal role of the subjective vividness experience, a fundamental and difficult issue that continues to elude research efforts. An opportunity to gain some upper hand may be offered by conditions in which vivid imagery influences incidental recall in example situations such as the one mentioned earlier. The link between vividness and incidental recall was first suggested long ago (Richardson, 1969; Paivio, 1971) but the best evidence comes from studies showing that self-reported vividness is related with incidental recall of imagery-evoking verbal cues (Sheehan and Neisser, 1969; Sheehan, 1971, 1972b, 1973). In a typical paradigm devised by Sheehan (1972a), “vivid imagers” and “non-vivd imagers,” as defined by the VVIQ, were either intentionally or incidentally instructed to recall concrete (high imagery-evoking) and abstract (low imagery-evoking) words. Results showed that vivid imagers recalled concrete words significantly better in the incidental than in the intentional recall condition; whereas recall of abstract words was similarly poor in both conditions.
In another line of research, Neisser and Kerr used objective methods of mnemonic effectiveness and response time to study the spatial properties of visual imagery (Neisser and Kerr, 1973; Keenan and Moore, 1979; Kerr and Neisser, 1983). They asked the subjects to construct images in three different conditions according to presented sentences describing two objects in a given reciprocal spatial relation (concealed, next to/“pictorial,” far from/“separate”) and measured incidental recall rates of target verbal cues. Visual images acted as mnemonics in the concealed condition as well as the “pictorial” condition. If the procedure changed subtly and intentional learning was used instead, the objects in the concealed condition were recalled no better than the separate condition. The data from these experiments also showed that concealed images were less vivid than pictorial images, and response time was longer for less vivid images. Although instruction for imagery/recall had an effect on imagery vividness, incidental recall was invariably found to predict vividness even in studies that attempted to falsify Neisser and Kerr’s findings (Keenan, 1983).
The association between vividness and incidental recall is a relatively consistent finding across several different conditions and manipulations, and suggests that incidental recall could be used as the benchmark variable against which alternative hypotheses on the nature of imagery vividness and its function could be compared. Because older research had several shortcomings, Experiment 1 was designed to replicate, generalize, and extend said relationship. Most of those studies used global or delayed self-report of vividness. In addition, image generation time was confounded with vividness, and most paradigms did not clearly show whether the observed effects were discriminatively and specifically linked to recall processes (refer to Sheehan, 1973, for one exception). Furthermore, individual differences were often globally defined by the VVIQ, such that “good” versus “poor” imagers determined “high” versus “low” vividness, respectively. Finally, the lack of control for factors relating to the cued words themselves was a consistent problem in previous research. In the present research, a direct imagery and incidental recall paradigm were used, and several verbal properties were controlled for (age of word acquisition, word familiarity/frequency, imageability, and concreteness).
We compared two hypothetical cognitive components of mental image generation from verbal descriptions, which possibly could account for the outcomes of Experiment 1. If the relationship between vividness and unexpected recall were contingent upon shared processing while encoding the cues in the study phase (image generation), a possible relationship may be explained by depth of elaboration (Craik and Lockhart, 1972; Eysenck and Eysenck, 1980). The more time spent elaborating the imagined material, the more subjectively vivid the material should be. Subsequently, this should lead to better retention and recall in the test phase (free incidental recall). The main predictions derived from this hypothesis were that: (1) a direct relationship between image latency and incidental recall should exist, as should a relationship between incidental recall and self-rated vividness; (2) however, the correlation between vividness and incidental recall should be accounted for by image latency. Therefore, the correlation between vividness and incidental recall should be non-significant and/or correspond to a small effect size when image latency would be controlled for.
A possible alternative based on neurocognitive considerations is that vividness ratings rely on an index of the availability of multiple sensory traces in long-term memory (Hintzman and Block, 1971). Thus, because the strength of vividness would reflect the magnitude of the networks of sensory traces consolidated from episodic memory (Morris and Hampson, 1983; Rabbitt and Winthorpe, 1988), higher vividness ratings should be associated with better incidental recall performance (higher likelihood of accessing long-term traces). This model would also predict that the relationship between vividness and incidental recall can be partly explained by individual differences in participants’ ability to access long-term memory sensory information based on the prior estimate of availability supported by vividness judgments. The latter aspect could be conceived as a “meta-imagery” contribution, where the vividness judgment may reflect “a judgment of the richness of the current image combined with an estimate of the additional sensory information that could be incorporated, should the task requirements change.” (Baddeley and Andrade, 2000; p. 141). Consequently, individuals with greater metacognitive ability should experience more vivid images, be more efficient and faster in generating images, and yield higher incidental recall than the individuals who possess a reduced metacognitive ability. If greater vividness were related to greater incidental recall accuracy, and the relationship was not simply due to longer image latencies, then this would support the hypothesis that vividness acts as an index of stored memory trace availability, and plays a non-epiphenomenal role in determining the likelihood of accessing such memories in long-term memory.
In all the following experiments, explicit instructions to generate mental images was adopted as this manipulation has proven to be perhaps the most reliable and most direct way to ensure that participants are actually generating mental images, as shown by converging evidence from hundreds of studies showing that the report of having an image at request is associated with behavioral, neural, or clinical neuropsychological indices. In addition, while direct interference of imagery on low-level perception is an established phenomenon (Craver-Lemley and Reeves, 1987), the opposite effect, direct interference of low-level perception on imagery, is either weak and ubiquitous (see D’Angiulli, 2002) or is based again on introspective reports (as in Baddeley and Andrade, 2000). Therefore, the latter manipulations are no better or different than the ones we used for verifying the employment of imagery.
Experiment 1
Method and Materials
Participants
Participants were 26 first-year university students age range = 17–25; 14 female and 12 male). None had participated in an imagery study before (Campos et al., 2007). Participants signed up through a subject pool within 3 weeks of beginning introductory psychology courses, with 2% credit toward their final grade used as incentive. No significance was found for gender or age against any factors, so these variables were dropped from further consideration.
Stimuli
A body of 60 verbal description-cues from previous research (D’Angiulli, 2002:1]; available in D’Angiulli, 2001a) were matched with regards to noun or compound word frequency, imageability, concreteness, and reading time. These cues included single-noun and double-noun descriptions comprising both animate (e.g., dog, cat) and inanimate objects (e.g., car, bottle). The present data showed no significant differences between the two subsets of stimuli in terms of vividness or latency of elicited imagery. Secondary analyses indicated that these descriptions were rated as emotionally neutral, with negligible inter-item variability along a simple emotional rating scale (D’Angiulli, 2001b). In addition, the 10 noun-cues were selected from earlier research (Paivio et al., 1968) to use as buffer items during the incidental recall phase of the experiment (i.e., to filter out recency and primacy effects during recall). The 60 cues were presented in random order, preceded by five buffer noun-cues and followed by five other buffer noun-cues (which were presented in a fixed order).
Stimuli properties previously shown to intercorrelate were controlled for. Verbal cues with higher concreteness levels were shown to be recalled at significantly higher rates (Paivio, 1971), as were high frequency words (e.g., Miller and Roodenrys, 2009). Imageability, which refers to how easily a mental image can be generated from a word, has been correlated with concreteness (Tse and Altarriba, 2007). Age of acquisition, which refers to the average age a word enters a subject’s lexicon was indirectly controlled for, as it is highly correlated with both imageability (Ma et al., 2009) and concreteness (Barry and Gerhand, 2003). The well-validated MRC Psycholinguistic Database (Clark, 1997) was used to ensure the words used for cuing had approximately the same scores on these factors. Because it was assumed that vividness is an image-specific process, and it could not be rated if an image does not reach to conscious awareness, all cases rated “no image” were eliminated from our analysis.
Procedure
The protocol for Experiment 1 was approved by the Carleton University Research Ethics Board.
Image generation phase
Participants were seated facing a computer monitor and pressed the right mouse button to begin each trial. Upon clicking the mouse, an alerting beep was sounded, followed 250 ms later by the display of a noun-cue at the center of the screen. Participants were instructed to read the cue silently and as quickly as possible. They were immediately asked to generate an image that corresponded to the noun-cue. Participants were required to press the right mouse button again when they considered their image to be complete, and at its most vivid.
Upon pressing the button, another alerting beep was sounded, followed 250 ms later by a horizontal array of seven choices appearing near the bottom of the screen. From left to right, each button was labeled with one of seven vividness level descriptions in a seven-point scale format [(1), “no image”; (2), “very vague/dim”; (3), “vague/dim”; (4), “not vivid”; (5), “moderately vivid”; (6), “very vivid”; and (7), “perfectly vivid”], as in Marks (1995. Time was taken to familiarize participants with the rating system during pre-test practice sessions. Participants used the mouse to click on one of these seven buttons, and were instructed to rate any failure to generate an image as a “no image.” There was no deadline for their response.
Following the vividness response, the array of buttons disappeared and the display reverted back to a screen instructing the participant to click the mouse when they were ready to begin the next trial. In an effort to minimize imagery persistence between trials, stimuli were presented in random order with a minimum inter-trial interval of 5 s (Craver-Lemley and Reeves, 1987). Participants were not informed that latency times were covertly measured. Button presses were justified as a means to signal a complete image, which was ready to be rated, and prompt the appearance of the vividness scale buttons.
Free incidental recall phase
After completing the image generation phase, participants took a 20 min break. Afterward, they were asked to return to the lab to fill out additional paperwork, to receive course credit, and complete the debriefing process. Prior to the image generation phase, participants had not been informed that they would be required to recall any of the stimuli. Upon their return, precisely 30 min from the end of the image generation phase, they were asked to complete the incidental recall task, wherein they were required to recall and record as many of the previously read descriptions as possible.
Each phase of the experiment was exclusively conducted by one of two paid undergraduate research assistants. Both research assistants received training in their module, yet remained naïve to the purposes and hypotheses of the study. Final debriefing was conducted through an exit interview with the principal investigator.
Results and Discussion
Preliminary analyses were conducted on the empirical distributions of raw response times (RTs) for each level of vividness (except level 1 = “no image”). A total of 1490 valid observations were available after all cases with a rating of “no image” (5% of total trials) were removed. Data were binned using the smallest increment that did not make the histograms appear too irregular. From the initial binning it became apparent that our RT data could be fitted by an ex-Gaussian – that is, the convolution of an exponential with a Gaussian. This ex-Gaussian model has been used successfully in several experimental paradigms (for reviews, see Ratcliff, 1979, 1993; McNicol and Stewart, 1980; Luce, 1986) to fit explicit theoretical distribution functions and to give convenient summary of empirical RT distributions. The assumption of the ex-Gaussian model is that RT is the sum of two other random variables, one distributed as a Gaussian and one distributed as an exponential (Luce, 1986). Previous work (D’Angiulli and Reeves, 2002) has supported the hypothesis that the ex-Gaussian model reflects the time to retrieve images from memory so that “image generation” can be essentially reduced to “retrieving images from memory.” Therefore, variations in each of ex-Gaussian parameters across vividness levels could be assumed to describe the core underlying generative processes common to both imagery and incidental recall. The ex-Gaussian model was fitted using a robust regression method due to Hoaglin et al. (1983).
To ensure the ex-Gaussian reflected the shape of the group data, and the shape of the individual data, the model was first vincentized for individual data, and then averaged over vividness levels. Histograms were constructed by pooling the raw RTs from each vividness level over subjects, irrespective of the individual source of the RTs. This method has been used in situations where there are too few trials for single subjects (see Ratcliff, 1979). We verified whether the related observations were serially independent and not autocorrelated for each subject, if so we could assume independence of collective observations (see Neter et al., 1996). In our case, the Durbin–Watson autocorrelation test statistic D clearly exceeded the upper bound in the assessment of each subject [du > 1.62; α = 0.05; n = 60; lag = 1] as well as for each vividness level submitted to fitting, thereby showing no autocorrelation.
Table 1 shows the ex-Gaussian fit to the distribution histograms of RTs obtained for each vividness level. For each distribution, the ex-Gaussian fit explained at least 68% of the variance associated with RTs. The general distribution of the vividness data showed the median rating was a value of 4 (“non-vivid”). Examination of each vividness level regressed onto RTs showed both distributions were best summarized by piecewise linear regressions of opposite slope. These data supported a clear split between vivid (rating values 5–7) and non-vivid (2–4) observations.
Table 1. Results of the ex-Gaussian fit to empirical image latency distributions in unconstrained image generation phase of Experiment 1 (see text for details).
The Gaussian of both vivid images (levels 5–7) and less vivid images (levels < 4) are reported in Figure 1. Both distributions have comparable standard deviation, as evidenced by the left tail of the distributions. However, the distribution of less vivid images is delayed >500 ms, as evidenced by the shift on the time axis. Consistent with previous findings (D’Angiulli and Reeves, 2002), more vivid images were typically associated with shorter Gaussian latency components than were less vivid images. It is important to point out the enormous variability in the response latencies, and that the relationship between vividness could not be easily guessed by naïve participants. Therefore, it is rather implausible that the observed pattern might be due to response-bias based on an explicit or conscious criterion-shift, or set of decisions, since this would have required the participants to first tacitly simulate the ex-Gaussian model, and then retrofit their responses coherently to the model to produce the observed pattern. Because this would have to be done uniformly by all participants, the variability should have been much more contained than what we observed.
Figure 1. Ex-Gaussian model-fit of RT distributions for images rated with vividness 2–4 and 5–7 in Experiment 1.
The key analysis examined the predictability of recall and RTs from vividness rating category (non-vivid versus vivid). In an effort to meet assumptions for parametric procedures and augment robustness to violations, the distribution of RTs was normalized with a logarithmic transformation, after which no multivariate outliers were detected. Figure 2 shows the within-subject mean proportion of incidentally recalled imagery-evoking verbal cues presented during the image generation phase against the rated vividness level. Figure 3 shows the within-subjects mean RTs of image generation against the rated vividness level (for presentation, RT data are expressed as seconds, derived from antilog transformation). The proportion of recalled cues corresponding to vivid images was 0.77 (SE = 0.05), whereas the proportion of recalled cues corresponding to non-vivid images was 0.19 (SE = 0.04). A paired samples test showed the difference to be significant [t(25) = 6.69; p < 0.0001], explaining 74% of the variance. In contrast, the mean RTs for vivid (14.8 s, SE = 2.71) and non-vivid (13.33, SE = 2.14) cues did not differ [t(25) < 1, p = 0.34; R2 < 0.01].
Mean proportion of incidental recall for verbal cues corresponding to vivid (rating: 5–7) and non-vivid (rating: 2–4) images in Experiment 1.
Mean image generation time for verbal cues corresponding to vivid (rating: 5–7) and non-vivid (rating: 2–4) images in Experiment 1.
A linear regression analysis examining the effect of individual differences on the total number of images recalled showed that 14% of the variance in incidental recall accuracy was explained by participants’ average vividness rating [F(1, 25) = 4.05, MSe = 0.38, p = 0.05]. Therefore, the role of individual differences was modest and its effect size (r) was significantly smaller than that of vividness described earlier (0.86 versus 0.37, z = 3.07, p = 0.002).
A two-predictor model (stimulus and vividness) was fitted to the data to test the hypothesis regarding relationship between vividness and recall. Stimulus was plotted as a nominal factor, in which each category was a noun-cue. It was included as a predictor to ensure vividness effects were not due to the tendency for some words to produce more vivid images than others. The resulting model [Predicted logit of (Recall) = 0.664 + β1*Vividness + β2*Stimuli] was statistically reliable, χ2(62, 1441) = 340.969, p < 0.001 (see Appendix A for analysis details). According to the model, greater vividness ratings for noun-cues predicted recall with an overall success rate of 72.2%. The model correctly classified 83.7% of unrecalled cues and 54.7% of recalled cues. Stimulus and vividness generate a statistically significant predictive model for recall (see Appendix) that accounted for 28.3% of the variance in incidental recall. No change was observed if the model was fit to predict recall when response time was added as a predictor [χ2(65, 1490) = 389.437, p < 0.001]. RT did not exert an influence on the model (B = −0.002, p = 0.587), which further confirmed the null effect of RT on recall. Therefore, vividness could not account for recall accuracy simply because participants spent more time imagining the items corresponding to the verbal cues.
A linear mixed model was fit to the data to assess the contribution of stimulus and RTs to linear change in vividness of imagery. The variables in the model were evaluated by a Type III test. Since the sample size was not large, Restricted Iterative Generalized Least -squares (RIGLS) was used (Goldstein, 1986). Stimuli and RT had a significant effect on vividness [F(59, 1000) = 1.59, η2 = 0.086, p < 0.05], and F(1, 1103) = 5.17,η2 = 0.005, p < 0.05, respectively. Therefore, because the effects were small, RTs and stimuli influenced vividness only minimally. There was no interaction between stimuli and RTs (F < 1).
To determine if recall and vividness ratings were affected by the verbal properties of the word stimuli that were not kept constant during stimulus selection, correlation analyses were conducted on age of acquisition, and familiarity versus recall. No significant relationship was found between the percentage of participants that recalled a cue, and either age of acquisition (r = 0.213, p = 0.317) or familiarity scores (r = 0.118, p = 0.445). In addition, effects of stimuli regressed onto vividness, recall and RTs all explained less than 0.5% of the variance.
The results of Experiment 1 implicate vividness ratings as a predictor of incidental recall for imagery-evoking cues. The effect of individual differences in imaging ability on incidental recall was much smaller than the effect of vividness. Because image latency was unrelated to incidental recall, and inversely related to vividness, these data were incompatible with the depth of elaboration account. Because the effect of vividness on incidental recall for verbal cues was tested, the influence of expectancy and demand characteristics were minimized. These results support the validity of vividness as a measurable construct, and as an entity which may represent real underlying memory processes. Vividness ratings likely reflect a process which provides a natural mnemonic for unexpected retrieval of implicitly coded information (see Kosslyn et al., 2006).
Experiment 2
Although Experiment 1 did not include a measure of VVIQ, incidental and intentional recall has traditionally shown a modest correlation with the VVIQ and VVIQ2, with average effect sizes generally of about r = 0.13 (see McKelvie, 1995; Dean and Morris, 2003). More recent evidence suggests the relationship between VVIQ2 and trial-by-trial vividness ratings is weak to moderate (r < 0.20) (D’Angiulli, 2001a; D’Angiulli and Reeves, 2007). Also, the patterns of results from Sheehan (1971, 1972b) suggest the quality of imagery is contingent upon properties of the stimuli within the setting of each trial, and predicts incidental free recall and recognition performance. Lastly, other studies found the modest correlation between trial-by-trial ratings and VVIQ holds only for female participants (Sheehan, 1971, 1973).
In contrast with these findings, Pearson et al. (2011) reported large predictive effects of both trial-by-trial vividness ratings and VVIQ2 scores when related to bias in reporting a dominant pattern during a binocular rivalry task. The underlying assumption was that similar metacognitive processes (i.e., knowing how and what the observer knows about his/her own processes of visual mental imagery) would be used in trial-by-trial vividness ratings and in VVIQ2. If this assumption is correct, the overlapping processes could shed some light on the results of our Experiment 1. One interpretation of the results of Experiment 1 is that trial-by-trial vividness ratings may be accounted for by the same metacognitive judgment processes involved in responding to the VVIQ2. Experiment 2 was designed to examine the putative relationship between vividness ratings and VVIQ2. If the association between the VVIQ2 and vividness ratings were confirmed in Experiment 2, then one may also explain the basis through which vividness ratings could predict incidental recall in terms of the overlapping metacognitive processes involved in the VVIQ2.
The design of Experiment 2 was a variation of the paradigm used by Baddeley and Andrade 2000). Upon completing the VVIQ2, female participants were asked to read a short description of a static or dynamic scene, and press a key upon generating complete visual mental image. Participants then rated the vividness and the subjectively perceived latency of the image on a trial-by-trial basis. If, as the results of Experiment 1 would suggest, vividness ratings are based on an index of multiple sensory traces available in long-term memory, this account would predict: (1) higher trial-by-trial vividness ratings for dynamic scenes than static scenes, and (2) a negative (i.e., inverse) relationship between trial-by-trial vividness and perceived imagery latency. The VVIQ2 should correlate with trial-by-trial vividness ratings from both dynamic and static scenes, but should not relate to perceived imagery latency when the effects of vividness are removed.
Conversely, if the VVIQ2 accounts for most of the relationship between trial-by-trial vividness ratings and perceived imagery latency, then vividness judgments could be attributed to similar individual metacognitive skill differences involved in the two types of vividness measures (Baddeley and Andrade, 2000; Pearson et al., 2011). However, because dynamic mental imagery capacitates working memory more than static mental imagery, fewer resources are available for concurrent metacognitive processes. Then, under such circumstances one would expect less vivid images for dynamic scenes than static ones.
Methods and Materials
Participants
Participants were 44 female undergraduate students (age range: 18–25). Participants signed up through a subject pool, with 2% credit toward their final grade used as incentive. All participants had normal or corrected-to-normal vision, and no reported or documented learning disabilities. Participation required the attendance of two appointments. The first appointment was a preliminary screening session, where participants filled out the VVIQ2 and individual data. The second appointment was the experimental session. Five potential participants were excluded from the experiment, as they were unable to evoke the images as required.
Materials
An adaptation of 17 static and 17 dynamic scene descriptions were used (Baddeley and Andrade, 2000; Experiment 4, see Appendix A, p. 144). The scenes were adapted such that words including British content (e.g., Big Ben) were substituted with equally long words describing North American content (e.g., CNN Tower) which were validated through pilot experiments. During the screening phase, a question from the visual portion of the procedure for assessing expectations on the vividness of imagery was asked (Baddeley and Andrade, 2000; see Appendix C, Q2, Question 2, p. 145). After the experimental phase, a tacit knowledge assessment procedure was administered.
Procedure
The protocol for Experiment 2 was approved by the Carleton University Research Ethics Board.
Participants were given instructions, and 10 min of practice with five dynamic and five static imagery scenes. Between each practice trial, participants were required to report how well they could control each image. Only participant ratings with vividness greater than “extremely slow” (1) for 80% of the practice trials qualified for the entire experiment. One participant was eliminated from the initial pool under such criteria. Upon completing the practice session, participants verbally repeated the instructions to the experimenter to ensure the instructions were understood.
Participants were instructed to silently read a description of a dynamic or static scene displayed on a computer screen, which occurred 250 ms after an alerting beep. The experiment consisted of 17 dynamic, and 17 static descriptions. Participants were tested individually, and the procedure lasted approximately 40 min. Upon reading each description, participants were required to press a key to indicate the description was understood. Participants were instructed to imagine the description with their eyes open, and as seen from the front. Outline drawings were shown as examples before the experiment began. Each description was presented in random order with an inter-trial interval of 5 s (Craver-Lemley and Reeves, 1987). Upon forming a complete mental image, participants were required to press a button on a mouse. Four seconds after the button press, participants were shown buttons to rate perceived vividness, and perceived latency of the images. Participants were asked to rate their image as “complete” or “finished” when the image was maximally clear and detailed (see Cocude and Denis, 1988). Participants were required to rate their mental image as they had experienced it at the time of the key press. There was no deadline for the rating responses.
The presentation order of the scales was randomized, such that vividness could follow or precede perceived imagery rating. The second rating task followed immediately after the first rating response. The vividness scale consisted of a horizontal array of seven buttons appearing at the center of the screen. From left to right, each button was labeled with a short description corresponding to one of seven levels of the vividness scale used in Experiment 1. The imagery latency (speed) scale consisted of a horizontal array of seven buttons appearing at the center of the screen. From left to right, each button was labeled with a short description corresponding to one of the seven levels: from “extremely fast” (7), to “extremely slow” (1). Valid trials were defined by vividness greater than 1. Subjects were instructed to give a “1” response if they were unable to form a mental image. Upon completing the experiment, participants underwent a post-experimental interview, wherein they quickly described what they had imagined for seven randomly probed descriptions from both dynamic and static condition. Post-experimental interviews were concluded with the tacit knowledge assessment procedure (Baddeley and Andrade, 2000), and included the following question:
“We are interested in knowing if you think that there wasa relationship between how vivid your images were and other factors. Please just tell us what you expect or think, please do not use images to answer the question, we are just interested in what you predict or think about things that may be related or may determine the vividness of your images.”
Results and Discussion
To eliminate effects of discrepant scales, total scores for the VVIQ2 were converted to mean vividness values through a simple linear transformation. The transformation resulted in a seven-point scale; henceforth, referred to as mean vviq2. As in Experiment 1, we considered only valid responses. The rate of excluded invalid trials was approximately 3% (level 1 = “no image”), a proportion similar to Experiment 1. On average, images were reported as moderately vivid, and were produced at a relatively fast perceived latency in both static (Mviv. = 5.31, SDviv. = 0.55; Mspeed = 5.40, SDspeed = 0.39) and dynamic (Mviv. = 5.29, SD = 0.77; Mspeed = 5.59, SDspeed = 0.62) conditions. Paired samples contrasts showed dynamic imagery was perceived as significantly faster than static imagery [t(38) = 2.52, p < 0.025]. However, mean vividness ratings did not differ between the two conditions [t(38) < 1, p = 0.797]. The latter result differed from Baddeley and Andrade’s findings (Experiment 4). (They found dynamic imagery was significantly less vivid than static imagery). Images produced for the VVIQ2 were significantly more vivid (Mvviq2 = 5.68, SDvviq2 = 0.62) than vividness for static images [t(38) = 3.88, p < 0.0001], and dynamic images [t(38) = 2.83, p < 0.01].These data may be interpreted as evidence that participants were generally much more confident in their imagery abilities than what they were capable of demonstrating during the experimental procedure. The discrepancy between trial-by-trial vividness level and VVIQ2 imply a lack of agreement between metacognitive judgment as measured through the VVIQ2, and verbal reports specific to the actual imagery task.
Table 2 shows correlations among all measures. VVIQ2 was significantly correlated with vividness of static imagery, but was not related to vividness of dynamic imagery, nor perceived latency in both static and dynamic imagery conditions. A very strong inverse relationship between trial-by-trial vividness ratings and perceived imagery latency was observed in both static and dynamic imagery conditions, with strong to marginal evidence of the same trends in crossed conditions.
Table 2. Correlation matrix among VVIQ2 and self-reported image vividness ratings and perceived generation speed in dynamic and static imagery conditions of Experiment 2.
Whereas vividness ratings correlated with perceived latency, the VVIQ2 did not. These data provide very weak evidence validating the VVIQ2, when the criterion is a self-report, subjective third variable. Logically, one would not expect any predictive success of VVIQ2 in relation to a behavioral variable such as incidental recall. The observed patterns were analyzed to determine if they could be predicted by expectations or tacit knowledge (Pylyshyn, 2003). There was no significant difference in the number of participants expecting vivid imagery to be less or more vivid than static imagery (χ2 < 1). Figure 4 describes participant responses concerning self-rated predictions about the type of relationship they expect to exist between perceived vividness and perceived imagery latency, as documented during the preliminary screening session. Most participants predicted a positive relationship, or no relationship between vividness and imagery latency. One participant correctly predicted the inverse relationship. Upon removing the data of this participant from the analysis, there were no significant differences between results.
Figure 4. Percentages of participants predicting what type of relationship they tacitly think there should be between vividness ratings and speed of imagery in Experiment 2.
In conclusion, the association between the VVIQ2 and vividness ratings was not observed consistently in both the conditions of Experiment 2, and if collapsed across conditions (static and dynamic) the effect becomes modest and not significant. VVIQ2 also failed to validate against a third self-report criterion variable (perceived image latency). If the VVIQ2 assesses individual differences in metacognitive ability, it seems implausible that such abilities would predict incidental recall. Because trial-by-trial vividness predicted incidental recall, the metacognitive aspects assumed to be reflected by VVIQ2 do not appear to influence vividness and the mental imagery process to a significant degree.
General Discussion
Despite controlling for imageability, concreteness, age of acquisition, and verbal frequency/familiarity, the results from Experiment 1 showed a positive relationship between vividness ratings and incidental recall of imagery-evoking cues. These results are not consistent with depth of elaboration, as faster image generation latencies accompanied higher vividness ratings, a pattern opposite to what depth of elaboration would predict. Furthermore, because depth of elaboration predicts a positive correlation between incidental recall and image generation time, it again fails to account for the data from Experiment 1.
Our findings are compatible with an alternative model of vividness processes based on multi-trace memory theory (MMT; Moscovitch et al., 2005). This model proposes that vividness ratings are based on an index of the availability of multiple sensory traces in long-term memory, the strength of vividness reflecting the magnitude of the networks of sensory traces that have been consolidated from episodic memory. This is described by the inverse relationship between vividness ratings and image latency (the “vivid-is-fast” relation). Thus, higher vividness ratings are associated with higher likelihood of incidental recall, as shown by the data of Experiment 1.
The follow-up results observed in Experiment 2 showed that individual differences, as measured by the VVIQ2, are not a viable account for the relationship in Experiment 1 between vividness and incidental recall. Most important, the results of Experiment 2 also suggest that if there were metacognitive aspects involved in trial-by-trial vividness ratings, they would not likely be the same ones underlying VVIQ measures. Taken together the results of Experiment 1 and Experiment 2 are consistent with those observed in a meta-analysis we conducted, representing 5% of the literature pertaining to “vividness” and “VVIQ” (reported in Appendix B). The proportion of significant and non-significant experimental outcomes for trial-by-trial vividness ratings and VVIQ factor effects were calculated. For behavioral, cognitive, and neural measures, a greater number of significant experimental outcomes accompanied trial-by-trial vividness ratings than the VVIQ. Furthermore, the correlation between VVIQ scores and trial-by-trial vividness ratings for 21 entries showed an average correlation of 0.15, and variability in these values ranged from r = −0.27, to r = 0.64. Consistent with the results of experiment 2, these additional results support the contention that trial-by-trial vividness self-reports and VVIQ scores share some descriptive properties of visual imagery. However, trial-by-trial vividness ratings seem to resolve the construct of mental imagery with much greater reliability. Although metacognitive processes may be occurring in single trial judgment, it is perhaps more parsimonious to assume that vividness ratings are mostly a form of Level 2 retrospective verbal reports (Ericsson and Simon, 1993).
Considered as retrospective verbal reports, vividness ratings may be based on a direct translation of residual top-down sensory traces available in long-term memory (D’Angiulli and Reeves, 2002), wherein vividness intensity is proportional to the magnitude of sensory traces available. This statement agrees with a number of neurocognitive considerations borne out of MMT research. According to that theoretical framework, each sensory trace is distributed across the cortex, such that various distributive patterns are unique to a specific sensory input, and is distinct from all other distributive patterns (Hintzman, 1976). Sensory traces are thought to be indexed by the hippocampus (Ryan et al., 2001), and integrated into a mental image by the cuneus, precuneus, and occipital lobes (Svoboda et al., 2006; Svoboda and Levine, 2009; Cabeza and St. Jacques, 2007). However, hippocampal indexing becomes less influential as each individual sensory trace is integrated into cortical networks through successive (re)presentations (Takashima et al., 2009). Mental images are consolidated neural patterns that correspond to these “synthetic” sensory long-term traces, whose levels of interconnectedness are correlated to their perceived reportable vividness (Rabin et al., 2010).
Our study also indicates that although the VVIQ or VVIQ2 may very well measure an individual’s ability to generate vivid mental images (“trait vividness”), it likely lacks the resolution to measure an individual’s ability to experience vivid mental images in specific situational contexts (“state vividness”). To study specific processes behind the phenomenon of vividness itself (rather than “trait vividness”), it is perhaps more appropriate to use trial-by-trial self-reports, wherein vividness is rated immediately after its generation (Begg, 1988; Hertzog and Dunlosky, 2006; D’Angiulli, 2009; Pearson et al., 2011). Such self-reports have met with compounding success progressing beyond the VVIQ’s realm of individual differences, while remaining generally consistent with it. Vividness ratings demonstrate the reasonably robust nature of self-reports as a measure of “state” and “trait” vividness (D’Angiulli, 2002; D’Angiulli and Reeves, 2002, 2007; D’Angiulli, 2009; Alter and Balcetis, 2010; Rabin et al., 2010; Pearson et al., 2011). This particular issue is critical given the recent resurgence of use of the VVIQ in cognitive neuroscience – especially in the realm of neuroimaging (Amedi et al., 2005; Palmiero et al., 2010).
In summary, we found that trial-by-trial vividness ratings predict incidental recall, and the relationship cannot be attributed to depth of elaboration or metacognitive processes related to self-appraisal of individual imagery ability, as measured by the VVIQ2. Our results suggest that vividness of imagery makes implicit information available to consciousness, and to some extent, is linked with the associative processes through which phenomenal availability translates into access of incidental episodic memories. Therefore, we conclude, in certain conditions conscious phenomenological experience associated with imagery does not have a trivial role as it can have a critical influence on recall performance.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Adam Reeves and the late Ulric Neisser for comments on earlier ideas and bits of manuscripts before the complete draft even existed. Many thanks also go to John M. Kennedy, Robert Kunzendorf, J. T. E. Richardson, Stuart McKelvie for comments and pre-reviews of the ideas and data presented here, Judy Hall for meta-analytic advice. Portions of this research were presented at ICOM (International Congress on Memory), University of York, UK, 2012.
References
Alter, A. L., and Balcetis, E. (2010). Fondness makes the distance grow shorter: desired locations seem closer because they seem more vivid. J. Exp. Soc. Psychol. 47, 16–21.
Amedi, A., Malach, R., and Pascual-Leone, A. (2005). Negative BOLD differentiates visual imagery and perception. Neuron 48, 859–872.
Baddeley, A. D., and Andrade, J. (2000). Working memory and the vividness of imagery. J. Exp. Psychol. Gen. 129, 126–145.
Barrowcliff, A. L., Gray, N. S., Freeman, T. C. A., and MacCulloch, M. J. (2004). Eye-movements reduce the vividness, emotional valence and electrodermal arousal associated with negative autobiographical memories. J. Forens. Psychiatry Psychol. 15, 325–345.
Barry, C., and Gerhand, S. (2003). Both concreteness and age-of-acquisition affect reading accuracy but only concreteness affects comprehension in a deep dyslexic patient. Brain Lang. 84, 84–104.
Begg, I. (1988). What does the vividness of an image tell us about the value of imagery? J. Ment. Imagery 12, 45–56.
Bywaters, M., Andrade, J., and Turpin, G. (2004). Determinants of the vividness of visual imagery: the effects of delayed recall, stimulus affect and individual differences. Memory 12, 479–488.
Cabeza, R., and St. Jacques, P. (2007). Functional neuroimaging of autobiographical memory. Trends Cogn. Sci. (Regul. Ed.) 11, 219–227.
Campos, A., Gómez-Juncal, R., and Pérez-Fabello, M. J. (2007). Experience in imagery and imagery vividness. Imagin. Cogn. Pers. 27, 337–348.
Cattaneo, Z., Bona, S., and Silvanto, J. (2012). Cross-adaptation combined with TMS reveals a functional overlap between vision and imagery in the early visual cortex. Neuroimage 59, 3015–3020.
Cattaneo, Z., Pisoni, A., Papagno, C., and Silvanto, J. (2011). Modulation of visual cortical excitability by working memory: effect of luminance contrast of mental imagery. Front. Psychol. 2:29. doi:10.3389/fpsyg.2011.00029
Clark, C. (1997). MRC Psycholinguistic Database. The University of Western Australia School of Psychology. Available at: http://websites.psychology.uwa.edu.au/school/MRCDatabase/mrc2.html
Cocude, M., and Denis, M. (1988). Measuring the temporal characteristics of visual images. J. Ment. Imagery 12, 89–101.
Craik, F., and Lockhart, R. (1972). Levels of processing: a framework for memory research. J. Verb. Learn. Verb. Behav. 11, 671–684.
Craver-Lemley, C., and Reeves, A. (1987). Visual imagery selectively reduces vernier acuity. Perception 16, 599–614.
Cui, X., Jeter, C. B., Yang, D., Montague, P. R., and Eagleman, D. M. (2007). Vividness of mental imagery: individual variability can be measured objectively. Vision Res. 47, 474–478.
D’Angiulli, A. (2001a). Phenomenal and temporal aspects of visual mental image generation: validating verbal reports on vividness through latency analysis. Diss. Abstr. Int. 61, 5015.
D’Angiulli, A. (2001b). Vividness vs. imageability: effects on word frequency and image latency. Northwest Cognition and Memory, 3rd Annual Meeting, May 26, Vancouver, BC.
D’Angiulli, A. (2002). Mental image generation and the contrast sensitivity function. Cognition 85, B11–B19.
D’Angiulli, A. (2008). Is the spotlight an obsolete metaphor of “seeing with the mind’s eye”? A constructive naturalistic approach to the inspection of visual mental images. Imagin. Cogn. Pers. 28, 117–135.
D’Angiulli, A. (2009). “Vividness and behavioral-specificity in imagery: not what you’d expect,” in Proceedings of the 31st Annual Meeting of The Cognitive Science Society: Symmetry & Vividness in Vision, eds N. Taatgen and H. van Rijn (Amsterdam, NL: The Cognitive Science Society), 49.
D’Angiulli, A., and Reeves, A. (2002). Generating visual mental images: latency and vividness are inversely related. Mem. Cognit. 30, 1179–1188.
D’Angiulli, A., and Reeves, A. (2007). The relationship between self-reported vividness and latency during mental size scaling of everyday items: phenomenological evidence of different types of imagery. Am. J. Psychol. 120, 521–551.
Dean, G. M., and Morris, P. E. (2003). The relationship between self-reports of imagery and spatial ability. Br. J. Psychol. 94, 245–273.
Ericsson, K. A., and Simon, H. A. (1993). Protocol Analysis: Verbal Reports as Data. Cambridge, MA: The MIT Press.
Eysenck, M. W., and Eysenck, M. C. (1980). Effects of processing depth, distinctiveness, and word frequency on retention. Br. J. Psychol. 71, 263–274.
Farah, M. J., and Peronnet, F. (1989). Event-related potentials in the study of mental imagery. J. Psychophysiol. 3, 99–109.
Farah, M. J., Weisberg, L. L., Monheit, M. A., and Peronnet, F. (1989). Brain activity underlying mental imagery: event-related potentials during mental image generation. J. Cogn. Neurosci. 1, 302–316.
Goldenberg, G., Steiner, M., Podreka, I., and Deecke, L. (1992). Regional cerebral blood flow patterns related to verification of high and low imagery sentences. Neuropsychologia 30, 1081–1092.
Goldstein, H. (1986). Multilevel mixed linear model analysis using iterative generalized least square. Biometrika 73, 43–56.
Hertzog, C., and Dunlosky, J. (2006). “Using visual imagery as a mnemonic for verbal associative learning,” in Imagery and Spatial Cognition: Methods, Models, and Cognitive Assessment, eds T. Vecchi and G. Bottini (Philadelphia, PA: John Benjamins), 259–278.
Hintzman, D. (1976). “Repetition and memory,” in The Psychology of Learning and Motivation, ed. G. Bower (New York, NY: Academic Press), 47–91.
Hintzman, D. L., and Block, R. A. (1971). Repetition and memory: evidence for a multi-trace hypothesis. J. Exp. Psychol. 88, 297–306.
Hoaglin, D. C., Mosteller, F., and Tukey, J. W. (1983). Understanding Robust and Exploratory Data Analysis. New York: Wiley.
Keenan, J. M. (1983). Qualifications and clarifications of images of concealed objects: a reply to Kerr and Neisser. J. Exp. Psychol. Learn. Mem. Cogn. 9, 222–230.
Keenan, J. M., and Moore, R. E. (1979). Memory for images of concealed objects: a reexamination of Neisser and Kerr. J. Exp. Psychol. Learn. Mem. Cogn. 5, 374–385.
Kerr, N. H., and Neisser, U. (1983). Mental images of concealed objects-new evidence. J. Exp. Psychol. Learn. Mem. Cogn. 9. 212–221.
Kosslyn, S. M., Thompson, W., and Ganis, G. (2006). The Case for Mental Imagery. New York, NY: Oxford University Press.
Luce, R. D. (1986). Response Times: Their Role in Inferring Elementary Mental Organization. New York: Oxford University Press.
Ma, W., Golinkoff, R., Hirsh-Pasek, K., McDonough, C., and Tardif, T. (2009). Imageability predicts the age of acquisition of verbs in Chinese children. J. Child Lang. 36, 405–423.
McKelvie, S. J. (1995). Vividness of Visual Imagery: Measurement, Nature, Function & Dynamics. New York, NY: Brandon House.
McNicol, D., and Stewart, G. W. (1980). “Reaction time and the study of memory,” in Reaction Times, ed. A. T. Welford (London: Academic Press), 253–307.
Miller, L., and Roodenrys, S. (2009). The interaction of word frequency and concreteness in immediate serial recall. Mem. Cognit. 37, 850–865.
Moscovitch, M., Rosenbaum, R. S., Gilboa, A., Addis, D. R., Westmacott, R., Grady, C., et al. (2005). Functional neuroanatomy of remote episodic, semantic and spatial memory: a unified account based on multiple trace theory. J. Anat. 207, 35–66.
Mosteller, F., and Bush, R. R. (1954). “Selected quantitative techniques,” in Handbook of Social Psychology, ed. G. Lindzey (Cambridge, MA: Addison-Wesley), 289–334.
Neisser, U., and Kerr, N. (1973). Spatial and mnemonic properties of visual images. Cogn. Psychol. 5, 138–150.
Neter, J., Kutner, M. H., Nachtsheim, C. J., and Wasserman, W. (1996). Applied Linear Statistical Models, 4th Edn. New York: McGraw-Hill/Irwin Series: Operations and Decision Sciences.
Paivio, A., Yuille, J. C., and Madigan, S. (1968). Concreteness, imagery, and meaningfulness values for 925 nouns. J. Exp. Psychol. 76, 1–25.
Palmiero, M., Nakatani, C., Raver, D., Belardinelli, M., and Leeuwen, C. (2010). Abilities within and across visual and verbal domains: how specific is their influence on creativity? Creat. Res. J. 22, 369–377.
Pearson, D. G. (1995). The VVIQ and cognitive models of imagery: future directions for research. J. Ment. Imagery 19, 167–170.
Pearson, J., Rademaker, R. L., and Tong, F. (2011). Evaluating the mind’s eye: the metacognition of visual imagery. Psychol. Sci. 22, 1535–1542.
Rabbitt, P., and Winthorpe, C. (1988). “What do old people remember? The Galton paradigm reconsidered,” in Practical Aspects of Memory: Current Research and Issues, eds M. M. Gruneberg, P. E. Morris, and R. N. Sykes (New York: Wiley), 301–307.
Rabin, J. S., Gilboa, A., Stuss, D. T., Mar, R. A., and Rosenbaum, R. S. (2010). Common and unique neural correlates of autobiographical memory and theory of mind. J. Cogn. Neurosci. 22, 1095–1111.
Ratcliff, R. (1979). Group reaction time distributions and an analysis of distribution statistics. Psychol. Bull. 86, 446–461.
Rosenthal, R. (1991). Meta-Analytic Procedures for Social Research (rev. ed.). Newbury Park, CA: Sage.
Ryan, L., Nadel, L., Keil, K., Putnam, K., Schnyer, D., Trouard, T., et al. (2001). Hippocampal complex and retrieval of recent and very remote autobiographical memories: evidence from functional magnetic resonance imaging in neurologically intact people. Hippocampus 11, 707–714.
Sheehan, P. W. (1972b). Role of imagery in incidental learning: replication and extension of an effect. J. Exp. Psychol. 95, 226–228.
Sheehan, P. W. (1973). Stimulus imagery effect and the role of imagery in incidental learning. Aust. J. Psychol. 25, 93–102.
Sheehan, P. W., and Neisser, U. (1969). Some variables affecting the vividness of imagery in recall. Br. J. Psychol. 60, 71–80.
Shepard, R. N. (1966). Learning and recall as organization and search. J. Verb. Learn. Verb. Behav. 5, 201–204.
Sparing, R., Mottaghy, F. M., Ganis, G., Thompson, W. L., Topper, R., Kosslyn, S. M., et al. (2002). Visual cortex excitability increases during visual mental imagery – a TMS study in healthy human subjects. Brain Res. 938, 92–97.
Svoboda, E., and Levine, B. (2009). The effects of rehearsal on the functional neuroanatomy of episodic autobiographical and semantic remembering: a functional magnetic resonance imaging study. J. Neurosci. 29, 3073–3082.
Svoboda, E., McKinnon, M., and Levine, B. (2006). The functional neuroanatomy of autobiographical memory: a meta-analysis. Neuropsychologia 44, 2189–2208.
Takashima, A., Nieuwenhuis, I. L. C., Jensen, O., Talamini, L. M., Rijpkema, M., and Fernández, G. (2009). Shift from hippocampal to neocortical centered retrieval network with consolidation. J. Neurosci. 29, 10087–10093.
Tse, C., and Altarriba, J. (2007). Testing the associative-link hypothesis in immediate serial recall: evidence from word frequency and word image ability effects. Memory 15, 675–690.
Appendices
Appendix A
A linear mixed model was fit to the data to assess the contribution of the variables to linear change in vividness of imagery. The analysis was carried out using SPSS17. The variables in the model were evaluated by a Type III test. Since the sample size was not large, Restricted Iterative Generalized Least Square (RIGLS) estimation method was used. Table A1 shows the type III tests of fixed effects. (Cases with rating of “no image” (vividness rating value 1) were excluded from all analysis).
Table A1. Type III tests of fixed effects in linear mixed model analysis testing the influences of stimuli and image generation time (RTs) on vividness ratings.
To test how well the factors in the dataset predict recall, a logistic regression analysis was performed. The response for recall was recorded as 1 for recalled verbal descriptions and 0 for not recalled verbal descriptions. Explanatory variables included vividness, stimuli, and reaction time (RT). Table A2 displays model specifications, including the specified distribution and link function. Table A3 summarizes the results. Table A4 shows the full model results by predictor.
Table A3. Evaluation result for logistic regression predictive model for incidental recall using stimuli and vividness as predictor.
All 1490 valid observations were entered in the logistic regression model as the preliminary linear mixed modeling fitting analysis indicated that residual errors were only modestly correlated within each subject and were independent across subjects. Robustness to the violation of the assumption of independence was demonstrated by replicating the results with the following confirmatory repeated measure logistic regression model.
The dichotomous outcome for recall was further modeled with a repeated measure logistic regression analysis. The model was based on the probability of the largest value of response variable, which was 1. Two stimuli, which caused singularity of Hessian matrix, were removed from the dataset, resulting in 1441 observations (and no difference in the results). Models specifications, including the specified distribution and link function were same as the initial logistic regression model.
Type III test evaluated the effect of explanatory variables on recall accuracy in the purposed model. The test result showed that Vividness and stimuli were significant predictors for recall, χ2 = 14.77 and χ2 = 4276, p < 0.05 respectively (see Table A5). No other significance was found. Table A6 shows estimation for parameters in the model. Table A7 shows validity of predicted probabilities. The prediction for descriptions which were not recalled was more accurate than that for the verbal descriptions which were, 50.5% of non-recalled descriptions and 22% of recalled descriptions were correctly predicted. This confirmed that the model had overall 72.5% accuracy.
Appendix B
A corpus of 66 peer-reviewed experimental journal articles representing 4.32% of the literature available through PsycINFO, and containing the keyword “vividness” was randomly compiled by a research assistant naive to the purposes of the study (see Appendix references). Random selection of a relevant representative sample can be defended as a sound, reasonable meta-analytic tactic, provided the selected sources are analyzed according to a set of predefined, a priori criteria (Rosenthal, 1991). As a prerequisite for inclusivity, any statistical outcome directly pertaining to the measures VVIQ and trial-by-trial vividness ratings were to be utilized in the analysis, except those pertaining to post hoc comparisons.
The analysis consisted of two phases, a preliminary non-parametric analysis, and a secondary parametric analysis. Data for the preliminary analysis was obtained by partitioning individual experimental outcomes into two 2 × 2 contingency tables. Upon partitioning each experimental outcome as either a significant or non-significant experimental outcome, and as either a VVIQ or trial-by-trial vividness subjective report, each datum was further categorized as either a neural or behavioral/cognitive objective measure.
The same dataset from the preliminary non-parametric analysis was utilized in the secondary parametric analysis. However, each binomial outcome was transformed into an exact probability value. Analytic accuracy was maintained by calculating probabilities from reported test statistics and degrees of freedom. If required, raw data was statistically analyzed anew from means and variance. This rule was strictly adhered to unless otherwise unavoidable, in which case probability signifiers were rounded to the reported cut-off (i.e., p < 0.05 was approximated as 0.05); however, it should be noted that rounding was required six times over the course of 863 entries. The resultant entries were then categorized as either VVIQ or trial-by-trial vividness subjective report, and as either a neural or behavioral/cognitive objective measure. All values within each category were summated, and divided by the square root of the number of entries within each category.
A non-parametric analysis examining experimental outcome between VVIQ and trial-by-trial vividness ratings is presented in Figures A1A,B. The data in Figure A1A represent the number of significant versus non-significant experimental outcomes for VVIQ and trial-by-trial vividness ratings for behavioral/cognitive objective measures. The data in Figure A1B represent the number of significant versus non-significant experimental outcomes for VVIQ and trial-by-trial vividness ratings for neural objective measures. A higher proportion of successes accompany trial-by-trial vividness ratings for both behavioral/cognitive and neural objective measures. This relationship is especially true for studies underlying the neural origin of vividness.
Figure A1. (A) Experimental outcomes for trial-by-trial vividness ratings and VVIQ with respect to behavioral and/or cognitive measures. (B) Experimental outcomes for trial-by-trial vividness ratings and VVIQ with respect to neural measures. The dark line refers to results which reject the null hypothesis, and the light line refers to results which fail to reject the null hypothesis.
The trends observed in the preliminary analysis prompted the use of a more sensitive statistical procedure. Because the directionality of each statistical outcome was not immediately apparent, and degrees of freedom often exceed one for F-tests and Chi-square tests of significance, standard meta-analytic methodology was decidedly insufficient for such purposes (Rosenthal, 1991). Under these circumstances, Stouffer’s method of adding Z’s provides a straightforward and reasonable estimate (Mosteller and Bush, 1954; Rosenthal, 1991). Upon determining exact probability values for each entry introduced, the values were transformed into their standard normal deviates. These values were summated, and divided by the square root of the number of entries within each category. Data for the parametric analysis is shown in Figure A2. These data show the summated Z-scores for VVIQ and trial-by-trial vividness ratings for behavioral/cognitive and neural objective measures.
Figure A2. Summed Z-scores for vividness and VVIQ for neural and behavioral and/or cognitive measures.
As evidenced by Figure A2, two trends remain especially salient. Firstly, trial-by-trial vividness ratings are consistently greater for behavioral/cognitive and neural measures. Secondly, behavioral/cognitive measures yield significantly greater values than those which are neural. These results suggest that trial-by-trial vividness ratings are a more effective means by which to measure the subjective experience of mental imagery. Furthermore, Fisher’s Z-transformation for experimental outcomes concerning the correlation between VVIQ scores and trial-by-trial vividness ratings for 21 entries retrieved from six of the peer-reviewed journal articles showed an average Zr of 0.154, and variability in these values ranged from r = −0.27, to r = 0.64. Consistent with the results of experiment 2, these results support the contention that trial-by-trial vividness self-reports and VVIQ scores share some descriptive properties of visual imagery; however, trial-by-trial vividness ratings seem to be much more resolved.
References
Adeyemo, S. A. (1998). Imagery unvividness and social-psychological implications. J. Ment. Imagery 22, 79–98.
Aleman, A., Böcker, K. B. E., and de Haan, E. H. F. (1999). Disposition towards hallucination and subjective versus objective vividness of imagery in normal subjects. Pers. Individ. Differ. 27, 707–714.
Aleman, A., and de Haan, E. H. F. (2001). Do people with better mental imagery ability make more reality monitoring errors? Curr. Psychol. Lett. 6, 7–15.
Aleman, A., and de Haan, E. H. F. (2004). Fantasy proneness, mental imagery and reality monitoring. Pers. Individ. Differ. 36, 1747–1754.
Aleman, A., Nieuwenstein, M. R., Böcker, K. B. E., and de Haan, E. H. F. (2000). Mental imagery and perception in hallucination-prone individuals. J. Nerv. Ment. Dis. 188, 830–836.
Allbutt, J., Ling, J., Heffernan, T. M., and Shafiullah, M. (2008). Self-report imagery questionnaire scores and subtypes of social-desirable responding: auditory imagery, visual imagery, and thinking style. J. Individ. Differ. 29, 181–188.
Baddeley, A. D., and Andrade, J. (2000). Working memory and the vividness of imagery. J. Exp. Psychol. Gen. 129, 126–145.
Barlett, C. P., and Brannon, L. A. (2007). “If only…”: the role of visual imagery in counterfactual thinking. Imagin. Cogn. Pers. 26, 87–100.
Belardinelli, M. O., Palmiero, M., Sestieri, C., Nardo, D., Di Matteo, R., Londei, A., et al. (2009). An fMRI investigation on image generation in different sensory modalities: the influence of vividness. Acta Psychol. (Amst.) 132, 190–200.
Bent, N. A., and Wick, E. (2006). Beyond vividness: parental filters as moderators in mental imagery and measured anxiety level. J. Ment. Imagery 30, 21–38.
Bird, C. M., and Burgess, N. (2008). The hippocampus and memory: insights from spatial processing. Nat. Rev. Neurosci. 9, 182–194.
Borkovec, T. D., and Sides, J. K. (1979). The contribution of relaxation and expectancy to fear reduction via graded, imaginal exposure to feared stimuli. Behav. Res. Ther. 17, 529–540.
Burton, L. J. (2003). Examining the relation between visual imagery and spatial ability tests. Int. J. Test. 3, 277–291.
Campos, A., Amor, Á., and González, M. Á. (2002). Presentation of keywords by means of interactive drawings. Span. J. Psychol. 5, 102–109.
Campos, A., Gómez-Juncal, R., and Pérez-Fabello, M. J. (2007). Experience in imagery and imagery vividness. Imagin. Cogn. Pers. 27, 337–348.
Campos, A., López, A., and González, M. A. (1999a). Effects of eyes open/closed and order of rating on VVIQ scores. J. Ment. Imagery. 23, 35–43.
Campos, A., Marcos, J. L., and Gonzáles, M. Á. (1999b). Emotionality of words as related to vividness of imagery and concreteness. Percept. Mot. Skills 88, 1135–1140.
Cartwright, D., Jenkins, J. L., Chavez, R., and Peckar, H. (1983). Studies in imagery and identity. J. Pers. Soc. Psychol. 44, 376–384.
Cocude, M., and Denis, M. (1988). Measuring the temporal characteristics of visual images. J. Ment. Imagery 12, 89–101.
Conduit, R., Crewther, S. G., and Coleman, G. (2004). Spontaneous eyelid movements (ELMS) during sleep are related to dream recall on awakening. J. Sleep Res. 13, 137–144.
Cornoldi, C., de Beni, R., Cavedon, A., and Mazzoni, G. (1992). How can a vivid image be described? Characteristics influencing vividness judgments and the relationship between vividness and memory. J. Ment. Imagery 16, 89–107.
Cui, X., Jeter, C. B., Yang, D., Montague, P. R., and Eagleman, D. M. (2007). Vividness of mental imagery: individual variability can be measured objectively. Vision Res. 47, 474–478.
D’Angiulli, A. (2002). Mental image generation and the contrast sensitivity function. Cognition 85, B11–B19.
D’Angiulli, A., and Reeves, A. (2002). Generating visual mental images: latency and vividness are inversely related. Mem. Cognit. 30, 1179–1188.
D’Angiulli, A., and Reeves, A. (2005). “Picture theory, tacit knowledge or vividness-core? Three hypotheses on the mind’s eye and its elusive size,” in Proceedings of the 27th Cognitive Science Society Annual Meeting, eds B. Bara, L. Barsalou, and M. Bucciarelli (Stresa: The Cognitive Science Society), 536–541.
D’Angiulli, A., and Reeves, A. (2007). The relationship between self-reported vividness and latency during mental size scaling of everyday items: phenomenological evidence of different types of imagery. Am. J. Psychol. 120, 521–551.
De Pascalis, V., Marucci, F. S., Penna, P. M., and Pessa, E. (1987). Hemispheric activity of 40 Hz EEG during recall of emotional events: differences between low and high hypnotizables. Int. J. Psychophysiol. 5, 167–180.
Dean, G. M., and Morris, P. E. (2003). The relationship between self-reports of imagery and spatial ability. Br. J. Psychol. 94, 245–273.
Dyckman, J. M., and Cowan, P. A. (1978). Imaging vividness and the outcome of in vivo and imagined scene desensitization. J. Consult. Clin. Psychol. 46, 1155–1156.
Gale, A., Morris, P. E., Lucas, B., and Richardson, A. (1972). Types of imagery and imagery types: an EEG study. Br. J. Psychol. 63, 523–531.
Giusberti, F., Cornoldi, C., De Beni, R., and Massaroni, M. (1992). Differences in vividness ratings of perceived and imagined patterns. Br. J. Psychol. 83, 533–547.
Hale, S. M., and Simpson, H. M. (1971). Effects of eye movements on the rate of discovery and the vividness of visual images. Percept. Psychophys. 9, 242–264.
Holmes, E. A., Lang, T. J., Moulds, M. L., and Steele, A. M. (2008). Prospective and positive mental imagery deficits in dysphoria. Behav. Res. Ther. 46, 976–981.
Hunt, M., Bylsma, L., Brock, J., Fenton, M., Goldberg, A., Miller, R., et al. (2006). The role of imagery in the maintenance and treatment of snake fear. J. Behav. Ther. Exp. Psychiatry 37, 283–298.
Iaccino, J., and Byrne, J. (1989). Mental layouts of concealed objects as a function of imagery type and experimental conditions. Bull. Psychon. Soc. 27, 402–404.
Jenness, A., and Jorgensen, A. P. (1941). Ratings of vividness of imagery in the waking state compared with reports of somnambulism. Am. J. Psychol. 54, 253–259.
Keenan, J. M., and Moore, R. E. (1979). Memory for images of concealed objects: a reexamination of Neisser and Kerr. J. Exp. Psychol. Learn. Mem. Cogn. 5, 374–385.
Keogh, L., and Markham, R. (1998). Judgements of other people’s memory reports: differences in reports as a function of imagery vividness. Appl. Cogn. Psychol. 12, 159–171.
Kerr, N. H., and Neisser, U. (1983). Mental images of concealed objects-new evidence. J. Exp. Psychol. Learn. Mem. Cogn. 9. 212–221.
Kidd, L. K., and Workman, J. E. (1999). Assessment of creativity in apparel design. Cloth. Text. Res. J. 17, 58–64.
Kosslyn, S., and Alper, S. (1977). On the pictorial properties of visual images: effects of image size on memory for words. Can. J. Psychol. 31, 32–40.
Lorey, B., Pilgramm, S., Bischoff, M., Stark, R., Vaitl, D., Kindermann, S., et al. (2011). Activation of the parieto-premotor network is associated with vivid motor imagery–a parametric fMRI Study. PLoS ONE 6:e20368. doi:10.1371/journal.pone.0020368
Mantani, T., Okamoto, Y., Shirao, N., Okada, G., and Yamawaki, S. (2005). Reduced activation of posterior cingulate cortex during imagery in subjects with high degrees of alexithymia: a functional magnetic resonance imaging study. Biol. Psychiatry 57, 982–990.
Markham, R., and Hynes, L. (1993). The effect of vividness of imagery on reality monitoring. J. Ment. Imagery 17, 159–170.
Marks, D. F., and Isaac, A. R. (1995). Topographical distribution of EEG activity accompanying visual and motor imagery in vivid and non-vivid imagers. Br. J. Psychol. 86, 271–282.
McClelland, A., Kemps, E., and Tiggemann, M. (2006). Reduction of vividness and associated craving in personalized food imagery. J. Clin. Psychol. 62, 355–365.
McKelvie, S. J. (1998). Effects of gender on reported vividness of visual imagery for parents. J. Ment. Imagery 22, 99–112.
Morrison, R. G., and Wallace, B. (2001). Imagery vividness, creativity and the visual arts. J. Ment. Imagery 25, 135–152.
Neisser, U., and Kerr, N. (1973). Spatial and mnemonic properties of visual images. Cogn. Psychol. 5, 138–150.
Overton, J. A. G. (2004). Correlation of EEG activity with subjective performance on a guided imagery test: an exploratory study. J. Ment. Imagery 28, 17–60.
Pearson, J., Rademaker, R. L., and Tong, F. (2011). Evaluating the mind’s eye: the metacognition of visual imagery. Psychol. Sci. 22, 1535–1542.
Rauch, S. A. M., Foa, E. B., Furr, J. M., and Filip, J. C. (2004). Imagery vividness and perceived anxious arousal in prolonged exposure treatment for PTSD. J. Trauma. Stress 17, 461–465.
Riske, M. L., Wallace, B., and Allen, P. A. (2000). Imaging ability and eye witness accuracy. J. Ment. Imagery 24, 137–148.
Ritchey, H., and Beal, C. (1980). Image detail and recall: evidence for within-item elaboration. J. Exp. Psychol. 6, 66–76.
Rodway, P., Gillies, K., and Schepman, A. (2006). Vivid imagers are better at detecting salient changes. J. Individ. Differ. 27, 218–228.
Sadler, P., and Woody, E. Z. (2006). Does the more vivid imagery of high hypnotizables depend on greater cognitive effort? A test of dissociation and social-cognitive theories of hypnosis. Int. J. Clin. Exp. Hypn. 54, 372–391.
Slee, J. A. (1980). Individual differences in visual imagery ability and the retrieval of visual appearances. J. Ment. Imagery 4, 93–113.
Sutherland, M. E., Harrell, J. P., and Isaacs, C. (1987). The stability of individual differences in imagery ability. J. Ment. Imagery 11, 97–104.
Tracy, R. J., Roesner, L. S., and Kovac, R. N. (1988). The effect of visual versus auditory imagery on vividness and memory. J. Ment. Imagery 12, 145–161.
Van Diest, I., De Peuter, S., Devriese, S., Wellens, E., Van, d. W., and Van, d. B. (2005). Imagined risk of suffocation as a trigger for hyperventilation. Psychosom. Med. 67, 813–819.
Vergeer, I., and Roberts, J. (2006). Movement and stretching imagery during flexibility training. J. Sports Sci. 24, 197–208.
Walczyk, J. J., and Hall, V. C. (1988). The relationship between imagery vividness ratings and imagery accuracy. J. Ment. Imagery 12, 163–171.
Keywords: episodic memory, incidental recall, multi-trace theory, visual imagery, vividness, VVIQ
Citation: D’Angiulli A, Runge M, Faulkner A, Zakizadeh J, Chan A and Morcos S (2013) Vividness of visual imagery and incidental recall of verbal cues, when phenomenological availability reflects long-term memory accessibility. Front. Psychology 4:1. doi: 10.3389/fpsyg.2013.00001
Received: 02 April 2012; Accepted: 02 January 2013;
Published online: 04 February 2013.
Edited by:
Joel Pearson, The University of New South Wales, AustraliaReviewed by:
Joel Pearson, The University of New South Wales, AustraliaGiorgio Ganis, Plymouth University, UK
Copyright: © 2013 D’Angiulli, Runge, Faulkner, Zakizadeh, Chan and Morcos. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
*Correspondence: Amedeo D’Angiulli, Neuroscience of Imagery, Cognition, and Emotion Research Lab, Carleton University, 1125 Colonel By Drive, Room 1316 Dunton Tower, Ottawa, ON K1S 5B6, Canada. e-mail:YW1lZGVvQGNvbm5lY3QuY2FybGV0b24uY2E=