A Neurobiologically Constrained Cortex Model of Semantic Grounding With Spiking Neurons and Brain-Like Connectivity

Tomasello, Rosario; Garagnani, Max; Wennekers, Thomas; Pulvermüller, Friedemann

doi:10.3389/fncom.2018.00088

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 06 November 2018

Volume 12 - 2018 | https://doi.org/10.3389/fncom.2018.00088

This article is part of the Research TopicFrom Neuronal Network to Artificial Neural Network: Structure, Function and IntelligenceView all 7 articles

A Neurobiologically Constrained Cortex Model of Semantic Grounding With Spiking Neurons and Brain-Like Connectivity

Rosario Tomasello^1,2,3^*

Max Garagnani^1,4

Thomas Wennekers²

Friedemann Pulvermüller^1,3,5

¹Brain Language Laboratory, Department of Philosophy and Humanities, WE4, Freie Universität Berlin, Berlin, Germany
²Centre for Robotics and Neural Systems, University of Plymouth, Plymouth, United Kingdom
³Berlin School of Mind and Brain, Humboldt Universität zu Berlin, Berlin, Germany
⁴Department of Computing, Goldsmiths, University of London, London, United Kingdom
⁵Einstein Center for Neurosciences, Berlin, Germany

One of the most controversial debates in cognitive neuroscience concerns the cortical locus of semantic knowledge and processing in the human brain. Experimental data revealed the existence of various cortical regions relevant for meaning processing, ranging from semantic hubs generally involved in semantic processing to modality-preferential sensorimotor areas involved in the processing of specific conceptual categories. Why and how the brain uses such complex organization for conceptualization can be investigated using biologically constrained neurocomputational models. Here, we improve pre-existing neurocomputational models of semantics by incorporating spiking neurons and a rich connectivity structure between the model ‘areas’ to mimic important features of the underlying neural substrate. Semantic learning and symbol grounding in action and perception were simulated by associative learning between co-activated neuron populations in frontal, temporal and occipital areas. As a result of Hebbian learning of the correlation structure of symbol, perception and action information, distributed cell assembly circuits emerged across various cortices of the network. These semantic circuits showed category-specific topographical distributions, reaching into motor and visual areas for action- and visually-related words, respectively. All types of semantic circuits included large numbers of neurons in multimodal connector hub areas, which is explained by cortical connectivity structure and the resultant convergence of phonological and semantic information on these zones. Importantly, these semantic hub areas exhibited some category-specificity, which was less pronounced than that observed in primary and secondary modality-preferential cortices. The present neurocomputational model integrates seemingly divergent experimental results about conceptualization and explains both semantic hubs and category-specific areas as an emergent process causally determined by two major factors: neuroanatomical connectivity structure and correlated neuronal activation during language learning.

Introduction

Although the brain mechanisms of meaning processing have been investigated for many years, cognitive neuroscientists have not reached a consensus about the function and the organizational principles of semantic knowledge. A range of neuroimaging and neuropsychological patient studies suggest a contribution of several cortical areas to semantic processing, but the precise role of each of them is still subject to debate. Cognitive and neuroscientists have suggested that the meanings of all words are equally processed and stored in a central “symbolic system” cortically located in a “semantic hub.” However, “semantic hubs” have been proposed in different cortical regions, including the anterior-inferior-temporal lobe (Patterson et al., 2007; Ralph et al., 2017), the anterior-inferior-parietal (Binder et al., 2009; Binder and Desai, 2011) and the posterior-inferior-frontal cortex (Posner and Pavese, 1998; Bookheimer, 2002; Tate et al., 2014; Schomers and Pulvermüller, 2016; Carota et al., 2017). Whereas it is possible, in principle, that several semantic hubs co-exist, some researchers postulated the need for bringing together all semantic information into one focal area and consequently reject the existence of multiple semantic hubs (Patterson et al., 2007; Ralph et al., 2017). Furthermore, and over and above semantic hubs generally contributing to all types of semantics, the phenomenon of category-specific semantic processing has long been in focus (McCarthy and Warrington, 1988; Shallice, 1988): modality-preferential cortices, including visual, auditory, olfactory, gustatory, somatosensory and motor regions, have been shown to differentially activate when specific semantic types are processed, for example animal vs. tool nouns or verbs typically used to speak about different types of actions (Damasio et al., 1996; Chao et al., 1999; Hauk et al., 2004; Kemmerer et al., 2012; Grisoni et al., 2016; Vukovic et al., 2017). Also studies of patients with lesions in modality-specific regions revealed category-specific semantic deficits (Warrington and Mccarthy, 1983; Damasio et al., 1996; Neininger and Pulvermüller, 2003; Gainotti, 2010; Trumpp et al., 2013; Dreyer et al., 2015) which can not be explained by symbolic systems accounts presuming category-general semantic hubs. Likewise, these findings challenge proposals that see the semantic processing role of sensorimotor areas as optional, ancillary or epiphenomenal and deny them a genuine semantic conceptual function (Machery, 2007; Mahon and Caramazza, 2008; Caramazza et al., 2014). The evidence for multiple hubs and modality-specific areas for conceptual-semantic knowledge is difficult to reconcile within most current neurobiological models of symbol processing.

To incorporate the diverging semantic theories and data from healthy and patient studies described above, it is necessary to build sophisticated models of relevant cortical areas that are biologically constrained by mimicking relevant features of brain function and connectivity. Ideally, such brain-constrained models may predict and offer mechanistic explanations for semantic processing in the human brain. Potentially, such modeling efforts can confirm a given theoretical framework, for example the existence of distributed semantic circuits spread out across several semantic hubs and modality-preferential areas or, as an alternative, the existence of a single focal “semantic hub.” Based on previous integrative proposals (Damasio, 1989; Pulvermüller, 2013), we hypothesize that semantic category-specific and category-general behaviors of different cortical areas are a direct consequence of the neuroanatomical connectivity between the areas involved and learning experiences that are essential for grounding concepts in knowledge about objects and actions. Here, we attempt to address this theoretical hypothesis with a neurobiologically constrained spiking model of the cortex in order to integrate data from healthy and patient studies described above.

Recent simulations of cortical function and learning incorporating fine microstructural and physiological details of millions of neurons (Izhikevich and Edelman, 2008; Markram et al., 2011) have not yet addressed specific questions about the neurobiological basis of specific cognitive functions such as semantic processing. Previous connectionist models have made significant progress in explaining of language and semantic processing (Dell et al., 1999; Plaut and Gonnerman, 2000; Christiansen and Chater, 2001), but most of them do not attempt to replicate realistic properties of the human brain. Although recent simulation studies included neuroanatomical information to model semantic processing, they have used learning mechanism (i.e., back-propagation—Ueno et al., 2011; Chen et al., 2017), which were argued to be biologically implausible (Mazzoni et al., 1991; O'Reilly, 1998). Furthermore, these studies have incorporated just one semantic hub area in the anterior temporal lobe, whereas other evidence summarized above are not addressed. A recent modeling effort incorporates neuroanatomical structure and connectivity into models of semantic processing (Garagnani and Pulvermüller, 2016). By meticulously mimicking the general parcellation of cortex into areas, their long-range cortico-cortical connections, features of local connectivity within cortical areas, local and global inhibitory mechanisms regulating cortical activity, and realistic neurobiological learning mechanisms, a stepwise approximation to response properties of real brain-internal networks could be achieved. Still, these previous study has fallen short of implementing the complexity of cortico-cortical connectivity and the activation dynamics of spiking cortical neurons.

Building upon these previous efforts with graded-response neural-network models (Garagnani and Pulvermüller, 2016), we here set out to model the brain's semantic mechanisms using a mathematically precise model of multiple cortical areas, incorporating spiking neurons, biologically plausible non-supervised learning mechanisms and connectivity structure based on neuroanatomical studies. The network was used to simulate associative word learning by linking word-forms with their semantically-related object and action representations. The present biologically constrained model bridges the gap between neural mechanisms and conceptual brain functions, offering a biological account of how aspects of word meaning are acquired, stored, and processed in the brain.

Methods and Materials

General Features of the Model

We implemented a neurobiologically constrained model replicating cortical areas of fronto-temporo-occipital lobes and their connectivity to shed light on the mechanism underlying semantic processing grounded in action and perception. We created a neural architecture with 15,000 representative neurons for simulating activity in twelve cortical areas in the left language-dominant hemisphere (see Figure 1A). These “areas” represented three levels of processing—primary, secondary, and higher-association cortex—in four modality-systems: (motor) frontal superior-lateral hand-motor, (articulatory) inferior face-motor, (auditory) superior-temporal and (visual) inferior-temporo-occipital system. Two of these, the auditory and articulatory systems (areas highlighted in blue and red, Figure 1A) are in perisylvian language cortex and appear most relevant for language processing (Zatorre et al., 1996; Pulvermüller, 1999; Fadiga et al., 2002; Pulvermüller and Fadiga, 2010). The motor and visual system (yellow and green highlighted areas) are outside the perisylvian language cortex (called “extrasylvian” in the present work) and involved in processing visual object processing (Ungerleider and Haxby, 1994), and the execution of manual actions (Deiber et al., 1991; Lu et al., 1994; Dum and Strick, 2002, 2005).

FIGURE 1

Figure 1. (A) Structure and connectivity of 12 frontal, temporal and occipital cortical areas relevant for learning the meaning of words related to actions. Perisylvian cortex comprises an inferior-frontal articulatory-phonological system (red colors), including primary motor cortex (M1_i), premotor (PM_i) and inferior-prefrontal (PF_i), and a superior-temporal acoustic-phonological system (areas in blue), including auditory parabelt (PB), auditory belt (AB) and primary auditory cortex (A1). Extrasylvian areas comprise a lateral dorsal hand-motor system (yellow to brown), including lateral prefrontal (PF_L), premotor (PM_L) and primary motor cortex (M1_L), and a visual “what” stream of object processing (green), including anterior-temporal (AT), temporo-occipital (TO), and early visual areas (V1). When learning words in the context of perceived objects or to actions, both peri- and extrasylvian systems are involved. Numbers indicate Brodmann Areas (BAs) and the arrows (black, purple, and blue) represent long distance cortico-cortical connections as documented by neuroanatomical studies. (B) Schematic global area and connectivity structure of the implemented model. The colors indicate correspondence between cortical and model areas. (C) Micro-connectivity structure of one of the 7,500 single excitatory neural elements modeled (labeled “e”). Within-area excitatory links (in gray) to and from cell e are limited to a local (19 × 19) neighborhood of neural elements (light-gray area). Lateral inhibition between e and neighboring excitatory elements is realized as follows: the underlying cell i inhibits e in proportion to the total excitatory input it receives from the 5 × 5 neighborhood (dark-purple shaded area); by means of analogous connections (not depicted), e inhibits all of its neighbors. Adapted from (Garagnani and Pulvermüller, 2013).

The model replicates a range of important anatomical and physiological features of the human brain (e.g., Garagnani et al., 2008, 2017; Tomasello et al., 2017). As follow a summary of the six neurobiological principles incorporated in the neural network model:

(i) Neurophysiological dynamics of spiking pyramidal cells including temporal summation of inputs, threshold-based spiking, nonlinear transformation of membrane potentials into neuronal outputs, and adaptation (Connors et al., 1982; Matthews, 2001);

(ii) Synaptic modification by way of Hebbian-type learning, including the two biological mechanisms of long-term potentiation (LTP) and long-term depression (LTD) (Artola and Singer, 1993);

(iii) Area-specific global regulation mechanisms and local lateral inhibition (global and local inhibition) (Braitenberg, 1978; Yuille and Geiger, 2003);

(iv) Within-area connectivity: a sparse, random and initially weak connectivity was implemented locally, along with a neighborhood bias toward close-by links (Kaas, 1997; Braitenberg and Schüz, 1998);

(v) Between-area connectivity based on neurophysiological principles and motivated by neuroanatomical evidence; and

(vi) Uncorrelated white noise was constant present in all neurons during all stages of learning and retrieval with additional noise added to the stimulus patterns to mimic uncorrelated input conditions (Rolls and Deco, 2010).

Note that the connectivity structure implemented in the network reflects existing anatomical pathways between corresponding cortical areas of the cortex revealed by neuroanatomical studies using diffusion tensor and diffusion-weighted imaging (DTI/DWI) in humans and non-human primates (Table 2) (Rilling et al., 2011; Thiebaut de Schotten et al., 2012). A detailed description of the single-neuron properties, synaptic plasticity rule, and single-area model structure is provided next, followed by details of the network anatomy and connectivity structure.

Structure and Function of the Spiking Model

Each of the 12 model areas consists of two layers of artificial neuron-like elements (“cells”), 625 excitatory and 625 inhibitory (e- and i-cells), thus resulting in 15,000 cells in total (see Figure 1C). Each e-cell models a single representative pyramidal spiking neuron situated in a local patch of the cortex and the underlying i-cell represents the cluster of inhibitory interneurons located within the same cortical column (Wilson and Cowan, 1972; Eggert and van Hemmen, 2000). The state of each cell x at time t is uniquely defined by its membrane potential V(x,t), specified by the following equation:

\begin{array}{l} τ \cdot \frac{d V (x, t)}{d t} = - V (x, t) + k_{1} (V_{I n} (x, t) + k_{2} η (x, t)) & (B 1) \end{array}

where V_In (x,t) is the net input acting upon cell x at time t (sum of all inhibitory and excitatory postsynaptic potentials—I/EPSPs; inhibitory synapses are given a negative sign), τ is the membrane's time constant, k₁, k₂ are scaling values (see Table 1 for the specific parameter values used in the simulations) and η(·,t) is a white noise process with uniform distribution over [−0.5, 0.5]. Note that noise is an inherent property of each model cell, intended to mimic the spontaneous activity (baseline firing) of real neurons. Therefore, noise was constantly present in all areas, in equal amounts (inhibitory cells have k₂ = 0, i.e., the noise is generated by the excitatory cells). The output (or transformation function) φ of an excitatory cell e is defined as follows:

\begin{array}{l} ϕ (e, t) = {\begin{array}{l} 1 & i f (V (e, t) - α ω (e, t)) > t h r e s h \\ 0 & o t h e r w i s e \end{array} & (B 2) \end{array}

Thus, an excitatory cell e spikes (=1) whenever its membrane potential V(e,t) overcomes a fixed threshold thresh by the quantity αω(e,t) (where α is a constant and ω is defined below). Inhibitory cells are graded response neurons, for simplicity, as they intend to represent the average impact of a cluster of local interneurons; the output φ(i,t) of an inhibitory neuron i is 0 if V(i,t) < 0 and V(i,t) otherwise.

TABLE 1

Table 1. Parameter values used in the simulation.

To simulate neuronal adaptation (Kandel et al., 2000), the function ω(·,t) is defined so as to track the cell's most recent firing-rate activity. More precisely, the amount of adaptation ω(e,t) of cell e at time t is defined by:

\begin{array}{l} τ_{A D A P T} \cdot \frac{d ω (e, t)}{d t} = - ω (e, t) + ϕ (e, t) & (B 3.1) \end{array}

where τ_ADAPT is the “adaptation” time constant. The solution ω(e,t) of Equation (B3.1) is the low-pass-filtered output φ of cell e, which provides an estimate of the cell's most recent firing-rate history. A cell's average firing activity is also used to specify the network's Hebbian plasticity rule [see Equation (B4) below]; in this context, the (estimated) instantaneous mean firing rate ω_E(e,t) of an excitatory neuron e is defined as:

\begin{array}{l} τ_{F a v g} \cdot \frac{d ω_{E} (e, t)}{d t} = - ω_{E} (e, t) + ϕ (e, t) & (B 3.2) \end{array}

To regulate and control activity in the network, local and area-specific inhibition is implemented (Palm, 1982; Bibbig et al., 1995; Wennekers et al., 2006), realizing, respectively, local and global competition mechanisms (Duncan, 1996, 2006). More precisely, in Equation (B1) the input V_In(e,t) to each excitatory cell of the same area includes an area-specific (“global”) inhibition term k_Gω_G(e,t) [with k_G a constant and ω_G(e,t) defined below] subtracted from the total I/EPSPs postsynaptic potentials V_In in input to the cell; this regulatory mechanism ensures that area (and network) activity is maintained within physiological levels (Braitenberg and Schüz, 1998):

\begin{array}{l} τ_{G L O B} \cdot \frac{d ω_{G} (e, t)}{d t} = - ω_{G} (e, t) + \sum_{e \in a r e a} φ (e, t) & (B 3.3) \end{array}

Excitatory links within and between (possibly non-adjacent) model areas are established at random and limited to a local (topographic) neighborhood; weights are initialized at random, in the range [0, 0.1]. The probability of a synapse to be created between any two cells falls off with their distance (Braitenberg and Schüz, 1998) according to a Gaussian function clipped to 0 outside the chosen neighborhood (a square of size n = 19 for excitatory and n = 5 for inhibitory cell projections). This produces sparse, patchy and topographic connectivity, as typically found in the mammalian cortex (Amir et al., 1993; Kaas, 1997; Braitenberg and Schüz, 1998; Douglas and Martin, 2004).

The Hebbian learning mechanism implemented simulates well-documented synaptic plasticity phenomena of long-term potentiation (LTP) and depression (LTD), as implemented by Artola, Bröcher and Singer (Artola et al., 1990; Artola and Singer, 1993). This rule provides a realistic approximation of known experience-dependent neuronal plasticity and learning (Musso et al., 1999; Rioult-Pedotti et al., 2000; Malenka and Bear, 2004; Finnie and Nader, 2012), and includes both (homo- and hetero-synaptic, or associative) LTP, as well as homo- and hetero-synaptic LTD. In the model, we discretized the continuous range of possible synaptic efficacy changes into two possible levels, +Δ and –Δ (with Δ < < 1 and fixed). Following Artola et al., we defined as “active” any (axonal) projection of excitatory cell e such that the estimated firing rate ω_E(e,t) of cell e at time t [see Equation (B3.2)] is above ϑ_pre, where ϑ_pre ∈ ]0,1] is an arbitrary threshold representing the minimum level of presynaptic activity required for LTP (or homosynaptic LTD) to occur. Thus, given a pre-synaptic cell i making contact onto a post-synaptic cell j, the change Δw(i,j) inefficacy of the (excitatory-to-excitatory) link from i to j is calculated as follows:

\begin{array}{l} Δ w (i, j) = {\begin{array}{l} + Δ & i f ω_{E} (i, t) \geq ϑ_{p r e} a n d V (j, t) \geq ϑ_{+} & (L T P) \\ - Δ & i f ω_{E} (i, t) \geq ϑ_{p r e} a n d ϑ_{-} \leq V (y, t) < ϑ_{+} & (h o m o s y n a p t i c L T D) \\ - Δ & i f ω_{E} (i, t) < ϑ_{p r e} a n d V (y, t) \geq ϑ_{+} & (h e t e r o s y n a p t i c L T D) \\ 0 & o t h e r w i s e \end{array} & (B 4) \end{array}

The values in Table 1 describes the parameters used during word learning simulation in the network, which were chosen on the basis of previous simulations (e.g., Garagnani et al., 2007, 2009; Garagnani and Pulvermüller, 2011; Schomers et al., 2017; Tomasello et al., 2017).

Simulated Brain Areas and Their Connectivity Structure

The spiking model mimics 12 different cortical areas with area-intrinsic connections and mutual connections between them. Six areas were modeled for the left-perisylvian language cortex including the primary auditory cortex (A1), auditory belt (AB), and modality-general parabelt areas (PB) constituting the auditory system, and the inferior part of primary motor cortex (M1_i), inferior premotor (PM_i) and multimodal prefrontal motor cortex (PF_i) representing the articulatory system (i.e., inferior face-motor areas). Additionally, six extrasylvian areas were modeled including the primary visual cortex (V1), temporo-occipital (TO) and anterior-temporal areas (AT) for the ventral visual system and the dorsolateral fronto-central motor (M1_L), premotor (PM_L), and prefrontal cortices (PF_L) for the motor system.

The network's connectivity structure reflects relevant features of cortical connectivity between corresponding areas of the cortex. These were modeled between neighbor cortical areas within each of the 4 “streams” (see black arrows Figures 1A,B) and between all pairs of multimodal areas (PB, PF_i, AT, and PF_L) through the long distance cortico-cortical connections (purple arrows). Additionally, non-adjacent “jumping” links were included within the superior or inferior temporal and superior or inferior frontal cortices (blue arrows). The neuroanatomical evidence motivated by studies using diffusion tensor and diffusion-weighted imaging (DTI/DWI) in humans and non-humans primates are reported in Table 2 and described in previous study (Garagnani et al., 2017).

TABLE 2

Table 2. Connectivity structure of the modeled cortical areas.

Simulating Word Acquisition

Prior to network training, all synaptic links (between- and within-areas) connecting single cells were established at random (see Methods section under “Structure and function of the spiking model”). Based on Hebbian (Hebb, 1949) learning principles, word-meaning acquisition was simulated under the impact of repeated sensorimotor pattern presentations (Fuster, 2003; D'Esposito, 2007) to the primary areas of the network (see Figure 2), as follows: Each network instance used twelve distinct sets of sensorimotor neural patterns representing six action- and six object-related words. Each pattern consisted of a fixed set of 19 cells chosen at random within the 25 × 25 cells of an area (ca. 3% of the cells) and simultaneously activated in one of the primary areas of the network. The learning of object- and action-related words were grounded in sensorimotor information presented to the primary cortices of the model: besides perisylvian auditory A1 and articulatory M1_i activity, object-related words received concordant visual (V1) and, similarly, action-related words received lateral motor area (M1_L) grounding activity. Note that white (so-called “contextual”) noise was continuously presented to all primary areas of the network, and thus superimposed on all learning patterns. This partly accounted for the variability of perceptions and actions of the same type. To sum up, the network was set up to learn correlations between word and referential semantic information in action and perception and to investigate which type of representations (i.e., cell assemblies) would develop in the model as a result of learning and cortical structure. Note that similar approaches to simulating spontaneous emergence of associations between articulatory and acoustic-phonetic neural patterns have been used in other computational studies (e.g., Westermann and Reck Miranda, 2004; Guenther et al., 2006), although these previous works did not attempt to model semantic processes (i.e., word meaning acquisition).

FIGURE 2

Figure 2. Distributions of cell-assemblies (CAs) emerging in the 12 area network during simulation of word learning in the semantic context of visual perception (A) and action execution (B). Results of one typical instantiation of the model in Figure 1B are shown, using the same area labels. Each set of 12 squares (in black) illustrates one specific network area, with white dots indexing the distribution of CA neurons across the 12 network areas as a result of sensorimotor pattern presentation in 3 of the 4 primary areas. The perisylvian cortex was always stimulated, which mimics the learning of a spoken word form characterized by articulatory-acoustic features, while object words (A) received concordant stimulation to visual area (V1) and action words (B) to motor area (M1_i). Note that a random pattern simulating realistic noise input, changing in every learning phase, was presented to the non-relevant system (see Methods section). As a consequence of learning, CA circuits emerged in the network which extends into higher and primary visual cortex (V1, TO, but not M1_L) for object words. In contrast, network correlates of action-related words extend into lateral motor cortex (M1_L, PM_L, but not V1), thus semantically grounding words in information about actions. For convenience, the area structure of the network is repeated at the top.

Sensorimotor neural patterns in the arrangement of 3 × 19 cells, were presented for 3,000 times to the relevant primary regions (this number was chosen on the basis of previous simulations obtained with a six area model, showing that no substantial change between 1,000 and 2,000 learning steps was revealed, Garagnani et al., 2009; Schomers et al., 2017). A word pattern was presented for 16 simulation time steps, followed by a period during which no input (interstimulus interval—ISI) was given. The next learning step (pattern presentation) occurred only when the global inhibition of PF_i and PB areas reduced below a specific fixed threshold allowing the activity to return to a baseline value so that one trial is not affecting the next one. Only the inherent baseline noise (simulating spontaneous neuronal firing) and “contextual” noise were present in the neural network during each ISI.

After learning, following a procedure which has become standard in our simulation studies (Garagnani et al., 2008; Garagnani and Pulvermüller, 2016; Schomers et al., 2017; Tomasello et al., 2017), we identified and quantified the neurons forming the 12 distributed CA circuits that emerged across the network areas during object and action word production. For simulating “word production” in the network, the motor and auditory neurons of each word form in areas M1 and A1 were activated together for 15 time-steps. Separate analyses were performed for object recognition and action execution, which was simulated by activating the corresponding stimulation pattern in visual or motor cortex (V1 or M1) thought to represent the object-related or action-related schemas semantically linked to the word forms. During this period, we computed and displayed the average firing rate of each excitatory cell (7,500 e-cells, cell's responses).

As an estimate of a cell's average firing-rate here we used the value ω_E(e,t) from Equation (B3.2), integrated with time-constant τ_Favg = 5. An e-cell was then taken to be a member of a given CA circuit only if its time-averaged rate (output value or “firing rate”) reached a threshold θ which was area- and cell-assembly specific, and defined as a fraction γ of the maximal single-cell's time-averaged response in that area to pattern w. More formally,

θ = θ_{A} (w) = γ \underset{x \in A}{m a x} \bar{O {(x, t)}_{w}}

where $\bar{O {(x, t)}_{w}}$ is the estimated time-averaged response of cell x to word pattern w (see in Method section under “Structure and function of the spiking model”) and γ ∈ [0, 1] is a constant [we used γ = 0.5 on the basis of previous simulation results (see Garagnani et al., 2008, 2009; Tomasello et al., 2017)]. This was computed for each of the 12 trained network instances, averaging the number of CA cells per area over the 6 object- and 6 action-related words.

To statistically test for the presence of significant differences in the topographical CA distribution across the twelve network areas, for each network instance we performed a repeated-measures Analyses of Variance (ANOVA). A 4-way ANOVA was run with factors WordType (two levels: Object vs. Action), PeriExtra (two levels: Perisylvian = {A1, AB, PB, M1_i, PM_i, PF_i}, Extrasylvian cortex = {V1, TO, AT, M1_L, PM_L, PF_L}), TemporalFrontal (TempFront)” (2 levels: temporal areas = {A1, AB, PB, V1, TO, AT}, frontal areas = {M1_L, PM_L, PF_L, M1_i, PM_i, PF_i}) and Areas (three levels: Primary = {A1, V1, M1_L, M1_i}, Secondary = {TO, AB, PM_L, PM_i} and Central = {PB, AT, PF_L, PF_i} areas). Finally, we further run a second statistical analysis on the data of the 6 perisylvian and 6 extrasylvian areas separately with factors “WordType,” “TempFront,” “Areas,” as described above.

Results

Word Learning Results

Twelve different instances of spiking networks were initialized at random having the same architecture as described above (Figure 1B), providing analogs of 12 human subjects in a word learning experiment. Word-meaning acquisition was then simulated under the impact of repeated sensorimotor pattern presentations, in the 3 of the 4 sub-systems (see Figure 2), by co-activating specific neurons in their respective primary cortex. The cells activated in M1_i and A1 represented articulatory and acoustic-phonetic features by which spoken words are typically characterized, while those presented to V1 and M1_L simulated visually-related and action-related semantic features. This simulates associative learning of object-related word, whereby the word is uttered while the referent object is present (Vouloumanos and Werker, 2009) or the related action is being performed (Tomasello and Kruger, 1992). While each learning pattern directly activated three primary areas, the fourth unrelated area (M1_i for object- and V1 for action-related words) received further uncorrelated noise pattern input that changed inconsistently over learning episodes. This aimed at ensuring that the correlation between word-form activity in perisylvian cortex and semantic information was high in one modality (for action /object words, in motor and visual systems respectively) but low in the non-relevant one.

Cell assemblies gradually emerged as a consequence of learning with different assemblies responding to different input patterns. These neural circuits spanned different areas, linking up word-forms in the auditory and articulatory sub-systems with referential-semantic information in the visual and motor sub-systems. Figure 2 illustrates 6 of the 12 CA-distributions emerging across the novel spiking network along with the sensorimotor pattern presented as input during learning. Each set of 12 squares is a snapshot of a distributed word-related CA circuit across the network areas; 3 for object-related words (A) and 3 for action-related (B) words of one network instance (the other simulated networks exhibited similar results). Each white pixel in the squares represents an active cell of the CA.

The CA circuits in Figure 2 show roughly the same spread across the perisylvian areas for object and action-related words. By contrast, the visual and motor sub-systems of the extrasylvian cortex appear to show a different pattern of CA cell distribution, namely a double dissociation, i.e., object-related words seemed to extend more to the visual areas (V1, TO) and less to the motor areas (PM_L, M1_L) and vice versa for action-related words.

Figure 3 illustrates examples of CA circuit activation (i.e., each white pixel represents a spike) after the training has been undertaken. The network was confronted with the acoustic component (input pattern in primary auditory area) representing the auditory word-forms of the learned (A) object- and action-related (B) words, which in turn caused the “ignition” of the whole CA circuit for that specific word-pattern. The snapshot numbers indicate simulation time-steps of the network activity. Similarly, as in the distribution of the emerging CA circuits illustrated in Figure 2, action- and object-related word recognition exhibited a semantic category-specific spreading of activity in the modality-preferential areas, which is near simultaneous (i.e., synchronous spikes) binding information from phonological (articulatory-acoustic) and semantic information. Interestingly, the re-activation of the word-related cell assemblies across the cortical areas exhibit the distinct consecutive neuronal and cognitive processes; the stimulation phase (time steps 1–2), which corresponds to word perception (orange pixel), the full activation or “ignition” phase (time steps 5–8), the correlate of word comprehension (magenta pixel), and the reverberant maintenance of activity (time steps 12–14), which underpins verbal working memory (blue pixels).

FIGURE 3

Figure 3. Activation spreading in the 12 area network showing examples of the simulated recognition processes for object- and action-related words (on the left and right, respectively; see CA #6 and CA #10 in Figure 2, respectively). Network responses to stimulation of A1 with the “auditory” patterns of two of the learned words; similar to Figure 2, the 12 network areas are represented as 12 squares, but, in this case, selected snapshots of network activity are shown. The re-activation process comes in different consecutive neuronal and cognitive phases, the stimulation phase, which corresponds to word perception (orange pixel), the full activation or “ignition” phase, the correlate of word comprehension (magenta pixel), and the reverberant maintenance of activity, which underpins verbal working memory (blue pixels). Each colored pixel indicates one spike one neuron included in the CA circuit at a given time step. At the top, the 12 model areas and their connectivity structure are shown and their location in the cortex indicated.

The bar graph in Figure 4 reports the topographical distribution of the CA circuits across the network areas averaged over 12 networks. Different panels show results from the word production (A) and object and action recognition (B) “experiments.” In each panel, average numbers of cell assembly neurons (plus standard errors) are shown for each area, with extrasylvian areas displayed at the top and perisylvian ones at the bottom. Intriguingly, the extrasylvian areas show a different CA distribution between the two word-type circuits, while the perisylvian language areas seem not to show any word-category differences.

FIGURE 4

Figure 4. Mean numbers of cell assembly neurons in different model areas after simulating the learning of action- (light gray) and object-related words (dark gray) during word production (A) and object and action recognition (B); error bars show standard errors over networks. (A) Simulated word production (simultaneous presentation of articulatory-auditory patterns in A1 and M1i areas) after word meaning acquisition. The extrasylvian areas (upper part) whose cells can be seen as circuit correlates of word meaning show a double dissociation, with relatively more strongly developed CAs for object- than for action-related words in primary and secondary visual areas (V1, TO), but stronger CAs for action-related than for object-related words in dorsolateral primary motor and pre-motor cortices (PM_L, M1_L). Also, the semantic hub areas (PFi, AT) showed a degree of dissociation between the two word types. Data from the perisylvian cortex (lower part), namely articulatory and auditory areas, whose cells can be seen as circuit correlates of spoken word-forms do not show category-specific effects. Brain areas and their connectivity structure are also illustrated. The shaded areas, but not the colored boxes, indicate location in the cortex. (B) Simulated object and action recognition [alternated presentation of sensorimotor patterns in visual (for object) and in motor areas (for action words)]. The present simulation exhibits similar results to the word production simulation. The small horizontal segment indicates the stimulus input presentation. Asterisks indicate that, within a given area, the number of CA cells significantly differed between the circuits of action and object words (Bonferroni-corrected planned comparison tests).

Furthermore, independently of whether an object or action-related word is represented, the word learning results showed higher density of CA cells in the connector hubs (PB, PF_i, AT, and PF_L) than in the secondary (AB, PM_i, TO, PM_L) and primary areas (A1, M1_i, V1, M1_L). Similar results were revealed for both word production and action and object recognition, which is in line with the differential CA topographies already noted above and in Figure 2. However, there were minor differences in the estimated cell assembly topographies, as the relatively larger number of CA cells in the primary areas of the extrasylvian system were obtained for object and action recognition compared to word production, which was (trivially) due to the stimulus presentation there.

The 4-way repeated measurement ANOVA (with factors WordType, PeriExtra, TemporalFrontal, and Areas) performed on the word production data from all of the 12 network areas fully confirmed the empirical and visual observation described above. A highly significant interaction emerged with factors WordType, PeriExtra, TempFront and Areas (F_{2, 22} = 14.012, p < 0.0002), revealing different CA circuits across the 12 area network between object- and action-related words. A main effect of Areas (F_{2, 22} = 265.721, p < 0.0001), indicating the different CA cell densities distributed across the network as noted above, namely higher CA cells in hubs than in secondary regions (p < 0.0001), and higher in secondary than in primary cortices (p < 0.0001). We separately ran a 3-way ANOVA on the data from the two systems, because of the significant interaction between peri- and extrasylvian areas. As expected, the extrasylvian system revealed a highly significant interaction of all 3 factors WordType, TempFront, and Areas (F_{2, 22} = 53.11, p < 0.0001), confirming the word category dissociation in the CA topographies and local cell-density distributions across the extrasylvian regions as suggested by Figures 2, 3. No significant differences between CA distributions of the 2 word types were found in the perisylvian areas (F_{2, 22} = 0.067, p = 0.93).

We further ran Bonferroni-corrected planned comparison tests (12 comparisons, corrected critical p < 0.0042) to investigate the differences between CA types that emerged after learning. Differences in CA-cell densities between word types and pairs of areas in the semantic systems were all significant (p < 0.0001), confirming the presence of a higher neuron-density in visual (V1, TO, and AT) than in motor (M1_L, PM_L, and PF_L) areas for object-related words (p < 0.0001), and the opposite for action-related words (p < 0.0001). Analysis of the connector hubs (AT, PF_L) also showed a significant difference between the 2 word types there, i.e. stronger action-related word CA cell densities in PF_L compared to AT (p < 0.0001), and the opposite for object-related words (p < 0.0001). As observed above, no significant differences emerged in the perisylvian areas (p = 0.029) between the word types. We further run the same statistical analysis on the object and action recognition data, which revealed similar results as the word production simulation, i.e., double dissociation between action and object-related words in the extrasylvian system (F_{2, 22} = 467.321, p < 0.0001) with no significant difference in perisylvian cortex (F_{2, 22} = 0.060, p < 0.91).

Discussion

We investigated the neural mechanisms underlying word learning in a biologically constrained spiking model replicating connectivity and cortical features of the frontal, temporal and occipital areas to simulate aspects of semantic grounding in action and perception. The present neural-network showed

• Emergence of neuron circuits distributed across primary, secondary, and multimodal areas, as a result of simulating the grounding of word-forms in their semantically-related objects and actions (Figure 2). We call these “semantic circuits,” because they interlink articulatory-acoustic word-from information with referential semantic representations coded in motor and visual areas;

• Re-activation of the word-related circuits during word recognition exhibited the distinct consecutive neuronal and cognitive processes of word perception, word understanding and working memory (Figure 3);

• Higher neuron densities of the semantic circuits and prolonged activity in the multimodal areas, where all semantic and phonological information first converges;

• Pronounced semantic category-specificity primarily in the modality-preferential areas and moderate specificity also in multimodal areas for both word production and object and action recognition (Figures 4A,B).

The present simulations offer a neurobiological explanation of a wide range of recent experimental results about word meaning processing and make critical predictions about the functional role of multimodal-association hubs, secondary and primary cortical regions in language and semantic processing. Below, we provide a detailed discussion of the models and their results in light of previous empirical evidence, current semantic brain theories and its novel critical predictions.

Semantic Brain Processes: Data and Models

Accumulating evidence emphasizes the relevance of several cortical regions for semantic processing, including inferior-frontal, superior- and anterior-temporal multimodal areas (Patterson et al., 2007; Binder et al., 2009; Pulvermüller, 2013), which are apparently relevant for all types of semantic processing, and modality-preferential areas, which seemingly take a category-specific role in semantics (Barsalou, 2008; Binder and Desai, 2011; Pulvermüller, 2013). Of great relevance in the current discussion about semantic grounding and “embodiment” is the contribution of modality-preferential areas including primary and secondary cortices, for example the motor and premotor cortex, or the primary and other “early” visual areas, in semantic processing. These areas, which had classically been seen as “perceptual” or “motor” in their function, seem to partake in and contribute to semantic processing, as a range of previous experimental studies showed. The present results fit the postulate of semantic grounding (Harnad, 1990) that, in order to know the meaning of a symbol, it is necessary to relate it to real world entities, for example, the word “grasp” to grasping actions and the word “house” to the typical visual shape of houses. Grounding in this sense needs to be implemented in semantic representations that reach into motor and sensory systems. Our simulations applying brain constrained modeling at different levels demonstrate grounding in this very sense, hence fitting (and explaining) the experimental results mentioned above.

Some attempts to integrate both category-general and category-specific semantic mechanisms into one theoretical framework have been proposed. The “hub-and-spoke” model postulates one single semantic hub in anterior-inferior-temporal lobe with category-specific spokes mainly in posterior brain areas (Ralph et al., 2017). This model explains crucial features of semantic dementia, but is inconsistent with hub-like properties of other multimodal areas (see Introduction) and, in addition, does not address the motor system's role in category-specific processing (Vukovic et al., 2017), along with some fine-grained differences in the ability to process specific semantic categories which result from different types of dementias (Shebani et al., 2017). Neurocomputational studies (Ueno et al., 2011; Chen et al., 2017) have investigated aspects of the hub-and-spoke model. However, as mentioned in the introduction, Chen et al. did not include all the brain areas for which experimental studies show a critical role in general semantic processing and they used learning mechanism (i.e., back-propagation—Ueno et al., 2011; Chen et al., 2017) which were criticized as implausible for cortical networks (Mazzoni et al., 1991; O'Reilly, 1998).

A claim about multiple semantic hubs has been made, in association with that about category-specific areas (Binder and Desai, 2011; Pulvermüller, 2013). However, formal neural-networks that could act as a foundation of a theory of semantic brain mechanisms did so far not reach the level of sophisticated neurobiologically constrained modeling with spiking neurons, realistic connectivity and learning. Earlier attempts were made using a preliminary version of the present architecture adopting non-spiking neurons (Garagnani and Pulvermüller, 2016; Tomasello et al., 2017). These previous models already suggest an explanation of category-general and category-specific semantic processing, but their conclusions were more limited by their less accurate modeling of neurophysiological and neuroanatomical features of the cortex.

Novel Contribution: Increased Brain-Constraints

Here, we added important neurobiological constraints, introducing leaky integrate-and-fire neurons that transform their summed input non-linearly into discrete output in the form of spikes. Similarly to biological neurons, functional interaction within the present model was based on discrete spikes, whereas previous mean-field networks used continuous activity functions (i.e., graded-response neurons), a less realistic implementation. Using graded-response neurons makes it easier to build distributed neural circuits across multiple areas as a result of action-perception learning since this type of neuron retains an increased firing rate for more extended periods. It was, therefore, crucial to investigate the possibility of distributed circuit formation with spiking neurons, which show an activation (action potential) for a short moment and then go silent again.

Compared with earlier studies, the present network included a more realistic set of cortico-cortical fiber tracts, adding second-next area connections or “jumping links” (blue arrows Figures 1A,B) indicated by DTI/DWI studies. A recent neurocomputational study (Schomers et al., 2017) showed that these jumping links are instrumental for building verbal short-term memory, a capacity crucial for human language learning. Furthermore, previous exploratory implementation of “jumping links” in an extended semantic network of mean-field (non-spiking/gradually active) neuronal elements suggested a degree of over-activation in case of implementation of the rich set of cortico-cortical connections, thus preventing precise simulation of more realistic connectivity. The use of spiking neuronal cells, whose action potentials only last for 1 simulation time-step and therefore produced less activity overall compared with the graded-neuron network, opened the possibility to include additional connection pathways documented by recent research without running into over-activation problems. On the other hand, spiking-neuron networks with just next neighbor connections between areas (thus omitting the “jumping” links) ran into an under-activation problem, precisely because of the same feature (i.e., that spiking neurons lose their activity immediately). Thus, only the combined improvement of neuroanatomical (jumping connections) and neurophysiological (spiking) realism led to a functional network, which largely confirms conclusions formerly proposed on the basis of less realistic architectures. Incorporating significant biological detail into networks may be essential for obtaining a better understanding of the complex cortical mechanisms underlying semantic processing. Indeed, recent modeling results suggest that large-scale synchronous spiking within cell assembly circuits, also observed here, may be important for the binding of form to meaning during word learning and comprehension (Garagnani et al., 2017).

In summary, the comparison of less and more biologically constrained networks showed that improving the degree of realism does not always help. Moving from graded-response to spiking neurons alone renders an underactive network with little perspective on modeling semantic cognition, as the addition of a more detailed, elaborate and realistic connectivity structure on its own produces an overactive and thus, once again, dysfunctional networks. Only the parallel improvement on structural (anatomical) and functional (physiological) dimensions, that is, adding jumping links and spiking neurons, led to a functional network once again, which could confirm results from the earlier simulations obtained from the next-neighbor-connectivity and mean-field network, but provides a simulation at a more brain-constrained and therefore more realistic level.

Emergence of Distributed Symbolic Circuits

The present model imitates elementary processes of semantic learning, where word-forms are presented in the context of object (Vouloumanos and Werker, 2009) or action information (Tomasello and Kruger, 1992). In our model, the co-occurrence of objects or actions with word-forms was implemented as correlated neuronal activation patterns in the model's primary articulatory (M1_i) and auditory (A1) along with either dorsolateral motor (M1_L) or visual cortex (V1). The first significant finding of this study is that such information about the semantic grounding of symbols can be mapped reliably onto biologically constrained associative networks. Each pattern representing the pairing of one specific symbol and one specific action or object led to the formation of a distributed circuit of spiking neurons spread out across several areas of the architecture. Each of these distributed circuits acted as a coherent functional unit, with its interlinked neurons in sensory, motor and multimodal areas activating together. The formation of each circuit required the spreading of activity across the network and the selective strengthening of a significant number of partaking neurons. Such strengthening was substantial enough so that, after learning, “auditory input” was sufficient to revive the entire circuit, including its articulatory and semantic components. By comparing the mean-field next-neighbor model with the jumping-links spiking model, massive differences were revealed in the dynamics of cell assemblies activations during auditory word recognition (Figure 3). Whereas the mean-field model showed cascaded activation dynamics (with serial onset of activations and only partly overlapping activity of the hub areas AT, PF_L), the full-fledged three-phase dynamics with perception (activation of auditory areas), ignition (near-simultaneous activation of cell assembly neurons dispersed across wide cortical areas), and working memory (reverberation of activity in part of the cell assembly) was only present in the spiking and fully connected model. Intriguingly, after ignition, activity retreats from modality-preferential areas (time step 12, Figure 3) to hub areas (time step 14), which predicts an “anterior shift” from visual and motor areas to adjacent-anterior connector hub regions in temporal and prefrontal cortex during working memory (see also Fuster, 2009; Pulvermüller and Garagnani, 2014; Pulvermüller, 2018).

Although the formation of each circuit was driven by correlated information in sensory and motor areas, widely distributed circuits with many neurons in multimodal convergence zones got active. The involvement of neurons in multimodal areas is explained by long-distance connectivity structure, in particular by the absence of direct long-distance connections between sensory and motor areas; to bind information across modalities, activity must travel through connector hub areas (also called convergence zones, Damasio, 1989) bridging between sensorimotor cortices. It is important to emphasize, however, that while the presence of connector hubs in the model is a (neuroanatomically motivated) structural feature, the result that the learned action and object word circuits reach both extrasylvian connector hubs AT and PF_L–hence forming semantic hubs—is not trivial, and could not be a priori predicted¹. In other words, while the presence of connector hubs is a structural feature of the model, the formation of semantic hubs is not, and constitutes one of its crucial emergent properties.

The spontaneous formation of internal semantic circuits spanning the entire spiking neural network is a direct consequence of neurobiological principles modeled in the architecture that are known to govern the human brain. As discussed below, the activation of the learned distributed circuits explains relevant “semantic area activations” seen in neuroimaging experiments (for further discussion, see Garagnani and Pulvermüller, 2013; Tomasello et al., 2017).

Explaining Multiple Semantic Hubs

Not only did our model firmly bind neurons in multimodal areas to sensorimotor neurons involved in semantic processing, but, within each circuit, the proportion of these multimodal-area neurons was even greater than the percentage of circuit neurons in primary and secondary areas. On first view, this appears as surprising, because, during pattern presentation, sensory and motor neurons were directly stimulated together, whereas multimodal areas were activated only indirectly, by activity spreading from primary areas. However, the multimodal areas occupy a central location in the network topology because they bridge between sensory and motor areas, and therefore receive near-simultaneous convergent input from different (here, three) systems during learning. Such convergence also takes advantage of the higher “degree” of connectivity characterizing multimodal areas and of their resultant role as “connector hubs,” for which a special role in cognition has previously been proposed (van den Heuvel and Sporns, 2013). The cumulative effect of correlated inputs through several pathways converging on multimodal hubs accounts for their higher neuron-densities and their resultant major contribution to semantic circuit function. Thus, given that large fractions of the neurons of all semantic circuits were located in connector hubs, the model explains the prominent role of these connector regions in general semantic processing, which is due to both, the well-known pre-existing neuroanatomical connectivity and the correlated neuronal activity during word learning.

Crucially, the model implicates and explains not only one, but at least four experimentally observed “semantic hub” areas. One of these is in anterior-temporal lobe, providing a theoretical foundation for the critical postulate of the hub-and-spoke model (Patterson et al., 2007). Other semantic hubs are in superior-temporal-parabelt and in inferior- and dorsolateral-prefrontal cortex, where other models postulate sites of general semantic processing (Posner and Pavese, 1998; Bookheimer, 2002; Tate et al., 2014; Schomers and Pulvermüller, 2016; Carota et al., 2017). Our model, therefore, fits (and explains) data indicating the presence of frontal and temporal semantic hub areas, thus reconciling extant experimental evidence for a range of regions generally involved in conceptual processing (for reviews, see Kiefer and Pulvermüller, 2012; Pulvermüller, 2013).

Explaining Category-Specificity

We modeled the learning and processing of two different semantic categories: object- and action-related words. The formation of semantic circuits was driven by sensorimotor pattern information, involving visual cortex activity for object words and hand-motor cortex activity for action words. The respective other input system was activated with random noise to model the variable action output (visual input) in the context of specific visual objects (actions). Such uncorrelated noisy activity counters the spontaneous extension of neuron circuits toward inactive areas (Doursat and Bienenstock, 2006). Notably, as a consequence of the differential sensorimotor activation patterns, different circuit topographies developed across the areas for both word production and action or object recognition: circuits storing action-related information reached into the motor cortices (M1_L-PM_L) but not or less into visual areas (V1-TO), and vice versa for object words. Semantic circuits with different cortical topographies, which are a result of correlated neuronal activity in different sensorimotor areas during language learning, can therefore explain the emergence of category-specific semantic contributions of different cortical areas.

We take this observation as a proof-of-concept that the present type of spiking and jumping network is capable of spontaneously developing semantic-category specificity replicating a number of studies revealing neuroimaging and neuropsychological dissociations between action verbs and object nouns or between nouns sub-categories related to animals and tools (Damasio and Tranel, 1993; Martin et al., 1996; Martin, 2007; Moseley and Pulvermüller, 2014; Kemmerer, 2015). Interestingly, some category specificity was revealed in the semantic hubs, although it was less pronounced compared with primary and secondary areas. This area category-specific activation predicted by the model (Figure 4) seems to be of graded nature, with stronger category effect in the primary areas than in secondary areas and stronger in the secondary than in the hub areas and awaits experimental validations. The moderate category specificity predicted in the semantic hub areas is in line with recent evidence that semantic dementia patients due to anterior-temporal lesion show category-specific semantic impairments (Pulvermüller et al., 2010; Gainotti, 2012; Shebani et al., 2017), which sits less well with the suggested general-semantic function across all semantic types (Patterson et al., 2007).

It needs to be emphasized that most previous studies on semantics have investigated action and object words taken from natural languages, focusing mostly on the noun-verb distinction, which makes it difficult to control for all psycholinguistic proprieties and especially, when these words were acquired (e.g., Moseley and Pulvermüller, 2014). If we take our present simulations as models of concrete action verb vs. object noun processing, there is a good fit with the data, as these semantically and lexically different word types tend to differentially activate motor regions or ventral visual areas respectively (Damasio et al., 1996; Martin et al., 1996; Pulvermüller et al., 1999, 2014; Vigliocco et al., 2004; Martin, 2007; Moseley et al., 2013). However, note that the “action” and “object words” simulated here capture the differential action- and object-relatedness of many verbs and nouns, but not the lack of such semantic differences seen between abstract verbs/nouns and certainly not the combinatorial, or distributional differences between word categories, which result from their differential placements in specific grammatical contexts. Hence, for directly comparing the predictions of the present simulations to empirical data, it will be advantageous to perform analogous learning experiments and brain imaging studies to investigate where in the brain the neural signatures of novel object and action words first emerge. Nevertheless, the present simulation demonstrate the validity of a neurobiological theory of language processing (see Introduction, and Damasio, 1989; Pulvermüller, 2013), in which the mutual interaction of a set of neurobiological principles at work within anatomically-realistic structures and Hebbian learning are sufficient for explaining the emergence of semantic hubs and category specificity in the human brain.

It may be worthwhile to point to additional limitations of the present work along with possible extensons in the future. When an infant learns a new action word (e.g., “grasp”), by hearing a novel word form while performing the related action toward an object, concurrent activity might be present not just in the perisylvian language areas and motor cortices, but also in the visual occipital-parietal “where” stream (Mishkin and Ungerleider, 1982; Mishkin et al., 1983), which was not implemented here. Therefore, an important extension of the present model would be to include parietal areas and the dorsal visual-where stream. Inclusion of left parietal areas would also be strongly motivated experimentally, as they are well known to play a role in general language processing (Pulvermüller and Fadiga, 2010) and also in category-specific processing of prepositions, number and tool words (Dehaene, 1995; Binder and Desai, 2011; Tschentscher et al., 2012; Shebani et al., 2017). Further model extensions should address other forms of language learning. Here we investigate but one aspect of word meaning acquisition, namely associative learning between a word and its referents, which represents only a very basic step of semantic learning. To capture other types of semantic learning, the emergence of semantic knowledge from variable contexts needs to be covered along with the semantic grounding of words learned from texts, where semantic links may be explained by co-activation of linguistic representations. Future work may address with realistic neuronal networks how, based on a kernel of early acquired words semantically grounded in referent object and action contexts, the co-occurrence of words in texts can lead to the formation of novel semantic circuits and semantic representations (Harnad, 2011; Stramandinoli et al., 2012). Furthermore, future simulations should extend the present work by investigating how combinatorial grammatical binding between pre-learnt and whole-form-stored lexical units emerges from correlated activity in co-activated neuronal circuits (see Pulvermüller, 2010).

Still, already in its current form, the present computational model makes critical predictions (some of which we spelled out in detail in discussion above) about how meaning is acquired, processed and stored in the human brain. Compared with earlier similar work, the spiking-and-jumping neural network developed in this work is based on a wider range of biological principles and features of the human brain, such as neurophysiological dynamics of spiking pyramidal cells, synaptic modification by way of Hebbian learning, local lateral inhibition and area-specific global regulation mechanisms, uncorrelated white noise present in all neurons during learning, brain-like connectivity structure based on neuroanatomical evidence. Therefore, the present model provides a sophisticated mechanistic explanation of the differential involvement of semantic cortical regions.

Conclusion

We used a biologically constrained neurocomputational model mimicking cortical features and connectivity of frontal, temporal and occipital cortices to simulate the brain mechanisms of word meaning acquisition. Extending our earlier work (Garagnani and Pulvermüller, 2016; Tomasello et al., 2017) by introducing, for the first time, spiking neuronal cells in a neuroanatomical constrained model with brain like connectivity, we show that Hebbian associative learning and connectivity together are sufficient to account for the emergence of general semantic areas (“semantic hubs”), as well as specific contributions of others modality-preferential ones to the processing of specific semantic categories. The present simulation results show that neurobiologically constrained networks can fruitfully contribute to bridging the gap between cellular-level mechanisms, behavior and cognition by integrating brain theory with experimental data.

Author Contributions

RT conceived the study, conducted the experiments, analyzed the data, and wrote the paper. MG, TW, and FP supervised the study and contributed to paper writing.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Supported by the Freie Universität Berlin, Deutsche Forschungsgemeinschaft (Pu 97/22-1), Engineering and Physical Sciences Research Council (EPSRC) and Biotechnology and Biological Sciences Research Council (BBSRC) U.K. (project grant no. EP/J004561/1: BABEL—Brain-inspired architecture for brain embodied language) and the Berlin School of Mind and Brain, Humboldt Universität (Ph. D. fellowship to RT). The authors would also like to thank the HPC Service of ZEDAT, Freie Universität Berlin, for support and computing time.

Footnotes

1. ^Note that the linkage of a perisylvian word circuit with semantic information coming from the visual (or motor) system does not necessarily have to go through connector hub PF_L (or AT).

References

Amir, Y., Harel, M., and Malach, R. (1993). Cortical hierarchy reflected in the organization of intrinsic connections in macaque monkey visual cortex. J. Comp. Neurol. 334, 19–46. doi: 10.1002/cne.903340103

PubMed Abstract | CrossRef Full Text | Google Scholar

Arikuni, T., Watanabe, K., and Kubota, K. (1988). Connections of area 8 with area 6 in the brain of the macaque monkey. J. Comp. Neurol. 277, 21–40. doi: 10.1002/cne.902770103

PubMed Abstract | CrossRef Full Text | Google Scholar

Artola, A., Bröcher, S., and Singer, W. (1990). Different voltage-dependent thresholds for inducing long-term depression and long-term potentiation in slices of rat visual cortex. Nature 347, 69–72. doi: 10.1038/347069a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Artola, A., and Singer, W. (1993). Long-term depression of excitatory synaptic transmission and its relationship to long-term potentiation. Trends Neurosci. 16, 480–487. doi: 10.1016/0166-2236(93)90081-V

PubMed Abstract | CrossRef Full Text | Google Scholar

Barsalou, L. W. (2008). Grounded cognition. Annu. Rev. Psychol. 59, 617–645. doi: 10.1146/annurev.psych.59.103006.093639

PubMed Abstract | CrossRef Full Text | Google Scholar

Bauer, R. H., and Fuster, J. M. (1978). The effect of ambient illumination on delayed-matching and delayed-response deficits from cooling dorsolateral prefrontal cortex. Behav. Biol. 22, 60–66. doi: 10.1016/S0091-6773(78)92019-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Bauer, R. H., and Jones, C. N. (1976). Feedback training of 36-45 Hz EEG activity in the visual cortex and hippocampus of cats: evidence for sensory and motor involvement. Physiol. Behav. 17, 885–890. doi: 10.1016/0031-9384(76)90003-2

CrossRef Full Text | Google Scholar

Bibbig, A., Wennekers, T., and Palm, G. (1995). A neural network model of the cortico-hippocampal interplay and the representation of contexts. Behav. Brain Res. 66, 169–175. doi: 10.1016/0166-4328(94)00137-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Binder, J. R., and Desai, R. H. (2011). The neurobiology of semantic memory. Trends Cogn. Sci. 15, 527–536. doi: 10.1016/j.tics.2011.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Binder, J. R., Desai, R. H., Graves, W. W., and Conant, L. L. (2009). Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cereb. Cortex 19, 2767–2796. doi: 10.1093/cercor/bhp055

PubMed Abstract | CrossRef Full Text | Google Scholar

Bookheimer, S. (2002). Functional MRI of language: new approaches to understanding the cortical organization of semantic processing. Annu. Rev. Neurosci. 25, 151–188. doi: 10.1146/annurev.neuro.25.112701.142946

PubMed Abstract | CrossRef Full Text | Google Scholar

Braitenberg, V. (1978). “Cell assemblies in the cerebral cortex,” in Theoretical Approaches to Complex Systems, eds R. Heim and G. Palm (Berlin: Springer), 171–188. doi: 10.1007/978-3-642-93083-6_9

CrossRef Full Text | Google Scholar

Braitenberg, V., and Schüz, A. (1998). Cortex: Statistics and Geometry of Neuronal Connectivity. Berlin: Springer.

Google Scholar

Bressler, S. L., Coppola, R., and Nakamura, R. (1993). Episodic multiregional cortical coherence at multiple frequencies during visual task performance. Nature 366, 153–156. doi: 10.1038/366153a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Caramazza, A., Anzellotti, S., Strnad, L., and Lingnau, A. (2014). Embodied cognition and mirror neurons: a critical assessment. Annu. Rev. Neurosci. 37, 1–15. doi: 10.1146/annurev-neuro-071013-013950

PubMed Abstract | CrossRef Full Text | Google Scholar

Carota, F., Kriegeskorte, N., Nili, H., and Pulvermüller, F. (2017). Representational similarity mapping of distributional semantics in left inferior frontal, middle temporal, and motor cortex. Cereb. Cortex. 27, 294–309. doi: 10.1093/cercor/bhw379

PubMed Abstract | CrossRef Full Text | Google Scholar

Catani, M., Jones, D. K., Donato, R., and Ffytche, D. H. (2003). Occipito-temporal connections in the human brain. Brain 126, 2093–2107. doi: 10.1093/brain/awg203

PubMed Abstract | CrossRef Full Text | Google Scholar

Catani, M., Jones, D. K., and Ffytche, D. H. (2005). Perisylvian language networks of the human brain. Ann. Neurol. 57, 8–16. doi: 10.1002/ana.20319

PubMed Abstract | CrossRef Full Text | Google Scholar

Chafee, M. V., and Goldman-Rakic, P. S. (2000). Inactivation of parietal and prefrontal cortex reveals interdependence of neural activity during memory-guided saccades. J. Neurophysiol. 83, 1550–1566. doi: 10.1152/jn.2000.83.3.1550

PubMed Abstract | CrossRef Full Text | Google Scholar

Chao, L. L., Haxby, J. V., and Martin, A. (1999). Attribute-based neural substrates in temporal cortex for perceiving and knowing about objects. Nat. Neurosci. 2, 913–919. doi: 10.1038/13217

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, L., Lambon Ralph, M. A., and Rogers, T. T. (2017). A unified model of human semantic knowledge and its disorders. Nat. Hum. Behav. 1:39. doi: 10.1038/s41562-016-0039

PubMed Abstract | CrossRef Full Text | Google Scholar

Christiansen, M. H., and Chater, N. (2001). Connectionist psycholinguistics: capturing the empirical data. Trends Cogn. Sci. 5, 82–88. doi: 10.1016/S1364-6613(00)01600-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Connors, B. W., Gutnick, M. J., and Prince, D. A. (1982). Electrophysiological properties of neocortical neurons in vitro. J. Neurophysiol. 48, 1302–1320. doi: 10.1152/jn.1982.48.6.1302

PubMed Abstract | CrossRef Full Text | Google Scholar

Damasio, A. R. (1989). Time-locked multiregional retroactivation: a systems-level proposal for the neural substrates of recall and recognition. Cognition 33, 25–62. doi: 10.1016/0010-0277(89)90005-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Damasio, A. R., and Tranel, D. (1993). Nouns and verbs are retrieved with differently distributed neural systems. Proc. Natl. Acad. Sci. U.S.A. 90, 4957–4960. doi: 10.1073/pnas.90.11.4957

PubMed Abstract | CrossRef Full Text | Google Scholar

Damasio, H., Grabowski, T. J., Tranel, D., Hichwa, R. D., and Damasio, A. R. (1996). A neural basis for lexical retrieval. Nature 380, 499–505. doi: 10.1038/380499a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Deacon, T. W. (1992). Cortical connections of the inferior arcuate sulcus cortex in the macaque brain. Brain Res. 573, 8–26. doi: 10.1016/0006-8993(92)90109-M

PubMed Abstract | CrossRef Full Text | Google Scholar

Dehaene, S. (1995). Electrophysiological evidence for category-specific word processing in the normal human brain. Neuroreport 6, 2153–2157. doi: 10.1097/00001756-199511000-00014

PubMed Abstract | CrossRef Full Text | Google Scholar

Deiber, M. P., Passingham, R. E., Colebatch, J. G., Friston, K. J., Nixon, P. D., and Frackowiak, R. S. (1991). Cortical areas and the selection of movement: a study with positron emission tomography. Exp. Brain Res. 84, 393–402. doi: 10.1007/BF00231461

PubMed Abstract | CrossRef Full Text | Google Scholar

Dell, G. S., Chang, F., and Griffiths, Z. M. (1999). Connectionist models of language production: lexical access and grammatical encoding. Cogn. Sci. 23, 517–542. doi: 10.1207/s15516709cog2304_6

CrossRef Full Text | Google Scholar

D'Esposito, M. (2007). From cognitive to neural models of working memory. Proc. R. Soc. London B Biol. Sci. 362, 761–772. doi: 10.1098/rstb.2007.2086

PubMed Abstract | CrossRef Full Text | Google Scholar

Distler, C., Boussaoud, D., Desimone, R., and Ungerleider, L. G. (1993). Cortical connections of inferior temporal area TEO in macaque monkeys. J. Comp. Neurol. 334, 125–150. doi: 10.1002/cne.903340111

PubMed Abstract | CrossRef Full Text | Google Scholar

Douglas, R. J., and Martin, K. A. (2004). Neuronal circuits of the neocortex. Annu. Rev. Neurosci. 27, 419–451. doi: 10.1146/annurev.neuro.27.070203.144152

PubMed Abstract | CrossRef Full Text | Google Scholar

Doursat, R., and Bienenstock, E. (2006). “Neocortical self-structuration as a basis for learning,” in 5th International Conference on Development and Learning (ICDL) 2006 (Bloomington, IN: Indiana University).

Google Scholar

Dreyer, F. R., Frey, D., Arana, S., Saldern, S., von Picht, T., Vajkoczy, P., et al. (2015). Is the motor system necessary for processing action and abstract emotion words? Evidence from focal brain lesions. Front. Psychol. 6:1661. doi: 10.3389/fpsyg.2015.01661

PubMed Abstract | CrossRef Full Text | Google Scholar

Dum, R. P., and Strick, P. L. (2002). Motor areas in the frontal lobe of the primate. Physiol. Behav. 77, 677–682. doi: 10.1016/S0031-9384(02)00929-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Dum, R. P., and Strick, P. L. (2005). Frontal lobe inputs to the digit representations of the motor areas on the lateral surface of the hemisphere. J. Neurosci. 25, 1375–1386. doi: 10.1523/JNEUROSCI.3902-04.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

Duncan, J. (1996). Competitive brain systems in selective attention. Int. J. Psychol. 31:3343.

Google Scholar

Duncan, J. (2006). EPS mid-career award 2004: brain mechanisms of attention. Q. J. Exp. Psychol. 59, 2–27. doi: 10.1080/17470210500260674

PubMed Abstract | CrossRef Full Text | Google Scholar

Eacott, M. J., and Gaffan, D. (1992). Inferotemporal-frontal disconnection: the uncinate fascicle and visual associative learning in monkeys. Eur. J. Neurosci. 4, 1320–1332. doi: 10.1111/j.1460-9568.1992.tb00157.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Eggert, J., and van Hemmen, J. L. (2000). Unifying framework for neuronal assembly dynamics. Phys. Rev. E. Stat. Phys. Plasmas. Fluids. Relat. Interdiscip. Topics 61, 1855–1874. doi: 10.1103/PhysRevE.61.1855

PubMed Abstract | CrossRef Full Text | Google Scholar

Fadiga, L., Craighero, L., Buccino, G., and Rizzolatti, G. (2002). Speech listening specifically modulates the excitability of tongue muscles: a TMS study. Eur. J. Neurosci. 15, 399–402. doi: 10.1046/j.0953-816x.2001.01874.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Finnie, P. S., and Nader, K. (2012). The role of metaplasticity mechanisms in regulating memory destabilization and reconsolidation. Neurosci. Biobehav. Rev. 36, 1667–1707. doi: 10.1016/j.neubiorev.2012.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuster, J. M. (2003). Cortex and Mind: Unifying Cognition. New York, NY: Oxford university press.

Google Scholar

Fuster, J. M. (2009). Cortex and memory: emergence of a new paradigm. J. Cogn. Neurosci. 21, 2047–2072. doi: 10.1162/jocn.2009.21280

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuster, J. M., Bauer, R. H., and Jervey, J. P. (1985). Functional interactions between inferotemporal and prefrontal cortex in a cognitive task. Brain Res. 330, 299–307. doi: 10.1016/0006-8993(85)90689-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuster, J. M., and Jervey, J. P. (1981). Inferotemporal neurons distinguish and retain behaviorally relevant features of visual stimuli. Science 212, 952–955. doi: 10.1126/science.7233192

PubMed Abstract | CrossRef Full Text | Google Scholar

Gainotti, G. (2010). The influence of anatomical locus of lesion and of gender-related familiarity factors in category-specific semantic disorders for animals, fruits and vegetables: a review of single-case studies. Cortex 46, 1072–1087. doi: 10.1016/j.cortex.2010.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Gainotti, G. (2012). The format of conceptual representations disrupted in semantic dementia: a position paper. Cortex 48, 521–529. doi: 10.1016/j.cortex.2011.06.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., Lucchese, G., Tomasello, R., Wennekers, T., and Pulvermüller, F. (2017). A spiking neurocomputational model of high-frequency oscillatory brain responses to words and pseudowords. Front. Comput. Neurosci. 10:145. doi: 10.3389/fncom.2016.00145

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., and Pulvermüller, F. (2011). From sounds to words: a neurocomputational model of adaptation, inhibition and memory processes in auditory change detection. Neuroimage 54, 170–181. doi: 10.1016/j.neuroimage.2010.08.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., and Pulvermüller, F. (2013). Neuronal correlates of decisions to speak and act: Spontaneous emergence and dynamic topographies in a computational model of frontal and temporal areas. Brain Lang. 127, 75–85. doi: 10.1016/j.bandl.2013.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., and Pulvermüller, F. (2016). Conceptual grounding of language in action and perception: a neurocomputational model of the emergence of category specificity and semantic hubs. Eur. J. Neurosci. 43, 721–737. doi: 10.1111/ejn.13145

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., Wennekers, T., and Pulvermüller, F. (2007). A neuronal model of the language cortex. Neurocomputing 70, 1914–1919. doi: 10.1016/j.neucom.2006.10.076

CrossRef Full Text | Google Scholar

Garagnani, M., Wennekers, T., and Pulvermüller, F. (2008). A neuroanatomically grounded Hebbian-learning model of attention-language interactions in the human brain. Eur. J. Neurosci. 27, 492–513. doi: 10.1111/j.1460-9568.2008.06015.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Garagnani, M., Wennekers, T., and Pulvermüller, F. (2009). Recruitment and consolidation of cell assemblies for words by way of hebbian learning and competition in a multi-layer neural network. Cognit. Comput. 1, 160–176. doi: 10.1007/s12559-009-9011-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Gierhan, S. M. (2013). Connections for auditory language in the human brain. Brain Lang. 127, 205–221. doi: 10.1016/j.bandl.2012.11.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Grisoni, L., Dreyer, F. R., and Pulvermüller, F. (2016). Somatotopic semantic priming and prediction in the motor system. Cereb. Cortex 26, 2353–2366. doi: 10.1093/cercor/bhw026

PubMed Abstract | CrossRef Full Text | Google Scholar

Guenther, F. H., Ghosh, S. S., and Tourville, J. A. (2006). Neural modeling and imaging of the cortical interactions underlying syllable production. Brain Lang. 96, 280–301. doi: 10.1016/j.bandl.2005.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Guye, M., Parker, G. J., Symms, M., Boulby, P., Wheeler-Kingshott, C. A., Salek-Haddadi, A., et al. (2003). Combined functional MRI and tractography to demonstrate the connectivity of the human primary motor cortex in vivo. Neuroimage 19, 1349–1360. doi: 10.1016/S1053-8119(03)00165-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Harnad, S. (1990). The symbol grounding problem. Phys. D 42, 335–346. doi: 10.1016/0167-2789(90)90087-6

CrossRef Full Text | Google Scholar

Harnad, S. (2011). “From sensorimotor categories and pantomime to grounded symbols and propositions,” in Handbook of Language Evolution, eds M. Tallerman and K. Gibson (Oxford, UK: Oxford University Press), 387–392.

Google Scholar

Hauk, O., Johnsrude, I., and Pulvermüller, F. (2004). Somatotopic representations of action words in human motor and premotor cortex. Neuron 41, 301–307. doi: 10.1016/S0896-6273(03)00838-9

CrossRef Full Text | Google Scholar

Hebb, D. O. (1949). The Organization of Behavior. New York, NY: John Wiley.

Google Scholar

Izhikevich, E. M., and Edelman, G. (2008). Large-scale model of mammalian thalamocortical systems. Proc. Natl. Acad. Sci. U.S.A. 105, 3593–3598. doi: 10.1073/pnas.0712231105

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaas, J. H. (1997). Topographic maps are fundamental to sensory processing. Brain Res. Bull. 44, 107–112. doi: 10.1016/S0361-9230(97)00094-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaas, J. H., and Hackett, T. A. (2000). Subdivisions of auditory cortex and processing streams in primates. Proc. Natl. Acad. Sci. U.S.A. 97, 11793–11799. doi: 10.1073/pnas.97.22.11793

PubMed Abstract | CrossRef Full Text | Google Scholar

Kandel, E. R., Schwartz, J. H., and Jessell, T. M. (2000). Principles of Neural Science. New York, NY: McGraw-hill.

Google Scholar

Kemmerer, D. (2015). Are the motor features of verb meanings represented in the precentral motor cortices? Yes, but within the context of a flexible, multilevel architecture for conceptual knowledge. Psychon. Bull. Rev. 22, 1068–1075. doi: 10.3758/s13423-014-0784-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Kemmerer, D., Rudrauf, D., Manzel, K., and Tranel, D. (2012). Behavioral patterns and lesion sites associated with impaired processing of lexical and conceptual knowledge of actions. Cortex 48, 826–848. doi: 10.1016/j.cortex.2010.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Kiefer, M., and Pulvermüller, F. (2012). Conceptual representations in mind and brain: theoretical developments, current evidence and future directions. Cortex 48, 805–825. doi: 10.1016/j.cortex.2011.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, M. T., Preston, J. B., and Strick, P. L. (1994). Interconnections between the prefrontal cortex and the premotor areas in the frontal lobe. J. Comp. Neurol. 341, 375–392. doi: 10.1002/cne.903410308

PubMed Abstract | CrossRef Full Text | Google Scholar

Machery, E. (2007). Concept empiricism: a methodological critique. Cognition 104, 19–46. doi: 10.1016/j.cognition.2006.05.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Mahon, B. Z., and Caramazza, A. (2008). A critical look at the embodied cognition hypothesis and a new proposal for grounding conceptual content. J. Physiol. Paris 102, 59–70. doi: 10.1016/j.jphysparis.2008.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Makris, N., and Pandya, D. N. (2009). The extreme capsule in humans and rethinking of the language circuitry. Brain Struct. Funct. 213, 343–358. doi: 10.1007/s00429-008-0199-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Malenka, R. C., and Bear, M. F. (2004). LTP and LTD: an embarrassment of riches. Neuron 44, 5–21. doi: 10.1016/j.neuron.2004.09.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Markram, H., Meier, K., Lippert, T., Grillner, S., Frackowiak, R., Dehaene, S., et al. (2011). Introducing the human brain project. Proc. Comput. Sci. 7, 39–42. doi: 10.1016/j.procs.2011.12.015

CrossRef Full Text | Google Scholar

Martin, A. (2007). The representation of object concepts in the brain. Annu. Rev. Psychol. 58, 25–45. doi: 10.1146/annurev.psych.57.102904.190143

PubMed Abstract | CrossRef Full Text | Google Scholar

Martin, A., Wiggs, C. L., Ungerleider, L. G., and Haxby, J. V. (1996). Neural correlates of category-specific knowledge. Nature 379, 649–652. doi: 10.1038/379649a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Matthews, G. G. (2001). Neurobiology: Molecules, Cells, and Systems, Blackwell Science (Oxford: Blackwell Science).

Google Scholar

Mazzoni, P., Andersen, R. A., and Jordan, M. I. (1991). A more biologically plausible learning rule for neural networks. Proc. Natl. Acad. Sci. U.S.A. 88, 4433–4437. doi: 10.1073/pnas.88.10.4433

PubMed Abstract | CrossRef Full Text | Google Scholar

McCarthy, R. A., and Warrington, E. K. (1988). Evidence for modality-specific meaning systems in the brain. Nature 334, 428–430. doi: 10.1038/334428a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Meyer, J. W., Makris, N., Bates, J. F., Caviness, V. S., and Kennedy, D. N. (1999). MRI-Based topographic parcellation of human cerebral white matter. Neuroimage 9, 1–17. doi: 10.1006/nimg.1998.0383

CrossRef Full Text | Google Scholar

Mishkin, M., and Ungerleider, L. G. (1982). Contribution of striate input to the visuospatial functions of parieto-preoccipital cortx in monkeys. Behav. Brain Res. 6, 57–77. doi: 10.1016/0166-4328(82)90081-X

CrossRef Full Text | Google Scholar

Mishkin, M., Ungerleider, L. G., and Macko, K. A. (1983). Object vision and spatial vision: two central pathways. Trends Neurosci. 6, 414–417. doi: 10.1016/0166-2236(83)90190-X

CrossRef Full Text | Google Scholar

Moseley, R. L., and Pulvermüller, F. (2014). Nouns, verbs, objects, actions, and abstractions: local fMRI activity indexes semantics, not lexical categories. Brain Lang. 132, 28–42. doi: 10.1016/j.bandl.2014.03.001

CrossRef Full Text | Google Scholar

Moseley, R. L., Pulvermüller, F., and Shtyrov, Y. (2013). Sensorimotor semantics on the spot: brain activity dissociates between conceptual categories within 150 ms. Sci. Rep. 3:1928. doi: 10.1038/srep01928

PubMed Abstract | CrossRef Full Text | Google Scholar

Musso, M., Weiller, C., Kiebel, S., Müller, S. P., Bülau, P., and Rijntjes, M. (1999). Training-induced brain plasticity in aphasia. Brain 122, 1781–1790. doi: 10.1093/brain/122.9.1781

PubMed Abstract | CrossRef Full Text | Google Scholar

Neininger, B., and Pulvermüller, F. (2003). Word-category specific deficits after lesions in the right hemisphere. Neuropsychologia 41, 53–70. doi: 10.1016/S0028-3932(02)00126-4

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Reilly, R. C. (1998). Six principles for biologically based computational models of cortical cognition. Trends Cogn. Sci. 2, 455–462.

PubMed Abstract | Google Scholar

Palm, G. (1982). Neural Assemblies. An Alternative Approach to Artificial Intelligence. Secaucus, NJ: Springer-Verlag New York, Inc.

Google Scholar

Pandya, D. N. (1995). Anatomy of the auditory cortex. Rev. Neurol. 151, 486–494.

PubMed Abstract | Google Scholar

Pandya, D. N., and Barnes, C. L. (1987). “Architecture and connections of the frontal lobe,” in The Frontal Lobes Revisited, ed E. Perecman (New York, NY: The IRBN Press), 41–72.

Google Scholar

Pandya, D. N., and Yeterian, E. H. (1985). “Architecture and connections of cortical association areas,” in Association and Auditory Cortices SE - 1, Cerebral Cortex, eds A. Peters and E. Jones (Boston, MA: Springer), 3–61. doi: 10.1007/978-1-4757-9619-3_1

CrossRef Full Text | Google Scholar

Parker, A. (1998). Interaction of frontal and perirhinal cortices in visual object recognition memory in monkeys. Eur. J. Neurosci. 10, 3044–3057. doi: 10.1046/j.1460-9568.1998.00306.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Parker, G. J., Luzzi, S., Alexander, D. C., Wheeler-Kingshott, C. A., Ciccarelli, O., and Lambon Ralph, M. A. (2005). Lateralization of ventral and dorsal auditory-language pathways in the human brain. Neuroimage 24, 656–666. doi: 10.1016/j.neuroimage.2004.08.047

PubMed Abstract | CrossRef Full Text | Google Scholar

Patterson, K., Nestor, P. J., and Rogers, T. T. (2007). Where do you know what you know? The representation of semantic knowledge in the human brain. Nat. Rev. Neurosci. 8, 976–987. doi: 10.1038/nrn2277

PubMed Abstract | CrossRef Full Text | Google Scholar

Paus, T., Castro-Alamancos, M. A., and Petrides, M. (2001). Cortico-cortical connectivity of the human mid-dorsolateral frontal cortex and its modulation by repetitive transcranial magnetic stimulation. Eur. J. Neurosci. 14, 1405–1411. doi: 10.1046/j.0953-816x.2001.01757.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Petrides, M., and Pandya, D. N. (2009). Distinct parietal and temporal pathways to the homologues of Broca's area in the monkey. PLoS Biol. 7:e1000170. doi: 10.1371/journal.pbio.1000170

PubMed Abstract | CrossRef Full Text | Google Scholar

Plaut, D. C., and Gonnerman, L. M. (2000). Are non-semantic morphological effects incompatible with a distributed connectionist approach to lexical processing? Lang. Cogn. Process. 15, 445–485. doi: 10.1080/01690960050119661

CrossRef Full Text | Google Scholar

Posner, M. I., and Pavese, A. (1998). Anatomy of word and sentence meaning. Proc. Natl. Acad. Sci. U.S.A. 95, 899–905. doi: 10.1073/pnas.95.3.899

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F. (1999). Words in the brain's language. Behav. Brain Sci. 22, 253–336.

Google Scholar

Pulvermüller, F. (2010). Brain embodiment of syntax and grammar: discrete combinatorial mechanisms spelt out in neuronal circuits. Brain Lang. 112, 167–179. doi: 10.1016/j.bandl.2009.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F. (2013). How neurons make meaning: brain mechanisms for embodied and abstract-symbolic semantics. Trends Cogn. Sci. 17, 458–470. doi: 10.1016/j.tics.2013.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F. (2018). Neural reuse of action perception circuits for language, concepts and communication. Prog. Neurobiol. 160, 1–44. doi: 10.1016/j.pneurobio.2017.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F., Cooper-Pye, E., Dine, C., Hauk, O., Nestor, P. J., and Patterson, K. (2010). The word processing deficit in semantic dementia: all categories are equal, but some categories are more equal than others. J. Cogn. Neurosci. 22, 2027–2041. doi: 10.1162/jocn.2009.21339

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F., and Fadiga, L. (2010). Active perception: sensorimotor circuits as a cortical basis for language. Nat. Rev. Neurosci. 11, 351–360. doi: 10.1038/nrn2811

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F., and Garagnani, M. (2014). From sensorimotor learning to memory cells in prefrontal and temporal association cortex: a neurocomputational study of disembodiment. Cortex 57, 1–21. doi: 10.1016/j.cortex.2014.02.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F., Lutzenberger, W., and Preissl, H. (1999). Nouns and verbs in the intact brain: evidence from event-related potentials and high-frequency cortical responses. Cereb. Cortex 9, 497–506. doi: 10.1093/cercor/9.5.497

PubMed Abstract | CrossRef Full Text | Google Scholar

Pulvermüller, F., Moseley, R. L., Egorova, N., Shebani, Z., and Boulenger, V. (2014). Motor cognition-motor semantics: action perception theory of cognition and communication. Neuropsychologia 55, 71–84. doi: 10.1016/j.neuropsychologia.2013.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Ralph, M. A., Jefferies, E., Patterson, K., and Rogers, T. T. (2017). The neural and computational bases of semantic cognition. Nat. Rev. Neurosci. 18, 42–55. doi: 10.1038/nrn.2016.150

PubMed Abstract | CrossRef Full Text | Google Scholar

Rauschecker, J. P., and Scott, S. K. (2009). Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 12, 718–724. doi: 10.1038/nn.2331

PubMed Abstract | CrossRef Full Text | Google Scholar

Rauschecker, J. P., and Tian, B. (2000). Mechanisms and streams for processing of “what” and “where” in auditory cortex. Proc. Natl. Acad. Sci. U.S.A. 97, 11800–11806. doi: 10.1073/pnas.97.22.11800

CrossRef Full Text | Google Scholar

Rilling, J. K. (2014). Comparative primate neuroimaging: insights into human brain evolution. Trends Cogn. Sci. 18, 46–55. doi: 10.1016/j.tics.2013.09.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Rilling, J. K., Glasser, M. F., Jbabdi, S., Andersson, J., and Preuss, T. M. (2011). Continuity, divergence, and the evolution of brain language pathways. Front. Evol. Neurosci. 3:11. doi: 10.3389/fnevo.2011.00011

CrossRef Full Text | Google Scholar

Rilling, J. K., Glasser, M. F., Preuss, T. M., Ma, X., Zhao, T., Hu, X., et al. (2008). The evolution of the arcuate fasciculus revealed with comparative DTI. Nat. Neurosci. 11, 426–428. doi: 10.1038/nn2072

PubMed Abstract | CrossRef Full Text | Google Scholar

Rilling, J. K., and van den Heuvel, M. P. (2018). Comparative primate connectomics. Brain. Behav. Evol. 91, 170–179. doi: 10.1159/000488886

PubMed Abstract | CrossRef Full Text | Google Scholar

Rioult-Pedotti, M. S., Friedman, D., and Donoghue, J. P. (2000). Learning-Induced LTP in Neocortex. Science 290, 533–536. doi: 10.1126/science.290.5491.533

PubMed Abstract | CrossRef Full Text | Google Scholar

Rizzolatti, G., and Luppino, G. (2001). The cortical motor system. Neuron 31, 889–901. doi: 10.1016/S0896-6273(01)00423-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Rolls, E. T., and Deco, G. (2010). The Noisy Brain: Stochastic Dynamics as a Princple of Brain Function. Oxford: Oxford University Press.

Google Scholar

Romanski, L. M. (2007). Representation and integration of auditory and visual stimuli in the primate ventral lateral prefrontal cortex. Cereb. Cortex 17, i61–i69. doi: 10.1093/cercor/bhm099

PubMed Abstract | CrossRef Full Text | Google Scholar

Romanski, L. M., Bates, J. F., and Goldman-Rakic, P. S. (1999a). Auditory belt and parabelt projections to the prefrontal cortex in the rhesus monkey. J. Comp. Neurol. 403, 141–157. doi: 10.1002/(SICI)1096-9861(19990111)403:2<141::AID-CNE1>3.0.CO;2-V

PubMed Abstract | CrossRef Full Text | Google Scholar

Romanski, L. M., Tian, B., Fritz, J., Mishkin, M., Goldman-Rakic, P. S., and Rauschecker, J. P. (1999b). Dual streams of auditory afferents target multiple domains in the primate prefrontal cortex. Nat. Neurosci. 2, 1131–1136. doi: 10.1038/16056

PubMed Abstract | CrossRef Full Text | Google Scholar

Saur, D., Kreher, B. W., Schnell, S., Kümmerer, D., Kellmeyer, P., Vry, M. S., et al. (2008). Ventral and dorsal pathways for language. Proc. Natl. Acad. Sci. U.S.A. 105, 18035–18040. doi: 10.1073/pnas.0805234105

PubMed Abstract | CrossRef Full Text | Google Scholar

Schomers, M. R., Garagnani, M., and Pulvermüller, F. (2017). Neurocomputational consequences of evolutionary connectivity changes in Perisylvian language cortex. J. Neurosci. 37, 3045–3055. doi: 10.1523/JNEUROSCI.2693-16.2017

PubMed Abstract | CrossRef Full Text | Google Scholar

Schomers, M. R., and Pulvermüller, F. (2016). Is the sensorimotor cortex relevant for speech perception and understanding? An integrative review. Front. Hum. Neurosci. 10:435. doi: 10.3389/fnhum.2016.00435

PubMed Abstract | CrossRef Full Text | Google Scholar

Seltzer, B., and Pandya, D. N. (1989). Intrinsic connections and architectonics of the superior temporal sulcus in the rhesus monkey. J. Comp. Neurol. 290, 451–471. doi: 10.1002/cne.902900402

PubMed Abstract | CrossRef Full Text | Google Scholar

Shallice, T. (1988). From Neuropsychology to Mental Structure. New York, NY: Cambridge University Press. doi: 10.1017/CBO9780511526817

CrossRef Full Text | Google Scholar

Shebani, Z., Patterson, K., Nestor, P. J., Diaz-de-Grenu, L. Z., Dawson, K., and Pulvermüller, F. (2017). Semantic word category processing in semantic dementia and posterior cortical atrophy. Cortex 93, 92–106. doi: 10.1016/j.cortex.2017.04.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Stramandinoli, F., Marocco, D., and Cangelosi, A. (2012). The grounding of higher order concepts in action and language: a cognitive robotics model. Neural Netw. 32, 165–173. doi: 10.1016/j.neunet.2012.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Tate, M. C., Herbet, G., Moritz-Gasser, S., Tate, J. E., and Duffau, H. (2014). Probabilistic map of critical functional regions of the human cerebral cortex: Broca's area revisited. Brain 137, 2773–2782. doi: 10.1093/brain/awu168

PubMed Abstract | CrossRef Full Text | Google Scholar

Thiebaut de Schotten, M., Dell'Acqua, F., Valabregue, R., and Catani, M. (2012). Monkey to human comparative anatomy of the frontal lobe association tracts. Cortex 48, 82–96. doi: 10.1016/j.cortex.2011.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomasello, M., and Kruger, A. C. (1992). Joint attention on actions: acquiring verbs in ostensive and non-ostensive contexts. J. Child Lang. 19, 311–333. doi: 10.1017/S0305000900011430

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomasello, R., Garagnani, M., Wennekers, T., and Pulvermüller, F. (2017). Brain connections of words, perceptions and actions: a neurobiological model of spatio-temporal semantic activation in the human cortex. Neuropsychologia 98, 111–129. doi: 10.1016/j.neuropsychologia.2016.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Trumpp, N. M., Kliese, D., Hoenig, K., Haarmeier, T., and Kiefer, M. (2013). Losing the sound of concepts: damage to auditory association cortex impairs the processing of sound-related concepts. Cortex 49, 474–486. doi: 10.1016/j.cortex.2012.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Tschentscher, N., Hauk, O., Fischer, M. H., and Pulvermüller, F. (2012). You can count on the motor cortex: finger counting habits modulate motor cortex activation evoked by numbers. Neuroimage 59, 3139–3148. doi: 10.1016/j.neuroimage.2011.11.037

PubMed Abstract | CrossRef Full Text | Google Scholar

Ueno, T., Saito, S., Rogers, T. T., and Lambon Ralph, M. A. (2011). Lichtheim 2: synthesizing aphasia and the neural basis of language in a neurocomputational model of the dual dorsal-ventral language pathways. Neuron 72, 385–396. doi: 10.1016/j.neuron.2011.09.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Ungerleider, L. G., Gaffan, D., and Pelak, V. S. (1989). Projections from inferior temporal cortex to prefrontal cortex via the uncinate fascicle in rhesus monkeys. Exp. Brain Res. 76, 473–484. doi: 10.1007/BF00248903

PubMed Abstract | CrossRef Full Text | Google Scholar

Ungerleider, L. G., and Haxby, J. V. (1994). “What” and “where” in the human brain. Curr. Opin. Neurobiol. 4, 157–165.

PubMed Abstract | Google Scholar

van den Heuvel, M. P., and Sporns, O. (2013). Network hubs in the human brain. Trends Cogn. Sci. 17, 683–696. doi: 10.1016/j.tics.2013.09.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Vigliocco, G., Vinson, D. P., Lewis, W., and Garrett, M. F. (2004). Representing the meanings of object and action words: the featural and unitary semantic space hypothesis. Cogn. Psychol. 48, 422–488. doi: 10.1016/j.cogpsych.2003.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Vouloumanos, A., and Werker, J. F. (2009). Infants' learning of novel words in a stochastic environment. Dev. Psychol. 45, 1611–1617. doi: 10.1037/a0016134

PubMed Abstract | CrossRef Full Text | Google Scholar

Vukovic, N., Feurra, M., Shpektor, A., Myachykov, A., and Shtyrov, Y. (2017). Primary motor cortex functionally contributes to language comprehension: an online rTMS study. Neuropsychologia 96, 222–229. doi: 10.1016/j.neuropsychologia.2017.01.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Wakana, S., Jiang, H., Nagae-Poetscher, L. M., van Zijl, P. C. M., and Mori, S. (2004). Fiber tract-based atlas of human white matter anatomy. Radiology 230, 77–87. doi: 10.1148/radiol.2301021640

PubMed Abstract | CrossRef Full Text | Google Scholar

Warrington, E. K., and Mccarthy, R. (1983). Category specific access dysphasia. Brain 106, 859–878. doi: 10.1093/brain/106.4.859

PubMed Abstract | CrossRef Full Text | Google Scholar

Webster, M. J., Bachevalier, J., and Ungerleider, L. G. (1994). Connections of inferior temporal areas TEO and TE with parietal and frontal cortex in macaque monkeys. Cereb. Cortex 4, 470–483. doi: 10.1093/cercor/4.5.470

PubMed Abstract | CrossRef Full Text | Google Scholar

Wennekers, T., Garagnani, M., and Pulvermüller, F. (2006). Language models based on Hebbian cell assemblies. J. Physiol. Paris 100, 16–30. doi: 10.1016/j.jphysparis.2006.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Westermann, G., and Reck Miranda, E. (2004). A new model of sesorimotor coupling in the development of speech. Brain Lang. 89, 393–400. doi: 10.1016/S0093-934X(03)00345-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Wilson, H. R., and Cowan, J. D. (1972). Excitatory and inhibitory interactions in localized populations of model neurons. Biophys. J. 12, 1–24. doi: 10.1016/S0006-3495(72)86068-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Yeterian, E. H., Pandya, D. N., Tomaiuolo, F., and Petrides, M. (2012). The cortical connectivity of the prefrontal cortex in the monkey brain. Cortex 48, 68–81. doi: 10.1016/j.cortex.2011.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Young, M. P., Scannell, J. W., and Burns, G. (1995). The Analysis of Cortical Connectivity. Heidelberg: Springer.

Google Scholar

Young, M. P., Scannell, J. W., Burns, G. A., and Blakemore, C. (1994). Analysis of connectivity: neural systems in the cerebral cortex. Rev. Neurosci. 5, 227–250. doi: 10.1515/REVNEURO.1994.5.3.227

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuille, A. L., and Geiger, D. (2003). “Winner-take-all mechanisms,” in The Handbook of Brain Theory and Neural Networks, ed M. Arbib (Cambridge, MA: MIT Press), 1056–1060.

Google Scholar

Zatorre, R. J., Meyer, E., Gjedde, A., and Evans, A. C. (1996). PET studies of phonetic processing of speech: review, replication, and reanalysis. Cereb. Cortex 6, 21–30. doi: 10.1093/cercor/6.1.21

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: word acquisition, semantic grounding, Hebbian learning, distributed neural assemblies, spiking neural network, brain-like connectivity

Citation: Tomasello R, Garagnani M, Wennekers T and Pulvermüller F (2018) A Neurobiologically Constrained Cortex Model of Semantic Grounding With Spiking Neurons and Brain-Like Connectivity. Front. Comput. Neurosci. 12:88. doi: 10.3389/fncom.2018.00088

Received: 06 July 2018; Accepted: 15 October 2018;
Published: 06 November 2018.

Edited by:

Yilei Zhang, Nanyang Technological University, Singapore

Reviewed by:

Vit Novacek, National University of Ireland Galway, Ireland
Srinivasa Chakravarthy, Indian Institute of Technology Madras, India

Copyright © 2018 Tomasello, Garagnani, Wennekers and Pulvermüller. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rosario Tomasello, dG9tYXNlbGxvLnJAZnUtYmVybGluLmRl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.