
95% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Psychol. , 17 June 2015
Sec. Cognition
Volume 6 - 2015 | https://doi.org/10.3389/fpsyg.2015.00735
This article is part of the Research Topic Quantum Structures in Cognitive and Social Science View all 15 articles
A correction has been applied to this article in:
Corrigendum: Noncontextuality with marginal selectivity in reconstructing mental architectures
We present a general theory of series-parallel mental architectures with selectively influenced stochastically non-independent components. A mental architecture is a hypothetical network of processes aimed at performing a task, of which we only observe the overall time it takes under variable parameters of the task. It is usually assumed that the network contains several processes selectively influenced by different experimental factors, and then the question is asked as to how these processes are arranged within the network, e.g., whether they are concurrent or sequential. One way of doing this is to consider the distribution functions for the overall processing time and compute certain linear combinations thereof (interaction contrasts). The theory of selective influences in psychology can be viewed as a special application of the interdisciplinary theory of (non)contextuality having its origins and main applications in quantum theory. In particular, lack of contextuality is equivalent to the existence of a “hidden” random entity of which all the random variables in play are functions. Consequently, for any given value of this common random entity, the processing times and their compositions (minima, maxima, or sums) become deterministic quantities. These quantities, in turn, can be treated as random variables with (shifted) Heaviside distribution functions, for which one can easily compute various linear combinations across different treatments, including interaction contrasts. This mathematical fact leads to a simple method, more general than the previously used ones, to investigate and characterize the interaction contrast for different types of series-parallel architectures.
The notion of a network of mental processes with components selectively influenced by different experimental factors was introduced to psychology in Saul Sternberg's (1969) influential paper. Sternberg considered networks of processes a, b, c, … involved in performing a mental task. Denoting their respective durations by A, B, C …, the hypothesis he considered was that the observed response time T is A + B + C + …. One cannot test this hypothesis, Sternberg wrote, without assuming that there are some factors, α, β, γ, …, that selectively influence the durations A, B, C …, respectively. Sternberg's analysis was confined to stochastically independent A, B, C, …, and the consequences of the assumptions of seriality and selective influences were tested on the level of the mean response times only.
Subsequent development of these ideas was aimed at the entire distributions of the response times and at a greater diversity and complexity of mental architectures than just series of “stages.” This development prominently includes Townsend (1984, 1990a,b); Schweickert and Townsend (1989); Townsend and Schweickert (1989); Roberts and Sternberg (1993); Townsend and Nozawa (1995); Schweickert et al. (2000), and several other publications, primarily by James Townsend and Richard Schweickert with colleagues. For an overview of these developments see Dzhafarov (2003) and Schweickert et al. (2012). In the present context we should separately mention the development of the ideas of stochastic ordering of processing times in Townsend (1984, 1990a) and Townsend and Schweickert (1989); as well as the idea of marginal selectivity (Townsend and Schweickert, 1989).
The notion of selective influences also underwent a significant development, having been generalized from stochastically independent random variables to arbitrarily interdependent ones (Dzhafarov, 2003; Dzhafarov and Gluhovsky, 2006; Kujala and Dzhafarov, 2008; Dzhafarov and Kujala, 2010, in press). The essence of the development is easy to understand using two random variables (e.g., process durations) A, B selectively influenced by two respective factors α, β. In Dzhafarov's (2003) notation, this is written (A, B) ↫ (α, β). According to the definition given in Dzhafarov (2003), this means that there are functions f and g and a random variable R (a common source of randomness) such that f(α, R) = A and g(β, R) = B. If such a choice of (f, g, R) exists, it is not unique. For instance, R can always be chosen to have any distribution that is absolutely continuous with respect to the usual Borel measure on the real line (e.g., a standard uniform, or standard normal distribution, see Dzhafarov and Gluhovsky, 2006). However, a triple (f, g, R) need not exist. It does not exist, e.g., if marginal selectivity (Townsend and Schweickert, 1989) is violated, i.e., if the distribution of, say, A at a given value of α changes in response to changing β. But marginal selectivity is not sufficient for the existence of a triple (f, g, R). Let, e.g., α and β be binary factors, with values 1, 2 each, and let the correlation ρ between A and B for a treatment (α, β) be denoted ραβ. Then the triple in question does not exist if the correlations violated the “cosphericity test” (Kujala and Dzhafarov, 2008), also known in quantum mechanics as Landau's inequality (Landau, 1988):
where . There are many other known conditions that must be satisfied for the existence of a triple (f, g, R) when marginal selectivity is satisfied (Dzhafarov and Kujala, 2010, 2012a,b, 2013, 2014a).
The allusion to quantum mechanics is not accidental: as shown in Dzhafarov and Kujala (2012a,b), the theory of selective influences in psychology can be viewed as a special application of the theory of (non)contextuality. This theory is interdisciplinary (Khrennikov, 2009; Dzhafarov and Kujala, 2014b,c,d), but its origins are in quantum theory, dating from Kochen and Specker (1967) and Bell's (1964, 1966) celebrated work. For the modern state of the theory see Dzhafarov et al. (in press). A simplified account of the (non)contextuality analysis of the example given above is as follows. One labels each random variable in play contextually, i.e., by what property is being measured/recorded under what treatment (context):
The notation here is, of course, redundant, because the context and property identifiers overlap, but we need now to emphasize the logic rather than achieve notational convenience. Once the labeling is done, one looks at all possible joint distributions imposable on all these random variables, for all properties and all treatments. A system is noncontextual if there exists such a joint distributions in which any two random variables that represent the same property (“what is measured”) are equal with probability 1. The latter is possible only if the random variables representing the same property always have the same distribution: in our case
for any values α, β, α′, β′ of the two factors. This is called consistent connectedness (Dzhafarov et al., in press), and in physics is known under a variety of names, including (in certain paradigms) “no-signaling condition” (Popescu and Rohrlich, 1994; Cereceda, 2000; Masanes et al., 2006). In psychology, this is marginal selectivity. The definition of noncontextuality just given is not the most general one, as the notion of contextuality can be extended to inconsistently connected (violating marginal selectivity) systems (Dzhafarov et al., in press), but we do not need this generality in this paper. What is important for us here is that the existence of a joint distribution mentioned in our definition is equivalent to the existence of a random variable R and the functions f, g mentioned in the introductory paragraph.
It is easy to show (Dzhafarov, 2003) that the existence of a triple (f, g, R) for given joint distributions of (A, B) under different treatments (α, β) is equivalent to the existence of a quintuple (f′, g′, S, SA, SB), where S, SA, SB are random variables, such that f′(α, S, SA) = A and g′(β, S, SB) = B. In such a representation, one can speak of a common source of randomness S and specific sources of randomness SA, SB. In Dzhafarov et al. (2004) this representation was used to investigate different series-parallel arrangements of the hypothetical durations A and B. The reason this representation has been considered convenient is that if one fixes the value S = s, then f′(α, s, SA) = Ac and g′(β, s, SB) = Bc are stochastically independent random variables. One can therefore use theorems proved for stochastically independent selectively influenced components (Schweickert et al., 2000) to obtain a general result by averaging across possible values of s. For instance, let α, β be binary factors (with values 1, 2 each), and let us assume that the observed duration Tαβ is min(Aα, Bβ) for every treatment (α, β). Then Tαβs = min(Aαs, Bβs) for every value S = s, and it is known that, for the independent Aαs, Bβs (satisfying a prolongation condition, as explained below),
Since this should be true for every value S = s, then it should also be true that
This follows from the fact that
where m(s) is the probability measure for S, and the integration is over the space of all possible s. The linear combination C(t) in (5) is called the interaction contrast of distributions functions.
The Prolongation Assumption used in Dzhafarov et al. (2004), and derived from Townsend (1984, 1990a) and Townsend and Schweickert (1989), is that, for every S = s,
For this particular architecture, T = min(A, B), this is the only assumption needed. To prove analogous results for more complex mental architectures, however, one needs additional assumptions, such as the existence of density functions for Aαs, Bβs at every s, and even certain ordering of these density functions in some vicinity [0, τ].
The same results, however, can be obtained without these additional assumptions, if one adopts the other, equivalent definition of selective influences: f(α, R) = A and g(β, R) = B, for some triple (f, g, R). If such a representation exists, then
are deterministic quantities (real numbers), for every value R = r. Any real number x in turn can be viewed as a random variable whose distribution function is a shifted Heaviside function
In particular, the quantity tαβr = min(aαr, bβr) for the simple architecture T = min(A, B) considered above is distributed according to
Let us see how inequality (5) can be derived using these observations.
We first formulate the (conditional) Prolongation Assumption, a deterministic version of (7): the assumption is that f, g, R can be so chosen that for every R = r,
Without loss of generality, we can also assume, for any given r,
(if not, rename a into b and vice versa).
Remark 1.1. The Prolongation Assumption clearly implies (7). Conversely, if (7) holds, one can always find functions f, g, R for which the Prolongation Assumption holds in the form above. For instance, one can choose R = (S, SA, SB), take SA and SB to be uniformly distributed between 0 and 1, and choose f(α, …), g(β, …) to be the quantile functions for the hypothetical distributions of A and B at the corresponding factor levels.
We next form the conditional (i.e., conditioned on R = r) interaction contrast
Notation Convention. When r is fixed throughout a discussion, we omit this argument and write aα, bβ, tαβ, hαβ(t), c(t) in place of aαr, bβr, tαβr, hαβr(t), cr(t). (For binary factors α, β, we also conveniently replace α, β in indexation with i, j.)
Following this convention, there are three different arrangements of a1, a2, b1, b2 (for a given R = r) satisfying (11)–(12):
In all three cases,
For arrangement (i) we have
This diagram shows the values of hijr(t) and the resulting values of cr(t) as t changes with respect to the fixed positions of tijr (with index r dropped everywhere). Analogously, for arrangements (ii) and (iii), we have, respectively
and
In all three cases, c(t) is obviously zero for t < t11 and t ≥ t22. We see that c(t) = cr(t) ≤ 0 for all t and every R = r. It follows that C(t) ≤ 0, because
for i, j ∈ {1, 2}, and
where μ is the probability measure associated with R and the integration is over all possible r. We obtain the same result as in (5), but in a very different way.
In this paper we extend this approach to other mental architectures belonging to the class of series-parallel networks, those involving other composition operations and possibly more than just two selectively influenced processes. In doing so we follow a long trail of work mentioned earlier. When dealing with multiple processes we follow Yang et al. (2013) in using high-order interaction contrasts. All our results are replications or straightforward generalizations of the results already known: the primary value of our work therefore is not in characterizing mental architectures, but rather in demonstrating a new theoretical approach and a new proof technique.
Since we deal with the durations of processes rather than the processes themselves, we use the term composition to describe a function that relates the durations of the components of a network to the overall (observed) duration. Formally, a composition is a real-valued function t = t(a, b, …, z) of an arbitrary number of real-valued arguments. The arguments a, b, …, z are referred to as durations or components. In this article, we will use X ∧ Y ∧ … ∧ Z to denote min(X, Y, …, Z), and X ∨ Y ∨ … ∨ Z to denote max(X, Y …, Z).
A series-parallel composition (SP) is defined as follows.
Definition 1.2. (1) A single duration is an SP composition. (2) If X and Y are SP compositions with disjoint sets of arguments, then X ∧ Y, X ∨ Y, and X + Y are SP compositions. (3) There are no other SP compositions than those construable by Rules 1 and 2.
Remark 1.3. The requirement that X and Y in Rule 2 have disjoint sets of arguments prevents expressions like X ∧ X or X + X ∨ Y. But if the second X in X ∧ X is renamed into X′, or X ∨ Y in X + X ∨ Y is renamed into Z, then the resulting X ∧ X′ and X + Z are legitimate SP compositions. This follows from the generality of our treatment, in which different components of an SP composition may have arbitrary joint distributions: e.g., X and X′ in X ∧ X′ may very well be jointly distributed so that Pr[X = X′] = 1. One should, however, always keep in minds the pattern of selective influences: thus, if X is influenced by α, then Z is also influenced by α in X + Z above.
Any SP composition is obtained by a successive application of Rules 1 and 2 (the sequence being generally non-unique), and at any intermediate stage of such a sequence we also have an SP composition that we can term a subcomposition.
Definition 1.4. Two durations X, Y in an SP composition are said to be parallel or concurrent if there is a subcomposition of this SP composition of the form SP1(X, X′, …) ∧ SP2(Y, Y′, …) (in which case X, Y are said to be min-parallel) or SP1(X, X′, …) ∨ SP2(Y, Y′, …) (X, Y are max-parallel). X, Y in an SP composition are said to be sequential or serial if there is a subcomposition of this SP composition of the form SP1(X, X′, …) + SP2(Y, Y′, …).
Definition 1.5. An SP composition is called homogeneous if it does not contain both ∧ and ∨ in it; if it does not contain ∧, it is denoted SP∨; if it does not contain ∨, it is denoted SP∧.
The only SP composition that is both SP∧ and SP∨ is a purely serial one: a + b + … + z. Most of the results previously obtained for mental networks are confined to homogeneous compositions. We will not need this constraint for the most part.
Since we will be dealing with compositions of more than just two components, we need to extend the definition of selective influences mentioned above. In the formulation below, ~ stands for “has the same distribution as.” A treatment ϕ =(λ1i1, …, λnin) is a vector of values of the factors λ1, …, λn, the values of λk (k = 1, …, n) being indicated by subscripts, λkik.
Definition 1.6. Random variables (X1, …, Xn) are selectively influenced by factors (λ1, …, λn), respectively,
if for some random variable R, whose distribution does not depend on (λ1, … λn), and for some functions g1, …, gn,
for any treatment .
In the subsequent discussion we assume that all non-dummy factors involved are binary in a completely crossed design (i.e., the overall time T is recorded for all 2n vectors of values for ϕ). When we have random variables not influenced by any of these factors, we will say they selectively influenced by an empty set of factors (we could also, equivalently, introduce for them dummy factors, with one value each).
Consider two processes, with durations A and B in an SP composition. The overall duration of this SP composition can be written as a function of A, B and other components: T = T(A, B, …). We assume that A, B, and all other components are selectively influenced by α, β, and empty set, respectively: (A, B, …) ↫ (α, β, ∅). Let each factor has two levels: α = 1, 2 and β = 1, 2, with four allowable treatments (1, 1), (1, 2), (2, 1), and (2, 2). The corresponding overall durations (random variables) are written as T11, T12, T21, and T22.
By Definition 1.6 of selective influences, each process duration (a random variable) is a function of some random variable R and the corresponding factor: A = a(α, R), B = b(β, R). For any given value R = r, the component durations are fixed numbers,
where x is the value of any duration X in the composition other than A and B. We assume that R is chosen so that the Prolongation Assumption (11) holds, with the convention (12).
The overall duration T at R = r is also a fixed number, written as (recall that we replace α, β in indexation with i, j)
The distribution function for tijr is the shifted Heaviside function hijr (t) = h(t − tijr),
The conditional interaction contrast cr(t) is defined by (13). Denoting by Hij(t) the distribution function of Tij, we have
with denoting the set of possible values of R. For the observable (i.e., estimable from data) interaction contrast
we have then
Note that it follows from our Prolongation Assumption that
We also define two conditional cumulative interaction contrasts (conditioned on R = r):
The corresponding observable cumulative interaction contrasts are
In these formulas we could switch the order of integration by Fubini's theorem, because, for any interval of reals I,
Recall the definition of cr(t) in (13). We follow our Notation Convention and drop the index r in cr(t) and all other expressions for a fixed r.
Lemma 2.1. In any SP architecture, for any r,
Proof. Follows from the (nonstrict) monotonicity of the SP composition in all arguments. □
Lemma 2.2. In any SP architecture, for any r, c(t) equals 0 for all values of t except for two cases:
and
Proof. By direct computation. □
Lemma 2.3. In any SP architecture, for any r, c (t) ≤ 0 for all values of t if and only if t11 = t12 ∧ t21; c(t) ≥ 0 for all values of t if and only if t12 ∨ t21 = t22.
Proof. Immediately follows from Lemma 2.2. □
Lemma 2.4. In any SP architecture, for any r,
(i) for any t if and only if −t11 + t12 + t21 − t22 ≥ 0,
(ii) for any t if and only if −t11 + t12 + t21 − t22 ≤ 0,
(iii) limt→∞c(0, t) = 0 if and only if −t11+t12+t21−t22 = 0.
(iv) limt→0c(t, ∞) = 0 if and only if −t11+t12+t21−t22 = 0.
Proof. Without loss of generality, put t12 ≤ t21. We have
The expressions for the first three cases are obviously nonnegative. If −t11 + t12 + t21 − t22 ≥ 0, then c(0, t) ≥ 0 for all t in the last case (t ≥ t22). With −t11 + t12 + t21 − t22 ≥ 0, we have −t11 + t12 + t21 − t ≥ t22 − t ≥ 0 for the fourth case (t21 ≤ t < t22). Hence c(0, t) ≥ 0 for all t if −t11 + t12 + t21 − t22 ≥ 0. Conversely, if c(0, t) ≥ 0 for all t, then it is also true for t ≥ t22, whence −t11 + t12 + t21 − t22 ≥ 0.
The proof for requires replacing it first with for some u > t22. We have
The expressions for the last three cases are obviously nonpositive. If −t11 + t12 + t21 − t22 ≤ 0, then for all t in the first case (t < t11). With −t11 + t12 + t21 − t22 ≤ 0, we have −t + t12 + t21 − t22 ≤ t11 − t < 0 for the second case (t11 ≤ t < t12). Hence for all t if −t11 + t12 + t21 − t22 ≤ 0. Since in all expressions u is algebraically eliminated, they remain unchanged as u → ∞. Conversely, if c(t, ∞) ≤ 0 for all t, then it is also true for t < t11, whence −t11 + t12 + t21 − t22 ≤ 0.
The statements (iii) and (iv) follow trivially. □
A simple parallel architecture corresponds to one of the two compositions: T = A ∧ B or T = A ∨ B, with (A, B) ↫ (α, β). Recall the definition of C(t) in (24).
Theorem 2.5. For T = A ∧ B, we have c(t) ≤ 0 for any r, t; consequently, C(t) ≤ 0 for any t. For T = A ∨ B, we have c(t) ≥ 0 for any r, t; consequently, C(t) ≥ 0 for any t.
Proof. For T = A ∧ B with the Prolongation Assumption (11)–(12), we have
It follows that
By Lemma 2.3, c(t) ≤ 0. As C(t) in (25) preserves the sign of c(t), we have C(t) ≤ 0. For T = A ∧ B, we have
It follows that
whence, by Lemma 2.3, c(t) ≥ 0 and therefore C(t) ≥ 0. □
Consider now a composition SP(A, B, …) with (A, B, …) ↫ (α, β, ∅).
Lemma 2.6. If A, B in SP(A, B, …) are parallel, then SP(A, B, …) can be presented as A′ ∧ B′ if they are min-parallel, or as A′ ∨ B′ if they are max-parallel, so that (A′, B′) ↫ (α, β) and, for any fixed R = r, the Prolongation Assumption holds.
Proof. By Definitions 1.2 and 1.4, if A, B are min-parallel, then SP∧(A, B, …) can be presented either as
or
or else
where B does not enter in SP1 and A does not enter in SP2. On renaming
and
we have, obviously, (A′, B′) ↫ (α, β). Fixing R = r, by the (nonstrict) monotonicity of SP compositions,
and
We can also put a′1 = SP1(a1, …) ≤ SP2(b1, …) = b′1 (otherwise we can rename the variables). The proof for the max-parallel case is analogous.
Theorem 2.7. If A, B in SP(A, B, …) are min-parallel, then c(t) ≤ 0 for any r, t; consequently, C(t) ≤ 0 for any t. If A, B in SP(A, B, …) are max-parallel, then c(t) ≥ 0 for any r, t; consequently, C(t) ≥ 0 for any t.
Proof. Immediately follows from Lemma 2.6 and Theorem 2.5. □
Simple serial architectures of size 2 corresponds to the SP composition T = A + B, with (A, B) ↫ (α, β). Recall the definitions of the two cumulative interaction contrasts: (27)–(30).
Theorem 2.8. If T = A + B, then c(0, t) ≥ 0 and c(t, ∞) ≤ 0 for any r, t; moreover,
for any r, t. Consequently, C(0, t) ≥ 0, C(t, ∞) ≤ 0 for any t, and
Proof. Follows immediately from Lemma 2.4, since
□
Consider now a composition SP(A, B, …) with (A, B, …) ↫ (α, β, ∅).
Theorem 2.9. If A and B are sequential in an SP(A, B, …) composition, then one or both of the following statements hold:
(i) c(0, t) ≥ 0 for any r, t, and C(0, t) ≥ 0 for any t,
(ii) c(t, ∞) ≤ 0 for any r, t, and C(t, ∞) ≤ 0 for any t.
Proof. In accordance with Definitions 1.2 and 1.4, SP(A, B, …) with sequential A, B can be presented as either
or
(note that any Z in SP1(A, …) + SP2(B, …) + Z can be absorbed by either of the first two summands). For both cases, by the monotonicity of SP compositions, for any R = r, SP1(a1, …) ≤ SP1(a2, …), SP2(b1, …) ≤ SP2(b2, …), and we can always assume SP1(a1, …) ≤ SP2(b1, …). Denoting the durations of SP1(ai, …) + SP2(bj, …) by t′ij, we have therefore, by Theorem 2.8, −t′11 + t′12 + t′21 − t′22 = 0. Denoting the durations of X and Y by t′ and t″, respectively, in the case (32) we have
By Lemma 2.4, all we have to show is that −t11+t12+t21−t22 ≥ 0. It is easy to see that t″ does not affect this linear combination, and its value is (assuming t′12 ≤ t′21, without loss of generality)
The nonnegativity of the first three expressions is obvious, the fifth one is zero, and the forth expression is larger than the fifth because t′ < t′22.
The proof for the case (33) is analogous. □
If the SP composition with sequential A, B is homogeneous (Definition 1.5), the statement of theorem can be made more specific.
Theorem 2.10. If A and B are sequential in an SP∧(A, B, …) composition, then c(0, t) ≥ 0 for any r, t, and C(0, t) ≥ 0 for any t; if the composition is SP∨(A, B, …), then c(t, ∞) ≤ 0 for any r, t, and C(t, ∞) ≤ 0 for any t.
We now turn to networks containing n ≥ 2 processes with durations (X1, …, Xn), selectively influenced by factors (λ1, …, λn). In other words, we deal with compositions SP(X1, …, Xn, …) such that (X1, …, Xn, …) ↫ (λ1, …, λn, ∅), where each λk is binary, with values 1,2. There are 2n allowable treatments and 2n corresponding overall durations, T11…1, T11…2, …, T22…2. According to Definition 1.6 of selective influences, each process duration here is a function of some random variable R and of the corresponding factor, Xk = xk(R, λk). For any fixed value R = r, these durations are fixed numbers for any given treatment, and so is the overall, observed value of the SP composition. We denote them
and
where i1, i2, …, in ∈ {1, 2}. The distribution function for ti1i2…inr is a shifted Heaviside function
Denoting by Hi1i2…in(t) the distribution function of Ti1i2…in, we have
Conditioned on R = r, the n-th order interaction contrast is defined in terms of mixed finite differences as
which, with some algebra can be shown to be equal to
Thus,
etc. The observable distribution function interaction contrast of order n is defined as
By straightforward calculus this can be written in extenso as
or, in terms of finite differences,
This is essentially the high-order interaction contrast used by Yang et al. (2013), the only difference being that they use survivor functions 1 − H(t) rather than the distribution functions H(t). We see that cr(t) and C(t) in the preceding analysis correspond to c(2)r(t) and C(2)(t), respectively.
We also introduce n-th order cumulative contrasts. Conditioned on R = r, we define
etc. Generalizing,
The corresponding unconditional cumulative contrasts of the n-th order are, as always, defined by integration of the conditional ones:
In the proofs below we will make use of the recursive representation of the conditional cumulative contrasts c[n]r. It is verified by straightforward calculus. Denoting
where w ∈ {1, …, n} and iw is fixed at 1 or 2, we have:
and generally, for n > 1,
Also we have, by substitution of variables under integral,
The Prolongation Assumption generalizing (11)–(12) is formulated as follows.
Prolongation Assumption. R and functions x1, …, xn in (34) can be chosen so that xk1r ≤ xk2r for all R = r and for all k = 1, …, n. Without loss of generality, we can also assume x11r ≤ x21r ≤ … ≤ xn1r (if not, rearrange x11r, …, xn1r).
Notation Convention. As we did before for n = 2, when r is fixed throughout a discussion, we omit this argument and write x1i1, …, xnin, ti1i2…in, hi1i2…in(t), c(n)(t) in place of x1i1r, …, xninr, ti1i2…inr, hi1i2…inr(t), c(n)r(t).
Theorem 3.1. If T = X1 ∧ … ∧ Xn, then for any r, t, c(n)(t) ≤ 0 if n is even and c(n)(t) ≥ 0 if n is odd. Consequently, for any t, C(n)(t) ≤ 0 if n is even and C(n)(t) ≥ 0 if n is odd.
Proof. By induction on n, the case n = 1 being true by the Prolongation Assumption:
Let the statement of the theorem be true for c(n−1)(t), with n − 1 ≥ 1. By the Prolongation Assumption,
for any i2 … in, whence
Therefore c(n−1)i1=1(t) = 0, and, applying the induction hypothesis to c(n−1)i1=2(t),
That C(n)(t) ≤ 0 if n is even and C(n)(t) ≥ 0 if n is odd follows by the standard argument. □
Theorem 3.2. If T = X1 ∨ … ∨ Xn, then for any r, t, c(n)(t) ≥ 0. Consequently, for any t, C(n)(t) ≥ 0.
Proof. By induction on n, the case n = 1 being true by the Prolongation Assumption:
Let the theorem be true for c(n−1)(t), where n − 1 ≥ 1. Let
where 1 ≤ m ≤ n. We have then
and
for all i1…im−1, im + 1 … in. Then c(n−1)im=2(t) = 0, and
Consequently, C(n)(t) ≥ 0, for any t. □
In a composition SP(X1, …, Xn, …), the components X1, …, Xn are considered parallel if any two of them are parallel. We assume selective influences (X1, …, Xn, …) ↫ (λ1, …, λn, ∅). We do not consider the complex situation when some of the selectively influenced processes X1, …, Xn are min-parallel and some are max-parallel. However, if they are all (pairwise) min-parallel or all max-parallel, we have essentially the same situation as with a simple parallel arrangement of n durations.
Lemma 3.3. If X1, …, Xn are all min-parallel or max-parallel in an SP composition, this composition can be presented as T = A1 ∧ … ∧ An or T = A1 ∨ … ∨ An, respectively. In either case, (A1, …, An) ↫ (λ1, …, λn) and the Prolongation Assumption holds for any R = r.
Proof. For the min-parallel case, by a minor modification of the proof of Lemma 2.6 we present the SP composition as
or
or else
Then we analogously decompose SP2(X2, …, Xn, …) achieving A1 ∧ A2 ∧ SP3(X3, …, Xn, …), and proceed in this fashion until we reach the required A1 ∧ … ∧ An. The pattern of selective influences is seen immediately, and the Prolongation Assumption follows by the monotonicity of the SP compositions. The proof for the max-parallel case is analogous. □
Theorem 3.4. If X1, …, Xn are min-parallel in an SP composition, then for any r, t, c(n)(t) ≤ 0 if n is even and c(n)(t) ≥ 0 if n is odd. Consequently, for any t, C(n)(t) ≤ 0 if n is even and C(n)(t) ≥ 0 if n is odd. If X1, …, Xn are max-parallel, then for any r, t, c(n)(t) ≥ 0, and for any t, C(n)(t) ≥ 0.
Proof. Follows from Lemma 3.3 and Theorems 3.1 and 3.2. □
Theorem 3.5. If T = X1 + … + Xn, then for any r, t, c[n](0, t) ≥ 0, while c[n](t, ∞) ≤ 0 if n is even and c[n](t, ∞) ≥ 0 if n is odd; moreover, c[n](0, ∞) = 0 for any r. Consequently, for any t, C[n](0, t) ≥ 0, while C[n](t, ∞) ≤ 0 if n is even and C[n](t, ∞) ≥ 0 if n is odd; moreover, C[n](0, ∞) = 0.
Proof. By induction on n, the case n = 1 being true by the Prolongation Assumption:
and
Let the statement of the theorem hold for all natural numbers up to and including n − 1 ≥ 1. Using the recursive representations (59)–(60),
which is ≥ 0 since c[n−1](0, τ) ≥ 0 and t − xw2 ≤ t − xw1. Analogously,
which is ≤ 0 if n is even and ≥ 0 if n is odd. Applying the mean value theorem to the results of (63) and (64), we get, for some t − xw2 < t′, t″ < t − xw1
and, as c[n−1](0, ∞) = 0, both expressions tend to zero as, respectively, t → ∞ (implying t′ → ∞) and t → 0 (implying t″ → 0). □
In a composition SP(X1, …, Xn, …), the components X1, …, Xn are considered sequential if any two of them are sequential. By analogy with Theorem 2.9 for two sequential processes and with Theorem 3.4 for parallel X1, …, Xn, one might expect that the result for the simple sequential arrangement X1 + … + Xn will also extend to n sequential components of more complex compositions SP(X1, …, Xn, …). However, this is not the case, as one can see from the following counterexample.
Consider the composition
with (X1, X2, X3) selectively influenced by binary factors, so that
It follows that
This is clearly a sequential arrangement of the three durations X1, X2, X3, but one can easily check that c[3](0, t) here is not nonnegative for all t. For instance, at t = 3 we have, by direct computation, c[3](0, t) = −1. We conclude that there is no straightforward generalization of Theorems 3.5 to arbitrary SP compositions.
The work presented in this paper is summarized in the abstract. By proving and generalizing most of the known results on the interaction contrast of distribution functions, we have demonstrated a new way of dealing with SP mental architectures. It is based on conditioning all hypothetical components of a network on a fixed value of a common source of randomness R (the “hidden variable” of the contextuality analysis in quantum theory), which renders these components deterministic quantities, and then treating these deterministic quantities as random variables with shifted Heaviside distribution functions.
The potential advantage of this method can be seen in the fact that the shifted Heaviside functions have the simplest possible arithmetic among distribution functions: for every time moment it only involves 0's and 1's. As a result, the complexity of this arithmetic does not increase with nonlinearity of the relations involved. Thus, Dzhafarov and Schweickert (1995); Cortese and Dzhafarov (1996), and Dzhafarov and Cortese (1996) argued that composition rules for mental architectures need not be confined to +, ∧, ∨. They analyzed architectures involving other associative and commutative operations, such as multiplication. Due to mathematical complexity, however, this work was confined to networks consisting of two components that are either stochastically independent or monotone functions of each other. It remains to be seen whether the approach presented here, mutatis mutandis, will lead to significant generalizations in this line of work.
The limitations of the approach, however, are already apparent. Thus, we were not able to achieve any progress over known results in applying it to Wheatstone bridges (Schweickert and Giorgini, 1999; Dzhafarov et al., 2004). The possibility that the “architecture” (composition rule) itself changes as one changes experimental factors makes the perspective of a general theory based on our approach even more problematic (e.g., Townsend and Fific, 2004). It seems, however, that these problems are not specific for just our approach.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This work is supported by NSF grant SES-1155956 and AFOSR grant FA9550-14-1-0318.
Bell, J. (1966). On the problem of hidden variables in quantum mechanics. Rev. Mod. Phys. 38, 447–453. doi: 10.1103/revmodphys.38.447
Cereceda, J. (2000). Quantum mechanical probabilities and general probabilistic constraints for einstein–podolsky–rosen–bohm experiments. Found. Phys. Lett. 13, 427–442. doi: 10.1023/A:1007828731477
Cortese, J. M., and Dzhafarov, E. N. (1996). Empirical recovery of response time decomposition rules II: discriminability of serial and parallel architectures. J. Math. Psychol. 40, 203–218. doi: 10.1006/jmps.1996.0021
Dzhafarov, E. N. (2003). Selective influence through conditional independence. Psychometrika 68, 7–26. doi: 10.1007/bf02296650
Dzhafarov, E. N., and Cortese, J. M. (1996). Empirical recovery of response time decomposition rules I: sample-level Decomposition tests. J. Math. Psychol. 40, 185–202. doi: 10.1006/jmps.1996.0020
Dzhafarov, E. N., and Gluhovsky, I. (2006). Notes on selective influence, probabilistic causality, and probabilistic dimensionality. J. Math. Psychol. 50, 390–401. doi: 10.1016/j.jmp.2006.03.003
Dzhafarov, E. N., and Kujala, J. V. (2010). The joint distribution criterion and the distance Tests for selective probabilistic causality. Front. Psychol. 1:151. doi: 10.3389/fpsyg.2010.00151
Dzhafarov, E. N., and Kujala, J. V. (2012a). Selectivity in probabilistic causality: where psychology runs into quantum physics. J. Math. Psychol. 56, 54–63. doi: 10.1016/j.jmp.2011.12.003
Dzhafarov, E. N., and Kujala, J. V. (2012b). Quantum entanglement and the issue of selective influences in psychology: an overview. Lect. Notes Comput. Sci. 7620, 184–195. doi: 10.1007/978-3-642-35659-9/17
Dzhafarov, E. N., and Kujala, J. V. (2013). Order-distance and other metric-like functions on jointly distributed random variables. Proc. Am. Math. Soc. 141, 3291–3301. doi: 10.1090/s0002-9939-2013-11575-3
Dzhafarov, E. N., and Kujala, J. V. (2014a). On selective influences, marginal selectivity, and Bell/CHSH inequalities. Top. Cogn. Sci. 6, 121–128. doi: 10.1111/tops.12060
Dzhafarov, E. N., and Kujala, J. V. (2014b). Embedding quantum into classical: contextualization vs conditionalization. PLoS ONE 9:e92818. doi: 10.1371/journal.pone.0092818
Dzhafarov, E. N., and Kujala, J. V. (2014c). A qualified kolmogorovian account of probabilistic contextuality. Lect. Notes Comput. Sci. 8369, 201–212. doi: 10.1007/978-3-642-54943-4/18
Dzhafarov, E. N., and Kujala, J. V. (2014d). Contextuality is about identity of random variables. Phys. Scr. T163:014009. doi: 10.1088/0031-8949/2014/T163/014009
Dzhafarov, E. N., and Kujala, J. V. (in press). “Probability, random variables, and selectivity,” in The New Handbook of Mathematical Psychology, eds W. Batchelder, H. Colonius, E. N. Dzhafarov, and J. Myung (Cambridge, UK: Cambridge University Press).
Dzhafarov, E. N., Kujala, J. V., and Larsson, J.-Å. (in press). Contextuality in three types of quantum-mechanical systems. Found. Phys. doi: 10.1007/s10701-015-9882-9
Dzhafarov, E. N., and Schweickert, R. (1995). Decompositions of response times: an almost general theory. J. Math. Psychol. 39, 285–314. doi: 10.1006/jmps.1995.1029
Dzhafarov, E. N., Schweickert, R., and Sung, K. (2004). Mental architectures with selectively influenced but stochastically interdependent components. J. Math. Psychol. 48, 51–64. doi: 10.1016/j.jmp.2003.12.003
Kochen, S., and Specker, E. P. (1967). The problem of hidden variables in quantum mechanics. J. Math. Mech. 17, 59–87. doi: 10.1512/iumj.1968.17.17004
Kujala, J. V., and Dzhafarov, E. N. (2008). Testing for selectivity in the dependence of random variables on external factors. J. Math. Psychol. 52, 128–144. doi: 10.1016/j.jmp.2008.01.008
Landau, L. J. (1988). Empirical two-point correlation functions. Found. Phys. 18, 449–460. doi: 10.1007/BF00732549
Masanes, Ll., Acin, A., and Gisin, N. (2006). General properties of nonsignaling theories. Phys. Rev. A 73:012112. doi: 10.1103/PhysRevA.73.012112
Popescu, S., and Rohrlich, D. (1994). Quantum nonlocality as an axiom. Found. Phys. 24, 379–385. doi: 10.1007/BF02058098
Roberts, S., and Sternberg, S. (1993). “The meaning of additive reaction-time effects: tests of three alternatives,” in Attention and Performance XIV: Synergies in Experimental Psychology, Artificial Intelligence, and Cognitive Neuroscience, eds D. E. Meyer and S. Kornblum (Cambridge, MA: MIT Press), 611–654.
Schweickert, R., Fisher, D.L., and Sung, K. (2012). Discovering Cognitive Architecture by Selectively Influencing Mental Processes. New Jersey, NJ: World Scientific.
Schweickert, R., and Giorgini, M. (1999). Response time distributions: some simple effects of factors selectively influencing mental processes. Psychon. Bull. Rev. 6, 269–288. doi: 10.3758/BF03212330
Schweickert, R., Giorgini, M., and Dzhafarov, E. N. (2000). Selective influence and response time cumulative distribution functions in serial-parallel task networks. J. Math. Psychol. 44, 504–535. doi: 10.1006/jmps.1999.1268
Schweickert, R., and Townsend, J. T. (1989). A trichotomy: interactions of factors prolonging sequential and concurrent mental processes in stochastic discrete mental (PERT) networks. J. Math. Psychol. 33, 328–347. doi: 10.1016/0022-2496(89)90013-8
Sternberg, S. (1969). The discovery of processing stages: Extensions of donders' method. Acta. Psychologica. 30, 276–315. doi: 10.1016/0001-6918(69)90055-9
Townsend, J. T. (1984). Uncovering mental processes with factorial experiments. J. Math. Psychol. 28, 363–400. doi: 10.1016/0022-2496(84)90007-5
Townsend, J. T. (1990a). The truth and consequences of ordinal differences in statistical distributions: Toward a theory of hierarchical inference. Psychol. Bull. 108, 551–567. doi: 10.1037/0033-2909.108.3.551
Townsend, J. T. (1990b). Serial vs. parallel processing: sometimes they look like tweedledum and tweedledee but they can (and should) be distinguished. Psychol. Sci. 1, 46–54. doi: 10.1111/j.1467-9280.1990.tb00067.x
Townsend, J. T., and Fific, M. (2004). Parallel & serial processing and individual differences in high-speed scanning in human memory. Percept. Psychophys. 66, 953–962. doi: 10.3758/BF03194987
Townsend, J. T., and Nozawa, G. (1995). Spatio-temporal properties of elementary perception: an investigation of parallel, serial, and coactive theories. J. Math. Psychol. 39, 321–359. doi: 10.1006/jmps.1995.1033
Townsend, J. T., and Schweickert, R. (1989). Toward the trichotomy method of reaction times: laying the foundation of stochastic mental networks. J. Math. Psychol. 33, 309–327. doi: 10.1016/0022-2496(89)90012-6
Keywords: interaction contrast, mental architectures, noncontextuality, response time, selective influences, series-parallel network
Citation: Zhang R and Dzhafarov EN (2015) Noncontextuality with marginal selectivity in reconstructing mental architectures. Front. Psychol. 6:735. doi: 10.3389/fpsyg.2015.00735
Received: 31 March 2015; Accepted: 17 May 2015;
Published: 17 June 2015.
Edited by:
Sandro Sozzo, University of Leicester, UKReviewed by:
James T. Townsend, Indiana University, USACopyright © 2015 Zhang and Dzhafarov. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ehtibar N. Dzhafarov, Department of Psychological Sciences, Purdue University, 703 Third Street, West Lafayette, IN 47907-2081, USA,ZWh0aWJhckBwdXJkdWUuZWR1
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.