- 1Institute of Applied Mathematics and Mechanics, Peter the Great St. Petersburg Polytechnic University, St. Petersburg, Russia
- 2Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States
- 3Department of Molecular Genetics & Microbiology, Genetics Institute, University of Florida, Gainesville, FL, United States
- 4Department of Cell Biology, SUNY Downstate Medical Center, Brooklyn, NY, United States
- 5Molecular and Computation Biology, University of Southern California, Los Angeles, CA, United States
Schizophrenia (SCZ) is a psychiatric disorder of unknown etiology. There is evidence suggesting that aberrations in neurodevelopment are a significant attribute of schizophrenia pathogenesis and progression. To identify biologically relevant molecular abnormalities affecting neurodevelopment in SCZ we used cultured neural progenitor cells derived from olfactory neuroepithelium (CNON cells). Here, we tested the hypothesis that variance in gene expression differs between individuals from SCZ and control groups. In CNON cells, variance in gene expression was significantly higher in SCZ samples in comparison with control samples. Variance in gene expression was enriched in five molecular pathways: serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion. More than 14% of variance in disease status was explained within the logistic regression model (C-value = 0.70) by predictors accounting for gene expression in 69 genes from these five pathways. Structural equation modeling (SEM) was applied to explore how the structure of these five pathways was altered between SCZ patients and controls. Four out of five pathways showed differences in the estimated relationships among genes: between KRAS and NF1, and KRAS and SOS1 in the MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in the PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathway. Our analysis provides evidence that variance in gene expression is an important characteristic of SCZ, and SEM is a promising method for uncovering altered relationships between specific genes thus suggesting affected gene regulation associated with the disease. We identified altered gene-gene interactions in pathways enriched for genes with increased variance in expression in SCZ. These pathways and loci were previously implicated in SCZ, providing further support for the hypothesis that gene expression variance plays important role in the etiology of SCZ.
Introduction
Schizophrenia (SCZ) is a psychiatric disorder with high heritability; estimates from twin and family-based studies suggest that the heritability of SCZ may be as high as 81% (Cardno and Gottesman, 2000; Sullivan et al., 2003; Lichtenstein et al., 2009; Wray and Gottesman, 2012). However, despite progress in identifying the genetic basis of the disorder, the molecular basis of SCZ remains elusive (Arslan, 2018). A recent GWAS of SCZ (the largest published), involving 36,989 SCZ cases and 113,075 controls, revealed 108 loci associated with the disease (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2014), and there are indications that the total number of loci associated with SCZ is much greater. However, while the estimated heritability of SCZ is high, the proportion explained by SNPs is smaller, with the genetic liability explained by SNPs estimated to be between 20%–35% when considering SNPs genome-wide (Lee et al., 2012; Cross-Disorder Group of the Psychiatric Genomics Consortium et al., 2013; Speed et al., 2017), or 3.4% when considering only SNPs with genome-wide significance in SCZ GWAS (Schizophrenia Working Group of the Psychiatric Genomics Consortium et al., 2014). The genetic component of SCZ is complex; rather than being explained by relatively few variants of large effect acting as fulcra of SCZ-associated molecular pathogenesis, the genetic susceptibility underlying SCZ may be a function of many variants contributing small effects which together dysregulate pathways and lead to SCZ. If SCZ is a disorder of pathways (Sullivan, 2012) GWAS alone is unlikely to offer immediate insight into the molecular basis of SCZ.
Studying gene expression profiles is a complimentary approach to GWAS and facilitates understanding the molecular etiology of SCZ. It allows the investigation of pathways and molecular networks affected by SCZ. The difficulties in studying gene expression in human patients with neurological disease has led to the adoption of relevant cellular models (Evgrafov et al., 2006) to facilitate understanding of cellular phenotypes. While post-mortem brain tissue samples are used to study gene expression changes in neurological diseases like SCZ (e.g., Chen et al., 2013; Lanz et al., 2015; Hu et al., 2016), these are comprised of terminally differentiated neurons and glial cells of an adult brain which has been subjected to many different environmental conditions. Because SCZ can be considered a neurodevelopmental disorder (Weinberger, 1987; Raedler et al., 1998; Lewis and Levitt, 2002; Alexander-Bloch et al., 2014), the use of port-mortem samples may not necessarily accurately capture the alterations in neurodevelopmental processes important in SCZ, but rather the consequences of these changes in terminally-differentiated cells. An alternative is to use cultured patient-derived neural progenitor cells, such as those derived from olfactory neuroepithelium (CNON; Wrobel et al., 2013), whereby neurodevelopmental changes can be modeled; environmental effects that are a component of post-mortem tissue samples can be reduced; and conditions can be standardized across samples. We analyzed the gene expression profiles in CNON cells from SCZ patients and control individuals to study alterations in gene expression reflective of the potential neurodevelopmental aspects of the disease. Recent analysis of this data identified genes differentially expressed in SCZ that are involved in Wnt and Notch signaling, and Serine biosynthesis pathways (Evgrafov et al., 2017).
Many of the studies on gene expression in diseases have focused on differences in mean expression between disease and non-disease groups. Analysis of the variance of gene expression both within genetically identical populations of cells or organisms and in genetically diverse populations is an emerging topic of discussion. It has provided insights in studies of various biological mechanisms, from evolution to embryonic development (Rönnegård and Valdar, 2011; Brown et al., 2014; Hoffman et al., 2014; Wang et al., 2014). There is evidence that suggests biological variance plays an important part in determining phenotypes (Ozbudak et al., 2002; Colman-Lerner et al., 2005; Raser and O’Shea, 2005; Cai et al., 2006; Manolio et al., 2009). Such variability, particularly in the context of gene expression, may be indicative of genomic or epigenomic influences on the function of a given gene or protein (Alemu et al., 2014). Genes with more constrained expression have been reported to be more likely to encode products with “housekeeping” functions, whereas genes with more variable expression tended to be those involved in developmental and environmental responses and more often associated with disease (Alemu et al., 2014). Interestingly, many of the SCZ-associated loci are also enriched for sequences that remained constant throughout primary evolution but evolved rapidly after the divergence of humans from chimpanzees (Pollard et al., 2006a). While these are predominantly noncoding sequences, they are thought to contain developmental gene regulatory elements and noncoding genes important in neurocognitive development (Pollard et al., 2006a,b; Hubisz and Pollard, 2014).
Although it has not been extensively studied, there is evidence that variance in biological processes is an important aspect of SCZ. For example, variation in cortical metabolic activity is elevated in SCZ patients compared to neurologically-healthy controls and bipolar disorder patients (Yang et al., 2014). In the context of gene expression, there have been two studies to date that have directly examined expression variance in SCZ (Mar et al., 2011; Zhang et al., 2016). In a study of patient-derived human olfactory neurosphere-derived (hONS) cells, several signaling pathways were found to have significantly altered mRNA expression variance in SCZ patient-derived hONS compared to control and Parkinson’s disease hONS (Mar et al., 2011). While high and low variance genes were observed in SCZ CNONs, the overall trend was a reduction in expression variance genome-wide (Mar et al., 2011). In a separate study, the variance in mRNA expression in peripheral blood mononuclear cells was found to be higher in early-onset SCZ patients compared to controls, and this variance was reduced after a 12-week treatment with oral antipsychotics (Zhang et al., 2016). These experiments highlight the apparent importance of biological variance in SCZ, however they examine different aspects of the disease, and the findings are somewhat difficult to reconcile. We therefore decided to examine gene expression variance in CNON cells between SCZ and controls in greater detail. We hypothesize that differences in expression variance between SCZ and control samples will enable us to detect SCZ-associated genetic perturbations (Mar et al., 2011; Mason et al., 2014; Zhang et al., 2016).
A variety of methods have been used to analyze the variance/covariance structure of gene expression profiles, e.g., differentially co-expression analysis (Watson, 2006; Lui et al., 2015), differential analysis of eigengene networks (Langfelder and Horvath, 2007), differential variability analysis (Ho et al., 2008; Jayaswal et al., 2013), PANA (Ponzoni et al., 2014), factor analysis (Coffman et al., 2005) and structural equation modeling (SEM). SEM is a multivariate statistical analysis technique based on Sewell Wright’s path analysis (Wright, 1918, 1921) and widely used in the fields of economics, psychology and sociology. SEM models a multiple-gene pathway structure by taking into account the direction of relationships among genes, allowing for complex interactions among genes where both mean and covariance structure of the data are modeled. In contrast, the coexpression analysis describes only relationships within a pair of genes. Network reconstruction methods which are based on partial correlations, describe chains of genes with related expression but do not account for covariances. SEM models can be built in an exploratory mode, but unlike other approaches, can also be deployed using an existing structure. As there are many already described gene networks (Kanehisa and Goto, 2000), SEMs provide a robust framework for modeling changes between environments in complex gene–gene interactions.
Recently, SEM was applied to analyze gene expression data and infer relationships between genes in gene regulatory networks (Li et al., 2006; Remington, 2009; Mi et al., 2010; Nock and Zhang, 2011; Fear et al., 2015). The attraction of SEM in this area resides in its ability to compare the path strengths between several nodes. The method was employed to predict perturbed gene interaction in neurological diseases, namely frontotemporal lobar degeneration with ubiquitinated inclusions, multiple sclerosis (Pepe and Grassi, 2014) and Parkinson’s disease (Pepe and Do, 2015). Moreover, SEM can be used to identify potential new gene interactions, as demonstrated in Fear et al. (2015). We use an SEM analysis of gene expression data in SCZ and controls to identify specific gene-gene relationships that differ in the context of a given pathway. Thus, in addition to identification of specific genes and implying involvement of corresponding pathways in SCZ, we anticipate identifying edges (gene-gene interactions) that are affected by SCZ in these pathways.
Materials and Methods
CNON Dataset
RNA-Seq transcriptome expression profiles of CNON cells from 144 SCZ patients (DSM-IV criteria) and 111 control individuals (no psychiatric disorders and no family history of SCZ) were used in this study. The details of biopsy and cell cultivation were described previously (Evgrafov et al., 2006; Wrobel et al., 2013). RNA libraries extracted from cells were prepared and sequenced on HiSeq2000 (Illumina) as previously described (Evgrafov et al., 2017). Reads were mapped to the combination of human genome, mtDNA and transcriptome, and those that were assigned to genes, were quantified by a custom RNA-Seq alignment pipeline, GT-FAR1. All the experimental and bioinformatics analysis steps were accompanied by quality control procedures; more detailed information is described in the previous study (Evgrafov et al., 2017). The data is available through dbGaP (Study ID: 26138). We carried out the analysis on 23,920 expressed genes (out of 59,902 total), normalized and filtered as in the previous study (Evgrafov et al., 2017).
Selection of Candidate Pathways
In addition to pathways enriched in differentially expressed genes (Wnt, Notch, Serine biosynthesis; Evgrafov et al., 2017), we found that PI3K-Akt signaling has been identified as important in multiple studies of SCZ (Mao et al., 2009; Panaccione et al., 2013; Singh et al., 2013; Topol et al., 2015; Mulligan and Cheyette, 2016; Howell et al., 2017; Wang et al., 2017). Pathway analysis of GWAS data, performed by The Network and Pathway Analysis Subgroup of the Psychiatric Genomics Consortium (2015) showed enrichment of several GO terms, associated with neuron structure and histone H3-K4 methylation. Analysis of GWAS data (Chang et al., 2015) resulted in a large list of enriched pathways (insulin signaling pathway, neurotrophin signaling pathway, focal adhesion, VEGF signaling pathway, GnRH signaling pathway, tight junction and regulation of actin cytoskeleton) and likely candidate pathways directly connected to those enriched (Wnt signaling pathway, adherens junction, apoptosis, calcium signaling pathway, PI3K-Akt signaling pathway, leukocyte transendothelial migration, long-term potentiation, cell cycle and MAPK signaling pathway). Analysis of GWAS together with mutation and CNV data (Kotlar et al., 2015) revealed potential involvement of the ARC signaling complex, NMDAR complex, VGCC and FMRP target pathways, which play an important role in long-term potentiation and long-term depression pathway. These 19 KEGG pathways for which there is support for involvement with SCZ were the focus of our analysis.
Variance in Gene Expression Between SCZ Patients and Control Individuals
We calculated the variances of gene expression for SCZ and control samples separately. Gene expression was estimated as in Evgrafov et al. (2017). We tested the null hypothesis that the number of genes with increased variance in the SCZ samples was equal or less than half of the total number of analyzed genes, using the sign test (Sheskin, 2006).
The 19 pathways were tested for enrichment for genes with increased variance in SCZ samples. We considered only genes annotated in the KEGG database. Using the variances calculated above, with a simple indicator for whether the variance was greater in SCZ, we applied an overrepresentation analysis (ORA) using the EASE score, a modified one-tail Fisher exact p-value implemented in DAVID (Hosack et al., 2003). We also applied a node-based ORA method (Gu et al., 2012). The node-based enrichment score is a sum of nodes containing at least one gene with increased variance. The null distribution of the enrichment score values was approximated by 1,000,000 simulation trials and the p-value of a pathway’s enrichment was calculated as the proportion of the trials having more extreme enrichments scores.
We also examined the residual variance after fitting age, sex, two principal components (PCs) and batch effect (Evgrafov et al., 2017). Residuals were calculated from DESeq2 tools (Love et al., 2014), and the null hypothesis that variance of the residuals in the SCZ group was equal to or less than that in the control group was tested using Levene’s test for each gene (Levene, 1960). We ranked the list of genes according to the estimated p-values, and applied the fast gene set enrichment analysis (FGSEA) method (available as the Bioconductor package “fgsea”; Sergushichev, 2016), for each pathway. We also conducted another Gene Set Analysis (GSA)–node-based enrichment analysis which aggregates p-values of Levene’s test by Fisher’s method (Leno-Colorado et al., 2017).
FDR-corrected p-values were calculated (Benjamini and Hochberg, 1995) and pathways significantly enriched for differential variance (FDR p-values < 0.01) were considered for further analysis.
Logistic Regression Analysis
To determine whether the genes in pathways enriched for increased variance were also associated with disease status, we estimated the overall correlational structure among these genes. Residuals from the DEseq2 model (described above) for all the genes in these enriched pathways were used in a principle component analysis. PC’s which explained at least 1% of the variance among the genes were used as dependent variables in a logistic regression with SCZ status as the outcome. To assess the model fit we use the C-index (i.e., area under receiver operating characteristic curve, ROC) as a measure of goodness-of-fit for binary outcomes in the model. A value of 0.5 means that the model is no better in predicting an outcome than random chance (Freedman, 2009).
SEM Analysis
SEM is a statistical technique that is first proposed by S. Wright as path analysis (Wright, 1918, 1921). Modern SEM models contain two distinct parts: structural and measurement models (Bollen, 1989; Kline, 2011). Structural models reflect the causal dependencies between endogenous and exogenous variables as the following system of linear equations:
where x and y denote vectors of observed variables (exogenous and endogenous, respectively), ξ and η are vectors of latent variables (exogenous and endogenous, respectively), B is a matrix of coefficients linking endogenous variables, Г is a matrix of coefficients relating exogenous variables to endogenous, and ζ is a vector of structural errors. The measurement model describes the relations between latent variables and their indicators:
where z denotes a vector of observed indicators, Λ is a matrix of factor loadings of the indicators z of a latent variables η and ξ, and δ is a vector of measurement errors. SEM makes the following assumptions: the variable ξ in uncorrelated with the error ζ, the variable is uncorrelated with the error δ, the error ζ is uncorrelated with the error δ, matrix I–B is nonsingular (where I is the identity matrix). In addition, the covariance matrixes for ξ, δ and ζ are known. Considering the structural model and the assumptions, the model-implied covariance matrix between observed variables, Σ is expressed as a function of parameter matrices (Bollen, 1989). Parameter estimation is performed by minimizing the difference between the model-implied covariance matrix Σ and the observed sample covariance matrix S by the Maximum Likelihood method (Kline, 2011). Here we considered SEM without exogenous latent variables. We also set covariance between each pair of exogenous variables to zero. In order to fix the scale of latent variables we put their variance to 1 and mean to 0 (Reference-Group Method; Little et al., 2006). Potential non-linear interactions and quadratic effects were not included in the modeling.
To compare SEM estimates between different subgroups of the data (SCZ and control samples, here) the multiple group (multigroup) modeling is applied (Pugesek et al., 2003). This technique simultaneously assesses parameters of multiple models and can examine the null hypothesis whether the two parameters are equal (Rosseel, 2012). Here we examined the differences between all structural parameters in SCZ and control groups. The multigroup modeling also allows us to compare parameters between groups. We considered five covariates—age, sex, PCs and batch effects. For each observed variable we constructed a linear equation where the covariates played the role of explanatory variables. We added these equations to the model and constrained equation parameters to be equal between groups. The multigroup SEM analysis was performed with lavaan tools (Rosseel, 2012). Two scores—Root mean square error of approximation (RMSEA) and comparative fit index (CFI)–were used to select the model (Hermida et al., 2015). RMSEA is a measure of the difference between the fit model and the data. Low values indicate models that describe the data well. CFI varies from zero (the proposed model fits no better than the baseline model) to one—values closer to one indicating a better model fit. The best model was defined as the one with the highest CFI value within models with RMSEA < 0.2. We also calculated the Akaike information criterion (AIC) and Bayesian information criterion (BIC) values for each model in the multigroup analysis to assess which group of samples (SCZ or control) is better explained by the model.
Visualization of path diagrams for SEM models was performed with DiagrammeR, an R package2.
Construction of SEM Models
Gene networks of interest were constructed based on the KEGG database and literature data (Supplementary Figure S1). These networks contained several nodes with more than one gene. Such a situation may arise when the node genes encode protein complex subunits or they are members of a multi-gene family. Moreover, it may happen that a gene encodes several isoforms of one protein, and either the exact isoform involved is unknown, or the isoform varies depending on cellular environment (Luo and Brouwer, 2013; Luo, 2017).
To address the precise configuration of a network, we generated a collection of structural equation models corresponding to all possible alternative networks where each complex node was substituted with one of the constituent genes. Parameter estimation was performed for each model by multi-group SEM analysis (Rosseel, 2012). If during the estimation a model became empirically non-identifiable, it was discarded. Within the identifiable model set we selected the models that had a low number of non-significant path coefficients (less or equal to 3) on the control set of samples and low RMSEA value (less than 0.2). A model with the highest value of CFI within the remaining models was defined as the best one and reflected the precise configuration of the network. The runtime of the pipeline to select the best model linearly depends on the number of alternative models.
Results
Analysis of Heterogeneity of Gene Expression Variance Between SCZ and Control Samples
We analyzed variance in gene expression profiles of CNON cells of 144 SCZ patients and 111 control individuals with no psychiatric disorders or family history of SCZ. All the experimental and bioinformatics pre-processing steps are previously described (Evgrafov et al., 2017). We analyzed 23,920 expressed genes and tested the null hypothesis that variance of the SCZ group was less than or equal to the control groups. We found significantly more genes with increased variance in the SCZ group (sign test, p-value < 0.01). In total, 16,434 genes had an increased variance amongst the SCZ group, and 7486 genes had a higher variance amongst controls (S1 Supporting Information).
Two Over Representation Analyses (ORA) and two Gene Set Analyses (GSA) were applied to 19 pathways (Table 1). Five pathways showed a higher than expected (FDR p-value < 0.01) number of genes with increased variance in the SCZ group according to the node-based ORA (Table 1). In addition, two pathways were significantly enriched for genes with increased expression variance using node-based GSA method (based on the Levene’s test and Fisher’s method) on the residuals (FDR p-values < 0.01; Table 1). These two sets were merged into one set of five unique pathways for further analysis—serine biosynthesis, PI3K-Akt, MAPK, neurotrophin signaling and focal adhesion (Table 1).
Table 1. The enrichment of signaling pathways with genes that have increased variances in Schizophrenia (SCZ) samples.
The Logistic Regression Fit for Disease Status
Serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion signaling pathways include 69 unique genes; several of these genes are shared between these five pathways. PC analysis on residuals for 69 genes identified 15 PCs which individually explained at least 14% of variance. The logistic regression fit for disease status (SCZ vs. control) with these 15 PCs as factors revealed three PCs with p-value < 0.05 and six PCs with p-value < 0.1. The C-index of the fit was equal to 0.70 indicating that variance among genes in these five pathways is associated with disease status in individuals (coefficient of determination, R2 = 0.14).
Identification of Important Regulatory Relationships Using SEM
Five pathways enriched for genes with increased expression variance in SCZ samples (serine biosynthesis, PI3K-Akt, MAPK, neurotrophin signaling and focal adhesion pathways) were modeled with SEM. Under model identification constraints (Supplementary Text S1) the number of nodes in a network for modeling was limited to 14 nodes, and therefore we reduced the five KEGG pathways to only the key interactions in these pathways (Table 2). The reduced networks of Serine biosynthesis, PI3K-Akt, MAPK, neurotrophin signaling and focal adhesion pathways were denoted as initial, consisted of 5, 11, 11, 9 and 11 nodes and included 5, 17, 20, 11 and 27 genes, respectively (see Table 2 and Supplementary Figure S1). The number of genes was higher that the number of nodes as several nodes were complex (i.e., contained more than one-member gene).
We generated alternative structural equation models substituting complex nodes within initial networks with each member gene. One, 12, 192, 3 and 1296 alternative models for serine biosynthesis, PI3K-Akt, MAPK, neurotrophin signaling and focal adhesion signaling pathways were generated, respectively. The multigroup SEM fit was then performed for each alternative model, and the best model for each pathway was identified using three characteristics: number of non-significant path coefficients, RMSEA and CFI. Models that were empirically non-identifiable or contained more than three non-significant path coefficients in the estimation on control set or showed RMSEA values more than 0.2 were discarded. The best model for each pathway was selected by the highest CFI index value among the models with RMSEA 0.2 or less. For each of the five pathways the values of both AIC and BIC criteria were higher in model fits for SCZ samples compared to controls indicating that the gene expression data in control group of samples were better explained by the structures of pathways.
In four out of the five pathways, we found statistically significant differences in path coefficients (p-value < 0.05; Table 3). These coefficients reflect the strength of causal interactions between NF1, SOS2 and RASA2 with KRAS in MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathways (Figure 1). SEM fit of neuroptrophin signaling pathway is shown in Supplementary Figure S2. The path coefficient represents the relationship between two genes, in the context of the entire pathway. A significant difference in a path coefficient between the two environments indicates that the relationship between the two genes is altered in the disease compared to the non-disease. All calculations were implemented in R. The link to the GitHub repository containing R scripts is provided in the Supplementary Material.
Table 3. The gene interactions with statistically significant differences in path coefficients between SCZ and control samples.
Figure 1. Structural equation modeling (SEM) fits for four gene networks representing serine biosynthesis, PI3K-Akt, MAPK and focal adhesion signaling. Each arrow contains three-line text information: the first line is the estimation of a path coefficient on control set of samples (and the standard error); the second line is the estimation of a path coefficient on Schizophrenia (SCZ) set; the third line shows the significance of difference between the estimates. p-values higher than 0.05 are marked by “ns” and blue color (non-significant), less than 0.05–by (*) and yellow color less than 0.01–by (**) and red color less than 0.001–by (***) and dark red color.
Discussion
SCZ is highly heritable severe mental disorder with clinically heterogeneous symptoms (Picardi et al., 2012; Takahashi, 2013) and different risk factors (Mäki et al., 2005; Serafini et al., 2012). Several lines of evidence point to neurodevelopment as playing a primary role in etiology of the disease (Weinberger, 1987; Raedler et al., 1998; Lewis and Levitt, 2002). To elucidate neurodevelopmental aspects of SCZ mechanisms we studied gene expression profiles in CNON cells derived from SCZ patients and healthy individuals, a suitable experimental model to analyze the neurodevelopment processes (Evgrafov et al., 2006, 2017; Wrobel et al., 2013). SCZ-related genes that differ between SCZ and control groups in the mean expression levels have been identified from these data (Evgrafov et al., 2017). Here, we hypothesized that variance in gene expression is altered in SCZ CNON cells and found that SCZ CNON cells showed higher variance in expression than the control cells.
Of the 19 pathways related to neurodevelopment and associated with SCZ-associated tested, five pathways (serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion signaling pathways) were enriched for genes with increased variance among the SCZ samples. There were 69 genes in these five pathways. Residuals from individual models that account for covariates were used to estimate the covariance among these genes in a PCs model. PC’s explained 14% of the variance in disease status, suggesting that the co-variance in expression among these genes and pathways are potential predictors of SCZ. Several edges (gene-gene interactions) are altered: NF1, SOS2 and RASA2 with KRAS in MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathways.
Previous examination of gene expression variance in SCZ has been somewhat contradictory. In an earlier study involving patient-derived hONS cells (which resemble CNON cells), variance in gene expression was found to be more constrained genome-wide in SCZ cases than in controls or Parkinson’s disease (Mar et al., 2011), whereas in blood, expression variance tended to be higher in SCZ than in controls (Zhang et al., 2016). It should be noted that both of these studies had relatively small sample sizes, and therefore may not have captured the entire breadth of expression variance in SCZ. Similarly, in the earlier CNON study, although the authors did not analyze variance in expression of all genes, it was noted that there were genes with high expression variance in SCZs in some of the same pathways that we examined in this study (Evgrafov et al., 2017).
There is support in the literature for the interactions highlighted in our study in SCZ and, more broadly, other neurodevelopmental disorders. Interactions of both NF1 and SOS1 genes with NRAS (homolog of KRAS) gene in MAPK pathway have been previously associated with Noonan syndrome (Baralle et al., 2003; Longoni et al., 2010), a multisystem developmental disorder that includes perturbed neurodevelopment (Noonan, 2005; Roelofs et al., 2016), and RASA2 mutations have also been reported in individuals with Noonan syndrome (Chen et al., 2014). While Noonan syndrome has a large array of physiological abnormalities, and that we did not explicitly examine interactions with NRAS (having collapsed the corresponding, RAS family node of the MAPK pathway to KRAS), this suggests that changes in how NF1 and SOS1 interact with the RAS genes contribute towards neurodevelopmental disorders (like SCZ and Noonan syndrome), possibly through altered coordination of signaling to the RAS/RAF component of the MAPK pathway. Under normal conditions, SOS1 activates RAS proteins (including KRAS) by removal of bound GDP from RAS, thus freeing RAS to bind GTP and activate downstream components of MAPK signaling (Boriack-Sjodin et al., 1998). In contrast, NF1 inhibits RAS activity by hydrolyzing bound GTP (Bollag et al., 1996; Corral et al., 2003), and RASA2 also suppresses RAS activation by enhancing RAS’s GTPase activity (Gaul et al., 1992; Maekawa et al., 1994; Arafeh et al., 2015). In our analysis, we found that the path coefficients the RASA2–KRAS and NF1–KRAS interactions were smaller in SZ than in controls, potentially indicating that the relationship between these genes in terms of gene expression is weaker in SCZ, particularly for the NF1–KRAS which shows a large difference between SCZ and controls. However, the most pronounced change is in the SOS1-KRAS interaction, where the path coefficient changes direction indicating a dramatic shift in the gene expression relationship between these two genes. The alterations in SOS1–KRAS, NF1–KRAS and RASA2–KRAS interactions reflect transcriptional dysregulation of MAPK signaling, centered on a specific mechanism (i.e., RAS activation/inhibition). RAS activation is a tightly-regulated process, and its overactivation leads to oncogenesis and various developmental syndromes (Bos, 1989; Rajalingam et al., 2007; Tidyman and Rauen, 2009). It is possible that the changes in how the expression of these genes relate to one another might explain the differences we observed in our analysis of the MAPK pathway, by perturbing the regulation of RAS activation/inhibition. We also note that paths downstream of KRAS are not significantly different between SCZ and controls, indicating that the downstream components of the MAPK pathway are unperturbed in SCZ. This taken with regulation of RAS activation by SOS1, NR1 and RASAS2 suggests that dysregulation RAS activation may contribute towards SCZ risk. This could also potentially explain abnormal MAPK activity previously reported in SCZ. SCZ-associated abnormalities in regulation of RAS/MEK/ERK pathway have been reported upstream of RAS family proteins, as the product of the SCZ risk gene, Disrupted in SCZ 1 (DISC1), interacts with RASSF7 in the brain (Wang et al., 2016), consistent with our findings. Other components of the MAPK signaling pathway such as RAS-related guanine exchange factors are also implicated in SCZ risk (Xu et al., 2008, 2009; Levy et al., 2015). These guanine exchange factors activate RAS by exchanging bound GDP for free GTP (Rebhun et al., 2000; Feig, 2011) and are important in the development of cortical neurons and regulation of neuronal function (Maeta et al., 2016). Taken all together and given that SOS1, NR1 and RASA2 regulate RAS activation by controlling its GTPase activity, this highlights the dysregulation of RAS activation as a possible mechanism in SCZ development.
In serine biosynthesis we found one interaction (between PSPH and SHMT2) altered between SCZ and controls. In the previous DE analysis, PSPH gene showed tendency to be more highly expressed in SCZ compared to controls, while another member of the same pathway, PSAT1, was a differentially expressed gene after FDR correction (Evgrafov et al., 2017). Moreover, samples with the lowest expression level of these genes were also from SCZ group. The observation that the highest and lowest expression levels of these genes were observed among SCZ samples in this study supports our hypothesis of increased gene expression variance in SCZ. Mutations in PSAT1, which regulates PSPH, affect neurodevelopment (Tabatabaie et al., 2010). There is also evidence that the gene is implicated in SCZ based on a study of gene expression in a family with a chromosomal translocation near PSAT1 associated with SCZ and schizotypal personality disorder (Ozekia et al., 2012). The second gene in the interaction, SHMT2, demonstrated the overexpression and regulatory function (suppression and promotion) in migration and proliferation in carcinoma cells (Wu et al., 2016).
The altered interaction in the PI3K-Akt signaling pathways identified in the current study involves AKT3, an excellent SCZ candidate gene. GWAS studies identified several SNPs with genome-wide significance inside or in the vicinity of this gene, suggesting that AKT3 is a good positional candidate. AKT3 encodes for serine/threonine kinase, a regulator of cell signaling in response to insulin and growth factors. Such a central position in an interrelated network of intracellular signaling implies a role in many cellular and organismal processes, including development. Multiple functional and genetic studies (Howell et al., 2017) clearly indicate the important role of this gene in brain development and suggest involvement in SCZ, and deletion of the gene in mice results in a phenotype reminiscent of a psychiatric manifestation (Bergeron et al., 2017). In addition, the RP11–370K11.1 gene, one of the differentially-expressed genes identified in the same dataset (Evgrafov et al., 2017), is located within AKT3 gene, and one of the SNPs strongly associated with SCZ resides within RP11–370K11.1. Another component of the interaction found to be altered between SCZ and controls is TSC2, and genetic variation in the TSC2 locus is associated with SCZ risk (The International Schizophrenia Consortium et al., 2009).
Finally, the focal adhesion pathways contained one interaction (between CRK and RAPGEF1 genes) that has a significantly different path coefficient between SCZ and control SEM models. Both CRK and RAPGEF1 genes are included in the SZGR2.0 database (Jia et al., 2017) as related to SCZ risk. Remarkably, dynamics of focal adhesion is altered in neural stem/progenitor cell cultures derived from olfactory neuroepithelium of SCZ patients (Féron et al., 1999; Fan et al., 2013), a cellular model similar to CNON cells. Focal adhesion controls motility (Mitra et al., 2005) is also affected in neural stem/progenitor cells derived from SCZ patients (Tee et al., 2017). Involvement of focal adhesion in etiology of SCZ is plausible if not expected as it mechanistically explains aberrations in brain development through affected control of proper migration of neural progenitor cells.
As expected, our variance-based analysis results are overall consistent with the differential gene expression (DEX) analysis of mean expression (Evgrafov et al., 2017), and identification of serine biosynthesis pathways in both analyses is a clear example. However, there are also substantial differences. First, in the current study, we identified five pathways disturbed in SCZ, while simple overrepresentation test of DEX results does not return any significant KEGG pathways. Second, SEM method is specifically designed to identify disturbed relationships between genes, and, in wider terms, affected pathways and networks. Thus, SEM could be especially helpful in deciphering molecular etiology of the disease. As we found, both approaches can identify the same genes, when, for example, the increased variance is explained mostly by a difference in mean between groups of samples. However, there are certainly cases when mean is not changed significantly, while SCZ samples are characterized by higher gene expression. We consider these approaches as mostly complementary and suggest that a combination of both methods should be used to find differences in cell regulation associated with SCZ.
In summary, our analysis provides evidence that variance and covariance in gene expression is an important characteristic of SCZ etiology. The application of SEM identified five interactions between genes in serine biosynthesis, PI3K-Akt, MAPK and focal adhesion pathways that were altered in SCZ neural progenitor cells, suggesting that these interactions are important in the molecular etiology of SCZ. The association of these interactions with SCZ is well supported by literature data that allows us to consider them as good candidates for further downstream analysis.
Author Contributions
AI, LM, SN and MS: conceptualization and methodology. OE and CA: data curation. AI: formal analysis and visualization. AI, JN, OE, LM, SN and MS: investigation. SN and MS: project administration. OE: resources. MS and LM: supervision. AI and MS: writing–original draft. AI, CA, JN, OE, LM, SN and MS: writing–review and editing.
Funding
This study is supported by the Ministry of Education and Science of the Russian Federation, assignment No 1.8697.2017/8.9 and by the following grants: National Institute of Mental Health 974 (NIMH) grant MH086874 and National Institute of Health 975 (NIH) grant GMS102227.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnmol.2018. 00192/full#supplementary-material
References
Alemu, E. Y. Jr., Carl, J. W., Corrada Bravo, H., and Hannenhalli, S. (2014). Determinants of expression variability. Nucleic Acids Res. 42, 3503–3514. doi: 10.1093/nar/gkt1364
Alexander-Bloch, A. F., Reiss, P. T., Rapoport, J., McAdams, H., Giedd, J. N., Bullmore, E. T., et al. (2014). Abnormal cortical growth in schizophrenia targets normative modules of synchronized development. Biol. Psychiatry 76, 438–446. doi: 10.1016/j.biopsych.2014.02.010
Arafeh, R., Qutob, N., Emmanuel, R., Keren-Paz, A., Madore, J., Elkahloun, A., et al. (2015). Recurrent inactivating RASA2 mutations in melanoma. Nat. Genet. 47, 1408–1410. doi: 10.1038/ng.3427
Arslan, A. (2018). Imaging genetics of schizophrenia in the post-GWAS era. Prog. Neuropsychopharmacol. Biol. Psychiatry 80, 155–165. doi: 10.1016/j.pnpbp.2017.06.018
Baralle, D., Mattocks, C., Kalidas, K., Elmslie, F., Whittaker, J., Lees, M., et al. (2003). Different mutations in theNF1 gene are associated with Neurofibromatosis-Noonan syndrome (NFNS). Am. J. Med. Genet. 119A, 1–8. doi: 10.1002/ajmg.a.20023
Benjamini, Y., and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300.
Bergeron, Y., Bureau, G., Laurier-Laurin, M.-É., Asselin, E., Massicotte, G., and Cyr, M. (2017). Genetic deletion of Akt3 induces an endophenotype reminiscent of psychiatric manifestations in mice. Front. Mol. Neurosci. 10:102. doi: 10.3389/fnmol.2017.00102
Bollag, G., Clapp, D. W., Shih, S., Adler, F., Zhang, Y. Y., Thompson, P., et al. (1996). Loss of NF1 results in activation of the Ras signaling pathway and leads to aberrant growth in haematopoietic cells. Nat. Genet. 12, 144–148. doi: 10.1038/ng0296-144
Bollen, K. A. (1989). Structural Equations with Latent Variables. Hoboken, NJ: John Wiley and Sons, Inc.
Boriack-Sjodin, P. A., Margarit, S. M., Bar-Sagi, D., and Kuriyan, J. (1998). The structural basis of the activation of Ras by Sos. Nature 394, 337–343. doi: 10.1038/28548
Brown, A. A., Buil, A., Viñuela, A., Lappalainen, T., Zheng, H.-F., Richards, J. B., et al. (2014). Genetic interactions affecting human gene expression identified by variance association mapping. Elife 3:e01381. doi: 10.7554/eLife.01381
Cai, L., Friedman, N., and Xie, X. S. (2006). Stochastic protein expression in individual cells at the single molecule level. Nature 440, 358–362. doi: 10.1038/nature04599
Cardno, A. G., and Gottesman, I. I. (2000). Twin studies of schizophrenia: from bow-and-arrow concordances to Star Wars Mx and functional genomics. Am. J. Med. Genet. 97, 12–17. doi: 10.1002/(sici)1096-8628(200021)97:1<12::aid-ajmg3>3.0.co;2-u
Chang, S., Fang, K., Zhang, K., and Wang, J. (2015). Network-based analysis of schizophrenia genome-wide association data to detect the joint functional association signals. PLoS One 10:e0133404. doi: 10.1371/journal.pone.0133404
Chen, C., Cheng, L., Grennan, K., Pibiri, F., Zhang, C., Badner, J. A., et al. (2013). Two gene co-expression modules differentiate psychotics and controls. Mol. Psychiatry 18, 1308–1314. doi: 10.1038/mp.2012.146
Chen, P.-C., Yin, J., Yu, H.-W., Yuan, T., Fernandez, M., Yung, C. K., et al. (2014). Next-generation sequencing identifies rare variants associated with Noonan syndrome. Proc. Natl. Acad. Sci. U S A 111, 11473–11478. doi: 10.1073/pnas.1324128111
Coffman, C. J., Wayne, M. L., Nuzhdin, S. V., Higgins, L. A., and McIntyre, L. M. (2005). Identification of co-regulated transcripts affecting male body size in Drosophila. Genome Biol. 6:R53. doi: 10.1186/gb-2005-6-6-r53
Colman-Lerner, A., Gordon, A., Serra, E., Chin, T., Resnekov, O., Endy, D., et al. (2005). Regulated cell-to-cell variation in a cell-fate decision system. Nature 437, 699–706. doi: 10.1038/nature03998
Corral, T., Jiménez, M., Hernández-Muñoz, I., Pérez de Castro, I., and Pellicer, A. (2003). NF1 modulates the effects of ras oncogenes: evidence of other NF1 function besides its GAP activity. J. Cell. Physiol. 197, 214–224. doi: 10.1002/jcp.10349
Cross-Disorder Group of the Psychiatric Genomics Consortium, Lee, S. H., Ripke, S., Neale, B. M., Faraone, S. V., Purcell, S. M., et al. (2013). Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat. Genet. 45, 984–994. doi: 10.1038/ng.2711
Evgrafov, O. V., Armoskus, C., Wrobel, B. B., Spitsyna, V. N., Souaiaia, T., Herstein, J. S., et al. (2017). Gene expression in patient-derived neural progenitors provide insights into neurodevelopmental aspects of schizophrenia [in review]. bioRxiv:2017/209197 doi: 10.1101/209197
Evgrafov, O. V., Wrobel, B. B., Kang, X., Simpson, G., Malaspina, D., and Knowles, J. A. (2006). Olfactory neuroepithelium-derived neural progenitor cells as a model system for investigating the molecular mechanisms of neuropsychiatric disorders. Psychiatr. Genet. 21, 217–228. doi: 10.1097/YPG.0b013e328341a2f0
Fan, Y., Abrahamsen, G., Mills, R., Caldero, C. C., Tee, J. Y., Leyton, L., et al. (2013). Focal adhesion dynamics are altered in schizophrenia. Biol. Psychiatry 74, 418–426. doi: 10.1016/j.biopsych.2013.01.020
Fear, J. M., Arbeitman, M. N., Salomon, M. P., Dalton, J. E., Tower, J., Nuzhdin, S. V., et al. (2015). The wright stuff: reimagining path analysis reveals novel components of the sex determination hierarchy in drosophila melanogaster. BMC Syst. Biol. 9:53. doi: 10.1186/s12918-015-0200-0
Feig, L. A. (2011). Regulation of neuronal function by Ras-GRF exchange factors. Genes Cancer 2, 306–319. doi: 10.1177/1947601911408077
Féron, F., Perry, C., Hirning, M. H., McGrath, J., and Mackay-Sim, A. (1999). Altered adhesion, proliferation and death in neural cultures from adults with schizophrenia. Schizophr. Res. 40, 211–218. doi: 10.1016/s0920-9964(99)00055-9
Freedman, D. A. (2009). Statistical Models: Theory and Practice. New York, NY: Cambridge University Press.
Gaul, U., Mardon, G., and Rubin, G. M. (1992). A putative Ras GTPase activating protein acts as a negative regulator of signaling by the Sevenless receptor tyrosine kinase. Cell 68, 1007–1019. doi: 10.1016/0092-8674(92)90073-l
Gu, Z., Liu, J., Cao, K., Zhang, J., and Wang, J. (2012). Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes. BMC Syst. Biol. 6:56. doi: 10.1186/1752-0509-6-56
Hermida, R., Luchman, J. N., Nicolaides, V., and Wilcox, C. (2015). The issue of statistical power for overall model fit in evaluating structural equation models. Comput. Methods Soc. Sci. 3, 25–42.
Ho, J. W. K., Stefani, M., dos Remedios, C. G., and Charleston, M. A. (2008). Differential variability analysis of gene expression and its application to human diseases. Bioinformatics 24, i390–i398. doi: 10.1093/bioinformatics/btn142
Hoffman, P. L., Saba, L. M., Flink, S., Grahame, N. J., Kechris, K., and Tabakoff, B. (2014). Genetics of gene expression characterizes response to selective breeding for alcohol preference. Genes Brain Behav. 13, 743–757. doi: 10.1111/gbb.12175
Hosack, D. A., Dennis, G. Jr., Sherman, B. T., Lane, H., and Lempicki, R. A. (2003). Identifying biological themes within lists of genes with EASE. Genome Biol. 4:R70. doi: 10.1186/gb-2003-4-10-r70
Howell, K. R., Floyd, K., and Law, A. J. (2017). PKB γ/AKT3 loss-of-function causes learning and memory deficits and deregulation of AKT/mTORC2 signaling: relevance for schizophrenia. PLoS One 12:e0175993. doi: 10.1371/journal.pone.0175993
Hu, J., Xu, J., Pang, L., Zhao, H., Li, F., Deng, Y., et al. (2016). Systematically characterizing dysfunctional long intergenic non-coding RNAs in multiple brain regions of major psychosis. Oncotarget 7, 71087–71098. doi: 10.18632/oncotarget.12122
Hubisz, M. J., and Pollard, K. S. (2014). Exploring the genesis and functions of Human Accelerated Regions sheds light on their role in human evolution. Curr. Opin. Genet. Dev. 29, 15–21. doi: 10.1016/j.gde.2014.07.005
Jayaswal, V., Schramm, S., Mann, G. J., Wilkins, M. R., and Yang, Y. H. (2013). VAN: an R package for identifying biologically perturbed networks via differential variability analysis. BMC Res. Notes 6:430. doi: 10.1186/1756-0500-6-430
Jia, P., Han, G., Zhao, J., Lu, P., and Zhao, Z. (2017). SZGR 2.0: a one-stop shop of schizophrenia candidate genes. Nucleic Acids Res. 45, D915–D924. doi: 10.1093/nar/gkw902
Kanehisa, M., and Goto, S. (2000). KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30. doi: 10.1093/nar/28.1.27
Kline, R. B. (2011). Pronciples and Practice of Structural Equation Modeling. 3rd Edn. New York, NY: The Gulford Press.
Kotlar, A. V., Mercer, K. B., Zwick, M. E., and Mulle, J. G. (2015). New discoveries in schizophrenia genetics reveal neurobiological pathways: a review of recent findings. Eur. J. Med. Genet. 58, 704–714. doi: 10.1016/j.ejmg.2015.10.008
Langfelder, P., and Horvath, S. (2007). Eigengene networks for studying the relationships between co-expression modules. BMC Syst. Biol. 1:54. doi: 10.1186/1752-0509-1-54
Lanz, T. A., Joshi, J. J., Reinhart, V., Johnson, K., Grantham, L. E. II., and Volfson, D. (2015). STEP levels are unchanged in pre-frontal cortex and associative striatum in post-mortem human brain samples from subjects with schizophrenia, bipolar disorder and major depressive disorder. PLOS One 10:e0121744. doi: 10.1371/journal.pone.0121744
Lee, S. H., DeCandia, T. R., Ripke, S., Yang, J., Sullivan, P. F., Goddard, M. E., et al. (2012). Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat. Genet. 44, 247–250. doi: 10.1038/ng.1108
Leno-Colorado, J., Hudson, N. J., Reverter, A., and Pérez-Enciso, M. (2017). A pathway-centered analysis of pig domestication and breeding in eurasia. G3 7, 2171–2184. doi: 10.1534/g3.117.042671
Levene, H. (1960). “Robust tests for equality of variances,” in Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, eds I. Olkin and H. Hotelling (Palo Alto, CA: Stanford University Press), 278–292.
Levy, R. J., Kvajo, M., Li, Y., Tsvetkov, E., Dong, W., Yoshikawa, Y., et al. (2015). Deletion of Rapgef6, a candidate schizophrenia susceptibility gene, disrupts amygdala function in mice. Transl. Psychiatry 5:e577. doi: 10.1038/tp.2015.75
Lewis, D. A., and Levitt, P. (2002). Schizophrenia as a disorder of neurodevelopment. Annu. Rev. Neurosci. 25, 409–432. doi: 10.1146/annurev.neuro.25.112701.142754
Li, R., Tsaih, S., Shockley, K., Stylianou, I. M., Wergedal, J., Paigen, B., et al. (2006). Structural model analysis of multiple quantitative traits. PLoS Genet. 2:e114. doi: 10.1371/journal.pgen.0020114
Lichtenstein, P., Yip, B. H., Björk, C., Pawitan, Y., Cannon, T. D., Sullivan, P. F., et al. (2009). Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet 373, 234–239. doi: 10.1016/S0140-6736(09)60072-6.
Little, T. D., Slegers, D. W., and Card, N. A. (2006). A non-arbitrary method of identifying and scaling latent variables in SEM and MACS models. Struct. Equ. Modeling 13, 59–72. doi: 10.1207/s15328007sem1301_3
Longoni, M., Moncini, S., Cisternino, M., Morella, I. M., Ferraiuolo, S., Russo, S., et al. (2010). Noonan syndrome associated with both a new Jnk-activating familial SOS1 and a de novo RAF1 mutations. Am. J. Med. Genet. A 152A, 2176–2184. doi: 10.1002/ajmg.a.33564
Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15:550. doi: 10.1101/002832
Lui, T. W., Tsui, N. B., Chan, L. W., Wong, C. S., Siu, P. M., and Yung, B. Y. (2015). DECODE: an integrated differential co-expression and differential expression analysis of gene expression data. BMC Bioinformatics 16:182. doi: 10.1186/s12859-015-0582-4
Luo, W. (2017). Pathview: pathway based data integration and visualization. Bioconductor Documentation p-8.
Luo, W., and Brouwer, C. (2013). Pathview: an R/Bioconductor package for pathway-based data integration and visualization. Bioinformatics 29, 1830–1831. doi: 10.1093/bioinformatics/btt285
Maekawa, M., Li, S., Iwamatsu, A., Morishita, T., Yokota, K., Imai, Y., et al. (1994). A novel mammalian Ras GTPase-activating protein which has phospholipid-binding and Btk homology regions. Mol. Cell. Biol. 14, 6879–6885. doi: 10.1128/mcb.14.10.6879
Maeta, K., Edamatsu, H., Nishihara, K., Ikutomo, J., Bilasy, S. E., and Kataoka, T. (2016). Crucial role of Rapgef2 and Rapgef6, a family of guanine nucleotide exchange factors for rap1 small GTPase, in formation of apical surface adherens junctions and neural progenitor development in the mouse cerebral cortex. eNeuro 3:ENEURO.0142-16.2016. doi: 10.1523/ENEURO.0142-16.2016
Mäki, P., Veijola, J., Jones, P. B., Murray, G. K., Koponen, H., Tienari, P. (2005). Predictors of schizophrenia—a review. Br. Med. Bull. 73–74, 1–15. doi: 10.1093/bmb/ldh046
Manolio, T. A., Collins, F. S., Cox, N. J., Goldstein, D. B., Hindorff, L. A., Hunter, D. J., et al. (2009). Finding the missing heritability of complex diseases. Nature 461, 747–753. doi: 10.1038/nature08494
Mao, Y., Ge, X., Frank, C. L., Madison, J. M., Koehler, A. N., Doud, M. K., et al. (2009). Disrupted in schizophrenia 1 regulates neuronal progenitor proliferation via modulation of GSK3 b/b-catenin signaling. Cell 136, 1017–1031. doi: 10.1016/j.cell.2008.12.044
Mar, J. C., Matigian, N. A., Mackay-Sim, A., Mellick, G. D., Sue, C. M., Silburn, P. A., et al. (2011). Variance of gene expression identifies altered network constraints in neurological disease. PLoS Genet. 7:e1002207. doi: 10.1371/journal.pgen.1002207
Mason, E. A., Mar, J. C., Laslett, A. L., Pera, M. F., Quackenbush, J., Wolvetang, E., et al. (2014). Gene expression variability as a unifying element of the pluripotency network. Stem Cell Rep. 3, 365–377. doi: 10.1016/j.stemcr.2014.06.008
Mi, X., Eskridge, K., Wang, D., Baenziger, P. S., Campbell, B. T., Gill, K. S., et al. (2010). Regression-based multi-trait QTL mapping using a structural equation model. Stat. Appl. Genet. Mol. Biol. 9:38. doi: 10.2202/1544-6115.1552
Mitra, S. K., Hanson, D. A., and Schlaepfer, D. D. (2005). Focal adhesion kinase: in command control of cell motility. Nature 6, 56–68. doi: 10.1038/nrm1549
Mulligan, K. A., and Cheyette, B. N. R. (2016). Neurodevelopmental perspectives on wnt signaling in psychiatry. Mol. Neuropsychiatry 2, 219–246. doi: 10.1159/000453266
Nock, N. L., and Zhang, L. X. (2011). Evaluating aggregate effects of rare and common variants in the 1000 Genomes Project exon sequencing data using latent variable structural equation modeling. BMC Proc. 5:S47. doi: 10.1186/1753-6561-5-s9-s47
Noonan, J. (2005). “Noonan syndrome,” in Handbook of Neurodevelopmental and Genetic Disorders in Adults, eds S. Goldstein and C. Reynolds (New York, NY: The Guilford Press), 308–319.
Ozbudak, E. M., Thattai, M., Kurtser, I., Grossman, A. D., and van Oudenaarden, A. (2002). Regulation of noise in the expression of a single gene. Nat. Genet. 31, 69–73. doi: 10.1038/ng869
Ozekia, Y., Pickardb, B. S., Kanoc, S., Malloyb, M. P., Zeledonc, M., Sunc, D. Q., et al. (2012). A novel balanced chromosomal translocation found in subjects with schizophrenia and schizotypal personality disorder: altered L-serine level associated with disruption of PSAT1 gene expression. Neurosci. Res. 69, 154–160. doi: 10.1016/j.neures.2010.10.003
Panaccione, I., Napoletano, F., Forte, A. M., Kotzalidis, G. D., Del Casale, A., Rapinesi, C., et al. (2013). Neurodevelopment in schizophrenia: the role of the wnt pathways. Curr. Neuropharmacol. 11, 535–558. doi: 10.2174/1570159X113119990037
Pepe, D., and Do, J. H. (2015). Comparison of perturbed pathways in two different cell models for Parkinson’s disease with structural equation model. J. Comp. Biol. doi: 10.1089/cmb.2015.0156 [Epub ahead of print].
Pepe, D., and Grassi, M. (2014). Investigating perturbed pathway modules from gene expression data via structural equation models. BMC Bioinformatics 15:132. doi: 10.1186/1471-2105-15-132
Picardi, A., Viroli, C., Tarsitani, L., Miglio, R., De Girolamo, G., Dell, G., et al. (2012). Heterogeneity and symptom structure of schizophrenia. Psychiatry Res. 198, 386–394. doi: 10.1016/j.psychres.2011.12.051
Pollard, K. S., Salama, S. R., King, B., Kern, A. D., Dreszer, T., Katzman, S., et al. (2006a). Forces shaping the fastest evolving regions in the human genome. PLoS Genet. 2:e168. doi: 10.1371/journal.pgen.0020168
Pollard, K. S., Salama, S. R., Lambert, N., Lambot, M.-A., Coppens, S., Pedersen, J. S., et al. (2006b). An RNA gene expressed during cortical development evolved rapidly in humans. Nature 443, 167–172. doi: 10.1038/nature05113
Ponzoni, I., Nueda, M., Tarazona, S., Götz, S., Montaner, D., Dussaut, J., et al. (2014). Pathway network inference from gene expression data. BMC Syst. Biol. 8:S7. doi: 10.1186/1752-0509-8-S2-S7
Pugesek, B. H., Tomer, A., and von Eye, A. (2003). Structural Equation Modeling: Application in Ecological and Evolutionary Biology. New York, NY: Cambridge University Press.
Raedler, T. J., Knable, M. B., and Weinberger, D. R. (1998). Schizophrenia as a developmental disorder of the cerebral cortex. Curr. Opin. Neurobiol. 8, 157–161. doi: 10.1016/s0959-4388(98)80019-6
Rajalingam, K., Schreck, R., Rapp, U. R., and Albert, Š. (2007). Ras oncogenes and their downstream targets. Biochim. Biophys. Acta 1773, 1177–1195. doi: 10.1016/j.bbamcr.2007.01.012
Raser, J. M., and O’Shea, E. K. (2005). Noise in gene expression: origins, consequences, and control. Science 309, 2010–2013. doi: 10.1126/science.1105891
Rebhun, J. F., Castro, A. F., and Quilliam, L. A. (2000). Identification of guanine nucleotide exchange factors (GEFs) for the Rap1 GTPase. Regulation of MR-GEF by M-Ras-GTP interaction. J. Biol. Chem. 275, 34901–34908. doi: 10.1074/jbc.m005327200
Remington, D. L. (2009). Effects of genetic and environmental factors on trait network predictions from quantitative trait locus data. Genetics 181, 1087–1099. doi: 10.1534/genetics.108.092668
Roelofs, R. L., Janssen, N., Wingbermühle, E., Kessels, R. P. C., and Egger, J. I. M. (2016). Intellectual development in Noonan syndrome: a longitudinal study. Brain Behav. 6:e00479. doi: 10.1002/brb3.479
Rönnegård, L., and Valdar, W. (2011). Detecting major genetic loci controlling phenotypic variability in experimental crosses. Genetics 188, 435–447. doi: 10.1534/genetics.111.127068
Rosseel, Y. (2012). lavaan: an R package for structural equation modeling. J. Stat. Softw. 48:2. doi: 10.18637/jss.v048.i02
Schizophrenia Working Group of the Psychiatric Genomics Consortium, Ripke, S., Neale, B. M., Corvin, A., Walters, J. T. R., Farh, K.-H., et al. (2014). Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427. doi: 10.1038/nature13595
Serafini, G., Pompili, M., Innamorati, M., Rihmer, Z., Sher, L., and Girardi, P. (2012). Can cannabis increase the suicide risk in psychosis? A critical review. Curr. Pharm. Des. 18, 5165–5187. doi: 10.2174/138161212802884663
Sergushichev, A. (2016). An algorithm for fast preranked gene set enrichment analysis using cumulative statistic calculation. bioRxiv 60012.
Sheskin, D. J. (2006). Handbook of Parametric and Nonparametric Statistical Procedures. Boca Raton: CRC Press.
Singh, A., Yamamoto, M., Kam, S. H. Y., Ruan, J., Gauvreau, G. M., Byrne, P. M. O., et al. (2013). Gene-metabolite expression in blood can discriminate allergen-induced isolated early from dual asthmatic responses. PLoS One 8:e67907. doi: 10.1371/journal.pone.0067907
Speed, D., Cai, N., UCLEB Consortium, Johnson, M. R., Nejentsev, S., and Balding, D. J. (2017). Reevaluation of SNP heritability in complex human traits. Nat. Genet. 49, 986–992. doi: 10.1038/ng.3865
Sullivan, P. F. (2012). Schizophrenia as a pathway disease. Nat. Med. 18, 210–211. doi: 10.1038/nm.2670
Sullivan, P. F., Kendler, K. S., and Neale, M. C. (2003). Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Arch. Gen. Psychiatry 60, 1187–1192. doi: 10.1001/archpsyc.60.12.1187
Tabatabaie, L., Klomp, L. W., Berger, R., and de Koning, T. J. (2010). L-Serine synthesis in the central nervous system: a review on serine deficiency disorders. Mol. Genet. Metab. 99, 256–262. doi: 10.1016/j.ymgme.2009.10.012
Takahashi, S. (2013). Heterogeneity of schizophrenia: genetic and symptomatic factors. Am. J. Med. Genet. B Neuropsychiatr. Genet. 162, 648–652. doi: 10.1002/ajmg.b.32161
Tee, J. Y., Sutharsan, R., Fan, Y., and Mackay-Sim, A. (2017). Cell migration in schizophrenia: patient-derived cells do not regulate motility in response to extracellular matrix. Mol. Cell. Neurosci. 80, 111–122. doi: 10.1016/j.mcn.2017.03.005
The International Schizophrenia Consortium, Purcell, S. M., Wray, N. R., Stone, J. L., Visscher, P. M., O’Donovan, M. C. (2009). Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752. doi: 10.1038/nature08185
The Network and Pathway Analysis Subgroup of the Psychiatric Genomics Consortium. (2015). Corrigendum: psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways. Nat. Neurosci. 18:1861. doi: 10.1038/nn0615-926c
Tidyman, W. E., and Rauen, K. A. (2009). The RASopathies: developmental syndromes of Ras/MAPK pathway dysregulation. Curr. Opin. Genet. Dev. 19, 230–236. doi: 10.1016/j.gde.2009.04.001
Topol, A., Zhu, S., Tran, N., Simone, A., Fang, G., and Brennand, K. (2015). Altered WNT signaling in human induced pluripotent stem cell neural progenitor cells derived from four schizophrenia patients. Biol. Psychiatry 78, e29–e34. doi: 10.1016/j.biopsych.2014.12.028
Wang, S., Liang, Q., Qiao, H., Li, H., Shen, T., and Ji, F. (2016). DISC1 regulates astrogenesis in the embryonic brain via modulation of RAS/MEK/ERK signaling through RASSF7. Development 143, 2732–2740. doi: 10.1242/dev.133066
Wang, G., Yang, E., Brinkmeyer-Langford, C. L., and Cai, J. J. (2014). Additive, epistatic, and environmental effects through the lens of expression variability QTL in a twin cohort. Genetics 196, 413–425. doi: 10.1534/genetics.113.157503
Wang, L., Zhou, K., Fu, Z., Yu, D., Huang, H., Zang, X., et al. (2017). Brain development and akt signaling: the crossroads of signaling pathway and neurodevelopmental diseases. J. Mol. Neurosci. 61, 379–384. doi: 10.1007/s12031-016-0872-y
Watson, M. (2006). CoXpress: differential co-expression in gene expression data. BMC Bioinformatics 7:509. doi: 10.1186/1471-2105-7-509
Weinberger, D. R. (1987). Implications of normal brain development for the pathogenesis of schizophrenia. Arch. Gen. Psychiatry 44, 660–669. doi: 10.1001/archpsyc.1987.01800190080012
Wray, N., and Gottesman, I. I. (2012). Using summary data from the Danish National Registers to estimate heritabilities for schizophrenia, bipolar disorder, and major depressive disorder. Front. Genet. 3:118. doi: 10.3389/fgene.2012.00118
Wrobel, B. B., Mazza, J. M., Evgrafov, O. V., and Knowles, J. A. (2013). Assessing the efficacy of endoscopic office olfactory biopsy sites to produce neural progenitor cell cultures for the study of neuropsychiatric disorders. Int. Forum Allergy Rhinol. 3, 133–138. doi: 10.1002/alr.21080
Wu, X., Deng, L., Tang, D., Ying, G., Yao, X., Liu, F., et al. (2016). miR-615–5p prevents proliferation and migration through negatively regulating serine hydromethyltransferase 2 (SHMT2) in hepatocellular carcinoma. Tumor Biol. 37, 6813–6821. doi: 10.1007/s13277-015-4506-8
Xu, B., Roos, J. L., Levy, S., van Rensburg, E. J., Gogos, J. A., and Karayiorgou, M. (2008). Strong association of de novo copy number mutations with sporadic schizophrenia. Nat. Genet. 40, 880–885. doi: 10.1038/ng.162
Xu, B., Woodroffe, A., Rodriguez-Murillo, L., Roos, J. L., van Rensburg, E. J., Abecasis, G. R., et al. (2009). Elucidating the genetic architecture of familial schizophrenia using rare copy number variant and linkage scans. Proc. Natl. Acad. Sci. U S A 106, 16746–16751. doi: 10.1073/pnas.0908584106
Yang, G. J., Murray, J. D., Repovs, G., Cole, M. W., Savic, A., Glasser, M. F., et al. (2014). Altered global brain signal in schizophrenia. Proc. Natl. Acad. Sci. U S A 111, 7438–7443. doi: 10.1073/pnas.1405289111
Keywords: schizophrenia, neurodevelopmental disorders, structural equation models, signaling pathways, gene network modeling
Citation: Igolkina AA, Armoskus C, Newman JRB, Evgrafov OV, McIntyre LM, Nuzhdin SV and Samsonova MG (2018) Analysis of Gene Expression Variance in Schizophrenia Using Structural Equation Modeling. Front. Mol. Neurosci. 11:192. doi: 10.3389/fnmol.2018.00192
Received: 31 January 2018; Accepted: 15 May 2018;
Published: 11 June 2018.
Edited by:
Ildikó Rácz, Universitätsklinikum Bonn, GermanyReviewed by:
Gianluca Serafini, Dipartimento di Neuroscienze e Organi di Senso, Ospedale San Martino (IRCCS), ItalyPatrick May, University of Luxembourg, Luxembourg
Copyright © 2018 Igolkina, Armoskus, Newman, Evgrafov, McIntyre, Nuzhdin and Samsonova. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Anna A. Igolkina, aWdvbGtpbmFhbm5hMTFAZ21haWwuY29t
Maria G. Samsonova, bS5nLnNhbXNvbm92YUBnbWFpbC5jb20=