- 1Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), Novosibirsk, Russia
- 2Research Institute of Fundamental and Clinical Immunology (RIFCI SB RAS), Novosibirsk, Russia
Using our previously published Web service SNP_TATA_Comparator, we conducted a genome-wide study of single-nucleotide polymorphisms (SNPs) within core promoters of 68 human rheumatoid arthritis (RA)-related genes. Using 603 SNPs within 25 genes clinically associated with RA-comorbid disorders, we predicted 84 and 70 candidate SNP markers for overexpression and underexpression of these genes, respectively, among which 58 and 96 candidate SNP markers, respectively, can relieve and worsen RA as if there is a neutral drift toward susceptibility to RA. Similarly, we predicted natural selection toward susceptibility to RA for 8 immunostimulatory genes (e.g., IL9R) and 10 genes most often associated with RA (e.g., NPY). On the contrary, using 25 immunosuppressive genes, we predicted 70 and 109 candidate SNP markers aggravating and relieving RA, respectively (e.g., IL1R2 and TGFB2), suggesting that natural selection can simultaneously additionally yield resistance to RA. We concluded that disruptive natural selection of human immunostimulatory and immunosuppressive genes is concurrently elevating and reducing the risk of RA, respectively. So, we hypothesize that RA in human could be a self-domestication syndrome referring to evolution patterns in domestic animals. We tested this hypothesis by means of public RNA-Seq data on 1740 differentially expressed genes (DEGs) of pets vs. wild animals (e.g., dogs vs. wolves). The number of DEGs in the domestic animals corresponding to worsened RA condition in humans was significantly larger than that in the related wild animals (10 vs. 3). Moreover, much less DEGs in the domestic animals were accordant to relieved RA condition in humans than those in the wild animals (1 vs. 8 genes). This indicates that the anthropogenic environment, in contrast to a natural one, affects gene expression across the whole genome (e.g., immunostimulatory and immunosuppressive genes) in a manner that likely contributes to RA. The difference in gene numbers is statistically significant as confirmed by binomial distribution (p < 0.01), Pearson’s χ2 (p < 0.01), and Fisher’s exact test (p < 0.05). This allows us to propose RA as a candidate symptom within a self-domestication syndrome. Such syndrome might be considered as a human’s payment with health for the benefits received during evolution.
Introduction
Rheumatoid arthritis (RA) is an autoimmune disease involving autoantibodies (e.g., anti-citrullinated protein antibodies) and proinflammatory cytokines (e.g., TNF-α and IL-6) that participate in the induction of chronic synovitis and bone erosion, followed by deformity (Smolen and Aletaha, 2015), which are some of the most prevalent causes of disability (Koller and Nobauer-Huhmann, 2009). Currently, it is widely accepted that RA immunopathogenesis is mostly mediated by the mechanisms involving a breakdown of immune tolerance to self antigens that is characterized by an increase in the activity of effector T cells causing RA symptoms (Sarkander et al., 2016). Two subpopulations of helper cells, Th1 and Th17, are mostly responsible for the increase in the activity of effector T cells (Frey et al., 2010). Additionally, memory cells of adaptive immunity in a given individual keep information on all the diseases survived; these cells will increase resistance to these diseases in the future (Sarkander et al., 2016). On the contrary, low activity of regulatory cells [e.g., regulatory T cells (Tregs) and myeloid suppressor cells] is often seen in RA (Alsaed et al., 2018). The immunosuppressive-activity deficit is one of the central features of RA pathogenesis (Lawson et al., 2006). Moreover, in patients with RA, an increase in the resistance of effector T cells to the suppressive action of Tregs is detectable too (Malemud, 2018). There are numerous factors related to disturbances in the mechanisms regulating immunocompetent cells, e.g., primarily, cytokines, and their receptors as well as transcription factors.
Rheumatoid arthritis cannot be completely cured (Smolen et al., 2016) in terms of autoimmunity because memory immune cells in the blood are able to retain information about antigens for a long time after successful treatment (Sarkander et al., 2016). For this reason, clinicians usually talk only about RA remission (Smolen and Aletaha, 2015). Because the immune system is more potent in women, their risk of RA is three times higher than that of men (Krasselt and Baerwald, 2017) and increases with the level of sex hormones after menopause and during pregnancy (Ho and Weinshilboum, 2017) but decreases during lactation (Karlson et al., 2004). A retrospective clinical and pharmacological meta-analysis of 29,880 RA patients in comparison with 73,758 relatively healthy volunteers identified 42 loci and 98 genes as candidate therapeutic targets in RA within the framework of predictive, preventive personalized, and participatory (4P) medicine (Okada et al., 2014).
Conventionally, RA risk is dependent on genetic factors and the lifestyle approximately equally (Nair et al., 2017), namely, the microbiome (reflecting a diet) (Sato et al., 2017), previous illnesses (Scott et al., 2010), environmental pollution (Klareskog et al., 2006), and bad habits (Malm et al., 2016) such as smoking (Erlandsson et al., 2016), physical inactivity, and overeating (Somers et al., 2014). That is why RA fits well the main idea of post-genomic 4P medicine, thus giving a chance to people to reduce their disease risks by correction of the lifestyle in line with their individual sequenced genomes (Trovato, 2014). This is important because at late RA stages, fibroblast-like synoviocytes are capable of hyperproliferation, which can cause leukemia in children with leukopenia (Jones et al., 2006) and, in adulthood, may lead to synovial hyperplasia as an impairment of the shape and mobility of joints (Lim and Bae, 2011).
The keystone of 4P medicine is the top scientific project of the 21st century, “1000 Genomes” (Telenti et al., 2016), due to which hundreds of thousands of individual human genomes are sequenced referenced in the variome data. This Big Data set contains consensus human genome sequences and hundreds of millions of single-nucleotide polymorphisms (SNPs) publicly available within databases Ensembl (Zerbino et al., 2015), the UCSC Genome Browser (Haeussler et al., 2015), and dbSNP (Day, 2010). Additionally, databases ClinVar (Landrum et al., 2014) and OMIM (Amberger et al., 2015) document, systematize, and prioritize only clinically proven and experimentally studied human disease SNP markers, respectively, whose allele frequencies significantly differ between cohorts of patients and conventionally healthy volunteers as a mandatory criterion (Varzari et al., 2019). Finally, the dbWGFP database (Wu et al., 2016) does the same in whole-genome mode for all the 10 billion potential SNPs in humans. If we assume that each SNP can affect at least 1 of known human 55,000 diseases (Pocai, 2019), such genetic load will be too high to survive in evolution. So, both Kimura’s theory (1968) and Haldane’s dilemma (1957) lead to the conclusion about neutrality of the vast majority of human SNPs. These neutral SNPs might be bioinformatically identified and discarded without time-consuming clinical testing. Although the current accuracy of bioinformatic calculations is not above the threshold of clinical applicability yet (Yoo et al., 2015), this accuracy grows each year (Putlyaeva et al., 2018; Zorlu et al., 2019).
Most of the SNPs documented in the OMIM database (Amberger et al., 2015) are within protein-coding regions of human genes and correspond to aberrations in protein structure and therefore function (Mitsuyasu et al., 1998). Indeed, these damages are uniform within any tissue and thus are easily detectable but cannot be corrected either therapeutically or via lifestyle changes. In contrast, SNPs within regulatory gene regions (Deplancke et al., 2016) have pathogenic manifestations correctable both by medication and by lifestyle changes within the framework of 4P medicine because these manifestations are limited to alterations of gene expression levels without any protein damage; the latter is negligibly rare among experimentally studied regulatory SNPs (Amberger et al., 2015). Certainly, many factors can independently modulate expression levels of the majority of genes; this situation complicates interpreting the expression patterns of these genes as partial contributions of such modulators as SNPs, somatic mutations, stressors, silencers, inhibitors, and activators. Actually, exogenous recombinant activated coagulation factor VII (F7), as an adjunctive therapy, can successfully help to urgently stop internal bleeding caused by acquired hemophilia as an autoimmune complication of RA, which is treated with immunosuppressive therapy at the same time (Drobiecki et al., 2013). The best-studied regulatory SNPs are mostly within 70-bp regions upstream of transcription start sites (Bhuiyan and Timmers, 2019) and affect gene expression levels proportionally with effect of these SNPs on the binding affinity of TATA-binding protein (TBP) for TBP-sites in these promoter regions (Mogno et al., 2010) when TATA box is canonical (Ponomarenko et al., 2013). According to the EPD database (Dreos et al., 2017), only ∼15% of eukaryotic gene promoters contain canonical TATA boxes as TBP-sites (Bucher, 1990), whereas genome-wide chromatin immunoprecipitation experiments (ChIP-seq) have detected such sites upstream of all the transcription start sites within eukaryotic genomes (Rhee and Pugh, 2012). Indeed, the binding of TBP to TBP-sites of genes shifts the equilibrium from transcriptionally inactive packing of these genes to pre-initiation complexes necessary to initiate the expression of these genes (Godde et al., 1995) as proven in TBP knockout mice (Martianov et al., 2002).
We developed our Web service SNP_TATA_Comparator1 (Ponomarenko et al., 2015), whose input consists of two promoter DNA sequences representing ancestral and minor alleles of the SNP being examined; the software generates TBP-binding affinity estimates for these promoter alleles (± standard error) and significance α of their difference with Fisher’s Z-test (Waardenberg et al., 2015). We applied it from SNP to SNP to predict their contribution to diseases [e.g., chronopathologies (Ponomarenko et al., 2016)] and selectively verified the obtained results using F1-hybrid mice (Chadaeva I. et al., 2019), real-time polymerase chain reaction (Oshchepkov et al., 2019), RNA-Seq data (Vasiliev et al., 2021), gel retardation assay, stopped-flow spectrometry, biosensors, or bioluminescence, as reviewed (Ponomarenko et al., 2017). SNP_TATA_Comparator (Ponomarenko et al., 2015) is already used in independent clinical studies [e.g., in a pulmonary tuberculosis case-control study (Varzari et al., 2018)]. In the present work, at the stage of comprehensive experimental validation, we tested whole-genome sequence-based predictions for RA-associated candidate biomedical SNP markers in vivo using publicly available RNA-Seq data (Albert et al., 2012; Hekman et al., 2018; Yang et al., 2018). Finally, we discuss the results of the verification of the SNP_TATA_Comparator predictions vis-à-vis the semiquantitative RNA-Seq data for the next step: further comprehensive experimental verification of our biomedical predictions using SNP_TATA_Comparator compared with genome-wide data on quantitative trait loci, QTLs [e.g., in human cardiopathology (Koopmann et al., 2014)].
Materials and Methods
DNA Sequences
We retrieved 1896 SNPs of 68 human genes from the dbSNP database, build No. 151, and DNA sequences from Ensembl (reference genome assembly GRCh38/hg38) using the UCSC Genome Browser.
Analysis of DNA Sequences
We used the web tool SNP_TATA_Comparator (Ponomarenko et al., 2015), the input of which are two proximal promoter sequences carrying either an ancestral (wt) or a minor (min) allele of an SNP being analyzed, as shown within two textboxes, “Basic sequence” and “Editable sequence,” respectively (Figure 1E). The double-headed open arrows (⇔) between Figures 1B,C,E explain how SNP_TATA_Comparator uses the Bioperl toolkit (Stajich et al., 2002) to retrieve the ancestral variant of the DNA sequence of the human MMP12 promoter from database Ensembl (Zerbino et al., 2015) in an automated mode. The solid arrows between Figures 1A,D,E show the input of a minor variant of this sequence into SNP_TATA_Comparator according to its description within the database dbSNP (Day, 2010) visualized using the UCSC Genome Browser (Haeussler et al., 2015).
Figure 1. The result produced by SNP_TATA_Comparator (Ponomarenko et al., 2015) in the case of a clinically proven SNP marker (rs2276109) of an asthma risk reduction due to MMP12 downregulation (Hunninghake et al., 2009), which is associated with a reduced risk of rheumatoid arthritis, RA (Liu et al., 2004). (A) The UCSC Genome Browser (Haeussler et al., 2015) visualizes a 70-bp promoter (double-headed dash-and-dot arrow) of a given human gene (here: MMP12, row 2, left column), where there are a TBP-binding site (TBP-site; framed, □) and SNPs retrieved from dbSNP (Day, 2010); their IDs constitute the left-hand column. (B) The Bioperl toolkit (Stajich et al., 2002). (C) The Ensembl database (Zerbino et al., 2015). (D) The current build (No. 151) of the dbSNP database describes the SNP under study (rs2276109 in this example). (E) Our prediction of MMP12 downregulation (textbox “Result”: row 3) caused by SNP rs2276109 (circled) by means of its both alleles (i.e., ancestral and minor ones as input data within two textboxes “Basic sequence” and “Editable sequence,” respectively). Double-headed open arrows (⇔) depict how SNP_TATA_Comparator retrieved the ancestral DNA sequence of the human MMP12 promoter from the Ensembl database (Zerbino et al., 2015) using the Bioperl toolkit (Stajich et al., 2002). (F) Our bioinformatics model of the TBP-promoter binding via three steps, as follows: (1) TBP sliding along DNA ↔ (2) TBP retarded by the TBP-site ↔ the TBP-promoter complex is fixed due to the bending of the DNA double helix at right angles (Ponomarenko et al., 2008). This model is based on the standard bioinformatical package of R (Waardenberg et al., 2015), as described in Supplementary File 1: Supplementary Methods. Solid arrows: data flows, when one predicts the manifestation of a given SNP using SNP_TATA_Comparator; dashed arrow: the SNP in this example (rs2276109); dotted arrows: two more SNPs, rs572527200 and rs1401366377 (Figure 3A), which correspond to either underexpression or overexpression of MMP12, respectively, as predicted in this work.
Using the “Calculate” option, we first estimated two (−ln(KD) ± δ) pairwise value sets of the highest observed estimate of TBP-promoter affinity ± its standard error according to our three-step approximation of their complex formation on each of these sequences independently, as depicted in Figure 1F and described in depth in Supplementary File 1 entitled Section “Supplementary DNA Sequence Analysis.” To this end, we took into account non-specific TBP-DNA affinity (Hahn et al., 1989), the position-weight matrix of TBP-sites (Bucher, 1990), minor-groove width of B-helical DNA (Karas et al., 1996), DNA melting during its bending, which fixes the TBP-promoter complex (Flatters and Lavery, 1998), and abundance levels of TA-rich dinucleotides (Ponomarenko et al., 1999) in the sequences analyzed. Then, we calculated Fisher’s Z-score and, finally, converted it into its p-value of statistical significance taken from standard software R (Waardenberg et al., 2015), as one can see in Figure 1F. Eventually, the “Result” textbox (Figure 1E) shows all the intermediate and final results, namely, −ln(KD(wt)) ± δ(wt) and −ln(KD(min)) ± δ(min) in lines 1 and 2, respectively; the prediction made using the terms “deficiency,” “excess,” “significant,” and “insignificant” in line 3 as proven experimentally (Mogno et al., 2010); and the Z-score and p-value in line 4.
Thus, we examined the SNPs one by one independently from the others and, as a result, either discarded those with insignificant effects on the TBP-promoter binding affinity according to our predictions (data not shown) or presented the SNPs in Supplementary Tables 1–4 (hereinafter: see Supplementary File 2 section “Supplementary Results”).
Keyword Searches in the PubMed Database
For each SNP that can statistically significantly alter the expression of the studied human genes according to our prediction above, we handmade a standard keyword search in the PubMed database (Lu, 2011) as depicted in Supplementary Figure 1 (hereinafter: see Supplementary File 3 entitled section “Supplementary Keyword Search”).
In vivo Validation of Our Current Predictions Using Public RNA-Seq Data
For in vivo validation of our predictions made in this work (Supplementary Tables 1–4) using public RNA-Seq data, we compiled all the 68 human genes whose effects on RA in humans have been estimated here, as shown in Figure 2 (Step-1), and we present them in Supplementary Table 5 (hereinafter, see Supplementary File 2 entitled section “Supplementary Results”).
Figure 2. An algorithmic flowchart of in vivo validation of our current predictions by means of publicly available RNA-Seq data on differentially expressed genes (DEGs) of domestic vs. wild animals. RA, rheumatoid arthritis; effect on RA, contributes to (↑), relieves (↓); NCA, nearest common ancestor; PADJ, significance [Fisher’s Z-test with those corrections on multiple comparisons that have been published by Albert et al. (2012), Hekman et al. (2018), and Yang et al. (2018)].
We verified our computer-based prediction (that RA can be a self-domestication syndrome) by means of previously published publicly available whole-genome RNA-Seq data, as described in Table 1 and depicted in Figure 2 (Step-2). For minimizing the effects of false–positive errors, we selected only the differentially expressed genes (DEGs) that were statistically significant according to Fisher’s Z-test, with those corrections on multiple comparisons (PADJ < 0.05) that have been published by their authors (Albert et al., 2012; Hekman et al., 2018; Yang et al., 2018). Therefore, these data included DEGs in the frontal cortex of guinea pigs (Cavia porcellus) vs. cavies (C. aperea; 883 DEGs), pigs vs. boars (Sus scrofa; 30 DEGs), domesticated vs. wild rabbits (Oryctolagus cuniculus; 17 DEGs), tame vs. aggressive rats (Rattus norvegicus; 20 DEGs), and dogs (Canis familiaris) vs. wolves (C. lupus; 13 DEGs); these data were retrieved from another study (Albert et al., 2012). Additionally, we used 450 DEGs in the blood of dogs vs. wolves retrieved from Yang et al. (2018). Besides, we analyzed 327 DEGs within anterior pituitary tissues of adult male foxes (Vulpes vulpes) of two unique outbred lines artificially selected for domestication (tameness) or aggressiveness (Hekman et al., 2018). Thus, the total number of DEGs analyzed in this study was 1740.
Table 1. The investigated genome-wide RNA-Seq transcriptomes of domestic animals with their wild congeners publicly available in database PubMed.
For the RNA-Seq data analysis, we employed the conventional concept of “divergence from the nearest common ancestor” (Samet, 1985), which we applied to domestic and wild animals whose DEGs were compared with orthologous human genes studied here, as suggested recently (Vasiliev et al., 2021). This procedure yielded 11 pairs of orthologous genes in humans and animals, as shown in Figure 2 (Step-3 and a Venn diagram). In this context, we compared the effects of codirected changes in the expression of orthologous genes on RA in humans and during the divergence of domestic and wild animals from their NEAREST COMMON ANCESTOR, as depicted in Figure 2 (Step-4). Accordingly, we filled out Fisher’s table 2 × 2 (Figure 2: Step-5) and, finally, tested this binary correlation table for statistical significance using standard package STATISTICA (StatSoftTM, Tulsa, OK, United States), as depicted in Figure 2 (Step-6).
Statistical Analysis
Using options “Statistics” → “Non-parametrics” → “2 × 2 Table” in STATISTICA (StatSoft, Tulsa, OK, United States), we verified our predictions (about the candidate SNP markers of worse RA as a self-domestication syndrome) with the 1740 DEGs of domestic vs. wild animals (Table 1) by three criteria, namely, binomial distribution, Pearson’s χ2, and Fisher’s exact test.
Results and Discussion
To test our Web service SNP_TATA_Comparator for robustness vis-à-vis the annual increase in the SNP count, we compared its predictions between two cases, namely, (1) a previous build (No. 147) of the dbSNP database dated 2016, and after that, (2) its current build (No. 151) dated 2017. As one can see in rows 3 and 4 of Table 2, during the period considered, the progress of biomedicine yielded a 1.5-fold increase (from 18 to 27) in the number of human genes with biomedical SNP markers in known TBP-sites and an almost threefold increase (from 225 to 646) in the number of SNPs within the gene regions in question. That is why here we first analyzed SNPs within only the human genes that were analyzed in our previous study on RA (Chadaeva I. V. et al., 2019), to compare the results obtained using the current build (No. 151) of database dbSNP (Day, 2010) with those obtained by means of the previous build (No. 147) of this database. Below, we present the results on some genes.
Table 2. Candidate SNP markers of rheumatoid arthritis (RA) that are located near TBP-sites of human gene promoters as predicted in this work, in comparison with genome-wide patterns.
Candidate SNP Markers of RA Near Clinical SNP Markers of Diseases at TBP-Sites
The human MMP12 gene encodes macrophage elastase and carries an SNP, rs2276109. Rs2276109 is a clinically proven SNP marker of an asthma risk reduction due to MMP12 deficiency (Hunninghake et al., 2009), as one can see in Figures 1A,D. Figure 1E presents how, using SNP_TATA_Comparator (Ponomarenko et al., 2015), we predicted that this SNP causes MMP12 deficiency (i.e., the “Decision” line of the “Result” textbox) because this SNP damages the TBP-site within the promoter of this gene, as shown by the dashed arrows in Figure 1A. This match between our prediction and the clinical data in question (Hunninghake et al., 2009) indicates suitability of SNP_TATA_Comparator for biomedical studies in silico, as highlighted in bold in the first row of Supplementary Table 1. In the two rightmost columns of this table, just below the citation of these clinical data, readers can see other data on MMP12 downregulation as a clinically proven physiological marker of RA (Liu et al., 2004); we found these data by means of keywords in the PubMed database. This finding allows us to predict that the clinical SNP marker (rs2276109) of the reduced risk of asthma is a candidate SNP marker of a reduced risk of RA too, as we highlighted in italics in Supplementary Table 1.
As one can see in Figure 1A, there are eight more SNPs within the analyzed 70-bp proximal promoter region depicted by a double-headed dash-and-dot arrow; two of them can either decrease (rs572527200) or increase (rs1401366377) MMP12 expression (dotted arrows and circles) according to our predictions, as exemplified by Figure 3A. With this in mind, on the basis of the same clinical data (Liu et al., 2004), we predicted two more candidate SNP markers of either decreased (rs572527200) or increased (rs1401366377) risk of RA, as shown in Supplementary Table 1.
Figure 3. Examples of our predictions for human RA-related genes in this work. (A) MMP12: rs1401366377; (B) ESR2: rs35036378; (C) NPY: rs139801169; (D) IL9R: rs56317732; (E) IL1R2: rs72990754; (F) TGFB2: rs546214861.
Likewise, we one-by-one updated the calculation results for those 13 human genes (ACKR1, APOA1, DHFR, F3, F7, HBB, HBD, IL1B, INS, MBL2, NOS2, SOD1, and TPI1) that we have already examined earlier and described in detail in our previous article on RA (Chadaeva I. V. et al., 2019); the updated results on these genes are found in Supplementary Table 1. Finally, after the publication of our previous article on RA (Chadaeva I. V. et al., 2019), within PubMed (Lu, 2011), we found clinically proven SNP disease markers located within 70-bp proximal promoters of the 11 human genes (ADH7, CETP, COMT, ESR2, FGFR2, HSD17B1, HTR2C, MLH1, PDYN, RET, and TGFBR2), as described below for the first time.
Human gene PDYN (i.e., prodynorphin as a basic building block of endorphins inhibiting pain and causing euphoria as hormones/neurotransmitters of joy) has a known SNP marker (rs886056538) of spinocerebellar ataxia when PDYN is in excess (Supplementary Table 1) according to the ClinVar database. Our keyword search in PubMed pointed to a rat model of human diseases, where the pain sensitivity threshold in RA positively correlates with this hormone’s abundance (Zheng et al., 2014). With this in mind, we predicted that rs886056538 is a candidate SNP marker of relieved RA as presented in Supplementary Table 1. As readers can see in this table, in the vicinity of rs886056538, we selected nine SNPs (e.g., rs1195765727), each of which can reduce the PDYN level and thus may be a candidate SNP marker of relieved RA too (Supplementary Table 1).
The human COMT gene encoding catechol-O-methyltransferase bears two SNP markers, rs370819229 and rs777650793, implicated by the ClinVar database in dilated cardiomyopathy and other cardiovascular diseases, respectively, because of COMT downregulation and upregulation calculated in our study (Supplementary Table 1). Within PubMed, we found a short clinical communication (Finan and Zautra, 2013) on a negative correlation between chronic pain sensitivity in RA and COMT. In accordance with these clinical observations, we proposed two known SNP markers of cardiovascular pathologies (rs370819229 and rs777650793) as candidate SNP markers of worsened and alleviated RA, respectively, as shown in Supplementary Table 1. In total, nearby we predicted two more candidate SNP markers (rs901020754 and rs779542396) worsening RA and 13 more candidate SNP markers relieving RA (e.g., rs748298389), all of which are given in Supplementary Table 1.
Human gene RET codes for the Ret proto-oncogene, whereas database ClinVar shows two SNPs, rs10900297 and rs10900296, located within its 70-bp proximal promoter, because they occur in patients with either pheochromocytoma or renal dysplasia (Supplementary Table 1). According to our predictions (Supplementary Table 1), these SNPs can surprisingly cause over- and under-expression of this gene, respectively, as confirmed by two clinical studies (Bridgewater et al., 2008; Sarin et al., 2014) retrieved by our keyword search in PubMed. Moreover, in this way, we found a clinical case report (Townsend et al., 1994) where both pheochromocytoma and RA were mutually complicating diagnoses. For this reason, we suggest two candidate SNP markers (rs10900297 and rs10900296) of complications in RA diagnosis and the same for four other SNPs, which can either significantly reduce the RET level (i.e., rs551321384 and rs1191017949) or significantly elevate it (i.e., rs1237152255 and rs1372293149), as readers can see in Supplementary Table 1.
Human gene MLH1 encodes DNA mismatch repair protein MLH1; in this gene’s promoter, ClinVar lists two SNP markers (rs63750527 and rs756099600) of colon cancer that elevate the MLH1 level, thus preventing cancer cell apoptosis during cancer chemotherapy and an immune response (Supplementary Table 1). We found a retrospective association (Jeong et al., 2017) (comorbidity) of colon cancer and RA. Accordingly, we predicted two candidate SNP markers (rs63750527 and rs756099600) of high risk of comorbidities in RA (Supplementary Table 1). Additionally, near rs63750527 and rs756099600, we identified three more SNPs also increasing both the MLH1 amount and colon cancer risk (e.g., rs752622244) and hence elevating the risk of an RA comorbidity (Supplementary Table 1). In the table, the reader can see three other SNPs that are located within the MLH1 promoter (e.g., rs864622145) and reduce an expression of this gene, thereby possibly accelerating the progression of RA to cancer because of reduced DNA repair (Kullmann et al., 2000).
Human gene ADH7 encoding alcohol dehydrogenase 7 carries a biomedically identified SNP marker (rs17537595) of esophageal cancer; the minor allele of this SNP can reduce ADH7 expression (Abbas et al., 2006) in agreement with our prediction presented in Supplementary Table 1. Our keyword search in PubMed revealed that both a deficit and an excess of this enzyme occur quite often in digestive-tract cancers (Jelski et al., 2008) that are comorbid with RA (Hemminki et al., 2012). That is why, for increased risk of comorbidities of RA, we predicted candidate SNP marker rs17537595 as well as three more markers nearby (rs372329931, rs755152695, and rs1238877951) having the same effect on this gene’s expression (Supplementary Table 1).
The promoter of human gene HSD17B1 contains SNP rs201739205, which decreases the amount of hydroxysteroid (17-β) dehydrogenase 1 encoded by this gene (Supplementary Table 1), as revealed in patients with hereditary breast cancer (Peltoketo et al., 1994), which is discordant to RA (Chen et al., 2019) according to the results of our keyword search in database PubMed. Therefore, we predicted a candidate SNP marker (rs201739205) of low risk of RA (Supplementary Table 1). Additionally, around it, we chose two more SNPs (rs748743528 and rs779674159) that decrease the HSD17B1 level and therefore are candidate SNP markers of an RA risk reduction too (Supplementary Table 1). Finally, within the promoter under study, we predicted three SNPs able to cause HSD17B1 overexpression (e.g., rs1332869256), each of which can also cause breast cancer (He et al., 2016) according to our PubMed search and therefore may be a candidate SNP marker of decreased risk of RA (Supplementary Table 1).
The human ESR2 gene coding for estrogen receptor 2 (β) carries a clinically proven SNP marker (rs35036378) of an ESR2-deficient primary pT1 tumor of the mammary gland (Philips et al., 2012), for which we also correctly predicted ESR2 underexpression, as shown in both Figure 3B and Supplementary Table 1. Our PubMed keyword search yielded a mouse model of human diseases with ESR2-dependent suppression of inflammation (Armstrong et al., 2013). These data allowed us to predict a candidate SNP marker (rs35036378) of reduced suppression of inflammation in RA (Supplementary Table 1). Nearby, we identified the only SNP (rs766797386) that can decrease the ESR2 level and consequently is a candidate SNP marker of reduced suppression of inflammation in RA (Supplementary Table 1).
The human FGFR2 gene (fibroblast growth factor receptor 2) is annotated in the ClinVar database showing two biomedical SNP markers in its promoter: rs886046768 for craniosynostosis and rs387906677 for bent bone dysplasia, which correspond, respectively, to an excess and deficit of this receptor according to our predictions (Supplementary Table 1). As for RA, using our keyword search in PubMed, we found a physiological in vitro model of human angiogenesis, where neovascularization increases with an FGFR2 concentration increase and vice versa (Brown et al., 1996). This finding permits us to predict that all SNPs in this gene’s promoter, which can significantly elevate (e.g., rs886046768) and reduce (e.g., rs387906677) this receptor’s level, are candidate SNP markers of elevated and reduced neovascularization in RA, respectively (Supplementary Table 1).
The promoter of human gene TGFBR2 has a known SNP marker (rs138010137) of aortic thoracic aneurysm according to database ClinVar. Our prediction for this SNP is underexpression of transforming growth factor beta receptor 2 encoded by this gene (Supplementary Table 1). We learned that a TGFBR2 deficit disrupts regulatory T-cell homeostasis in RA (Wang et al., 2018). This allows us to suggest a candidate SNP marker (rs138010137) of higher risk of RA as well as nearby, one more candidate SNP marker (rs1300366819) of the same RA parameter because of a decrease in the TGFBR2 level, according to our prediction (Supplementary Table 1). Finally, in the same way, we found out that synovial-fibroblast proliferation increases with an increase in the TGFBR2 level (Bira et al., 2005). Consequently, yet another SNP (rs1310294304) located within this promoter can be a candidate SNP marker of worsened RA because of TGFBR2 upregulation (Supplementary Table 1).
The human CETP gene codes for plasma lipid transfer protein and has a widely used SNP marker (rs1427119663) of hyperalphalipoproteinemia, which is an 18-bp deletion including the TBP-site in the promoter region of this gene; this mutation causes CETP underexpression (Plengpanich et al., 2011), as described in Supplementary Table 1. Because there is a known phenomenon of CETP-deficient mortality in RA (Ferraz-Amaro et al., 2013), in line with our keyword search in PubMed, we predicted rs1427119663 as a candidate SNP marker of complicated RA (Supplementary Table 1). In its vicinity, we identified five SNPs (e.g., rs17231520), all of which can increase this gene’s expression as a known risk factor of RA (Kim et al., 2016), hence our suggestion to regard them as candidate SNP markers of higher risk of RA (Supplementary Table 1).
Human gene HTR2C coding for serotonin receptor 2C is mentioned in ClinVar owing to its clinically documented SNP marker (rs3813929) of obesity as a complication of olanzapine-based antipsychotic treatment; this marker is associated with an HTR2C excess according to our calculations (Supplementary Table 1). As for RA, using our keyword search via PubMed, we revealed that HTR2C overexpression elevates the risk of RA with obstructive sleep apnea (Jagannathan et al., 2017) in contrast to HTR2C underexpression, which reduces adipogenesis and thus the obesity-related risk of RA (Priyadarshini et al., 2018). Keeping this in mind, we predicted that rs3813929 and five more SNPs nearby (e.g., rs886838672) are candidate SNP markers of the obesity-related complications in RA, as readers can see in Supplementary Table 1.
Predictions Based Upon dbSNP: Build No. 151 Versus Build No. 147
Row 4 of Table 2 sums up the aforementioned predictions on the basis of dbSNP, build No. 151, about the 603 SNPs within 25 genes’ promoters containing biomedical SNP markers, namely, a total number of candidate SNP markers is 154, that is, ∼fourfold higher than 42 for build No. 147 (row 3). In the three rightmost columns, readers can see the statistically significant predominance (a greater number) of candidate SNP markers contributing to RA (i.e., 32 in row 3 and 96 in row 4) over candidate SNP markers preventing RA (i.e., 10 in row 3 and 58 in row 4). This result supports the robustness of our predictions in both compared cases. This predominance of predisposition over resistance to RA fits the notion that diversity of adaptive-immunity memory cells in an individual grows with an increase in the number of diseases patient survived, thus elevating the disease resistance in the future (Sarkander et al., 2016) and the risk of false activation of these cells according to Ashby’s law (Schuldberg, 2015).
Besides, the three central columns illustrate how we verified the robustness of our predictions with the annual growth of dbSNP from the standpoint of both Kimura’s theory (1968) and Haldane’s dilemma (1957), which highlight neutral drift of most SNPs as the cornerstone of the human genome as a whole. As a bioinformatics criterion of neutral drift within regulatory regions of the genome, some authors (Kasowski et al., 2010) first noted empirically and next proposed heuristically to estimate an excess in the number of SNPs damaging protein-binding sites over those improving them. In row 1, readers can see that the genome-wide pattern of SNPs in humans that was identified using ChIP-seq data (The 1000 Genomes Project Consortium (GPC) et al., 2012) fits this neutral drift criterion at a probability rate (P) of >0.99 and actually follows a binomial distribution, indeed. Row 2 presents the genome-wide pattern of the clinically proven biomedical SNP markers near the TBP-sites within human gene promoters according to our predictions made by means of SNP_TATA_Comparator, and this pattern also matches the same criterion of neutral drift. This finding reflects the essence of Haldane’s dilemma (1957), who hypothesized an under-threshold level of the human genetic load. In row 3, there are relevant results of our previous research (Chadaeva I. V. et al., 2019) on the candidate SNP markers of RA within a previous build of dbSNP No. 147; again, they seem to be in agreement with the neutral drift criterion used. As for the candidate SNP markers of RA predicted using build No. 151 of this database (row 4), contrary to all of the above, there is a lower, not higher, number of SNPs damaging TBP-sites than those improving them; this result is insignificant (P > 0.15) and does not allow to rule out their neutral drift completely. This indicates that almost a fourfold greater number of SNPs within the current build of dbSNP relative to the previous one may allow us to detect other genome-wide patterns of candidate SNP markers of RA besides their neutral drift, which is unquestionable (Haldane, 1957; Kimura, 1968). Therefore, we next examined the human genes that are most often associated with RA according to independent sources, as presented in Supplementary Table 2 and described below.
Candidate SNP Markers of RA Near TBP-Sites in Promoters of the Human Protein-Coding Genes That Are Most Often Independently Associated With RA
The human NPY gene (neuropeptide Y) has four SNPs causing an excess of this protein (e.g., Figure 3C: rs139801169) and four SNPs diminishing its level (e.g., rs1223788416), which correspond to either higher or lower risk of obesity that is comorbid with RA (Stofkova et al., 2009), as found in PubMed. Thus, we predicted eight RA-related candidate SNP markers (Supplementary Table 2).
The human CCR6 promoter contains two SNPs rs1433814180 and rs1047738754 that both can decrease the amount of C–C motif chemokine receptor 6 encoded by this gene, as shown in Supplementary Table 2 presenting our predictions. Our keyword search within the PubMed database revealed a CCR6-deficient mouse model of human RA with reduced autoimmunity (Bonelli et al., 2018). This result allows us to propose two candidate SNP markers of decreased risk of RA (Supplementary Table 2).
Human gene CTLA4 (cytotoxic T-lymphocyte associated protein 4) carries four SNPs causing an overabundance this protein (e.g., rs127192924), which is a molecular biomarker of the overlapping autoimmunity proven by a case-control study (AlFadhli, 2013), as indicated in Supplementary Table 2. In addition, within its proximal promoters, there are four other SNPs whose minor alleles reduce the CTLA4 level according to our predictions (Supplementary Table 2: e.g., rs561368432), while there are CTLA4-inducible knockout mice as a laboratory animal model of human RA with an elevated autoimmune response (Alissafi et al., 2017). Considering all of the above, we predicted eight candidate SNP markers of worsened RA.
In total, within 10 human genes CCR6, CTLA4, HLA-A, IL23R, IRF5, NPY, PADI4, PTPN22, STAT4, and TRAF1, we predicted 42 and 27 candidate SNP markers, respectively, corresponding to a high and low risk of RA (Supplementary Table 2). On the other hand, 46 and 23 of all of them can, respectively, improve and damage TBP-sites within the promoters of the RA-related human genes examined, as shown in row 5 of Table 2. Thus, thanks to a fourfold increased number of SNPs within the current build No. 151 of dbSNP, besides the neutral drift toward predisposition to RA in humans (Table 2: rows 3 and 4), we first detected the natural selection against underexpression of the human genes often associated with RA; this selection could indeed elevate the risk of RA, as explained above. In hopes of further detailing this trend of natural selection, we next investigated human immunostimulatory and immunosuppressive genes independently from one another, as presented in Supplementary Tables 3, 4, respectively, and discussed below.
Candidate SNP Markers of RA Near TBP-Sites in Promoters of Human Protein-Coding Immunostimulatory Genes
Human gene IL9R codes for interleukin 9 receptor and includes two SNPs rs56317732 and rs945044791 corresponding to its overexpression and underexpression, respectively, according to our predictions exemplified by Figure 3D as well as to high and low both inflammation and fibroblast-like synoviocyte proliferation in RA (Raychaudhuri et al., 2018). Therefore, we proposed them as candidate SNP markers for aggravated and alleviated RA (Supplementary Table 3).
Looking through Supplementary Table 3, readers can see a significant number of candidate SNP markers of improved TBP-sites than candidate SNP markers of damaged TBP-sites (71 vs. 43) within promoters of eight human immunostimulatory genes ATF3, CCR7, IL3RA, IL9R, IL25, LCK, NFKB1, and ZBTB38 (p < 0.01, binomial distribution), as summarized in row 6 of Table 2. This corresponds to a significant predominance of 71 candidate SNP markers increasing RA-related risks over 43 such markers reducing these risks (p < 0.01), as presented ibid. Therefore, we can conclude that there is the pressure of natural selection on human immunostimulatory genes, and its direction both prevents their underexpression and elevates the human predisposition to RA, thus fitting the trends in the abovementioned rows 3, 4, and 5.
Candidate SNP Markers of RA Near TBP-Sites in the Promoters of Human Protein-Coding Immunosuppressive Genes
Human IL1R2 gene promoters contain two SNPs rs960068265 and rs946299576 able to increase the level of interleukin 2 receptor subunit β encoded by this gene as well as two other SNPs rs72990754 and rs960678696 capable of decreasing this level, as exemplified in Figure 3E. Using a keyword search in the PubMed database, we found a murine laboratory model of human diseases (Ocsko et al., 2018) where epigenetic silencing of this anti-inflammatory gene supported RA. Accordingly, Supplementary Table 4 presents four RA-related candidate SNP markers within this gene’s promoters, as predicted here.
The human gene TGFB2 promoter has only one SNP, rs546214861, that can lower the production of transforming growth factor beta 2 (synonym: glioblastoma-derived T-cell suppressor factor) encoded by this gene (Figure 3F), thereby improving the healing of bone-related tissues in inflammatory RA (Um et al., 2018) as a candidate SNP marker of RA alleviation (Supplementary Table 4).
If we look through Supplementary Table 4, there are more (104 vs. 75) candidate SNP markers corresponding to enhancement of (than damage to) TBP-sites of 25 human immunosuppressive genes BCL6, CD4, CNMD, EBI3, FGF21, GAS6, GDF5, DUSP1, FGF22, FOXP3, IL1R2, IL2RA, IL2RB, IL4, IL10, IL10RA, IL10RB, IRF2, IRF4, IRF8, PDCD1, PIAS1, SOCS3, TGFB2, and TNFRSF8 (Table 2, row 7: p < 0.025, binomial distribution). This means natural-selection pressure directed against underexpression of the human immunosuppressive genes, consistently with the abovementioned rows 6 and 7 of Table 2; these rows correspond to genes often associated with RA and immunostimulatory genes, respectively. As for predisposition to RA, contrary to all the abovementioned rows 3, 4, 5, and 6 of Table 2, when summarizing Supplementary Table 4, we were surprised by the significant predominance (109 over 70) of candidate SNP markers corresponding to alleviation over aggravation of RA (p < 0.01) binomial distribution rather than vice versa, as presented in row 7 of Table 2. This result indicates that natural-selection pressure on the human immunosuppressive genes is directed toward resistance to RA rather than predisposition to RA presented in rows 3, 4, 5, and 6 of this table. First, within row 7 compared with rows 3 and 4 of Table 2, readers can see two statistically significant genome-wide patterns opposite to each other, namely: (1) neutral drift increasing RA-related risks and (2) natural selection decreasing them; superposition of these patterns can stabilize them and thus establish the normal level of RA-related risks as a human trait. Finally, rows 5, 6, and 7 indicate the only consistently bidirectional natural selection at the whole-genome scale [either (1) for RA resistance in case of immunosuppressive genes or (2) for RA predisposition in case of immunostimulatory and all remaining genes] that can disrupt the RA norm in humans as if self-domestication (Theofanopoulou et al., 2017) has occurred with its disruptive natural selection (Belyaev, 1979). According to the mainstream point of view on Human Origins, there is not enough scientific evidence that this could actually happen. In this work, we verified the bidirectional natural selection on the human genome-wide scale simultaneously for resistance and predisposition to RA during their comparison with publicly available data on DEGs in pets vs. wild animals, as presented below.
In vivo Validation of Our Predictions Using DEGs Within Pets Versus Wild Animals
Here, we compared 68 human genes within whose promoters there are RA-related candidate SNP markers predicted in this work (Supplementary Tables 1–4, the essence of which is Supplementary Table 5) with 1740 DEGs of pets vs. wild animals (Albert et al., 2012; Hekman et al., 2018; Yang et al., 2018). Their description is given in Table 1, as depicted in Figure 2. The obtained results are presented in Table 3.
Table 3. Comparing the effects of the orthologous-gene expression changes on rheumatoid arthritis (RA) in humans and during the divergence of domestic and wild animals from their nearest common ancestor.
First of all, as readers can see in Table 3, underexpression of three human genes ESR2, IL1R2, and F7 corresponds to attenuated ESR2-dependent suppression (Armstrong et al., 2013), increased inflammation (Ocsko et al., 2018), and hemorrhagic forms of RA (Thornorsteinsson et al., 2004); this pattern is consistent with underexpression of their orthologous animal genes during domestication and vice versa. Besides, overexpression of five human genes CCR6, IL9R, NPY, IL1B, and TGFB2 elevates the risk of RA (Jatczak-Pawlik et al., 2020), inflammation (Raychaudhuri et al., 2018), obesity (Stofkova et al., 2009), and circadian pain (Olkkonen et al., 2015) and inhibits bone repair (Um et al., 2018), respectively; these data match the pattern of overexpression of their orthologous genes during animal domestication, and vice versa. Additionally, both underexpression and overexpression of two human genes CETP and HBB contribute to RA, as shown in Table 3. Finally, overexpression of only human gene PDYN in our gene set reduces pain sensitivity (Zheng et al., 2014), thus relieving RA, and in only this case in our study corresponds to the orthologous gene’s overexpression during the guinea pig domestication and vice versa (Table 3).
Table 4 sums up the findings of the comparative analysis of the above orthologous genes from humans and animals, namely, 10 and 1 of these domestic-animal DEGs were found to correspond to human gene-markers of aggravated and relieved RA, and the same is true for three and eight DEGs in the wild animals. Therefore, the DEGs in domestic animals are significantly consistent with their human orthologous genes that contribute to RA, according to Pearson’s χ2 test (p < 0.01), Fisher’s exact test (p < 0.05), and binomial distribution (p < 0.01). Finally, Table 4 indicates that the DEGs of wild animals correspond equally to human orthologous genes that aggravate and relieve RA (p > 0.1, binomial distribution) in agreement with the conventional wild-type norm.
Table 4. Correlations between the effects of codirected changes in the expression of orthologous genes on rheumatoid arthritis (RA) in humans and during the divergence of domestic and wild animals from their nearest common ancestor.
These findings mean that during domestication, the anthropogenic environment, in contrast to a natural environment, may alter gene expression in the animals on the genome-wide scale (e.g., immunostimulatory and immunosuppressive genes) in a manner that more often contributes to RA than not.
Conclusion
Because it is best to study TBP-sites genome-wide, we created SNP_TATA_Comparator and applied it to build No. 147 dbSNP of 2016 to predict candidate SNP markers of RA (Chadaeva I. V. et al., 2019) as a cause of disability (Koller and Nobauer-Huhmann, 2009). The robustness of this tool was verified here with the growth of the SNP number, as seen in dbSNP build No. 151 of 2017. Here we analyzed a fourfold higher SNP number allowing us to detect previously unknown whole-genome SNP patterns besides neutral drift (Haldane, 1957; Kimura, 1968) in relation to RA predisposition because diversity of adaptive-immunity memory cells grows with an increase in the number of diseases experienced, thereby enhancing both disease resistance (Sarkander et al., 2016) and the risk of false activation of these cells according to Ashby’s law of requisite variety (Schuldberg, 2015). That is why we additionally investigated both immunostimulatory genes and genes quite often linked to RA, and we noted natural selection against underexpression of these genes in the same direction, toward predisposition to RA. Likewise, we analyzed immunosuppressive genes and surprisingly observed the same natural selection against underexpression of these genes, which nevertheless acts in the opposite direction, toward resistance to RA. Overall, the natural-selection pressure seems bidirectional, e.g.: (1) on immunosuppressive genes toward RA alleviation and (2) on immunostimulatory gene toward RA aggravation, suggesting that self-domestication events (Theofanopoulou et al., 2017) have happened in humans because of the disruptive natural selection (Belyaev, 1979) found in our study. It is common knowledge that the use of gene promoters alone is not enough for the analysis of modes of evolution as a whole. Belyaev (1979) defined the domestication-related disruptive selection as “… what may be selected for are changes in the regulation of genes — that is, in the timing and the amount of gene expression rather than changes in individual structural genes” (Belyaev, 1979). This is a microevolution event when domestic and wild populations of the same species diverge from their nearest common ancestor, to which it is appropriate to compare our human-promoter-based predictions in silico (Ponomarenko et al., 2015) and the DEGs of domestic vs. wild animals in vivo, as demonstrated recently (Vasiliev et al., 2021).
In accordance with the mainstream opinion that scientific evidence for the human self-domestication is insufficient, we tested our predictions of relief and aggravation in RA using public data on 1740 DEGs in pets vs. wild animals (Albert et al., 2012; Hekman et al., 2018; Yang et al., 2018) as a bioinformatic animal model of human diseases. Among the DEGs examined, this approach yielded 10 and 1 DEGs that correspond to alleviation and aggravation of RA in pets, in contrast to three and 8 DEGs in the case of wild animals. Consequently, during domestication, the anthropogenic habitat conditions in comparison with a natural environment may change gene expression in the animals on the whole-genome scale (e.g., immunostimulatory and immunosuppressive genes) and thus contribute more often to RA according to three independent statistical criteria, such as Pearson’s χ2 test (p < 0.01), Fisher’s exact test (p < 0.05), and the binomial distribution test (p < 0.01). This finding allows us to propose RA as a candidate symptom within a self-domestication syndrome (Theofanopoulou et al., 2017). Such syndrome might be considered as a human’s payment with health for the benefits received during evolution.
Besides, the RA-related candidate SNP markers predicted here, which are expected to survive obligatory clinical “case-control” studies in the future, may become useful for physicians for optimizing the treatment of patients according to their individual sequenced genome reducing RA-related risks.
The presented verification of SNP_TATA_Comparator’s predictions vis-à-vis the semiquantitative RNA-Seq data is statistically significant. Therefore, the next step of further comprehensive experimental verification of SNP_TATA_Comparator’s biomedical predictions by means of genome-wide data on QTLs [e.g., in human cardiopathology (Koopmann et al., 2014)] seems to be justified and timely.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.
Author Contributions
NAK and VK contributed to study conception. MP wrote the manuscript. EO, IC, PP, DO, and DR contributed to the development and optimization of the software. NVK, ES, ID, and LS contributed to the data analysis. All authors contributed to the article and approved the submitted version.
Funding
The concept was supported by the Russian Federal Science & Technology Program for the Development of Genetic Technologies (for NAK and VK). The software development, manuscript writing, and data analysis were supported by Project No. 0259-2021-0009 from the Russian Government Budget (for EO, IC, PP, DO, DR, NVK, ES, ID, and LS). All authors declare that these funding bodies did not play roles in the design of the study; in the collection, analysis, and interpretation of the data; and in the writing of the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We are thankful to Shevchuk Editing (http://www.shevchuk-editing.com) (Brooklyn, NY, United States) for English editing.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.610774/full#supplementary-material
Supplementary File 1 : Supplementary Methods | An estimate of the affinity of TATA-binding protein (TBP) for a 70 bp proximal promoter in human genes.
Supplementary File 2 : Supplementary Results | Supplementary Tables 1–5. Candidate SNP markers of RA predicted in this work near TBP-binding sites within a promoter of human protein-coding genes.
Supplementary File 3 : Supplementary Keyword Search | Supplementary Figure 1. A decision logic of the keyword search for RA-related studies in the PubMed database (Lu, 2011).
Footnotes
References
Abbas, A., Lechevrel, M., and Sichel, F. (2006). Identification of new single nucleotid polymorphisms (SNP) in alcohol dehydrogenase class IV ADH7 gene within a French population. Arch. Toxicol. 80, 201–205. doi: 10.1007/s00204-005-0031-7
Albert, F. W., Somel, M., Carneiro, M., Aximu-Petri, A., Halbwax, M., Thalmann, O., et al. (2012). A comparison of brain gene expression levels in domesticated and wild animals. PLoS Genet. 8:e1002962. doi: 10.1371/journal.pgen.1002962
AlFadhli, S. (2013). Overexpression and secretion of the soluble CTLA-4 splice variant in various autoimmune diseases and in cases with overlapping autoimmunity. Genet. Test. Mol. Biomarkers 17, 336–341. doi: 10.1089/gtmb.2012.0391
Alissafi, T., Banos, A., Boon, L., Sparwasser, T., Ghigo, A., Wing, K., et al. (2017). Tregs restrain dendritic cell autophagy to ameliorate autoimmunity. J. Clin. Invest. 127, 2789–2804. doi: 10.1172/JCI92079
Alsaed, O., Hadwan, N., Khanjar, I., and Al-Allaf, A.-W. (2018). Seronegative bilateral symmetrical inflammatory polyarthritis: think twice before starting immunosuppression. Eur. J. Case Rep. Intern. Med. 5:000895. doi: 10.12890/2018_000895
Amberger, J. S., Bocchini, C. A., Schiettecatte, F., Scott, A. F., and Hamosh, A. (2015). OMIM.org: online Mendelian Inheritance in Man (OMIM§), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43, D789–D798. doi: 10.1093/nar/gku1205
Armstrong, C. M., Billimek, A. R., Allred, K. F., Sturino, J. M., Weeks, B. R., and Allred, C. D. (2013). A novel shift in estrogen receptor expression occurs as estradiol suppresses inflammation-associated colon tumor formation. Endocr. Relat. Cancer 20, 515–525. doi: 10.1530/erc-12-0308
Belyaev, D. K. (1979). The Wilhelmine E. Key 1978 invitational lecture. Destabilizing selection as a factor in domestication. J. Hered. 70, 301–308. doi: 10.1093/oxfordjournals.jhered.a109263
Bhuiyan, T., and Timmers, H. T. M. (2019). Promoter recognition: putting TFIID on the spot. Trends Cell Biol. 29, 752–763. doi: 10.1016/j.tcb.2019.06.004
Bira, Y., Tani, K., Nishioka, Y., Miyata, J., Sato, K., Hayashi, A., et al. (2005). Transforming growth factor beta stimulates rheumatoid synovial fibroblasts via the type II receptor. Mod. Rheumatol. 15, 108–113. doi: 10.1007/s10165-004-0378-2
Bisoendial, R. J., Levi, M., Tak, P. P., and Stroes, E. S. (2010). The prothrombotic state in rheumatoid arthritis: an additive risk factor for adverse cardiovascular events. Semin. Thromb. Hemost. 36, 452–457. doi: 10.1055/s-0030-1254054
Bonelli, M., Puchner, A., Goschl, L., Hayer, S., Niederreiter, B., Steiner, G., et al. (2018). CCR6 controls autoimmune but not innate immunity-driven experimental arthritis. J. Cell. Mol. Med. 22, 5278–5285. doi: 10.1111/jcmm.13783
Bridgewater, D., Cox, B., Cain, J., Lau, A., Athaide, V., Gill, P. S., et al. (2008). Canonical WNT/beta-catenin signaling is required for ureteric branching. Dev. Biol. 317, 83–94. doi: 10.1016/j.ydbio.2008.02.010
Brown, K. J., Maynes, S. F., Bezos, A., Maguire, D. J., Ford, M. D., and Parish, C. R. (1996). A novel in vitro assay for human angiogenesis. Lab. Invest. 75, 539–555.
Bucher, P. (1990). Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J. Mol. Biol. 212, 563–578. doi: 10.1016/0022-2836(90)90223-9
Chadaeva, I., Ponomarenko, P., Rasskazov, D., Sharypova, E., Kashina, E., Kleshchev, M., et al. (2019). Natural selection equally supports the human tendencies in subordination and domination: a genome-wide study with in silico confirmation and in vivo validation in mice. Front. Genet. 10:73. doi: 10.3389/fgene.2019.00073
Chadaeva, I. V., Rasskazov, D. A., Sharypova, E. B., Drachkova, I. A., Oshchepkova, E. A., Savinkova, L. K., et al. (2019). Candidate SNP-markers of rheumatoid arthritis that can significantly alter the affinity of the TATA-binding protein for human gene promoters. Vavilov. Zh. Genet. Selektsii. 23, 1047–1058. doi: 10.18699/vj19.586
Chen, H. H., Lin, C. H., Chen, D. Y., Chao, W. C., Chen, Y. H., Hung, W. T., et al. (2019). Risk of major autoimmune diseases in female breast cancer patients: a nationwide, population-based cohort study. PLoS One 14:e0222860. doi: 10.1371/journal.pone.0222860
Day, I. N. (2010). dbSNP in the detail and copy number complexities. Hum. Mutat. 31, 2–4. doi: 10.1002/humu.21149
Deplancke, B., Alpern, D., and Gardeux, V. (2016). The genetics of transcription factor DNA binding variation. Cell 166, 538–554. doi: 10.1016/j.cell.2016.07.012
Dreos, R., Ambrosini, G., Groux, R., Perier, R. C., and Bucher, P. (2017). The eukaryotic promoter database in its 30th year: focus on non-vertebrate organisms. Nucleic Acids Res. 45, D51–D55. doi: 10.1093/nar/gkw1069
Drobiecki, A., Pasiarski, M., Hus, I., Sokolowska, B., and Wątek, M. (2013). Acquired hemophilia in the patient suffering from rheumatoid arthritis: case report. Blood Coagul. Fibrinolysis 24, 874–880. doi: 10.1097/mbc.0b013e3283646635
Erlandsson, M. C., Doria Medina, R., Toyra Silfversward, S., and Bokarewa, M. I. (2016). Smoking functions as a negative regulator of IGF1 and impairs adipokine network in patients with rheumatoid arthritis. Mediators Inflamm. 2016:3082820. doi: 10.1155/2016/3082820
Ferraz-Amaro, I., Gonzalez-Gay, M. A., Garcia-Dopico, J. A., and Diaz-Gonzalez, F. (2013). Cholesteryl ester transfer protein in patients with rheumatoid arthritis. J. Rheumatol. 40, 1040–1047. doi: 10.3899/jrheum.121507
Finan, P. H., and Zautra, A. J. (2013). Rheumatoid arthritis: stress affects rheumatoid arthritis, but via what mechanisms? Nat. Rev. Rheumatol. 9, 569–570. doi: 10.1038/nrrheum.2013.139
Flatters, D., and Lavery, R. (1998). equence-dependent dynamics of TATA-Box binding sites. Biophys. J. 75, 372–381. doi: 10.1016/S0006-3495(98)77521-6
Frey, O., Meisel, J., Hutloff, A., Bonhagen, K., Bruns, L., Kroczek, R. A., et al. (2010). Inducible costimulator (ICOS) blockade inhibits accumulation of polyfunctional T helper 1/T helper 17 cells and mitigates autoimmune arthritis. Ann. Rheum. Dis. 69, 1495–1501. doi: 10.1136/ard.2009.119164
Gall, T., Petho, D., Nagy, A., Hendrik, Z., Mehes, G., Potor, L., et al. (2018). Heme induces endoplasmic reticulum stress (HIER stress) in human aortic smooth muscle cells. Front. Physiol. 9:1595. doi: 10.3389/fphys.2018.01595
Giakoumi, X., Tsironi, M., Floudas, C., Polymeropoylos, E., Papalambros, E., and Aessopos, A. (2005). Rheumatoid arthritis in thalassemia intermedia: coincidence or association? Isr. Med. Assoc. J. 7, 667–669.
Godde, J. S., Nakatani, Y., and Wolffe, A. P. (1995). The amino-terminal tails of the core histones and the translational position of the TATA box determine TBP/TFIIA association with nucleosomal DNA. Nucleic Acids Res. 23, 4557–4564. doi: 10.1093/nar/23.22.4557
Haeussler, M., Raney, B. J., Hinrichs, A. S., Clawson, H., Zweig, A. S., Karolchik, D., et al. (2015). Navigating protected genomics data with UCSC Genome Browser in a Box. Bioinformatics 31, 764–766. doi: 10.1093/bioinformatics/btu712
Hahn, S., Buratowski, S., Sharp, P., and Guarente, L. (1989). Yeast TATA-binding protein TFIID binds to TATA elements with both consensus and nonconsensus DNA sequences. Proc. Natl. Acad. Sci. U. S. A. 86, 5718–5722. doi: 10.1073/pnas.86.15.5718
Haldane, J. B. S. (1957). The cost of natural selection. J. Genet. 55, 511–524. doi: 10.1007/bf02984069
He, W., Gauri, M., Li, T., Wang, R., and Lin, S. X. (2016). Current knowledge of the multifunctional 17β-hydroxysteroid dehydrogenase type 1 (HSD17B1). Gene 588, 54–61. doi: 10.1016/j.gene.2016.04.031
Hekman, J. P., Johnson, J. L., Edwards, W., Vladimirova, A. V., Gulevich, R. G., Ford, A. L., et al. (2018). Anterior pituitary transcriptome suggests differences in ACTH release in tame and aggressive foxes. G3 (Bethesda) 8, 859–873. doi: 10.1534/g3.117.300508
Hemminki, K., Liu, X., Ji, J., Sundquist, J., and Sundquist, K. (2012). Effect of autoimmune diseases on mortality and survival in subsequent digestive tract cancers. Ann. Oncol. 23, 2179–2184. doi: 10.1093/annonc/mdr590
Ho, M. F., and Weinshilboum, R. M. (2017). Immune mediator pharmacogenomics: TCL1A SNPs and estrogen-dependent regulation of inflammation. J. Nat. Sci. 3:e416.
Hunninghake, G., Cho, M., Tesfaigzi, Y., Soto-Quiros, M., Avila, L., Lasky-Su, J., et al. (2009). MMP12, lung function, and COPD in high-risk populations. N. Engl. J. Med. 361, 2599–2608. doi: 10.1056/nejmoa0904006
Jagannathan, R., Seixas, A., St-Jules, D., Jagannathan, L., Rogers, A., Hu, L., et al. (2017). Systems biology genetic approach identifies serotonin pathway as a possible target for obstructive sleep apnea: results from a literature search review. Sleep Disord. 2017:6768323. doi: 10.1155/2017/6768323
Jatczak-Pawlik, I., Wolinski, P., KsiAZek-Winiarek, D., Pietruczuk, M., and Glabinski, A. (2020). CCR6 blockade on regulatory T cells ameliorates experimental model of multiple sclerosis. Cent. Eur. J. Immunol. 45, 256–266. doi: 10.5114/ceji.2020.101241
Jelski, W., Chrostek, L., Zalewski, B., and Szmitkowski, M. (2008). Alcohol dehydrogenase (ADH) isoenzymes and aldehyde dehydrogenase (ALDH) activity in the sera of patients with gastric cancer. Dig. Dis. Sci. 53, 2101–2105. doi: 10.1007/s10620-007-0135-4
Jeong, H., Baek, S. Y., Kim, S. W., Eun, Y. H., Kim, I. Y., Kim, H., et al. (2017). Comorbidities of rheumatoid arthritis: results from the Korean National Health and Nutrition Examination Survey. PLoS One 12:e0176260. doi: 10.1371/journal.pone.0176260
Jones, O. Y., Spencer, C. H., Bowyer, S. L., Dent, P. B., Gottlieb, B. S., and Rabinovich, C. E. (2006). A multicenter case-control study on predictive factors distinguishing childhood leukemia from juvenile rheumatoid arthritis. Pediatrics 117, e840–4. doi: 10.1542/peds.2005-1515
Karas, H., Knuppel, R., Schulz, W., Sklenar, H., and Wingender, E. (1996). Combining structural analysis of DNA with search routines for the detection of transcription regulatory elements. Comput. Appl. Biosci. 12, 441–446. doi: 10.1093/bioinformatics/12.5.441
Karlson, E. W., Mandl, L. A., Hankinson, S. E., and Grodstein, F. (2004). Do breast-feeding and other reproductive factors influence future risk of rheumatoid arthritis? Results from the Nurses’ Health Study. Arthritis Rheum. 50, 3458–3467. doi: 10.1002/art.20621
Kasowski, M., Grubert, F., Heffelfinger, C., Hariharan, M., Asabere, A., Waszak, S. M., et al. (2010). Variation in transcription factor binding among humans. Science 328, 232–235. doi: 10.1126/science.1183621
Kim, J. Y., Lee, E. Y., Park, J. K., Song, Y. W., Kim, J. R., and Cho, K. H. (2016). Patients with rheumatoid arthritis show altered lipoprotein profiles with dysfunctional high-density lipoproteins that can exacerbate inflammatory and atherogenic process. PLoS One 11:e0164564. doi: 10.1371/journal.pone.0164564
Kimura, M. (1968). Evolutionary rate at the molecular level. Nature 217, 624–626. doi: 10.1038/217624a0
Klareskog, L., Padyukov, L., Lorentzen, J., and Alfredsson, L. (2006). Mechanisms of disease: genetic susceptibility and environmental triggers in the development of rheumatoid arthritis. Nat. Clin. Pract. Rheumatol. 2, 425–433. doi: 10.1038/ncprheum0249
Koller, M., and Nobauer-Huhmann, I. (2009). Early arthritis: action desired - treatment required. Wien. Med. Wochenschr. 159, 66–69. doi: 10.1007/s10354-009-0653-0
Koopmann, T. T., Adriaens, M. E., Moerland, P. D., Marsman, R. F., Westerveld, M. L., Lal, S., et al. (2014). Genome-wide identification of expression quantitative trait loci (eQTLs) in human heart. PLoS One 9:e97380. doi: 10.1371/journal.pone.0097380
Krasselt, M., and Baerwald, C. (2017). Sex, symptom severity, and quality of life in rheumatology. Clin. Rev. Allergy Immunol. 56, 346–361. doi: 10.1007/s12016-017-8631-6
Kullmann, F., Widmann, T., Kirner, A., Justen, H. P., Wessinghage, D., Dietmaier, W., et al. (2000). Microsatellite analysis in rheumatoid arthritis synovial fibroblasts. Ann. Rheum. Dis. 59, 386–389. doi: 10.1136/ard.59.5.386
Landrum, M. J., Lee, J. M., Riley, G. R., Jang, W., Rubinstein, W. S., Church, D. M., et al. (2014). ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42, D980–D985. doi: 10.1093/nar/gkt1113
Lawson, C. A., Brown, A. K., Bejarano, V., Douglas, S. H., Burgoyne, C. H., Greenstein, A. S., et al. (2006). Early rheumatoid arthritis is associated with a deficit in the CD4+CD25high regulatory T cell population in peripheral blood. Rheumatology 45, 1210–1217. doi: 10.1093/rheumatology/kel089
Lim, D. S., and Bae, Y. S. (2011). Metastatic lymph node 51 and fibroblast-like synoviocyte hyperproliferation in rheumatoid arthritis pathogenesis. Rheumatol. Int. 31, 843–847. doi: 10.1007/s00296-011-1818-x
Liu, M., Sun, H., Wang, X., Koike, T., Mishima, H., Ikeda, K., et al. (2004). Association of increased expression of macrophage elastase (matrix metalloproteinase 12) with rheumatoid arthritis. Arthritis Rheum. 50, 3112–3117. doi: 10.1002/art.20567
Lu, Z. (2011). PubMed and beyond: a survey of web tools for searching biomedical literature. Database 2011:baq036. doi: 10.1093/database/baq036
Malemud, C. J. (2018). Defective T-cell apoptosis and t-regulatory cell dysfunction in rheumatoid arthritis. Cells 7:223. doi: 10.3390/cells7120223
Malm, K., Bremander, A., Arvidsson, B., Andersson, M. L., Bergman, S., and Larsson, I. (2016). The influence of lifestyle habits on quality of life in patients with established rheumatoid arthritis-A constant balancing between ideality and reality. Int. J. Qual. Stud. Health Well-being 11:30534. doi: 10.3402/qhw.v11.30534
Martianov, I., Viville, S., and Davidson, I. (2002). RNA polymerase II transcription in murine cells lacking the TATA binding protein. Science 298, 1036–1039. doi: 10.1126/science.1076327
Mitsuyasu, H., Izuhara, K., Mao, X., Gao, P., Arinobu, Y., Enomoto, T., et al. (1998). Ile50Val variant of IL4R alpha upregulates IgE synthesis and associates with atopic asthma. Nat. Genet. 19, 119–120. doi: 10.1038/472
Mogno, I., Vallania, F., Mitra, R. D., and Cohen, B. A. (2010). TATA is a modular component of synthetic promoters. Genome Res. 20, 1391–1397. doi: 10.1101/gr.106732.110
Nair, N., Wilson, A. G., and Barton, A. (2017). DNA methylation as a marker of response in rheumatoid arthritis. Pharmacogenomics 18, 1323–1332. doi: 10.2217/pgs-2016-0195
Ocsko, T., Toth, D. M., Hoffmann, G., Tubak, V., Glant, T. T., and Rauch, T. A. (2018). Transcription factor Zbtb38 downregulates the expression of anti-inflammatory IL1r2 in mouse model of rheumatoid arthritis. Biochim. Biophys. Acta Gene Regul. Mech. 1861, 1040–1047. doi: 10.1016/j.bbagrm.2018.09.007
Okada, Y., Wu, D., Trynka, G., Raj, T., Terao, C., Ikari, K., et al. (2014). Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381. doi: 10.1038/nature12873
Olkkonen, J., Kouri, V. P., Hynninen, J., Konttinen, Y. T., and Mandelin, J. (2015). Differentially Expressed in Chondrocytes 2 (DEC2) Increases the Expression of IL-1β and is abundantly present in synovial membrane in rheumatoid arthritis. PLoS One 10:e0145279. doi: 10.1371/journal.pone.0145279
Oshchepkov, D., Ponomarenko, M., Klimova, N., Chadaeva, I., Bragin, A., Sharypova, E., et al. (2019). A rat model of human behavior provides evidence of natural selection against underexpression of aggressiveness-related genes in humans. Front. Genet. 10:1267. doi: 10.3389/fgene.2019.01267
Peltoketo, H., Piao, Y., Mannermaa, A., Ponder, B., Isomaa, V., Poutanen, M., et al. (1994). A point mutation in the putative TATA box, detected in nondiseased individuals and patients with hereditary breast cancer, decreases promoter activity of the 17 beta-hydroxysteroid dehydrogenase type 1 gene 2 (EDH17B2) in vitro. Genomics 23, 250–252. doi: 10.1006/geno.1994.1487
Philips, S., Richter, A., Oesterreich, S., Rae, J. M., Flockhart, D. A., Perumal, N. B., et al. (2012). Functional characterization of a genetic polymorphism in the promoter of the ESR2 gene. Horm. Cancer 3, 37–43. doi: 10.1007/s12672-011-0086-2
Plengpanich, W., Le Goff, W., Poolsuk, S., Julia, Z., Guerin, M., and Khovidhunkit, W. (2011). CETP deficiency due to a novel mutation in the CETP gene promoter and its effect on cholesterol efflux and selective uptake into hepatocytes. Atherosclerosis 216, 370–373. doi: 10.1016/j.atherosclerosis.2011.01.051
Pocai, B. (2019). The ICD-11 has been adopted by the World Health Assembly. World Psychiatry 18, 371–372. doi: 10.1002/wps.20689
Ponomarenko, M., Mironova, V., Gunbin, K., and Savinkova, L. (2013). “Hogness Box,” in Brenner’s Encyclopedia of Genetics, 2nd Edn, eds S. Maloy and K. Hughes (Cambridge: Academic Press), 491–494. doi: 10.1016/B978-0-12-374984-0.00720-8
Ponomarenko, M., Ponomarenko, J., Frolov, A., Podkolodny, N., Savinkova, L., Kolchanov, N., et al. (1999). Identification of sequence-dependent features correlating to activity of DNA sites interacting with proteins. Bioinformatics 15, 687–703. doi: 10.1093/bioinformatics/15.7.687
Ponomarenko, M., Rasskazov, D., Arkova, O., Ponomarenko, P., Suslov, V., Savinkova, L., et al. (2015). How to use SNP_TATA_Comparator to find a significant change in gene expression caused by the regulatory SNP of this gene’s promoter via a change in affinity of the TATA-binding protein for this promoter. Biomed. Res. Int. 2015:359835. doi: 10.1155/2015/359835
Ponomarenko, M., Rasskazov, D., Chadaeva, I., Sharypova, E., Ponomarenko, P., Arkova, O., et al. (2017). SNP_TATA_Comparator: genomewide landmarks for preventive personalized medicine. Front. Biosci. 9, 276–306. doi: 10.2741/s488
Ponomarenko, P., Rasskazov, D., Suslov, V., Sharypova, E., Savinkova, L., Podkolodnaya, O., et al. (2016). Candidate SNP markers of chronopathologies are predicted by a significant change in the affinity of TATA-binding protein for human gene promoters. Biomed. Res. Int. 2016:8642703. doi: 10.1155/2016/8642703
Ponomarenko, P. M., Savinkova, L. K., Drachkova, I. A., Lysova, M. V., Arshinova, T. V., Ponomarenko, M. P., et al. (2008). A step-by-step model of TBP/TATA box binding allows predicting human hereditary diseases by single nucleotide polymorphism. Dokl. Biochem. Biophys. 419, 88–92. doi: 10.1134/S1607672908020117
Priyadarshini, S., Pradhan, B., Griebel, P., and Aich, P. (2018). Cortisol regulates immune and metabolic processes in murine adipocytes and macrophages through HTR2c and HTR5a serotonin receptors. Eur. J. Cell Biol. 97, 483–492. doi: 10.1016/j.ejcb.2018.07.004
Putlyaeva, L. V., Demin, D. E., Korneev, K. V., Kasyanov, A. S., Tatosyan, K. A., Kulakovskiy, I. V., et al. (2018). Potential markers of autoimmune diseases, alleles rs115662534(T) and rs548231435(C), disrupt the binding of transcription factors STAT1 and EBF1 to the regulatory elements of human CD40 gene. Biochemistry 83, 1534–1542. doi: 10.1134/S0006297918120118
Raychaudhuri, S. K., Abria, C., Maverakis, E. M., and Raychaudhuri, S. P. (2018). IL-9 receptor: regulatory role on FLS and pannus formation. Cytokine 111, 58–62. doi: 10.1016/j.cyto.2018.08.001
Rhee, H. S., and Pugh, B. F. (2012). Genome-wide structure and organization of eukaryotic pre-initiation complexes. Nature 483, 295–301. doi: 10.1038/nature10799
Rzepecka, J., Pineda, M. A., Al-Riyami, L., Rodgers, D. T., Huggan, J. K., Lumb, F. E., et al. (2015). Prophylactic and therapeutic treatment with a synthetic analogue of a parasitic worm product prevents experimental arthritis and inhibits IL-1β production via NRF2-mediated counter-regulation of the inflammasome. J. Autoimmun. 60, 59–73. doi: 10.1016/j.jaut.2015.04.005
Samet, H. (1985). A top-down quadtree traversal algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 7, 94–98. doi: 10.1109/tpami.1985.4767622
Sarin, S., Boivin, F., Li, A., Lim, J., Svajger, B., Rosenblum, N. D., et al. (2014). β-Catenin overexpression in the metanephric mesenchyme leads to renal dysplasia genesis via cell-autonomous and non-cell-autonomous mechanisms. Am. J. Pathol. 184, 1395–1410. doi: 10.1016/j.ajpath.2014.01.018
Sarkander, J., Hojyo, S., and Tokoyoda, K. (2016). Vaccination to gain humoral immune memory. Clin. Transl. Immunol. 5:e120. doi: 10.1038/cti.2016.81
Sato, K., Takahashi, N., Kato, T., Matsuda, Y., Yokoji, M., Yamada, M., et al. (2017). Aggravation of collagen-induced arthritis by orally administered Porphyromonas gingivalis through modulation of the gut microbiota and gut immune system. Sci. Rep. 7:6955. doi: 10.1038/s41598-017-07196-7
Schuldberg, D. (2015). What is optimum variability? Nonlinear Dynamics Psychol. Life Sci. 19, 553–568.
Scott, D. L., Wolfe, F., and Huizinga, T. W. (2010). Rheumatoid arthritis. Lancet 376, 1094–1108. doi: 10.1016/S0140-6736(10)60826-4
Smolen, J. S., and Aletaha, D. (2015). Rheumatoid arthritis therapy reappraisal: strategies, opportunities and challenges. Nat. Rev. Rheumatol. 11, 276–289. doi: 10.1038/nrrheum.2015.8
Smolen, J. S., Aletaha, D., and McInnes, I. B. (2016). Rheumatoid arthritis. Lancet 388, 2023–2038. doi: 10.1016/S0140-6736(16)30173-8
Somers, T. J., Wren, A. A., Blumenthal, J. A., Caldwell, D., Huffman, K. M., and Keefe, F. J. (2014). Pain, physical functioning, and overeating in obese rheumatoid arthritis patients: do thoughts about pain and eating matter ? J. Clin. Rheumatol. 20, 244–250. doi: 10.1097/RHU.0000000000000124
Stajich, J. E., Block, D., Boulez, K., Brenner, S. E., Chervitz, S. A., Dagdigian, C., et al. (2002). The Bioperl toolkit: perl modules for the life sciences. Genome Res. 12, 1611–1618. doi: 10.1101/gr.361602
Stofkova, A., Skurlova, M., Kiss, A., Zelezna, B., Zorad, S., and Jurcovicova, J. (2009). Activation of hypothalamic NPY, AgRP, MC4R, AND IL-6 mRNA levels in young Lewis rats with early-life diet-induced obesity. Endocr. Regul. 43, 99–106.
Telenti, A., Pierce, L. C., Biggs, W. H., di Iulio, J., Wong, E. H., Fabani, M. M., et al. (2016). Deep sequencing of 10,000 human genomes. Proc. Natl. Acad. Sci. U. S. A. 113, 11901–11906. doi: 10.1073/pnas.1613365113
The 1000 Genomes Project Consortium (GPC), Abecasis, G., Auton, A., Brooks, L., DePristo, M., Durbin, R., et al. (2012). An integrated map of genetic variation from 1.092 human genomes. Nature 491, 56–65. doi: 10.1038/nature11632
Theofanopoulou, C., Gastaldon, S., O’Rourke, T., Samuels, B. D., Martins, P. T., Delogu, F., et al. (2017). Self-domestication in Homo sapiens: insights from comparative genomics. PLoS One 12:e0185306. doi: 10.1371/journal.pone.0185306
Thornorsteinsson, V., Magnusson, S., Hellman-Erlingsson, S., Gutmundsdottir, B. R., and Arnason, A. (2004). Congenital deficiency of coagulation factor VII in an Icelandic family. Laeknabladid 90, 385–388.
Townsend, R. R., McGinnis, P. A., Tuan, W. M., and Thrasher, K. (1994). Case report: bilateral adrenal pheochromocytoma. Am. J. Med. Sci. 308, 123–125. doi: 10.1097/00000441-199408000-00013
Trovato, G. M. (2014). Sustainable medical research by effective and comprehensive medical skills: overcoming the frontiers by predictive, preventive and personalized medicine. EPMA J. 5:14. doi: 10.1186/1878-5085-5-14
Um, S., Lee, J. H., and Seo, B. M. (2018). TGF-β2 downregulates osteogenesis under inflammatory conditions in dental follicle stem cells. Int. J. Oral Sci. 10:29. doi: 10.1038/s41368-018-0028-8
Varzari, A., Deyneko, I. V., Vladei, I., Grallert, H., Schieck, M., Tudor, E., et al. (2019). Genetic variation in TLR pathway and the risk of pulmonary tuberculosis in a Moldavian population. Infect. Genet. Evol. 68, 84–90. doi: 10.1016/j.meegid.2018.12.005
Varzari, A., Tudor, E., Bodrug, N., Corloteanu, A., Axentii, E., and Deyneko, I. V. (2018). Age-specific association of CCL5 gene polymorphism with pulmonary tuberculosis: a case-control study. Genet. Test. Mol. Biomarkers 22, 281–287. doi: 10.1089/gtmb.2017.0250
Vasiliev, G., Chadaeva, I., Rasskazov, D., Ponomarenko, P., Sharypova, E., Drachkova, I., et al. (2021). A bioinformatics model of human diseases on the basis of differentially expressed genes (of domestic versus wild animals) that are orthologs of human genes associated with reproductive-potential changes. Int. J. Mol. Sci. 22:2346. doi: 10.3390/ijms22052346
Waardenberg, A. J., Basset, S. D., Bouveret, R., and Harvey, R. P. (2015). CompGO: an R package for comparing and visualizing Gene Ontology enrichment differences between DNA binding experiments. BMC Bioinformatics 16:275. doi: 10.1186/s12859-015-0701-2
Wang, L., Wang, C., Jia, X., and Yu, J. (2018). Circulating exosomal miR-17 inhibits the induction of regulatory t cells via suppressing TGFBR II expression in rheumatoid arthritis. Cell. Physiol. Biochem. 50, 1754–1763. doi: 10.1159/000494793
Wu, J., Wu, M., Li, L., Liu, Z., Zeng, W., and Jiang, R. (2016). dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions. Database 2016:baw024. doi: 10.1093/database/baw024
Yang, X., Zhang, H., Shang, J., Liu, G., Xia, T., Zhao, C., et al. (2018). Comparative analysis of the blood transcriptomes between wolves and dogs. Anim. Genet. 49, 291–302. doi: 10.1111/age.12675
Yoo, S. S., Jin, C., Jung, D. K., Choi, Y. Y., Choi, J. E., Lee, W. K., et al. (2015). Putative functional variants of XRCC1 identified by RegulomeDB were not associated with lung cancer risk in a Korean population. Cancer Genet. 208, 19–24. doi: 10.1016/j.cancergen.2014.11.004
Zerbino, D. R., Wilder, S. P., Johnson, N., Juettemann, T., and Flicek, P. R. (2015). The Ensembl regulatory build. Genome Biol. 16:56. doi: 10.1186/s13059-015-0621-5
Zheng, B., Hu, L., Song, X., Wu, Z., Cai, R., He, L., et al. (2014). Analgesic effect of different moxibustion durations in rheumatoid arthritis rats. J. Tradit. Chin. Med. 34, 90–95. doi: 10.1016/s0254-6272(14)60060-1
Keywords: rheumatoid arthritis, human, gene, promoter, TATA box, candidate SNP marker, self-domestication syndrome, RNA-seq verification
Citation: Klimova NV, Oshchepkova E, Chadaeva I, Sharypova E, Ponomarenko P, Drachkova I, Rasskazov D, Oshchepkov D, Ponomarenko M, Savinkova L, Kolchanov NA and Kozlov V (2021) Disruptive Selection of Human Immunostimulatory and Immunosuppressive Genes Both Provokes and Prevents Rheumatoid Arthritis, Respectively, as a Self-Domestication Syndrome. Front. Genet. 12:610774. doi: 10.3389/fgene.2021.610774
Received: 27 September 2020; Accepted: 17 May 2021;
Published: 22 June 2021.
Edited by:
Yuriy L. Orlov, I.M. Sechenov First Moscow State Medical University, RussiaReviewed by:
Ranajit Das, Yenepoya University, IndiaElvira Galieva, Novosibirsk State University, Russia
Anatoliy Ivashchenko, Al-Farabi Kazakh National University, Kazakhstan
Copyright © 2021 Klimova, Oshchepkova, Chadaeva, Sharypova, Ponomarenko, Drachkova, Rasskazov, Oshchepkov, Ponomarenko, Savinkova, Kolchanov and Kozlov. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mikhail Ponomarenko, pon@bionet.nsc.ru