- 1Department of Pathology, School of Medicine, São Paulo State University (UNESP), Botucatu, Brazil
- 2Molecular Genetics and Bioinformatics Laboratory, Experimental Research Unit (Unipex), School of Medicine, São Paulo State University (UNESP), Botucatu, Brazil
- 3Human Genome and Stem Cell Research Center, University of São Paulo, São Paulo, Brazil
- 4Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo, São Paulo, Brazil
- 5Centro Universitário Sudoeste Paulista, Avaré, Brazil
- 6Departamento de Química, Faculdade de Filosofa, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
- 7Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
- 8Laboratório de Saúde Humana In Silico, Programa de Pós-Graduação em Saúde e Desenvolvimento Humano, Universidade La Salle, Canoas, Brazil
- 9Departamento de Clínica Médica, Disciplina de Alergia e Imunologia Clínica, Faculdade de Medicina da Universidade de São Paulo, São Paulo, Brazil
- 10Laboratório de Imunologia, Instituto do Coração (InCor), LIM19, Hospital das Clínicas da Faculdade de Medicina da Universidade de São Paulo, (HCFMUSP), São Paulo, Brazil
- 11Instituto de Investigação em Imunologia, Instituto Nacional de Ciências e Tecnologia-iii (INCT), São Paulo, Brazil
- 12Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
- 13Prevent Senior Institute, São Paulo, Brazil
- 14Department of Pathology, School of Medicine, University of Sao Paulo, Sao Paulo, Brazil
Background: Although aging correlates with a worse prognosis for Covid-19, super elderly still unvaccinated individuals presenting mild or no symptoms have been reported worldwide. Most of the reported genetic variants responsible for increased disease susceptibility are associated with immune response, involving type I IFN immunity and modulation; HLA cluster genes; inflammasome activation; genes of interleukins; and chemokines receptors. On the other hand, little is known about the resistance mechanisms against SARS-CoV-2 infection. Here, we addressed polymorphisms in the MHC region associated with Covid-19 outcome in super elderly resilient patients as compared to younger patients with a severe outcome.
Methods: SARS-CoV-2 infection was confirmed by RT-PCR test. Aiming to identify candidate genes associated with host resistance, we investigated 87 individuals older than 90 years who recovered from Covid-19 with mild symptoms or who remained asymptomatic following positive test for SARS-CoV-2 as compared to 55 individuals younger than 60 years who had a severe disease or died due to Covid-19, as well as to the general elderly population from the same city. Whole-exome sequencing and an in-depth analysis of the MHC region was performed. All samples were collected in early 2020 and before the local vaccination programs started.
Results: We found that the resilient super elderly group displayed a higher frequency of some missense variants in the MUC22 gene (a member of the mucins’ family) as one of the strongest signals in the MHC region as compared to the severe Covid-19 group and the general elderly control population. For example, the missense variant rs62399430 at MUC22 is two times more frequent among the resilient super elderly (p = 0.00002, OR = 2.24).
Conclusion: Since the pro-inflammatory basal state in the elderly may enhance the susceptibility to severe Covid-19, we hypothesized that MUC22 might play an important protective role against severe Covid-19, by reducing overactive immune responses in the senior population.
Introduction
Although a diverse clinical spectrum has been described among patients with Covid-19, robust data show that increasing age correlates with more severe disease and a higher frequency of deaths worldwide (1). While the elderly have a higher prevalence of comorbidities such as cardiovascular diseases, diabetes, and cancer - which are also independently associated with a higher risk of severe Covid-19 (2), increasing age is still the most significant risk factor for Covid-19 mortality (3). To yield clues about this infection susceptibility, the comparison of extremely older people presenting mild symptoms and young adults with a very severe outcome may contribute with relevant observations.
Covid-19 severity among the elderly may be related to immunosenescence, changes in cytokine patterns, activation of inflammatory pathways, and impaired innate and adaptive immune responses (4–6). Also, comorbidities in older individuals are strongly associated with an increased risk of Covid-19 complications (7, 8). Alternatively, older individuals are more likely to have been exposed to other corona and influenza viruses during their lifespan, even by vaccination, increasing their odds of defeating SARS-CoV-2 (9). For instance, centenarians exposed to the 1918 H1N1 influenza virus might present some protection against SARS-CoV-2 infection (10).
The MHC (Major Histocompatibility Complex) region contains more than 200 genes, many of them related to immunity. Therefore, it is a natural candidate for influencing infectious disease susceptibility and severity. MHC genes influence different levels of the immune response against viruses, such as genes encoding cytokines and molecules of the complement system, which may influence Covid-19 severity and cytokine storm (11, 12), genes encoding membrane-associated mucins (MUC22) (13), and genes encoding molecules that mediate NK cell responses (HLA-G, HLA-E, MICA, and MICB) (14–17). Some of the MHC genes are particularly important to the antigen presentation pathway. HLA-A, HLA-B, and HLA-C encode the heavy chain of the MHC-I molecule, responsible for binding the intracellular antigens and presenting them on the cell surface to the T cell receptor (TCR) of CD8 T lymphocytes. Likewise, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1, and others, encode the MHC-II molecule, responsible for binding the exogenous antigens, usually internalized by antigen-presenting cells such as macrophages, and presenting them on the cell surface to CD4 T lymphocytes. These genes are highly polymorphic, with hundreds to thousands of alleles for each locus (18). Such diversity influences antigen presentation since different MHC molecules may present a different subset of antigens (19).
Because of the unusually high polymorphism and extensive paralogy of the MHC region, particularly at the HLA classical class I and II genes, the MHC region requires specialized tools to align short reads correctly and, thus, call genotypes and haplotypes properly (20, 21). Additionally, allele frequencies vary across populations, as clearly documented in previous studies focusing on HLA genes and Covid-19 (22–26). Associations between HLA genotype and disease severity extend to other unrelated viruses, such as HIV and dengue (27, 28). Some MHC variants have already been reported to be associated with Covid-19 severity. HLA-G variant rs9380142 was associated with Covid-19 critical illness (29) and HLA-E allele E*01:01 with Covid-19 severity, particularly in patients requiring intensive care (30). The CCHCR1 locus was associated with critical illness in Covid-19 (29).
We hypothesized that differences in MHC can predispose to a severe or mild clinical course in Covid-19 despite age. Therefore, aiming to verify the existence of MHC differences between extreme opposed outcomes for Covid-19, we have analyzed exomes of Brazilian convalescents by SARS-COV-2 grouped according to Covid-19 clinical status and age: a group of super elderly patients (>= 94 yo) recovered from Covid-19 with mild to moderate symptoms (without ventilation support) as compared to younger adults (mean age <= 52) with severe disease (with ventilation support). We also compared patients with a previously whole-genome sequenced (WGS) census-based sample of elderly individuals from the same city (São Paulo, Brazil), sampled before the current pandemic (31). Brazilians are a highly admixed population composed of tri-hybrid proportions of European (average 73%), African (18%), and Native American (9%) ancestries (31). This was taken into account and therefore global genomic ancestry was controlled when performing the association study.
Materials and methods
Definition of the groups
This survey included 225 patients with Covid-19, as illustrated in Figure 1. Diagnostic tests (RT-PCR) confirmed the positive SARS-CoV-2 infection in all individuals. The samples were collected between June and October 2020, before new SARS-CoV-2 variants were reported in Brazil (especially Gamma) and before the onset of the Brazilian vaccination program against Covid-19.
Figure 1 Distribution of Brazilian individuals with SARS-CoV-2 infection according to Covid-19 severity, and the definition of the larger groups MILD and SEVERE Covid-19.
Covid-19 severity was classified according to the clinical spectrum of the World Health Organization’s updated guideline for Covid-19 treatment (https://www.covid19treatmentguidelines.nih.gov/overview/clinical-spectrum/). Patients that were asymptomatic or presented mild symptoms were grouped into MILD Covid-19; deceased and/or hospitalized in ICU requiring ventilation support were grouped into SEVERE Covid-19 (Figure 1). The SEVERE group is significantly younger than the MILD group (Supplementary Table S1, p< 10-5).
We also considered a special group named super elderly (>= 94 yo) who recovered from Covid-19, with 72 super elderly with mild Covid-19 and 15 with moderate symptoms. We retrieved clinical data regarding the progression of Covi-19 and diagnostic test results.
These Covid-19 groups were compared with MHC data from a previously whole-genome sequenced (WGS) sample of Brazilian elderly individuals (>= 65 yo), known as the SABE cohort (31), collected before the SARS-CoV-2 outbreak and representative of the general elderly population in the same city. Age, sex, and mean genetic ancestry distributions for each group are displayed in Supplementary Table S1 and were used as co-variables in regression models.
Exome and whole-genome sequencing
We obtained full exomes from DNA extracted from patients’ samples with SARS-CoV-2 infection. We used the Nextera Rapid Capture Custom Enrichment Kit or the Nextera Flex Kit (Illumina, San Diego, CA, USA) for library preparation and the IDT xgen-V1 kit for capture following manufacturer protocols. Whole-exome sequencing was performed on the NovaSeq 6000 equipment (Illumina, USA) with a 150-base paired-end dual index read format. Reads were aligned to the human reference GRCh38 using Burrow–Wheeler Aligner (BWA), algorithm MEM (https://github.com/lh3/bwa/tree/master/bwakit). We also called genotypes using GATK HaplotypeCaller (version 4.0.9). We used the genotypes obtained in this step to infer the genetic ancestry. The pipeline used for alignment, variant calling, variant refinement, and genetic ancestry assessment is detailed elsewhere (22).
For the general elderly population (SABE), whole-genome sequencing was performed previously (31). Although SARS-Cov-2 infection is unknown for this cohort, this data provides a baseline for the frequency of each polymorphism in the general elderly population from São Paulo.
MHC genotyping and haplotyping
The MHC region is prone to genotyping errors because of alignment bias in paralogous and highly polymorphic genes (20, 21, 32). The HLA classical class I and II genes are the most impacted ones by alignment bias, and conventional NGS analysis workflows are not suitable for genotyping them. We used a customized workflow to circumvent this issue and get reliable genotypes and haplotypes in the MHC region. We used HLA-mapper (version 4) (20) to optimize read alignment along the MHC region (Supplementary Figure S1). The input for HLA-mapper is the BAM file obtained in the previous step (for exomes or whole-genomes). After applying HLA-mapper to correct the alignments, we called genotypes using GATK 4 HaplotypeCaller. Then, we selected only the variants that overlapped the region captured by the Exome, with no more than 5% of missing alleles in the Exomes. After, we refined the variants using the standard Variant Quality Score Recalibration (VQSR) supplemented with known variants from HLA genes. To obtain phased variants for each gene, we first phased closely located variants using WhatsHap (33). Then, we combined phase sets using Shapeit 4 (34). The final product is a phased VCF with SNPs throughout the MHC.
HLA allele calling
For HLA-A, HLA-B, HLA-C, HLA-E, HLA-F, HLA-G, MICA, MICB, HLA-DOA, HLA-DOB, HLA-DMA, HLA-DMB, HLA-DRA, HLA-DPA1, HLA-DPB1, TAP1, and TAP2, we obtained the complete exonic sequences for each individual by converting the phased VCF obtained in the previous step into complete CDS sequences using vcfx transcript (www.castelli-lab.net/apps/vcfx). We also translated these sequences into protein sequences (the allotypes) using Emboss transeq. We called HLA alleles (3-field and 2-field resolution) directly from these CDS and predicted protein sequences, comparing them with the ones reported in the IPD-IMGT/HLA database (35). Because of exome probe-capturing bias in some MHC regions, we imputed HLA-DRB1, HLA-DQA1, and HLA-DQB1 2-field alleles instead of calling alleles directly from the VCF data, as discussed in the next section.
Quality control for MHC variants and HLA imputation for HLA-DRB1, HLA-DQA1, and HLA-DQB1
For exomes, we noticed a probe-capturing bias in some MHC regions, which is not an unexpected issue, especially for HLA genes (32, 36). This bias is quite strong for HLA-DRB1, HLA-DQA1, and HLA-DQB1, as illustrated in Supplementary Figure S2. Although the HLA-mapper optimization corrects most of the alignment errors in HLA genes, the absence of sequences from one chromosome (the capture bias) led to genotyping bias and allele call errors. This is particularly problematic when comparing exomes and whole genomes because this error occurs only in the former.
To circumvent this issue, we only selected from the exomes the variants with an average proportion of reads among alleles in heterozygous sites (i.e., the allele balance) over 0.3, represented as a red line in Supplementary Figure S2. This procedure might have eliminated some important variants, but it allowed us to compare exomes and whole genomes by avoiding variants prone to genotyping errors. In addition, we selected only variants with a frequency of at least 1%, either among patients or among the general elderly population. Accordingly, 2,346 SNPs were selected across the MHC and were considered for all subsequent analyses.
The capture bias discussed above prevented the direct call of HLA alleles for HLA-DRB1, HLA-DQA1, and HLA-DQB1. We applied an imputation method with HIBAG 1.5 (37) to call 2-field resolution alleles (Supplementary Figure S1). First, we built a reference panel based on whole-genome data from Brazilians, the 1000Genomes, and HGDP datasets, using the same pipeline as presented in Supplementary Figure S1. The selected variants for the imputation model are bi-allelic variants present in both the reference panel and exomes and show an average proportion of reads among alleles in heterozygous sites over 0.30 in exomes (Supplementary Figure S2). After imputation, the incompatibility between imputed HLA alleles and direct calls was as follows: HLA-A (4.1%), HLA-B (2.8%), HLA-C (2.2%), HLA-DRB1 (20.9%), HLA-DQA1 (23%), and HLA-DQB1 (28.4%). Because of that, we opted to consider only the direct calls for HLA-A, HLA-B, and HLA-C, and only the imputed alleles from HLA-DRB1, HLA-DQA1, and HLA-DQB1.
Statistical analyses for association
The usual threshold for genome-wide significance in GWAS studies is P< 10-8, defined based on the average number of segregation blocks in European genomes. However, here, we are focusing on 2,346 SNPs across the MHC (not the full genome), in a different population (admixed Brazilians), and in a region with different LD patterns than the rest of the genome. Therefore, we calculated the number of different segregation blocks observed across our data by using Haploview and the confidence intervals algorithm (38). We detected exactly 100 segregation blocks with all bi-allelic markers with minimum allele frequencies of 2%. Therefore, we set alpha = 0.05/100 = 0.0005 as the threshold for detecting an associated variant despite multiple tests (the red lines in the Manhattan plots). We also report candidates that reach a 10-fold higher threshold, alpha < 0.005 (the blue line in the Manhattan plots) to avoid missing potential variants associated with Covid 19 severity.
We used plink2 to fit a logistic regression that considers each variable site, allotype, or amino acid residue as an independent marker. We created a plink-format table file containing a column for each allele of a SNP, every allotype, each amino acid in a specific position, and the dosage observed for the samples (from 0 to 2). The regression analysis, performed in R, considered sex and genetic ancestry as covariables in all comparisons. We did not include age as covariable to adjust P-values because of the super elderly. To evaluate the amino acid residues, we first aligned the predicted protein sequence of all individuals. While we considered all SNPs that have passed the filter described above, for the allotype and amino acid residues, we considered only the following genes: HLA-A, HLA-B, HLA-C, HLA-E, HLA-F, HLA-G, MICA, MICB, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1, HLA-DOA, HLA-DOB, HLA-DMA, HLA-DMB, TAP1, and TAP2.
Modeling of HLA structures and analysis of immunogenic regions and spike epitopes
Aiming to enhance our understanding of underlying mechanisms for HLA alleles associated with disease severity and the mechanisms underlying the associations, we have predicted the HLA molecule structure and the SARS-CoV-2 peptides that can bind to these HLA versions. The detailed methods are in the supplementary material.
Results
Covid-19 demographic data
Supplementary Table S1 presents demographic data and mean genome-wide genetic ancestry for each group. The mean age for the MILD group (66.9 years) is significantly higher than the SEVERE group (51.3 years), p < 10-5. There are more women in the MILD group (58.3%) and among the super elderly (74.7%) than in the SEVERE Covid-19 (43.6%).
All groups present similar proportions of genetic ancestry, except the SEVERE group which has a greater African and Native American ancestry than the MILD or the general elderly population from the same city (Supplementary Table S1). On one hand, ancestry might be related to Covid-19 severity, as observed for some Covid-19 comorbidities. For example, diabetes is a risk factor for severe Covid-19 (39) and is more frequent in individuals with higher African and Native-American ancestry in some populations (40). On the other hand, the lower socioeconomic status of a Brazilian citizen is correlated with a higher African and Native American ancestry. Most of the severe cases came from public hospitals, which, on average, are of poorer quality and lower efficiency, and then enriched for individuals with lower socioeconomic status (41). Nevertheless, the reported association results were controlled for genetic ancestry, which allowed adjustment to some extent for the socioeconomic scores.
Extreme outcomes for Covid-19 severity
The comparison between the SEVERE and MILD groups revealed three missense candidate variants that are 2-3 times more frequent in the SEVERE group than in MILD, coinciding with genes HLA-A, HLA-DOB, and TAP2 (Figure 2 and Table 1). The HLA-A variant is also significantly less frequent in the MILD group than in the general elderly population (p = 0.0066), and HLA-DOB and TAP2 variants are also significantly overrepresented in the SEVERE group compared to the general elderly population (p = 0.0009 and p = 0.0064, respectively), according with Table 1.
Figure 2 Manhattan plot illustrating differences among patients with MILD and SEVERE Covid-19, and the general elderly population from the same city. The general elderly population consists of whole-genomes (WGS) of 1,170 elders from São Paulo city, with unknown status regarding Covid-19. We considered only variants with a minor allele frequency of 1% in at least one of the groups, with no more than 5% of missing alleles in both groups, and with a ratio between read depth for each allele in heterozygous sites ranging from 0.3 to 0.5 for Exomes (the MILD and SEVERE groups). The red line marks the threshold for detecting a variant associated with the phenotype by calculating the number of segregation blocks observed in our data. The blue line marks the suggestive threshold for a candidate variant, p < 0.005. Most MUC22 variants are in Linkage Disequilibrium (r2 > 0.8).
Table 1 Polymorphisms across the MHC region (SNPs and indels) that are associated with Covid-19 severity in Brazilian patients.
Probably because of the relatively small and different sample size of both the MILD and SEVERE groups, some variants are highlighted when compared to the general elderly population but not when comparing the Covid-19 groups, as, the susceptibility missense variants rs1264457 at HLA-E and rs2228111 at TAP1, among others. On the other hand, the protective missense variants at MUC22 are significantly overrepresented among patients with MILD Covid-19 (Figure 2 and Table 1).
We also evaluated whether variants are associated with Covid-19 severity in younger patients by removing the super elderly from the MILD group. The resulting smaller sample size did not allow the identification of candidate variants in this comparison. However, the pattern observed in Table 1 was maintained.
We explored the allotypes and amino acid frequencies in different groups (Table 2). DOB*01:02 and the amino acid that defines this allotype, 18Q, are overrepresented in the SEVERE group compared to the MILD and the general elderly population. This amino acid exchange is related to the rs2071554 variant described in Table 1. HLA-E*01:03 and the main amino acid exchange that composes this allotype, 128G, is significantly overrepresented in the SEVERE group when compared to the general elderly population (p < 0.005) and the MILD group (p < 0.05). This amino acid exchange is related to rs1264457 (Table 1). All variants that define the allotype MICA*008 are overrepresented in the MILD group when compared to the general population (Table 2).
Two amino acid residues at HLA-A, 86R and 87N (full-length protein) or 62R and 63N (mature protein), are significantly overrepresented in the SEVERE group when compared to the MILD group (p < 0.005), and overrepresented in the general elderly population when compared to the MILD group (p < 0.01). Likewise, HLA-A residue 156/W is less frequent in the MILD group compared to the SEVERE (p < 0.05) and the general elderly population (p < 0.005). Residues 62R and 63N are associated with many HLA-A allotypes, including A*25, A*26, A*33, A*34, A*66, A*68, and A*69. Residue 156/W is associated with allotypes A*25, A*26, A*34, A*43, A*66, and A*68.
The linkage disequilibrium (LD) pattern in Figure S7 indicates that most Covid-19-associated MUC22 polymorphisms are in strong LD. Likewise, the MICA variants (all associated with MICA*008) are in strong LD. MUC22 and MICA are independent signals. The signals from HLA-DOB and TAP2 might not be independent. There is no LD between the HLA-A variant and other relevant variants across the MHC.
Super elderly patients recovered from Covid-19 as compared to the elderly general population
The strongest signal coincides with gene MUC22, rs62399430, two times more frequent among the super elderly than in the SEVERE group (p = 0.0057, OR=0.30). Moreover, when super elderly are compared to the general elderly population from the same city, we detected candidate variants in two genes, MUC22 and PSORS1C1/CDSN (Figure 3). Most of the signals coincide with gene MUC22. Variants rs62399430, rs11753789, and rs12110785 (p < 0.0002, OR > 2.0) are, in general, two times more frequent among super elderly recovered from Covid-19. Two of these are missense variants. Most MUC22 variants are in Linkage Disequilibrium (r2> 0.8, Figure S7). These variants are also overrepresented in the MILD Covid-19 group, which includes most of the super elderly patients and younger patients with mild Covid-19. Another candidate variant for protection is rs145583110, an intronic variant from PSORS1C1 or exonic for CDSN, which is 3 times more frequent among the super elderly than in the general population (p = 0.0018, OR = 3.85).
Figure 3 Manhattan plot illustrating differences between Brazilian centenarians (>90 y) recovered from Covid-19 and the general elderly population. The general population consists of whole-genomes (WGS) of 1,170 elders from São Paulo city (> 65 y), with unknown status regarding Covid-19. The centenarian group includes exomes of 87 patients more than 90 years old that presented mild Covid-19 symptoms. We considered only variants with a minor allele frequency of 1% in at least one of the groups, with no more than 5% of missing alleles in both groups, and with a ratio between read depth for each allele in heterozygous sites ranging from 0.3 to 0.5 for Exomes. The red line marks the threshold for detecting a variant associated with the phenotype by calculating the number of segregation blocks observed in our data. The blue line marks the suggestive threshold for a candidate variant, p < 0.005. Most MUC22 variants are in Linkage Disequilibrium (r2 > 0.8).
The allotype and amino acid residue frequencies reveal no relevant association when comparing super elderly with MILD Covid-19 and younger patients with SEVERE Covid-19, or when super elderly are compared to the general elderly population.
Discussion
Here we investigated the polymorphisms across the MHC region associated with Covid-19 disease severity in patients with extreme phenotypes: young adults with severe Covid-19, and super elderly individuals with mild Covid-19. All the samples were collected between June and October 2020, before new SARS-CoV-2 variants were reported in Brazil (especially Gamma) and before the onset of the Brazilian vaccination program against Covid-19. This data may shed some light on the mechanisms underlying SARS-CoV-2 resistance, particularly for the earlier SARS-CoV-2 strains in unvaccinated individuals. All samples from the control elderly Brazilian population were collected before the SARS-CoV-2 outbreak.
We applied a bioinformatics pipeline to correct alignments and call reliable genotypes and HLA alleles and detected some associated and candidate variants that influence infection severity, some related to genes from the antigen presentation pathway and others from different pathways. This section will focus on the strongest hits and the frequent candidate variants associated with the phenotypes, particularly MUC22, HLA-A, and HLA-DOB.
MUC22 variants and mild Covid-19
We detected missense MUC22 variants associated with mild Covid-19 when evaluating all patients with mild symptoms, and the super elderly recovered from Covid-19. MUC genes encode mucins, and 16 different mucins have been identified in the lung (42). Mucins are high-molecular-weight glycoproteins that can be secreted or anchored to the cell membrane (transmembrane mucins) (42). MUC22 is a member of the mucins’ family. It encodes transmembrane mucins expressed in the bronchi of the lungs and participates in the inflammatory and innate immune response (43, 44). Airway mucus comprises water, antimicrobial proteins, serum protein transudates, and mucus glycoproteins. It protects and lubricates the respiratory tract. However, excessive mucus production is related to inflammatory lung diseases (45), which are found in severe cases of Covid-19. Overexpression of MUC1 and MUC5AC mucins, for instance, play a key role in Covid-19 symptoms and may contribute to the high viscosity of airway mucus, leading to airflow obstruction and respiratory distress (46).
MUC22 is up-regulated in infections with respiratory syncytial vírus (47). Some studies have been pointing to mucin´s role in Covid-19 (48, 49). The outcome of SARS-CoV-2 infection may be correlated with a signature of shed mucins in circulation from infected lung or respiratory tract epithelial cells (48). MUC22 polymorphisms have been associated with diffuse panbronchiolitis (43) and asthma in Latinos (50).
One may argue that these variants are related to longevity and not with protection against severe Covid-19. However, this variant is not correlated with longevity because (a) the SABE sample refers to a census-based cohort of elderly Brazilians with an average age of 75, and thus variant associated with longevity would already be more frequent, and (b) their frequency in SABE is very similar to frequencies observed in European and Latin American populations (https://www.ncbi.nlm.nih.gov/snp/rs62399430#frequency_tab).
Except for rs146685560, all MUC22 protective variants are correlated with higher expression of miR-6891 in many tissues, including the esophagus and lung (gtexportal.org). miR-6891 is co-expressed with MUC22 in the alveolus (pneumocyte type II). Interestingly, miR-6891 is encoded in the MHC, and targets the ORF3a gene from SARS-CoV-2 (51), which encodes a sodium or calcium ion channel protein involved in replication and pathogenesis (52). Most importantly, miR-6891-5p is upregulated in Calu3 cells infected with SARS-CoV-2 (53). During the early stages of SARS-CoV-2 infection, ORF3a directs the host’s immune response (54), may induce lysosomal evasion (55, 56) and could promote cytokine storms by activating the NF-kB signaling and NLRP3 inflammasomes pathways (57). Therefore, we can hypothesize that a higher expression of miR-6891-5p, associated with all MUC22 protective variants, may contribute to less severe symptoms during SARS-CoV-2 infection.
One possible explanation for higher miR-6891-5p expression is linkage with HLA-B since the MIR6891 gene coincides with an HLA-B intron. In the present study, there is a clear association between the protective variant rs62399430/T and some HLA-B alleles, such as B*35:01, B*35:03, B*48:02, and B*51:01. All of these HLA-B alleles are listed as high mRNA expressing alleles (58–60). Thus, higher HLA-B mRNA expression might be correlated with higher miR-6891 expression, and MUC22 variants are tagging this phenotype.
Also, MUC22 is approximately 100 kb from CCHCR1 gene, the most important signal in the MHC region for Covid-19 susceptibility, according to the Covid-19 Host Genetics Consortium (Covid19hg - https://app.covid19hg.org/). All CCHCR1 variants associated with Covid-19 (29, 61) are intronic and were not captured by our genotyping method (whole-exome sequencing). The exomic CCHCR1 variants included in our survey do not correlate with Covid-19 severity. This is expected since the meta-analysis provided by Covid19hg indicates that CCHCR1 is associated with Covid-19 susceptibility but not with severity when comparing hospitalized and non-hospitalized patients (as performed here). Moreover, studying LD among the most relevant variants from MUC22 and CCHCR1, we detected a weak LD (r2 < 0.3, D’ < 0.7). Covid19hg effort detected no relevant signal from MUC22, including the SNPs described here. Although our MUC22 findings are not cross-validated by the Covid19hg, we must consider that we are evaluating a very different cohort, aged and super elderly individuals with mild Covid-19 and younger patients with severe Covid-19. The signal from MUC22 is much stronger in the super elderly group. The frequency of rs62399430 among the super elderly with mild Covid-19 is 2x higher than the observed among Europeans and in the general elderly populations and 3.8x higher than in Africa. Therefore, considering study designs and cohort ancestries, the MUC22 signal might be ancestry-specific and independent of CCHCR1.
HLA-DOB*01:02/rs2071554 and severe Covid-19
The HLA-DO molecule is a heterodimer formed by two heavy chains, HLA-DOA and HLA-DOB. HLA-DO is a non-classical MHC-II molecule that does not present peptides on the cell surface, but it is required to efficiently load endosomal peptides onto MHC-II molecules (62–66). Thus, modifications in HLA-DOB may directly influence antigen presentation in the MHC class II pathway.
The HLA-DOB*01:02 allele has also been associated with an increased risk of death in patients with non-small cell lung cancer; reduced median survival time (67, 68), with type 1 diabetes (69), which is a major comorbidity related to severe Covid-19 (39), and with resistance to SARs-CoV-2 infection (22). DOB*01:02 has a different signal peptide, which may influence cellular localization and trafficking of the protein (15), possibly leading to inadequate antigen presentation.
HLA-DOB*01:02 is rare (0.6%) among SARs-CoV-2 resistant individuals (22), has similar frequencies of around 6% among patients with mild Covid-19 or the control elderly population, and reaches 13.6% in patients with severe Covid-19 (Table 1). The frequency of DOB*01:02 among the super elderly is similar to the general elderly population.
DOB*01:02 is frequent in Africa (reaching the same frequency as the SEVERE group) and less frequent in Europe. Although the percentage of Native-American and African ancestries are much higher in the SEVERE group than others, the associations described here are adjusted for genetic ancestry and sex. Because DOB*01:02 frequencies vary among different biogeographic regions, other studies are unlikely to find similar results unless addressing African or admixed populations comparable to the Brazilian one.
HLA-A, TAP1, and TAP2 polymorphisms and severe Covid-19
We detected three different associations involving genes from the class I antigen presentation pathway. These associations involve TAP1 and TAP2, which participate in the peptide pumping from the cytoplasm to the endoplasmic reticulum, and gene HLA-A, which will present these peptides on the cell surface to T CD8 lymphocytes (Tables 1, 2). High expression levels of TAP1 and TAP2 are correlated with the amount of virus in lung tissue (70). The TAP2 signal might be a hitchhiking association due to linkage with HLA-DOB*01:02 (Figure S7).
For HLA-A, the amino acid residues at positions 62 and 63 (mature protein) depend on the haplotype formed by four different variants, rs1059455, rs1064588, rs2230991, and rs199474424. Their presence is associated with severe Covid-19. The fact that (a) this combination of amino acids only occurs when there is a specific haplotype, (b) this haplotype occurs in many different HLA-A alleles, and (c) allele frequencies vary in different populations, may explain why we did not detect any association between HLA-A allotypes with Covid-19 severity and why previous surveys described different results or no association (23, 71–77).
One possible mechanism for these associations is different binding affinities to SARs-CoV-2 peptides coupled with the subset of peptides pumped by TAP. The higher frequency of 62R-63N in the SEVERE group is related to alleles A*33:03, A*68:01, and A*68:02. A*68 alleles are among the strong binders for SARs-CoV-2 peptides (78, 79) and among the best binders for respiratory viruses (70).
To investigate whether residues 62R/63N could be interfering with the subset of antigens presented by HLA-A, we performed an in silico prediction of the antigen processing pathway impact on the presentation ability of the alleles carrying 62R/63N. Since the set of MHC alleles that an individual presents defines the ligandome on its cell surface, we predicted the peptidome of inspected alleles and compared them with the corresponding immunogenic regions from the SARS-CoV-2 spike protein. The comparison did not provide evidence that the investigated alleles lack the potential to present T cell epitopes already described for SARS-CoV-2 sequences (Supplementary Figures S3, S4). Additionally, we extracted the frequency of response for each predicted peptide from each 62R/63N sample alleles to compare their average values of immunogenicity frequencies with other alleles associated with good and bad outcomes in Covid-19. Again, the alleles carrying 62R/63N and overrepresented in the severe group exhibited similar numbers, indicating that the presentation ability was not responsible for the impaired response (Supplementary Figure S5).
To infer if the mutations could be interfering with immunogenicity triggering, we also looked for alterations present in the 62R/63N alleles that could impact the TCR interaction surface of the MHC cleft. The HLA structural models were screened, looking for shared features in regions usually contacted by complementarity-determining region 3 (CDR3) loops (Supplementary Figure S6). Hierarchical clustering analysis showed that four out of eight investigated alleles presented an electrostatic potential distribution fingerprint in the probed area. This physicochemical element was already described as pivotal to cytotoxicity elicitation, as described previously (80).
Interestingly, the clustered alleles were HLA-A*33:01, A*33:03, A*68:01, and A*68:02, the ones with higher frequencies among patients with Severe Covid-19 (Figure 4). These analyses emphasize the need for a deeper investigation when we are dealing with HLA alleles and their involvement with immunogenic issues. Only looking for ligandome predictions can underestimate the whole importance of these structures in T cell stimulation.
Figure 4 Top view of HLA-A molecules and hierarchical clusterization of HLA-A alleles. In (A) there is a MHC structure in both the ribbon and surface visualizations from the region that interacts with the TCR, respectively. The area in orange represents the zone proximal about the residues 62 and 63. In (B) we can see the cluster dendrogram of the analyzed MHCs. Under each allele name, there is the surface image that was generated for electrostatic potential distribution. In the right branch, we can see the cluster of alleles carrying rs199474424 (please refer to Figure 2), HLA-A*33:01, A*33:03, A*68:01, and A*68:02, with their respective shared molecular fingerprint.
Among the alleles with increased frequency in the SEVERE group, A*68:02 is particularly frequent in Africa, A*62:01 among Native Americans, and A*33:03 in Asia (www.allelefrequencies.net). Although the SEVERE group from Brazil presents a higher African and Native American ancestry than the MILD group (Supplementary Table S1), the associations presented here (62R/63N, p = 0.0017, OR=2.7) are adjusted for ancestry and sex. This p-value is much lower, 0.0003, and significant even after correction for multiple tests within the HLA-A locus when not adjusted for ancestry. Therefore, population stratification should be ruled out as the main issue leading to these results. Since the associated alleles are most frequent in non-European populations, particularly Native Americans and Africans, it is unlikely that any study addressing European ancestry samples (23, 25, 74, 75, 77, 81–83) would find similar results. In fact, most of the previous studies evaluate European patients and find different results than the one presented here. Nevertheless, a frequency analysis of HLA alleles among Covid-19 infected patients from Saudi Arabia found HLA-A*68 among the most common alleles associated with severe disease (84), and A*68 is relatively common in Saudi Arabia (www.allelefrequencies.net). These results demonstrate the importance of addressing admixed populations such as Brazilians and other less-studied population samples.
Conclusions
In short, here, we performed an in-depth analysis of the MHC region in a cohort of unvaccinated super elderly individuals with Covid-19 that presented mild, or no symptoms compared with a group of younger patients with severe Covid-19 and/or a lethal outcome. We used a method to call genotypes and haplotypes in the MHC that minimizes alignment and genotyping errors. Interestingly, the strongest signals in the MHC region for candidate variants protecting against severe Covid-19 coincide with gene MUC22. Missense variants at MUC22 are more frequent among the super elderly and in the MILD group than in the SEVERE group and the general elderly population. We hypothesized that MUC22 might play an important protective role against severe Covid-19. Functional studies must be placed to evaluate the true impact of such variants on MUC22 function.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: European Genome-phenome Archive (EGA), under accession number EGAS0000100637.
Ethics statement
This study was approved by the Committee for Ethics in Research of the Institute of Biosciences at the University of São Paulo (CAAE 34786620.2.0000.5464). The patients/participants provided their written informed consent to participate in this study.
Author contributions
EC, MC, MN, and MZ contributed to the conceptualization. MC, LM, and MVRS contributed to data curation. EC, MN, MOS, NS, KN, IO, EA, ES, and GV contributed to the formal analysis. MZ contributed to funding acquisition. MC, MN, MOS, EC-N, and KS contributed to the investigation. EC, MN, MOS, and KN contributed to the methodology. MZ contributed to the project administration. EC, MN, and MOS contributed the software. EC, MN, MOS, and KN contributed to visualization. EC, MZ, MC, MN, KS, and MOS contributed to writing–original draft. NS, RP, VAOC, CC, CM-J, GV, DM, LM, MVRS, JW, JE, VRC, JM, EN, JK, RB, MH, LD’A, AR-F, PB, AD, MD, PS, MP-B, and MZ contributed to writing–review, and editing. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the São Paulo Research Foundation (FAPESP/Brazil) (grant numbers 2013/08028-1, 2014/50931-3, 2019/19998-8, and 2020/09702-1), the National Council for Scientific and Technological Development (CNPq) (grant number 465355/2014-5), and JBS S.A. (grant number 69004). FAPESP/Brazil (Grant numbers 2013/17084-0 and 2017/19223-0) and the United States National Institutes of Health (NIH) (R01 GM075091) supported the development of the HLA and KIR pipeline and the genetic ancestry approach. This study was also supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001 and Fleury Group (Project NP-565). The funders were not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication.
Acknowledgments
The authors are extremely grateful to all volunteers for their participation and collaboration, the nurses for sample collection, the technical team for the material process and data analysis, and the Fleury Laboratory for serology tests. Special thanks to Brazilian Senator Mara Gabrilli for financial support and to JBS S.A. for the additional funding. The funders were not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2022.975918/full#supplementary-material
References
1. Dhama K, Patel SK, Natesan S, Vora KS, Iqbal Yatoo M, Tiwari R, et al. COVID-19 in the elderly people and advances in vaccination approaches. Hum Vaccines Immunother (2020) 16(12):2938–43. doi: 10.1080/21645515.2020.1842683
2. Bajgain KT, Badal S, Bajgain BB, Santana MJ. Prevalence of comorbidities among individuals with COVID-19: A rapid review of current literature. Am J Infect Control (2021) 49(2):238–46. doi: 10.1016/j.ajic.2020.06.213
3. Ho FK, Petermann-Rocha F, Gray SR, Jani BD, Katikireddi SV, Niedzwiedz CL, et al. Is older age associated with COVID-19 mortality in the absence of other risk factors? general population cohort study of 470,034 participants. PloS One (2020) 15(11):e0241824. doi: 10.1371/journal.pone.0241824
4. López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell (2013) 153(6):1194–217. doi: 10.1016/j.cell.2013.05.039
5. Fuentes E, Fuentes M, Alarcón M, Palomo I. Immune system dysfunction in the elderly. Acad Bras Ciênc (2017) 89:285–99. doi: 10.1590/0001-3765201720160487
6. Amore S, Puppo E, Melara J, Terracciano E, Gentili S, Liotta G. Impact of COVID-19 on older adults and role of long-term care facilities during early stages of epidemic in Italy. Sci Rep (2021) 11(1):12530. doi: 10.1038/s41598-021-91992-9
7. Yuki K, Fujiogi M, Koutsogiannaki S. COVID-19 pathophysiology: A review. Clin Immunol Orlando Fla (2020) 215:108427. doi: 10.1016/j.clim.2020.108427
8. Parasher A. COVID-19: Current understanding of its pathophysiology, clinical presentation and treatment. Postgrad Med J (2021) 97(1147):312–20. doi: 10.1136/postgradmedj-2020-138577
9. Debisarun PA, Gössling KL, Bulut O, Kilic G, Zoodsma M, Liu Z, et al. Induction of trained immunity by influenza vaccination - impact on COVID-19. PloS Pathog (2021) 17(10):e1009928. doi: 10.1371/journal.ppat.1009928
10. Poulain M, Chambre D, Pes GM. Centenarians exposed to the Spanish flu in their early life better survived to COVID-19. Aging (2021) 13(18):21855–65. doi: 10.18632/aging.203577
11. Charitos P, Heijnen IAFM, Egli A, Bassetti S, Trendelenburg M, Osthoff M. Functional activity of the complement system in hospitalized COVID-19 patients: A prospective cohort study. Front Immunol (2021) 12:765330. doi: 10.3389/fimmu.2021.765330
12. Karki R, Kanneganti TD. The “cytokine storm”: molecular mechanisms and therapeutic prospects. Trends Immunol (2021) 42(8):681–705. doi: 10.1016/j.it.2021.06.001
13. Linden SK, Sutton P, Karlsson NG, Korolik V, McGuckin MA. Mucins in the mucosal barrier to infection. Mucosal Immunol (2008) 1(3):183–97. doi: 10.1038/mi.2008.5
14. Castelli EC, de Almeida BS, Muniz YCN, Silva NSB, Passos MRS, Souza AS, et al. HLA-G genetic diversity and evolutive aspects in worldwide populations. Sci Rep (2021) 11(1):23070. doi: 10.1038/s41598-021-02106-4
15. Hervier B, Russick J, Cremer I, Vieillard V. NK cells in the human lungs. Front Immunol (2019) 10:1263. doi: 10.3389/fimmu.2019.01263
16. Raulet DH, Gasser S, Gowen BG, Deng W, Jung H. Regulation of ligands for the NKG2D activating receptor. Annu Rev Immunol (2013) 31:413–41. doi: 10.1146/annurev-immunol-032712-095951
17. Borst L, van der Burg SH, van Hall T. The NKG2A-HLA-E axis as a novel checkpoint in the tumor microenvironment. Clin Cancer Res Off J Am Assoc Cancer Res (2020) 26(21):5549–56. doi: 10.1158/1078-0432.CCR-19-2095
18. Radwan J, Babik W, Kaufman J, Lenz TL, Winternitz J. Advances in the evolutionary understanding of MHC polymorphism. Trends Genet TIG (2020) 36(4):298–311. doi: 10.1016/j.tig.2020.01.008
19. Wieczorek M, Abualrous ET, Sticht J, Álvaro-Benito M, Stolzenberg S, Noé F, et al. Major histocompatibility complex (MHC) class I and MHC class II proteins: Conformational plasticity in antigen presentation. Front Immunol (2017) 8:292. doi: 10.3389/fimmu.2017.00292
20. Castelli EC, Paz MA, Souza AS, Ramalho J, Mendes-Junior CT. Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures. Hum Immunol (2018) 79(9):678–84. doi: 10.1016/j.humimm.2018.06.010
21. Brandt DYC, Aguiar VRC, Bitarello BD, Nunes K, Goudet J, Meyer D. Mapping bias overestimates reference allele frequencies at the HLA genes in the 1000 genomes project phase I data. G3 Bethesda Md (2015) 5(5):931–41. doi: 10.1534/g3.114.015784
22. Castelli EC, de Castro MV, Naslavsky MS, Scliar MO, Silva NSB, Andrade HS, et al. MHC variants associated with symptomatic versus asymptomatic SARS-CoV-2 infection in highly exposed individuals. Front Immunol (2021) 12:742881. doi: 10.3389/fimmu.2021.742881
23. Iturrieta-Zuazo I, Rita CG, García-Soidán A, de Malet Pintos-Fonseca A, Alonso-Alarcón N, Pariente-Rodríguez R, et al. Possible role of HLA class-I genotype in SARS-CoV-2 infection and progression: A pilot study in a cohort of covid-19 Spanish patients. Clin Immunol Orlando Fla (2020) 219:108572. doi: 10.1016/j.clim.2020.108572
24. Nguyen A, David JK, Maden SK, Wood MA, Weeder BR, Nellore A, et al. Human leukocyte antigen susceptibility map for severe acute respiratory syndrome coronavirus 2. J Virol (2020) 94(13):e00510–20. doi: 10.1128/JVI.00510-20
25. Schetelig J, Heidenreich F, Baldauf H, Trost S, Falk B, Hoßbach C, et al. Individual HLA-a, -b, -c, and -DRB1 genotypes are no major factors which determine COVID-19 severity. Front Immunol (2021) 12:698193. doi: 10.3389/fimmu.2021.698193
26. Fischer JC, Schmidt AG, Bölke E, Uhrberg M, Keitel V, Feldt T, et al. Association of HLA genotypes, AB0 blood type and chemokine receptor 5 mutant CD195 with the clinical course of COVID-19. Eur J Med Res (2021) 26(1):107. doi: 10.1186/s40001-021-00560-4
27. Stephens H a. F, Klaythong R, Sirikong M, Vaughn DW, Green S, Kalayanarooj S, et al. HLA-a and -b allele associations with secondary dengue virus infections correlate with disease severity and the infecting viral serotype in ethnic thais. Tissue Antigens (2002) 60(4):309–18. doi: 10.1034/j.1399-0039.2002.600405.x
28. MacDonald KS, Fowke KR, Kimani J, Dunand VA, Nagelkerke NJ, Ball TB, et al. Influence of HLA supertypes on susceptibility and resistance to human immunodeficiency virus type 1 infection. J Infect Dis (2000) 181(5):1581–9. doi: 10.1086/315472
29. Pairo-Castineira E, Clohisey S, Klaric L, Bretherick AD, Rawlik K, Pasko D, et al. Genetic mechanisms of critical illness in COVID-19. Nature (2021) 591(7848):92–8. doi: 10.1038/s41586-020-03065-y
30. Vietzen H, Zoufaly A, Traugott M, Aberle J, Aberle SW, Puchhammer-Stöckl E. Deletion of the NKG2C receptor encoding KLRC2 gene and HLA-e variants are risk factors for severe COVID-19. Genet Med (2021) 23(5):963–7. doi: 10.1038/s41436-020-01077-7
31. Naslavsky MS, Scliar MO, Yamamoto GL, Wang JYT, Zverinova S, Karp T, et al. Whole-genome sequencing of 1,171 elderly admixed individuals from são paulo, Brazil. Nat Commun (2022) 13(1):1004. doi: 10.1038/s41467-022-28648-3
32. Kiyotani K, Mai TH, Nakamura Y. Comparison of exome-based HLA class I genotyping tools: identification of platform-specific genotyping errors. J Hum Genet (2017) 62(3):397–405. doi: 10.1038/jhg.2016.141
33. Martin M, Patterson M, Garg S, Fischer SO, Pisanti N, Klau GW, et al. WhatsHap: fast and accurate read-based phasing. bioRxiv (2016), 085050. doi: 10.1101/085050
34. Delaneau O, Zagury JF, Robinson MR, Marchini JL, Dermitzakis ET. Accurate, scalable and integrative haplotype estimation. Nat Commun (2019) 10(1):5436. doi: 10.1038/s41467-019-13225-y
35. Robinson J, Barker DJ, Georgiou X, Cooper MA, Flicek P, Marsh SGE. IPD-IMGT/HLA database. Nucleic Acids Res (2020) 48(D1):D948–55. doi: 10.1093/nar/gkq998
36. Wang VG, Kim H, Chuang JH. Whole-exome sequencing capture kit biases yield false negative mutation calls in TCGA cohorts. PloS One (2018) 13(10):e0204912. doi: 10.1371/journal.pone.0204912
37. Zheng X, Shen J, Cox C, Wakefield JC, Ehm MG, Nelson MR, et al. HIBAG–HLA genotype imputation with attribute bagging. Pharmacogenomics J (2014) 14(2):192–200. doi: 10.1038/tpj.2013.18
38. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science. (2002) 296(5576):2225–9. doi: 10.1126/science.1069424
39. Hartmann-Boyce J, Rees K, Perring JC, Kerneis SA, Morris EM, Goyder C, et al. Risks of and from SARS-CoV-2 infection and COVID-19 in people with diabetes: A systematic review of reviews. Diabetes Care (2021) 44(12):2790–811. doi: 10.2337/dc21-0930
40. Chande AT, Rishishwar L, Conley AB, Valderrama-Aguirre A, Medina-Rivas MA, Jordan IK. Ancestry effects on type 2 diabetes genetic risk inference in Hispanic/Latino populations. BMC Med Genet (2020) 21(Suppl 2):132. doi: 10.1186/s12881-020-01068-0
41. da Silva NN, Favacho VBC, Boska Gde A, Andrade Eda C, das Merces NP, de Oliveira MAF. Access of the black population to health services: integrative review. Rev Bras Enferm (2020) 73:1–9. doi: 10.1590/0034-7167-2018-0834
42. Ballester B, Milara J, Cortijo J. Mucins as a new frontier in pulmonary fibrosis. J Clin Med (2019) 8(9):1447. doi: 10.3390/jcm8091447
43. Hijikata M, Matsushita I, Tanaka G, Tsuchiya T, Ito H, Tokunaga K, et al. Molecular cloning of two novel mucin-like genes in the disease-susceptibility locus for diffuse panbronchiolitis. Hum Genet (2011) 129(2):117–28. doi: 10.1007/s00439-010-0906-4
44. Rose MC, Voynow JA. Respiratory tract mucin genes and mucin glycoproteins in health and disease. Physiol Rev (2006) 86(1):245–78. doi: 10.1152/physrev.00010.2005
45. Evans CM, Koo JS. Airway mucus: the good, the bad, the sticky. Pharmacol Ther (2009) 121(3):332–48. doi: 10.1016/j.pharmthera.2008.11.001
46. Lu W, Liu X, Wang T, Liu F, Zhu A, Lin Y, et al. Elevated MUC1 and MUC5AC mucin protein levels in airway mucus of critical ill COVID-19 patients. J Med Virol (2021) 93(2):582–4. doi: 10.1002/jmv.26406
47. Baños-Lara MDR, Piao B, Guerrero-Plata A. Differential mucin expression by respiratory syncytial virus and human metapneumovirus infection in human epithelial cells. Mediators Inflamm (2015) 2015:347292. doi: 10.1155/2015/347292
48. Bose M, Mitra B, Mukherjee P. Mucin signature as a potential tool to predict susceptibility to COVID-19. Physiol Rep (2021) 9(1):e14701. doi: 10.14814/phy2.14701
49. Smet A, Breugelmans T, Michiels J, Lamote K, Arras W, De Man JG, et al. A dynamic mucin mRNA signature associates with COVID-19 disease presentation and severity. JCI Insight (2021) 6(19):e151777. doi: 10.1172/jci.insight.151777
50. Galanter JM, Gignoux CR, Torgerson DG, Roth LA, Eng C, Oh SS, et al. Genome-wide association study and admixture mapping identify different asthma-associated loci in latinos: the genes-environments & admixture in Latino americans study. J Allergy Clin Immunol (2014) 134(2):295–305. doi: 10.1016/j.jaci.2013.08.055
51. Saçar Demirci MD, Adan A. Computational analysis of microRNA-mediated interactions in SARS-CoV-2 infection. PeerJ (2020) 8:e9369. doi: 10.7717/peerj.9369
52. Castaño-Rodriguez C, Honrubia JM, Gutiérrez-Álvarez J, DeDiego ML, Nieto-Torres JL, Jimenez-Guardeño JM, et al. Role of severe acute respiratory syndrome coronavirus viroporins e, 3a, and 8a in replication and pathogenesis. mBio (2018) 9(3):e02325–17. doi: 10.1128/mBio.02325-17
53. Demirci YM, Saçar Demirci MD. Circular RNA-MicroRNA-MRNA interaction predictions in SARS-CoV-2 infection. J Integr Bioinforma (2021) 18(1):45–50. doi: 10.1515/jib-2020-0047
54. Silvas JA, Vasquez DM, Park JG, Chiem K, Allué-Guardia A, Garcia-Vilanova A, et al. Contribution of SARS-CoV-2 accessory proteins to viral pathogenicity in K18 human ACE2 transgenic mice. J Virol (2021) 95(17):e0040221. doi: 10.1128/JVI.00402-21
55. Chen D, Zheng Q, Sun L, Ji M, Li Y, Deng H, et al. ORF3a of SARS-CoV-2 promotes lysosomal exocytosis-mediated viral egress. Dev Cell (2021) 56(23):3250–63. doi: 10.1016/j.devcel.2021.10.006
56. Miao G, Zhao H, Li Y, Ji M, Chen Y, Shi Y, et al. ORF3a of the COVID-19 virus SARS-CoV-2 blocks HOPS complex-mediated assembly of the SNARE complex required for autolysosome formation. Dev Cell (2021) 56(4):427–442.e5. doi: 10.1016/j.devcel.2020.12.010
57. Siu KL, Yuen KS, Castaño-Rodriguez C, Ye ZW, Yeung ML, Fung SY, et al. Severe acute respiratory syndrome coronavirus ORF3a protein activates the NLRP3 inflammasome by promoting TRAF3-dependent ubiquitination of ASC. FASEB J Off Publ Fed Am Soc Exp Biol (2019) 33(8):8865–77. doi: 10.1096/fj.201802418R
58. Yamamoto F, Suzuki S, Mizutani A, Shigenari A, Ito S, Kametani Y, et al. Capturing differential allele-level expression and genotypes of all classical HLA loci and haplotypes by a new capture RNA-seq method. Front Immunol (2020) 11:941. doi: 10.3389/fimmu.2020.00941
59. Johansson T, Yohannes DA, Koskela S, Partanen J, Saavalainen P. HLA RNA sequencing with unique molecular identifiers reveals high allele-specific variability in mRNA expression. Front Immunol (2021) 12:629059. doi: 10.3389/fimmu.2021.629059
60. Aguiar VRC, César J, Delaneau O, Dermitzakis ET, Meyer D. Expression estimation and eQTL mapping for HLA genes with a personalized pipeline. PloS Genet (2019) 15(4):e1008091. doi: 10.1371/journal.pgen.1008091
61. Pathak GA, Karjalainen J, Stevens C, Neale BM, Daly M, Ganna A, et al. A first update on mapping the human genetic architecture of COVID-19. Nature. (2022) 608(7921):E1–10. doi: 10.1038/s41586-022-04826-7
62. Poluektov Y, Kim A, Sadegh-Nasseri S. HLA-DO and its role in MHC class II antigen presentation. Front Immunol (2013) 4:260. doi: 10.3389/fimmu.2013.00260
63. van Ham M, van Lith M, Griekspoor A, Neefjes J. What to do with HLA-DO? Immunogenetics (2000) 51(10):765–70. doi: 10.1007/s002510000208
64. Liljedahl M, Kuwana T, Fung-Leung WP, Jackson MR, Peterson PA, Karlsson L. HLA-DO is a lysosomal resident which requires association with HLA-DM for efficient intracellular transport. EMBO J (1996) 15(18):4817–24. doi: 10.1002/j.1460-2075.1996.tb00862.x
65. Naruse TK, Kawata H, Inoko H, Isshiki G, Yamano K, Hino M, et al. The HLA-DOB gene displays limited polymorphism with only one amino acid substitution. Tissue Antigens (2002) 59(6):512–9. doi: 10.1034/j.1399-0039.2002.590608.x
66. Denzin LK, Sant’Angelo DB, Hammond C, Surman MJ, Cresswell P. Negative regulation by HLA-DO of MHC class II-restricted antigen processing. Science (1997) 278(5335):106–9. doi: 10.1126/science.278.5335.106
67. Wang H, Zhao S, Zhang X, Jia K, Deng J, Zhou C, et al. Major histocompatibility complex class II molecule in non-small cell lung cancer diagnosis, prognosis and treatment. OncoTargets Ther (2019) 12:7281–8. doi: 10.2147/OTT.S214231
68. Pu X, Hildebrandt MA, Lu C, Roth JA, Stewart DJ, Zhao Y, et al. Inflammation-related genetic variations and survival in patients with advanced non-small cell lung cancer receiving first-line chemotherapy. Clin Pharmacol Ther (2014) 96(3):360–9. doi: 10.1038/clpt.2014.89
69. He C, Hamon S, Li D, Barral-Rodriguez S, Ott J. MHC fine mapping of human type 1 diabetes using the T1DGC data. Diabetes Obes Metab (2009) 11(Suppl 1):53–9. doi: 10.1111/j.1463-1326.2008.01003.x
70. Desterke C, Turhan AG, Bennaceur-Griscelli A, Griscelli F. HLA-dependent heterogeneity and macrophage immunoproteasome activation during lung COVID-19 disease. J Transl Med (2021) 19(1):290. doi: 10.1186/s12967-021-02965-5
71. Shkurnikov M, Nersisyan S, Jankevic T, Galatenko A, Gordeev I, Vechorko V, et al. Association of HLA class I genotypes with severity of coronavirus disease-19. Front Immunol (2021) 12. doi: 10.3389/fimmu.2021.641900
72. Wang W, Zhang W, Zhang J, He J, Zhu F. Distribution of HLA allele frequencies in 82 Chinese individuals with coronavirus disease-2019 (COVID-19). HLA (2020) 96(2):194–6. doi: 10.1111/tan.13941
73. Littera R, Campagna M, Deidda S, Angioni G, Cipri S, Melis M, et al. Human leukocyte antigen complex and other immunogenetic and clinical factors influence susceptibility or protection to SARS-CoV-2 infection and severity of the disease course. Sardinian Experience. Front Immunol (2020) 11:605688. doi: 10.3389/fimmu.2020.605688
74. The Severe Covid-19 GWAS Group, Ellinghaus D., Degenhardt F., Bujanda L., Buti M., Albillos A., et al. Genomewide association study of severe covid-19 with respiratory failure. N Engl J Med (2020) 383(16):1522–34. doi: 10.1056/NEJMoa2020283
75. Novelli A, Andreani M, Biancolella M, Liberatoscioli L, Passarelli C, Colona VL, et al. HLA allele frequencies and susceptibility to COVID-19 in a group of 99 Italian patients. HLA. (2020) 96(5):610–4. doi: 10.1111/tan.14047
76. Nguyen A, Yusufali T, Hollenbach JA, Nellore A, Thompson RF. Minimal observed impact of HLA genotype on hospitalization and severity of SARS-CoV-2 infection. HLA (2022). doi: 10.1101/2021.12.22.21268062
77. Langton DJ, Bourke SC, Lie BA, Reiff G, Natu S, Darlay R, et al. The influence of HLA genotype on the severity of COVID-19 infection. HLA (2021) 98(1):14–22. doi: 10.1111/tan.14284
78. Barquera R, Collen E, Di D, Buhler S, Teixeira J, Llamas B, et al. Binding affinities of 438 HLA proteins to complete proteomes of seven pandemic viruses and distributions of strongest and weakest HLA peptide binders in populations worldwide. HLA (2020) 96(3):277–98. doi: 10.1111/tan.13956
79. Balz K, Chen M, Kaushik A, Cemic F, Heger V, Renz H, et al. Homologies between SARS-CoV-2 and allergen proteins may direct T cell-mediated heterologous immune responses. Res Sq (2020) 86873. rs.3.rs-. doi: 10.21203/rs.3.rs-86873/v1
80. Vianna P, Mendes MFA, Bragatte MA, Ferreira PS, Salzano FM, Bonamino MH, et al. pMHC structural comparisons as a pivotal element to detect and validate T-cell targets for vaccine development and immunotherapy-a new methodological proposal. Cells (2019) 8(12):E1488. doi: 10.3390/cells8121488
81. Lorente L, Martín MM, Franco A, Barrios Y, Cáceres JJ, Solé-Violán J, et al. [HLA genetic polymorphisms and prognosis of patients with COVID-19]. Med Intensiva (2021) 45(2):96–103. doi: 10.1016/j.medin.2020.08.004
82. Amoroso A, Magistroni P, Vespasiano F, Bella A, Bellino S, Puoti F, et al. HLA and AB0 polymorphisms may influence SARS-CoV-2 infection and COVID-19 severity. Transplantation (2021) 105(1):193–200. doi: 10.1097/TP.0000000000003507
83. Ben Shachar S, Barda N, Manor S, Israeli S, Dagan N, Carmi S, et al. MHC haplotyping of SARS-CoV-2 patients: HLA subtypes are not associated with the presence and severity of COVID-19 in the Israeli population. J Clin Immunol (2021) 41(6):1154–61. doi: 10.1007/s10875-021-01071-x
Keywords: human leukocyte antigens, immune response, major histocompatibility complex (MCH), HLA, SARS-CoV-2, MUC22, COVID-19, resistant genetic variants
Citation: Castelli EC, de Castro MV, Naslavsky MS, Scliar MO, Silva NSB, Pereira RN, Ciriaco VAO, Castro CFB, Mendes-Junior CT, Silveira EdS, de Oliveira IM, Antonio EC, Vieira GF, Meyer D, Nunes K, Matos LRB, Silva MVR, Wang JYT, Esposito J, Cória VR, Magawa JY, Santos KS, Cunha-Neto E, Kalil J, Bortolin RH, Hirata MH, Dell’Aquila LP, Razuk-Filho A, Batista-Júnior PB, Duarte-Neto AN, Dolhnikoff M, Saldiva PHN, Passos-Bueno MR and Zatz M (2022) MUC22, HLA-A, and HLA-DOB variants and COVID-19 in resilient super-agers from Brazil. Front. Immunol. 13:975918. doi: 10.3389/fimmu.2022.975918
Received: 22 June 2022; Accepted: 15 September 2022;
Published: 25 October 2022.
Edited by:
Martijn van Griensven, Maastricht University, NetherlandsReviewed by:
Pierre-Antoine Gourraud, Université de Nantes, FranceShabnam Salimi, University of Washington, United States
Copyright © 2022 Castelli, de Castro, Naslavsky, Scliar, Silva, Pereira, Ciriaco, Castro, Mendes-Junior, Silveira, de Oliveira, Antonio, Vieira, Meyer, Nunes, Matos, Silva, Wang, Esposito, Cória, Magawa, Santos, Cunha-Neto, Kalil, Bortolin, Hirata, Dell’Aquila, Razuk-Filho, Batista-Júnior, Duarte-Neto, Dolhnikoff, Saldiva, Passos-Bueno and Zatz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mayana Zatz, bWF5YXphdHpAdXNwLmJy; Erick C. Castelli, ZXJpY2suY2FzdGVsbGlAdW5lc3AuYnI=; Mateus V. de Castro, bWF0ZXVzdmlkaWdhbEBob3RtYWlsLmNvbQ==
†These authors have contributed equally to this work