
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
BRIEF RESEARCH REPORT article
Front. Immunol. , 17 February 2025
Sec. T Cell Biology
Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1488851
This article is part of the Research Topic New Avenues for the Development of Advanced Immunotherapies: Capitalizing on Studies of the B and T Cell Receptor Repertoire View all 8 articles
We describe the establishment and current content of the ImmuneCODE™ database, which includes hundreds of millions of T-cell Receptor (TCR) sequences from over 1,400 subjects exposed to or infected with the SARS-CoV-2 virus, as well as over 160,000 high-confidence SARS-CoV-2-associated TCRs. This database is made freely available, and the data contained in it can be used to assist with global efforts to understand the immune response to the SARS-CoV-2 virus and develop new interventions.
The emergence of SARS-CoV-2 in December of 2019 and the ensuing pandemic declared by the WHO at the end of January 2020 created an urgent need to understand the disease and its causative agent. Initial studies showed a strong T-cell based adaptive immune response to the virus (1–3), which is unsurprising given the ability of CD8+ and CD4+ T cells to recognize viral antigens presented by MHC class I and class II molecules. Antigen recognition by T cells leads to killing of infected cells, generation of T-cell memory against viral antigens, and the development of anti-viral antibodies (4). More detailed characterizations of the cellular immune response could aid the development of new interventions, soe applied our previously described Adaptive immunosequencing assay (5–7) and MIRA™ tool (8, 9) to deepen the understanding of the adaptive immune response to SARS-CoV-2 infection in support of COVID-19 research.
To generate these data, we partnered with Microsoft, Illumina, Labcorp/Covance, and health organizations across the world to generate the ImmuneCODE database described herein. These data are being made freely available to the scientific community so that any researcher, public health official or organization can utilize the data to accelerate ongoing global efforts to develop better diagnostics, vaccines, and therapeutics, as well as to answer important questions about the virus.
The database consists of two distinct but related datasets. (A) The immunosequencing dataset includes 1,486 deeply-sampled TCRβ repertoires from subjects who at the time of sampling either had been exposed to, were actively suffering from, or had recovered from COVID-19. These data originate from two sources (Table 1): ImmuneRACE (Immune Response Action to COVID-19 Events), a prospective study enrolling participants across the U.S. to decode how immune systems detect and respond to the virus, which includes self-reported demographic and clinical data; and more than a thousand de-identified geographically and ethnically diverse patient blood samples collected by institutions around the world. (B) The MIRA dataset maps TCRs to the SARS-Cov-2 virus epitopes they bind, and includes data obtained from exposed subjects and naïve controls. In total, the MIRA dataset includes more than 160,000 high-confidence SARS-CoV-2-associated TCRs. The data include varying degrees of demographic and clinical information (as allowed by each institution and corresponding IRB).
Table 1. List of available samples per dataset, including number of samples, institution and description of sample type and source.
By associating T-cell signatures with disease and outcomes, the ImmuneCODE database may inform our understanding of the immune response to the virus and help researchers around the world accelerate their work in basic and applied immunology, thus contributing to the development of new therapeutic and preventive measures.
The ImmuneRACE study is a prospective, single group, multi-cohort, exploratory study of unselected eligible participants exposed to, infected with, or recovering from COVID-19 (NCT04494893). Participants, aged 18 to 89 years and residing in 24 different geographic areas across the United States, were consented and enrolled via a virtual study design. Cohorting was based on participant-reported clinical history following the completion of both a screening survey and study questionnaire.
Cohort 1 included participants exposed within 2 weeks of study entry to someone with a confirmed COVID-19 diagnosis, either based on positive PCR testing or clinician diagnosis. Cohort 2 participants included those clinically diagnosed by a physician or with positive laboratory confirmation of active SARS-CoV-2 infection via PCR testing; no participants were shared between Cohort 1 and Cohort 2. Cohort 3 included participants previously diagnosed with COVID-19 disease who have been deemed recovered based on two consecutive negative nasopharyngeal or oropharyngeal (NP/OP) PCR tests, clearance by a healthcare professional, or the resolution of symptoms related to their initial COVID-19 diagnosis. The ImmuneRACE study was approved by Western Institutional Review Board (WIRB reference number 1–1281891-1, Protocol ADAP-006). All participants were consented for sample collection and metadata use via electronic informed consent processes.
Both whole blood and serum and a nasopharyngeal or oropharyngeal swab were collected from participants by trained mobile phlebotomists. Blood samples were shipped frozen or at room temperature to Adaptive Biotechnologies for processing, including, but not limited to, DNA extraction, and TCRβ analysis via the Adaptive immunosequencing assay (Adaptive Biotechnologies, Seattle, WA) from DNA extracted from blood samples. No batch effects were observed between samples shipped frozen vs. at room temperature. NP/OP swabs and serum were sent to Covance/Labcorp for further testing. An electronic questionnaire was administered to collect information pertaining to the participant’s medical history, symptoms, and diagnostic tests performed for COVID-19 disease. Participants had the option to undergo additional blood draws and questionnaires over 2 months.
Whole blood samples were collected in K2EDTA tubes based on each institution’s protocol and supervised by their respective Institutional Review Board. Samples were stored at the institution and sent to Adaptive as frozen whole blood, isolated PBMC or DNA extracted from either sample type for TCRβ analysis via the Adaptive immunosequencing assay (see Table 1). Samples provided by the NIAID were collected under approval by Comitato Etico Provinciale (protocol NP-4000), by Comitato Etico, Ospedale San Gerardo Monza (protocol COVID-STORM) and by Comitato Etico Pavia Fondazione IRCCS Policlinico San Matteo, Pavia (protocol 20200037677). Whole blood samples from DLS (Discovery Life Sciences, Huntsville, AL) were collected under Protocol DLS13 for collection of remnant clinical samples. From Bloodworks Northwest (Seattle, WA), volunteer donors recovered from COVID-19 were consented and collected under the Bloodworks Research Donor Collection Protocol BT001. Samples were processed for PBMC and donor data reported by the Biological Products division of Bloodworks NW under standard operating procedures.
A subset of the samples were processed for both T-cell receptor variable beta chain repertoire sequencing and MIRA, and another subset was processed only by one of these approaches. For each subject included in the dataset, subject_id (alternately coded as Subject) can be used to determine which assays were used to process which samples.
Immunosequencing of the CDR3 regions of human TCRβ chains was performed using the Adaptive assay as previously described (5–7). In brief, as much as 18 μg of extracted genomic DNA was amplified in a bias-controlled multiplex PCR with primers targeting all known TCRβ V and J genes on either side of the CDR3 region, followed by high-throughput sequencing. Sequences were collapsed and filtered in order to identify and quantitate the absolute abundance of each unique TCRβ CDR3 region, with additional bias correction performed computationally based on inline synthetic control molecules. In order to quantify the proportion of T cells out of total nucleated cells input for sequencing, or T-cell fraction, a panel of reference genes present in all nucleated cells was amplified simultaneously.
To identify antigen-specific TCRs, T cells derived post-expansion from either of the above input cell types were used for MIRA. Antigen-specific TCRs were identified as previously described (8, 9). Briefly, T cells were incubated overnight with MIRA peptide pools, and the antigen-specific subset was identified by CD137 upregulation. Following addition of peptides, cells were incubated at 37°C for ~18 hours. At the end of the incubation, replicate wells of cells were harvested from the culture and pooled and then stained with antibodies for analysis and sorting by flow cytometry. Cells were then washed and suspended in PBS containing FBS (2%), 1mM EDTA and 4,6-diamidino-2-phenylindole (DAPI) for exclusion of non-viable cells. Cells were acquired and sorted using a FACS Aria (BD Biosciences) instrument. Sorted antigen-specific (CD3+CD8+CD137+) T cells were pelleted and lysed in RLT Plus buffer for nucleic acid isolation; a small number of experiments sorted on CD4+ rather than CD8+ since the MIRA peptides in those experiments were selected for presentation by MHC class II molecules. Analysis of flow cytometry data files was performed using FlowJo (Ashland, OR).
RNA was isolated using AllPrep DNA/RNA mini and/or micro kits, according to manufacturer’s instructions (Qiagen). RNA was reverse transcribed to cDNA using Vilo kits (Life Technologies). TCRβ amplification, sequencing and clonotype determination were performed as described in the ‘T-cell receptor variable beta chain sequencing’ section above.
T-cell populations were exposed to pooled peptides or transgenes in a combinatoric format, similar to the approach described in (9). According to the MIRA panel design, each antigen is strategically placed in a subset of K unique pools while being omitted from the remaining pools (total pools = N). This design allows for antigens to be placed into a unique combination of N choose K “addresses,” where each address is a unique set of K pools; this allows for parallel testing of more antigens as the number of replicate pools (N) increases. In order to estimate an empirical false discovery rate and gauge assay quality, we left >40% of the unique addresses empty to assess the rate at which clones are spuriously sorted and detected in K pools with no query antigen present (hereafter referred to as invalid TCR associations).
T cells were aliquoted into 11 pools, and activated T cells were sorted using T-cell markers after overnight stimulation, as described previously (9). These putative antigen responding cells were set aside to characterize the T-cell clonotypes present in each sorted pool using the Adaptive immunosequencing assay. After immunosequencing, we examined the behavior of T-cell clonotypes by tracking the read counts of each unique TCRβ sequence across each sorted pool. True antigen-specific clones should be specifically enriched in a unique occupancy pattern that corresponds to the presence of one of the query antigens in K pools. We have reported on methods to map antigens to TCR clonotypes previously (9); we also developed a non-parametric Bayesian model to compute the posterior probability that a given clonotype is statistically associated with a certain antigen in a given experiment. This model uses the available read counts of TCRs to estimate a mean-variance relationship within a given experiment and as well as the probability that a clone will have zero read counts due to incomplete sampling of low frequency clones. This model takes the observed read counts of a clonotype across all N pools and estimates the posterior probability of a clone responding to all possible N choose K addresses and an additional hypothesis that a clone is activated in all pools (truly activated, but not specific to any of our query antigens). To define high-confidence antigen-clone associations, we identified TCR clonotypes assigned to a query antigen from this model with a posterior probability >= 0.9.
The ImmuneCODE database includes both TCRβ repertoires and MIRA data, and is being shared through the immuneACCESS® data portal at https://clients.adaptivebiotech.com/pub/covid-2020 (DOI 10.21417/ADPT2020COVID). Subjects described in this article can be identified by selecting samples with the “ImmunoCODERelease” tag value “002”.
The immueRACE study aimed to enroll 1,000 subjects who have been exposed to, are currently infected with, or have recovered from COVID-19. The current release of the database includes T-cell repertoire data from the first 160 participants in the study (including multiple samples from some subjects). This release also includes T-cell repertoire data from 1,326 subjects from global collaborators (Table 1).
These data were generated from participant samples using the Adaptive immunosequencing assay (5–7). They include a list of unique TCRβ rearrangements found in each analyzed sample, a count for each rearrangement, TCR sequence features, and sample-level metadata. These samples were not experimentally enriched for SARS-CoV-2-specific T cells, so the majority of TCRβ rearrangements are not involved in the adaptive immune response to this virus. The data can be exported using dedicated links on the immuneACCESS data portal shown above; see Supplementary Material for details.
The sequences in each TCR repertoire are annotated with a large set of features that may be useful across different research contexts; Supplementary Table 1 describes these annotations. In addition, Supplementary Table 2 provides sample-level metadata, which varies by source and participant but usually includes de-identified subject IDs, COVID-19 status, age in years, and sex. Supplementary Figure 1 summarizes key demographic features of this table (age, sex, and race) by showing their distributions.
Antigen-associated TCRs were identified using the “Multiplex Identification of Antigen-Specific T-Cell Receptors Assay” (MIRA; 8, 9). MIRA is a high-throughput multiplex assay, enabling the identification of TCRs that bind to large numbers of query antigens (hundreds to thousands at a time and in parallel) by combining immune assays with T-cell receptor sequencing. We use cell sorting based on the upregulation of activation markers to separate a population of antigen-reactive T cells. This positive population is sequenced via the Adaptive TCRβ immunosequencing assay, and clonotypes from activated T cells are identified by enrichment in the positive population compared to a sample of unenriched or unsorted T cells.
With the goal of identifying SARS-CoV-2-specific TCRs, we interrogated T-cell repertoires from both healthy donors and COVID-19 patients. Input cell types varied and included PBMCs from healthy donors or COVID-19 patients, as well as naïve T cells from healthy donors. To maximize TCR yield per experiment, we expanded T cells from both types of input cells for 8-13 days, yielding > 1 billion T cells per donor. When starting with PBMCs from either healthy donors or COVID-19 patients, T cells were expanded polyclonally with soluble anti-CD3. When starting with naïve CD8+ T cells from healthy donors, T cells were expanded following co-culture with monocyte-derived dendritic cells loaded with a pool of all peptides derived from SARS-CoV-2. The point of these expansions is to ensure that every T cell clone is present in every pool of the MIRA experiment. While cell expansion can change the frequencies of different T cell clones relative to the original sample, this does not matter for MIRA as long as clones of interest make it into all of the pools.
We used two different MIRA approaches: peptide- or transgene-based. Both enable the identification of antigen-associated TCRs, however the transgene-based approach enables identification of TCRs that bind epitopes encoded and presented by APCs following expression upon transfection of transgenes. This approach enables us to distinguish the subset of TCRs that respond to endogenously-presented epitopes rather than those that only respond to exogenously loaded peptides; binding or activation following a multimer stain or incubation with peptides may not accurately reflect whether a T cell is specific to an endogenously presented epitope. The underlying assumption for any immunological assay involving multimers or exogenously loaded peptides is that the epitope being tested is actually a presented epitope. For well-characterized epitopes this assumption is reasonable, however when querying large numbers of novel epitopes from a novel virus (SARS-CoV-2, for example) the risk for false positives (defined as TCRs specific to a never-before tested peptide that was exogenously loaded) is higher.
In total, the MIRA dataset includes more than 160,000 high-confidence SARS-CoV-2-associated TCRs. Results from a typical experiment are shown in Figure 1, where each column is a different antigen and the y-axis shows the number of unique TCRs assigned to that antigen. The rightmost 40% of the figure represents MIRA addresses (defined subsets of pools) to which no particular antigen was assigned; these provide a negative control, and can be contrasted with the clear enrichment signal present in several assigned addresses. These TCR-antigen associations are based on statistical modeling of MIRA data; the TCR-antigen pairings have not been verified by separate functional assays, although we expect that most of them are true binders. These MIRA callsets are made available as a set of downloadable files that can be accessed through the immuneACCESS data portal; see Supplementary Material for details.
Figure 1. Results from a typical MIRA experiment. Hundreds of “addresses” (subsets of pools) are queried in parallel, and each point on the x-axis corresponds to one address. The height of each bar shows the number of TCR sequences confidently assigned to that address by our statistical model. The addresses on the left side of the figure correspond to SARS-CoV-2 epitopes, while the addresses on the right side were not assigned to any particular antigen.
The dataset includes experiments from four MIRA panels. Two of these panels, named “minigene_Set1” and “minigene_Set2”, targeted large protein sequences intended to narrow down which parts of the genome generally elicit an immune response. The other two panels, named “C19_cI” and “C19_cII”, targeted individual peptides or small groups of peptides presented by class I and class II HLA alleles, respectively. Most of the MIRA data included in this release corresponds to the C19_cI panel.
Supplementary Tables 3 through 10 describe the MIRA data included in the database, as follows: Supplementary Table 3 (subject-metadata.csv) includes available metadata for each sample from subjects included in the MIRA experiments (including the four minigene and peptide panels described above). HLA types are provided when available. Missing values are generally represented with “N/A”, except for HLA types, where missing data is represented as an empty string. Note that the metadata contained in this file relates to the MIRA results, and is distinct from the TCRβ repertoire metadata referenced in Supplementary Tables 1 and 2. Supplementary Table 4 (orfs.csv) includes the genomic location of the MIRA targets as per GenBank (10). Supplementary Table 5 (minigene-hits.csv) contains counts of the number of unique TCRs that bound to targets within the “minigene_Set1” and “minigene_Set2” MIRA panels, while Supplementary Table 6 (minigene- detail.csv) describes the identity of the TCRs bound per target for both minigene MIRA panels. Similarly, Supplementary Tables 7 and 8 (peptide-hits-ci.csv and peptide-hits-cii.csv) contain counts of the number of unique TCRs that bound to targets within the “C19_cI” and “C19_cII” peptide MIRA panels, while Supplementary Tables 9 and 10 (peptide-detail-ci.csv and peptide-detail-cii.csv) describe the identity of the TCRs bound per target in these panels. TCRs in Supplementary Tables 7 and 9 (“ci/CI”) are from CD8+ T cells, whereas TCRs in Supplementary Tables 8 and 10 (“cii/cII”) are from CD4+ T cells.
To assist in the understanding of the adaptive immune response to SARS-CoV-2, we generated the ImmuneCODE database, which includes a dataset of TCR rearrangements observed in individuals exposed to, infected with, or recovered from COVID-19, and describes the ability of a subset of these TCRs to recognize SARS-CoV-2 epitopes. These data are provided to the scientific community with the goal of contributing to research efforts to develop novel interventions to prevent and treat COVID-19 infections. Though it is not the focus of this paper, the online portal currently includes tools for interacting with the data and integrating new data, although these are not guaranteed to be as stable as the database itself over time, and we expect that most researchers will benefit from downloading the data and building it into their analyses locally.
This resource was first made available to the scientific community in September 2020. Since then, it has been used by researchers around the world to advance our knowledge of SARS-CoV-2 infections and the immunological defenses against them. The ImmuneCODE data have been used to inform vaccine development (11), elucidate immunopathological signatures (12), benchmark tools for predicting TCR-peptide-MHC binding (13), and build diagnostic models of SARS-CoV-2 based on TCR repertoires (14), among many other applications. Any investigator that sequences a TCRβ repertoire can cross-reference it with our MIRA data to identify TCRs that may be responding to SARS-CoV-2 antigens (though a sequence match alone is not sufficient to prove specificity).
The database does have some limitations. All datasets currently available for download are from samples collected prior to September 2020, so they do not reflect any changes in the immune response that may have resulted from subsequent evolution of the virus. In addition, both the immune repertoires and the MIRA data are restricted to TCRβ sequences; the cognate TCRα sequences needed to construct a full T-cell receptor are not included, and a TCRβ sequence alone does not fully determine the antigen specificity of a T-cell clone. The majority of the MIRA data are from CD8+ T cells binding peptides presented by MHC class I molecules. This emphasis provides a rich understanding of which viral epitopes may be most involved in cytotoxic effector responses by CD8+ T cells, but the database is less informative about the targets of CD4+ T cells.
As SARS-CoV-2 continues to circulate and evolve, research into how it affects, and is affected by, the immune system will be ongoing. We anticipate that the ImmuneCODE database will continue to be a valuable resource for these investigations.
The original contributions presented in the study are publicly available. This data can be found here: https://clients.adaptivebiotech.com/pub/covid-2020.
The studies involving humans were approved by Samples provided by the NIAID were collected under approval by Comitato Etico Provinciale (protocol NP-4000), by Comitato Etico, Ospedale San Gerardo Monza (protocol COVID-STORM) and by Comitato Etico Pavia Fondazione IRCCS Policlinico San Matteo, Pavia (protocol 20200037677). Whole blood samples from DLS (Discovery Life Sciences, Huntsville, AL) were collected under Protocol DLS13 for collection of remnant clinical samples. From Bloodworks Northwest (Seattle, WA), volunteer donors recovered from COVID-19 were consented and collected under the Bloodworks Research Donor Collection Protocol BT001. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
SN: Conceptualization, Data curation, Software, Writing – original draft. MV: Data curation, Writing – original draft. MK: Data curation, Methodology, Writing – review & editing. JD: Data curation, Writing – review & editing. IK: Data curation, Writing – review & editing. ES: Project administration, Writing – review & editing. TC: Software, Writing – review & editing. KB: Resources, Writing – review & editing. MP: Formal analysis, Writing – review & editing. RG: Formal analysis, Writing – review & editing. TS: Supervision, Writing – review & editing. CG: Software, Writing – review & editing. SS: Investigation, Writing – review & editing. CC: Investigation, Writing – review & editing. FN: Investigation, Writing – review & editing. MM: Investigation, Writing – review & editing. OD: Investigation, Writing – review & editing. KD: Investigation, Writing – review & editing. GC-T: Investigation, Writing – review & editing. SB: Investigation, Writing – review & editing. VS: Investigation, Writing – review & editing. GM: Investigation, Writing – review & editing. JG: Investigation, Writing – review & editing. JH: Investigation, Writing – review & editing. LN: Investigation, Writing – review & editing. JM-L: Investigation, Writing – review & editing. BH: Supervision, Writing – review & editing. JC: Conceptualization, Supervision, Writing – review & editing. HR: Conceptualization, Supervision, Writing – review & editing.
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The ISB INCOV study supported by Dept. of Health and Human Services, Office of the Assistant Secretary for Preparedness and Response, Biomedical Advanced Research and Development Authority, under Contract No. HHSO100201600031C. L. D. Notarangelo is supported by the Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health. Sample collection in Brescia and Pavia was supported by Regione Lombardia, Italy. Sample collections from i+12/CNIO were supported by CRIS foundation.
The ImmuneCODE database is the result of collaboration between many individuals and organizations working together to advance global understanding of SARS-Cov2 and COVID-19. We are grateful for the support and participation of all participants. We would like to thank the COVID Clinicians who collected samples for the NIAID in Brescia, Monza and Pavia, Italy: Drs. Luisa Imberti, Eugenia Quiros-Roldan, Alessandra Sottini and Luisa Brugnoni (ASST Spedali Civili, Brescia), Andrea Biondi, Paolo Bonfanti, Laura Rachele Bettini, Mariella D’Angio’ (University of Milano Bicocca-Fondazioni MBBM Ospedale San Gerardo, Monza) and Riccardo Castagnoli, Daniela Montagna, Amelia Licari, Gian Luigi Marseglia (IRCCS Policlinico San Matteo, Pavia). We also thank Drs. Helen Su (NIAID, NIH, Bethesda), Clifton Dalgard and Andrew Snow (USUHS, Bethesda) for help with robotic extraction of DNA samples provided by NIAID. In addition, we would like to thank Caitlin Jirovsky, Matthew Bird and Rohit Nariya for operational involvement and Evan Delay, Adam Skrzekut and Dr. David Lin for oversight and management. We would also like to thank Covance/LabCorp and Illumina for their partnership in the ImmuneRACE study.
SN, MV, MK, JD, IK, ES, TC, KB, MP, RG, TS, BH, and HR have a financial interest in Adaptive Biotechnologies. CG. and JC. have a financial interest in Microsoft. JM-L is a consultant for Adaptive Biotechnologies in projects outside of COVID-19. Funding for the ISB INCOV project from BARDA was managed by Merck; Merck had no role in planning the research or writing the paper.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1488851/full#supplementary-material
1. Grifoni A, Weiskopf D, Ramirez SI, Mateus J, Dan JM, Moderbacher CR, et al. Targets of T cell responses to SARS-CoV-2 coronavirus in humans with COVID-19 disease and unexposed individuals. Cell. (2020) 181:1489–1501.e15. doi: 10.1016/j.cell.2020.05.015
2. Weiskopf D, Schmitz KS, Raadsen MP, Grifoni A, Okba NMA, Endeman H, et al. Phenotype and kinetics of SARS-CoV-2–specific T cells in COVID-19 patients with acute respiratory distress syndrome. Sci Immunol. (2020) 5. doi: 10.1126/sciimmunol.abd2071
3. Sekine T, Perez-Potti A, Rivera-Ballesteros O, Strålin K, Gorin JB, Olsson A, et al. Robust T cell immunity in convalescent individuals with asymptomatic or mild COVID-19. Cell. (2020) 183:158–168.e14. doi: 10.1016/j.cell.2020.08.017
4. Azkur AK, Akdis M, Azkur D, Sokolowska M, Van De Veen W, Brüggen M, et al. Immune response to SARS-CoV-2 and mechanisms of immunopathological changes in COVID-19. Allergy. (2020) 75:1564–81. doi: 10.1111/all.14364
5. Robins HS, Campregher PV, Srivastava SK, Wacher A, Turtle CJ, Kahsai O, et al. Comprehensive assessment of T-cell receptor β-chain diversity in αβ T cells. Blood. (2009) 114:4099–107. doi: 10.1182/blood-2009-04-217604
6. Carlson CS, Emerson RO, Sherwood AM, Desmarais C, Chung MW, Parsons JM, et al. Using synthetic templates to design an unbiased multiplex PCR assay. Nat Commun. (2013) 4:2680. doi: 10.1038/ncomms3680
7. Robins H, Desmarais C, Matthis J, Livingston R, Andriesen J, Reijonen H, et al. Ultra-sensitive detection of rare T cell clones. J Immunol Methods. (2012) 375:14–9. doi: 10.1016/j.jim.2011.09.001
8. Klinger M, Kong K, Moorhead M, Weng L, Zheng J, Faham M. Combining next-generation sequencing and immune assays: a novel method for identification of antigen-specific T cells. PloS One. (2013) 8:e74231. doi: 10.1371/journal.pone.0074231
9. Klinger M, Pepin F, Wilkins J, Asbury T, Wittkop T, Zheng J, et al. Multiplex identification of antigen-specific t cell receptors using a combination of immune assays and immune receptor sequencing. PloS One. (2015) 10:e0141561. doi: 10.1371/journal.pone.0141561
10. Sayers EW, Cavanaugh M, Clark K, Ostell J, Pruitt KD, Karsch-Mizrachi I. Genbank. Nucleic Acids Res. (2019) 47:D94–9. doi: 10.1093/nar/gky989
11. Kleanthous H, Silverman JM, Makar KW, Yoon I-K, Jackson N, Vaughn DW. Scientific rationale for developing potent RBD-based vaccines targeting COVID-19. NPJ Vaccines. (2021) 6:128. doi: 10.1038/s41541-021-00393-6
12. Sacco K, Castagnoli R, Vakkilainen S, Liu C, Delmonte OM, Oguz C, et al. Immunopathological signatures in multisystem inflammatory syndrome in children and pediatric COVID-19. Nat Med. (2022) 28:1050–62. doi: 10.1038/s41591-022-01724-3
13. Gao Y, Gao Y, Fan Y, Zhu C, Wei Z, Zhou C, et al. Pan-Peptide Meta Learning for T-cell receptor–antigen binding recognition. Nat Mach Intelligence. (2023) 5:236–49. doi: 10.1038/s42256-023-00619-3
Keywords: SARS-CoV-2, COVID-19, T cell, TCR repertoire, immune response, cellular immunity
Citation: Nolan S, Vignali M, Klinger M, Dines JN, Kaplan IM, Svejnoha E, Craft T, Boland K, Pesesky MW, Gittelman RM, Snyder TM, Gooley CJ, Semprini S, Cerchione C, Nicolini F, Mazza M, Delmonte OM, Dobbs K, Carreño-Tarragona G, Barrio S, Sambri V, Martinelli G, Goldman JD, Heath JR, Notarangelo LD, Martinez-Lopez J, Howie B, Carlson JM and Robins HS (2025) A large-scale database of T-cell receptor beta sequences and binding associations from natural and synthetic exposure to SARS-CoV-2. Front. Immunol. 16:1488851. doi: 10.3389/fimmu.2025.1488851
Received: 30 August 2024; Accepted: 23 January 2025;
Published: 17 February 2025.
Edited by:
Pieter Meysman, University of Antwerp, BelgiumReviewed by:
Kilian Schober, University Hospital Erlangen, GermanyCopyright © 2025 Nolan, Vignali, Klinger, Dines, Kaplan, Svejnoha, Craft, Boland, Pesesky, Gittelman, Snyder, Gooley, Semprini, Cerchione, Nicolini, Mazza, Delmonte, Dobbs, Carreño-Tarragona, Barrio, Sambri, Martinelli, Goldman, Heath, Notarangelo, Martinez-Lopez, Howie, Carlson and Robins. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Harlan S. Robins, aHJvYmluc0BhZGFwdGl2ZWJpb3RlY2guY29t
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.