Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 22 July 2022
Sec. Applied Genetic Epidemiology
This article is part of the Research Topic Advancing the Understanding of Emergence of SARS-CoV-2 Genetic Variants and COVID-19 Vaccine Efficacy: Essential Clinical and Molecular Insights and Breakthroughs View all 39 articles

Sequence similarity of SARS-CoV-2 and humans: Implications for SARS-CoV-2 detection

Heng Li,Heng Li1,2Xiaoping HongXiaoping Hong1Liping DingLiping Ding1Shuhui MengShuhui Meng1Rui LiaoRui Liao1Zhenyou Jiang
Zhenyou Jiang3*Dongzhou Liu
Dongzhou Liu1*
  • 1Department of Rheumatology and Immunology, Shenzhen People’s Hospital, The Second Clinical Medical College of Jinan University, Shenzhen, China
  • 2Integrated Chinese and Western Medicine Postdoctoral Research Station, Jinan University, Guangzhou, China
  • 3Department of Microbiology and Immunology, School of Medicine, Jinan University, Guangzhou, China

Detecting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) needs human samples, which inevitably contain trace human DNA and RNA. Sequence similarity may cause invalid detection results; however, there is still a lack of gene similarity analysis of SARS-CoV-2 and humans. All publicly reported complete genome assemblies in the Entrez genome database were collected for multiple sequence alignment, similarity and phylogenetic analysis. The complete genomes showed high similarity (>99.88% sequence identity). Phylogenetic analysis divided these viruses into three major clades with significant geographic group effects. Viruses from the United States showed considerable variability. Sequence similarity analysis revealed that SARS-CoV-2 has 612 similar sequences with the human genome and 100 similar sequences with the human transcriptome. The sequence characteristics and genome distribution of these similar sequences were confirmed. The sequence similarity and evolutionary mutations provide indispensable references for dynamic updates of SARS-CoV-2 detection primers and methods.

Introduction

SARS-CoV-2 was first reported in December 2019 (McDonald 2021) and has spread rapidly worldwide, bringing severe social and economic problems to many countries (Biskanaki, Rallis et al., 2020; Miller, Becker et al., 2020; Castro, Kim et al., 2021). By June 2021, the SARS-CoV-2 pandemic had swollen to more than 170 million confirmed cases with a mortality rate of 2–3.4% (Yan et al., 2020a; Szcześniak, Gładka et al., 2021). High genetic infectivity, a large percent of asymptomatic cases and variability were major drivers of the epidemic (He, Qin et al., 2020; Wei, Geng et al., 2020). The reproduction number (R0) of SARS-CoV-2 was calculated to be 5.34 times higher than in SARS-CoV (3.1/0.58), and its latency was longer (Abdelrahman, Li et al., 2020). Generally, one infected patient can cause up to 5.7 further confirmed cases (Yan et al., 2020b; Kucharski, Russell et al., 2020; Wu, Leung et al., 2020). In addition, the infection of medical staff causes enormous losses of medical resources (Rongqing, Li et al., 2020; Liu, Ouyang et al., 2021), which worsens the pandemic. As a positive-sense (+) ssRNA-enveloped virus, the genome of SARS-CoV-2 is highly variable, and many mutant strains have been reported from different countries (including N501Y, D614G) (Leung, Shum et al., 2021; Mohammad, Alshawaf et al., 2021; Odeyale et al., 2021).

Viral mutations can affect the detection of SARS-CoV-2 (Peñarrubia, Ruiz et al., 2020; Dong, Wang et al., 2021; Hasan, Hossain et al., 2021). Currently, mainstream detection methods are based on the specificity of SARS-CoV-2 sequences. However, the specificity of commonly used detection primers for SARS-CoV-2 variants of concern remains unclear. The coronavirus genome is a single-stranded, positive-sense RNA ranging from 26 to 32 kilobases. SARS-CoV-2, severe acute respiratory syndrome (SARS) coronavirus and the Middle East respiratory syndrome (MERS) coronavirus are potentially lethal to humans among various coronaviruses (Drosten, Günther et al., 2003; Rota, Oberste et al., 2003; Zumla, Hui et al., 2015). From November 2002 to July 2003, SARS-CoV-1 coronavirus was responsible for more than 8000 cumulative infections (Prompetchara, Ketloy et al., 2020) and 774 deaths (9.6%) in 37 countries (Peiris, Yuen et al., 2003). MERS coronavirus has caused 2494 infections (Jiang, Xia et al., 2020) and 858 known deaths (35%) since September 2012 (https://www.who.int/emergencies/mers-cov/en/). The prevalence and lethality of coronaviruses pose a significant threat to human beings. Novel viral mutations could cause the failure of virus detection and the invalidity of vaccines. Gene sequence alignment of SARS-CoV-2 and humans is still absent in previous studies.

Mastering viral gene signatures and trends in genetic changes are necessary and ongoing efforts to maintain the dynamic update of viral detection methods and avoid viral detection escape. Although there were 998,314 nucleotide sequences related to SARS-CoV-2 in the NCBI Virus database by 01 November 2021 (https://www.ncbi.nlm.nih.gov/sars-cov-2/), only 92 sequences are recorded in the genomic form. We report the genetic relationship of 92 available genomes of SARS-CoV-2. The distribution of mutation sites was determined by multiple sequence alignments and constructed an evolutionary tree. The similarity of SARS-CoV-2 and human genes was quantified. This study provides essential information about the evolution and detection of SARS-CoV-2 from a new perspective.

Materials and methods

Data sources

This study analyzed all the SARS-CoV-2 genome assemblies publicly available in the Entrez genome databases by 01 November 2021 (https://www.ncbi.nlm.nih.gov/genome). By searching for " Severe acute respiratory syndrome coronavirus 2″, we collected 92 complete genome sequences. The related data were downloaded from GenBank (https://www.ncbi.nlm.nih.gov/genbank/). Information on viruses, such as genome size, GC, accession, CDS, release date, GenBank FTP resources, etc., are present in Supplementary Table S1.

Genome analysis and comparison

ClustalW software (version 2.0.10) was used for sequence alignment, using the slow alignment setting. Similar sequences between SARS-CoV-2 and the human genome were searched using BLASTN, with the human genome assembly GRCh38.p13 as reference (Annotation Release 109.20200228) and MN908947.3 as a query. Word size (7), match/mismatch score (1, −1), and Gap costs (1,2) were used as parameters. The E value is 25, excluding repeated sequences.

Phylogenetic analysis

Phylogenetic analysis of the complete SARS-CoV-2 genomes was conducted using MEGA software (version 7.0.14) with 1000 bootstrap replicates, employing the Fast Minimum Evolution method.

Statistical analysis

Statistical analysis was conducted using SPSS 17.0 software (SPSS, Chicago, United States). The t-test was applied while comparing groups. The significance level was set at p < 0.05. GraphPad Prism 5 was used to generate graphics.

Results

Sources and distribution of complete SARS-CoV-2 genomes

Although many viral sequences have been reported, they are all presented as gene segments rather than genomic data. We collected 92 SARS-CoV-2 genome assemblies reported publicly in the Entrez genome database by 01 November 2021. All the genome sequences were complete, ranging from 29782 to 29903 nt in length. The related information is presented in Supplementary Table S1, including genome accession number, GC%, CDS, release date and GenBank resources.

These genome samples came from 9 countries. However, 60% were from China, 31% were from the United States, and only 9.8% were from other countries (Figure 1A). These countries are scattered around the world without apparent aggregation. Although a few countries are geographically contiguous, due to the barrier of mountains and rivers, the exchange of travelers is mainly dependent on airports (Figure 1B). The complete genome MN908947.3 from Wuhan city, first reported in March 2020, served as a reference. The sequences of these genomes are highly similar, and the sequence identity is higher than 99.88% (Figure 2A). The number of mutation sites was 0–12, with an average of 3.41.

FIGURE 1
www.frontiersin.org

FIGURE 1. The proportion (A) and geographic locations (B) of 92 full-length sequenced SARS-CoV-2 genome assemblies.

FIGURE 2
www.frontiersin.org

FIGURE 2. Sequence alignment of 92 full-length SARS-CoV-2 genomes. The first reported genome (MN908947.3) in Wuhan city was used as a reference. The bar on the right represents the total number of mutated bases for each genome. The bottom line converged all the mutation sites of 92 SARS-CoV-2 genomes.

Sequence alignment revealed the mutation frequency of SARS-CoV-2 genomes

It is worth noting that there were 299 mutation sites in these genomes in total, with an average of one mutation site per 100 nt. The largest sequence stretch without recorded mutations was ∼1000 nt. To clarify the landscape of individual genes, we performed a statistical analysis of the number and frequency of mutation sites for each gene (Figure 3). ORF1a and ORF1b contain 16 nonstructural proteins. As expected, ORF1a and ORF1b had the largest number of mutation sites as the longest sequences. However, their mutation frequency (mismatch/100 nt) was not high. By contrast, ORF10 and ORF8 had the first and second highest mutation frequencies, but the number of mutation sites was lower due to the short length. Analysis of the coding region also showed no mutations in ORF6, ORF7a and ORF7b (Figure 1B), indicating highly conserved. Secondly, structural protein membrane (M) and spike (S) also had lower mutation rates. S-protein contains receptor binding domain mediating viral invasion into host cells (Hoffmann, Kleine-Weber et al., 2020). Finally, mutations outside those genes are equally of concern, as there are unknown genes with unidentified functions, and all genes are subject to dynamic evolution. For example, ORF3d was identified and characterized by Nelson et al. as a novel overlapping gene in SARS-CoV-2 (Nelson, Ardern et al., 2020).

FIGURE 3
www.frontiersin.org

FIGURE 3. The landscape of gene mutations in the analyzed SARS-CoV-2 genomes. (A) The number of mismatched base pairs in each gene. (B) Mismatch rate in each gene.

Phylogenetic analysis revealed the evolutionary relationship of SARS-CoV-2 genomes

Based on differences in genome sequences, we performed phylogenetic analysis of the SARS-CoV-2 genomes. The results showed three genomes (MT163716.1/USA/WA3-UW1/2020, MT126808.1 BRA/SP02/2020 and MT066156.1/ITA/INMI1/2020) formed independent branches (Figure 4). The remaining 89 genomes formed three major clades. Clade 1 contained only four genomes, and each of them was from a different country. Clade 2 contained 25 genomes, all from the United States CDC-Cruise A. Clade 3 contained the largest number of genomes, with 60 genomes from five countries. This evidence reveals the relationship and genetic distances between different mutant viruses.

FIGURE 4
www.frontiersin.org

FIGURE 4. Phylogenetic analysis of full-length SARS-CoV-2 genomes.

Characterization of sequence similarity of SARS-CoV-2 and human

For SARS-CoV-2 detection, RNA needs to be reverse transcribed into DNA for sequencing, so foreign RNA/DNA could cause interference (Figure 5A). It has been reported that host (human) readings were mixed in the results of SARS-CoV-2 detection using patients’ bronchoalveolar lavage fluid samples or re-cultured viruses (Lu, Zhao et al., 2020). Since SARS-CoV-2 detection is mainly based on human samples, human genes are the primary interference source. After filtering the low complexity regions, we identified 612 similar sequences between SARS-CoV-2 and the human genome. The loci of these sequences are equally distributed in each chromosome, except for fewer on chromosomes Y, 22, 19, and 11 (Figure 6A). The similar sequences ranges in length from 33 to 212 nt, with an average sequence length of 77.9 nt and a median of 75 nt. The average sequence identity is 72.55%. The length of the consistent sequences ranges from 31 to 132 nt, with an average sequence length of 55.4 nt and a median of 53 nt (Figure 6B). The average gap rate was calculated to be 2.68%. These sequences are distributed in both the plus and minus strands at a ratio close to 1:1 (Figure 6C).

FIGURE 5
www.frontiersin.org

FIGURE 5. (A) SARS-CoV-2 sampling and processing flow. (B) Distribution of similar sequences between the SARS-CoV-2 genome and human genome/transcriptome.

FIGURE 6
www.frontiersin.org

FIGURE 6. Sequence characteristics of similar loci between the SARS-CoV-2 and human genomes. (A) Distribution of similar sequences in the human chromosome. (B) Length of similar and consensus sequences. (C) The proportion of identity and gap. (D) Distribution of similar sequences in sense and antisense strands.

Notably, there were fewer similar sites in the human transcriptome than in the human genome (100 < 612), but their characteristics were consistent. The sequence similarity was close (72 vs. 71%) and the gap ratios were 3% for both (Figure 7). This result suggested that SARS-CoV-2 shares more sequence similarity with the human genome than with the transcriptome, indicating that admixed human DNA is more likely to affect the virus detection results and that there may be less interference from reverse transcription of human RNA.

FIGURE 7
www.frontiersin.org

FIGURE 7. Sequence characteristics of similar loci between the SARS-CoV-2 genome and human transcriptome. (A) Length of similar and consensus sequences. (B) The proportion of identity and gap.

We conducted enrichment analysis for genes where the similar sites were located. GO analysis enriched three significant terms: integral component of Golgi membrane (GO:0030173), vesicle-mediated transport to the plasma membrane (GO:0098876) and ubiquitin ligase complex (GO:0000151) respectively (p < 0.01). In contrast, KEGG analysis only obtained one pathway: estrogen signaling pathway (p < 0.01) (Supplementary Table S2). The results showed that these genes are not closely related.

Discussion

Previous studies revealed that several viral mutations of SARS-CoV-2 may affect related detection and treatment strategies. Starr et al. showed that a single amino acid mutation in the receptor-binding domain (RBD) of SARS-CoV-2 entirely blocked the binding of the REGN-COV2 antibody (Starr, Greaney et al., 2021). Many mutants (E484K, N501Y and K417N) resulted in a more substantial loss of neutralizing activity of antibodies (Collier, De Marco et al., 2021; Tegally, Wilkinson et al., 2021; Wang, Schmidt et al., 2021). Specific mutants (E484K, T95I, del142-144, and D614G) were confirmed to cause vaccine breakthrough infections (Hacisuleyman, Hale et al., 2021). Multiple antibody combinations effectively protected against SARS-CoV-2 immune escape brought about by single-site mutations (Ku, Xie et al., 2021). However, not all mutations increase the risk of viral escape. For example, the D614G spike mutation increases SARS-CoV-2 susceptibility to neutralization by monoclonal antibodies and convalescent sera (Weissman, Alameh et al., 2021). The studies of the above mutants help us understand the challenges brought by virus mutation, but their objects are parts of the genome. Unlike previous studies on single amino acid or single-gene mutations, this study focuses on complete genome sequences, providing a landscape of SARS-CoV-2 genome mutations.

This study reports the alignment results and phylogenetic analysis of the existing complete SARS-CoV-2 genome sequences. Similarity analysis of SARS-CoV-2 and human whole genome/transcriptome sequences uncovered hundreds of similar sites. These results provide important information for SARS-CoV-2 detection and potential gene recombination possibilities.

Mutations are frequent in the genome of SARS-CoV-2, with an average of one mutation per 100 nt. However, base deletions are uncommon. Phylogenetic analysis indicated that except for the relatively unique genomes USA/WA3-UW1/2020, BRA/SP02/2020 and ITA/INMI1/2020 (MT163716.1, MT126808.1 and MT066156.1), the other SARS-CoV-2 genomes were divided into three clades. MT163716.1, MT126808.1 and MT066156.1 are from different countries, forming three separate branches at the base of the evolutionary tree. Clade Ⅰ includes four genomes from different countries and is at a certain evolutionary distance between them. Finally, clades Ⅱ and Ⅲ comprise most of the other genomes (including 25 and 60). From the base of the evolutionary tree (MT163716.1) to the three clades, all clades contain genomes from the United States. Based on current data, the genomes from the United States span the largest evolutionary distance. The virus similarity in each country is higher, indicating that the impact of travel restrictions is significant.

Subsequently, we analyzed similar sequences between the SARS-CoV-2 genome and the human genome/transcriptome. The analysis showed that SARS-CoV-2 has 612 similar sites to the human genome and 100 similar sites to the human transcriptome. We found that ∼70% of the similar sequences were completely identical and may influence detection primers. If the detection targets include these similar sites, the similar fragments may interfere with the Q-PCR results. The change of virus sequence may change the target site of detection, so it is necessary to carry out a genome-wide sequence comparison. Some commonly used SARS-CoV-2 detection primers are consistent with some human genes. For example, forward primer 1 ab: CCCTGTGGTTTACACTAA is consistent with the chromosome sequence 44831798 CCTGTGGGTTTACACT 44831814 of Homo sapiens isolate CHM13 chromosome 6 (sequence ID: NC_060930.1). The complementary fragments have the possibility of a mismatch in the PCR process. As viral genes mutate, the corresponding detection primers change. The detection targets need to avoid similar fragments to eliminate the interference caused by mismatches. Therefore, we believe that these sequences have the potential to interfere with viral detection and are not suitable as detection targets. Whether these similar viral sequences affect the expression or inheritance of human genes requires further investigation.

Our results suggest that the genomes of SARS-CoV-2 and humans contain many short similar sequences, with a sequence identity of ∼70%. The average length is 55.4 nt, long enough to contain detection primers. Thus, these sequences may cause interference in the process of virus detection. Although no recombination has been reported, sequence similarity provides a basis for recombination, and this inference may change if mixed sequences are identified.

SARS-CoV-2 has profoundly affected human society for several years. In turn, the rapid multiplication that comes with the pandemic accelerated its genetic variation. It is reported that SARS-CoV-2 could spread among animals, including pet cats and dogs (Chandler, Bevins et al., 2021; Dileepan, Di et al., 2021; Doerksen, Lu et al., 2021), and the wide range of hosts will increase its survivability. Gene interaction between virus and hosts is worth our vigilance, and the knowledge of sequence similarity is necessary to rule out spurious results to improve assay accuracy.

Conclusion

This work investigates the geographical distribution, mutational characteristics and phylogenetic relationship of complete SARS-CoV-2 genomes. Several hundred similar gene sequences of SARS-CoV-2 and humans with high concordance were identified. The sequence length (median 75 nt) and sequence identity (72.55%) may potentially interfere with the binding of primers and templates in virus detection. Although SARS-CoV-2 genomic integration has not been reported, the risk of recombination through endogenous transposons warrants vigilance. The interference of these similar sequences with virus detection requires excellent attention, and the interaction and influence on human genes require further investigation.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author contributions

HL and DL designed the study and wrote the paper. HL and MS collected all the sequence information. HL, LD and XH analyzed the data. JZ and DL revised the manuscript. The manuscript was approved by all authors.

Funding

This work was funded by the National Natural Science Foundation of China (No. 81971464), the China National Postdoctoral Program for Innovative Talents (BX20200151), the Sanming Project of Medicine in Shenzhen (SZSM201512019), the Research and Development Projects in Key Fields of Guangdong Science and Technology Department (2019B020229001).

Acknowledgments

Thanks for the support of the Shenzhen Fund for Guangdong Provincial High-level Clinical Key Specialties (No. SZXK011).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.946359/full#supplementary-material

References

Abdelrahman, Z., Li, M., and Wang, X. (2020). Comparative review of SARS-CoV-2, SARS-CoV, MERS-CoV, and influenza A respiratory viruses. Front. Immunol. 11, 552909. doi:10.3389/fimmu.2020.552909

PubMed Abstract | CrossRef Full Text | Google Scholar

Biskanaki, F., Rallis, E., Andreou, Ε., Sfyri, Ε., Tertipi, Ν., and Kefala, V. (2020). Social-economic impact of COVID-19 pandemic on aesthetic centers in Greece. J. Cosmet. Dermatol. 19 (9), 2165–2168. doi:10.1111/jocd.13517

PubMed Abstract | CrossRef Full Text | Google Scholar

Castro, M. C., Kim, S., Barberia, L., Ribeiro, A. F., Gurzenda, S., Ribeiro, K. B., et al. (2021). Spatiotemporal pattern of COVID-19 spread in Brazil. Science 372 (6544), 821–826. doi:10.1126/science.abh1558

PubMed Abstract | CrossRef Full Text | Google Scholar

Chandler, J. C., Bevins, S. N., Ellis, J. W., Linder, T. J., Tell, R. M., Jenkins-Moore, M., et al. (2021). SARS-CoV-2 exposure in wild white-tailed deer (Odocoileus virginianus). Proc. Natl. Acad. Sci. U. S. A. 118 (47), e2114828118. doi:10.1073/pnas.2114828118

PubMed Abstract | CrossRef Full Text | Google Scholar

Collier, D. A., De Marco, A., Ferreira, I. A. T. M., Meng, B., Datir, R. P., Walls, A. C., et al. (2021). Sensitivity of SARS-CoV-2 B.1.1.7 to mRNA vaccine-elicited antibodies. Nature 593 (7857), 136–141. doi:10.1038/s41586-021-03412-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Dileepan, M., Di, D., Huang, Q., Ahmed, S., Heinrich, D., Ly, H., et al. (2021). Seroprevalence of SARS-CoV-2 (COVID-19) exposure in pet cats and dogs in Minnesota, USA. Virulence 12 (1), 1597–1609. doi:10.1080/21505594.2021.1936433

PubMed Abstract | CrossRef Full Text | Google Scholar

Doerksen, T., Lu, A., Noll, L., Almes, K., Bai, J., Upchurch, D., et al. (2021). Near-complete genome of SARS-CoV-2 delta (AY.3) variant identified in a dog in Kansas, USA. Viruses 13 (10), 2104. doi:10.3390/v13102104

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, H., Wang, S., Zhang, J., Zhang, K., Zhang, F., Wang, H., et al. (2021). Structure-based primer design minimizes the risk of PCR failure caused by SARS-CoV-2 mutations. Front. Cell Infect. Microbiol. 11, 741147. doi:10.3389/fcimb.2021.741147

PubMed Abstract | CrossRef Full Text | Google Scholar

Drosten, C., Günther, S., Preiser, W., van der Werf, S., Brodt, H. R., Becker, S., et al. (2003). Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 348 (20), 1967–1976. doi:10.1056/NEJMoa030747

PubMed Abstract | CrossRef Full Text | Google Scholar

Hacisuleyman, E., Hale, C., Saito, Y., Blachere, N. E., Bergh, M., Conlon, E. G., et al. (2021). Vaccine breakthrough infections with SARS-CoV-2 variants. N. Engl. J. Med. 384 (23), 2212–2218. doi:10.1056/NEJMoa2105000

PubMed Abstract | CrossRef Full Text | Google Scholar

Hasan, R., Hossain, M. E., Miah, M., Hasan, M. M., Rahman, M., and Rahman, M. Z. (2021). Identification of novel mutations in the N gene of SARS-CoV-2 that adversely affect the detection of the virus by reverse transcription-quantitative PCR. Microbiol. Spectr. 9 (1), e00545–00521. doi:10.1128/spectrum.00545-21

CrossRef Full Text | Google Scholar

He, C., Qin, M., and Sun, X. (2020). Highly pathogenic coronaviruses: Thrusting vaccine development in the spotlight. Acta Pharm. Sin. B 10 (7), 1175–1191. doi:10.1016/j.apsb.2020.05.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoffmann, M., Kleine-Weber, H., Schroeder, S., Kruger, N., Herrler, T., Erichsen, S., et al. (2020). SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell 181 (2), 271–280. e278. doi:10.1016/j.cell.2020.02.052

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, S., Xia, S., Ying, T., and Lu, L. (2020). A novel coronavirus (2019-nCoV) causing pneumonia-associated respiratory syndrome. Cell. Mol. Immunol. 17 (5), 554. doi:10.1038/s41423-020-0372-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Ku, Z., Xie, X., Davidson, E., Ye, X., Su, H., Menachery, V. D., et al. (2021). Author Correction: Molecular determinants and mechanism for antibody cocktail preventing SARS-CoV-2 escape. Nat. Commun. 12 (1), 4177. doi:10.1038/s41467-021-24440-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kucharski, A. J., Russell, T. W., Diamond, C., Liu, Y., Edmunds, J., Funk, S., et al. (2020). Early dynamics of transmission and control of COVID-19: A mathematical modelling study. Lancet. Infect. Dis. 20 (5), 553–558. doi:10.1016/S1473-3099(20)30144-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Leung, K., Shum, M. H., Leung, G. M., Lam, T. T., and Wu, J. T. (2021). Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020. Euro Surveill. 26 (1), 2002106. doi:10.2807/1560-7917.ES.2020.26.1.2002106

CrossRef Full Text | Google Scholar

Liu, J., Ouyang, L., Yang, D., Han, X., Cao, Y., Alwalid, O., et al. (2021). Epidemiological, clinical, radiological characteristics and outcomes of medical staff with COVID-19 in wuhan, China: Analysis of 101 cases. Int. J. Med. Sci. 18 (6), 1492–1501. doi:10.7150/ijms.54257

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, R., Zhao, X., Li, J., Niu, P., Yang, B., Wu, H., et al. (2020). Genomic characterisation and epidemiology of 2019 novel coronavirus: Implications for virus origins and receptor binding. Lancet 395 (10224), 565–574. doi:10.1016/S0140-6736(20)30251-8

PubMed Abstract | CrossRef Full Text | Google Scholar

McDonald, L. T. (2021). Healing after COVID-19: Are survivors at risk for pulmonary fibrosis? Am. J. Physiol. Lung Cell. Mol. Physiol. 320 (2), L257–l265. doi:10.1152/ajplung.00238.2020

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, I. F., Becker, A. D., Grenfell, B. T., and Metcalf, C. J. E. (2020). Disease and healthcare burden of COVID-19 in the United States. Nat. Med. 26 (8), 1212–1217. doi:10.1038/s41591-020-0952-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Mohammad, A., Alshawaf, E., Marafie, S. K., Abu-Farha, M., Abubaker, J., and Al-Mulla, F. (2021). Higher binding affinity of furin for SARS-CoV-2 spike (S) protein D614G mutant could be associated with higher SARS-CoV-2 infectivity. Int. J. Infect. Dis. 103, 611–616. doi:10.1016/j.ijid.2020.10.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Nelson, C. W., Ardern, Z., Goldberg, T. L., Meng, C., Kuo, C. H., Ludwig, C., et al. (2020). Dynamically evolving novel overlapping gene as a factor in the SARS-CoV-2 pandemic. Elife 9, e59633. doi:10.7554/eLife.59633

PubMed Abstract | CrossRef Full Text | Google Scholar

Odeyale, R., Tulp, O., Einstein, G., and Chance, C. (2021). Will the emergence of the more highly infective mutant strains of SARS-cov-2 impose a greater strain on health care professionals in 2021? FASEB J. 35 (Suppl 1). doi:10.1096/fasebj.2021.35.S1.03289

CrossRef Full Text | Google Scholar

Peiris, J. S., Yuen, K. Y., Osterhaus, A. D., and Stohr, K. (2003). The severe acute respiratory syndrome. N. Engl. J. Med. 349 (25), 2431–2441. doi:10.1056/NEJMra032498

PubMed Abstract | CrossRef Full Text | Google Scholar

Peñarrubia, L., Ruiz, M., Porco, R., Rao, S. N., Juanola-Falgarona, M., Manissero, D., et al. (2020). Multiple assays in a real-time RT-PCR SARS-CoV-2 panel can mitigate the risk of loss of sensitivity by new genomic variants during the COVID-19 outbreak. Int. J. Infect. Dis. 97, 225–229. doi:10.1016/j.ijid.2020.06.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Prompetchara, E., Ketloy, C., and Palaga, T. (2020). Immune responses in COVID-19 and potential vaccines: Lessons learned from SARS and MERS epidemic. Asian pac. J. Allergy Immunol. 38 (1), 1–9. doi:10.12932/AP-200220-0772

PubMed Abstract | CrossRef Full Text | Google Scholar

Rongqing, Z., Li, M., Song, H., Chen, J., Ren, W., Feng, Y., et al. (2020). Early detection of severe acute respiratory syndrome coronavirus 2 antibodies as a serologic marker of infection in patients with coronavirus disease 2019. Clin. Infect. Dis. 71 (16), 2066–2072. doi:10.1093/cid/ciaa523

PubMed Abstract | CrossRef Full Text | Google Scholar

Rota, P. A., Oberste, M. S., Monroe, S. S., Nix, W. A., Campagnoli, R., Icenogle, J. P., et al. (2003). Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science 300 (5624), 1394–1399. doi:10.1126/science.1085952

PubMed Abstract | CrossRef Full Text | Google Scholar

Starr, T. N., Greaney, A. J., Addetia, A., Hannon, W. W., Choudhary, M. C., Dingens, A. S., et al. (2021). Prospective mapping of viral mutations that escape antibodies used to treat COVID-19. Science 371 (6531), 850–854. doi:10.1126/science.abf9302

PubMed Abstract | CrossRef Full Text | Google Scholar

Szcześniak, D., Gładka, A., Misiak, B., Cyran, A., and Rymaszewska, J. (2021). The SARS-CoV-2 and mental health: From biological mechanisms to social consequences. Prog. Neuropsychopharmacol. Biol. Psychiatry 104, 110046. doi:10.1016/j.pnpbp.2020.110046

PubMed Abstract | CrossRef Full Text | Google Scholar

Tegally, H., Wilkinson, E., Giovanetti, M., Iranzadeh, A., Fonseca, V., Giandhari, J., et al. (2021). Detection of a SARS-CoV-2 variant of concern in South Africa. Nature 592 (7854), 438–443. doi:10.1038/s41586-021-03402-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Z., Schmidt, F., Weisblum, Y., Muecksch, F., Barnes, C. O., Finkin, S., et al. (2021). mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants. Nature 592 (7855), 616–622. doi:10.1038/s41586-021-03324-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, Z.-Y., Geng, Y.-J., Huang, J., and Qian, H. Y. (20202019). Pathogenesis and management of myocardial injury in coronavirus disease. Eur. J. Heart Fail 22 (11), 1994–2006. doi:10.1002/ejhf.1967

PubMed Abstract | CrossRef Full Text | Google Scholar

Weissman, D., Alameh, M. G., de Silva, T., Collini, P., Hornsby, H., Brown, R., et al. (2021). D614G spike mutation increases SARS CoV-2 susceptibility to neutralization. Cell Host Microbe 29 (1), 23–31.e4. e24. doi:10.1016/j.chom.2020.11.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, J. T., Leung, K., and Leung, G. M. (2020). Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in wuhan, China: A modelling study. Lancet 395 (10225), 689–697. doi:10.1016/S0140-6736(20)30260-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, Y., Shin, W. I., Pang, Y. X., Meng, Y., Lai, J., You, C., et al. (2020a). The first 75 Days of novel coronavirus (SARS-CoV-2) outbreak: Recent advances, prevention, and treatment. Int. J. Environ. Res. Public Health 17 (7), E2323. doi:10.3390/ijerph17072323

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, Y., Shin, W. I., Pang, Y. X., Meng, Y., Lai, J., You, C., et al. (2020b). The first 75 Days of novel coronavirus (SARS-CoV-2) outbreak: Recent advances, prevention, and treatment. Int. J. Environ. Res. Public Health 17 (7), 2323. doi:10.3390/ijerph17072323

PubMed Abstract | CrossRef Full Text | Google Scholar

Zumla, A., Hui, D. S., and Perlman, S. (2015). Middle East respiratory syndrome. Lancet 386 (9997), 995–1007. doi:10.1016/S0140-6736(15)60454-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: coronavirus, SARS-CoV-2 detection, mutation, COVID-19, coronavirus-COVID-19

Citation: Li H, Hong X, Ding L, Meng S, Liao R, Jiang Z and Liu D (2022) Sequence similarity of SARS-CoV-2 and humans: Implications for SARS-CoV-2 detection. Front. Genet. 13:946359. doi: 10.3389/fgene.2022.946359

Received: 17 May 2022; Accepted: 06 July 2022;
Published: 22 July 2022.

Edited by:

Madhusudhanan Narasimhan, University of Texas Southwestern Medical Center, United States

Reviewed by:

Ioannis Trougakos, National and Kapodistrian University of Athens, Greece
Conrad Fischer, Barry University, United States

Copyright © 2022 Li, Hong, Ding, Meng, Liao, Jiang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhenyou Jiang, dGp6aHlAam51LmVkdS5jbg==; Dongzhou Liu, bGl1X2R6MjAwMUBzaW5hLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.