Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 14 December 2023
Sec. Infectious Diseases: Epidemiology and Prevention
This article is part of the Research Topic SARS-CoV-2: Virology, Epidemiology, Diagnosis, Pathogenesis, and Control View all 24 articles

The increase in SARS-CoV-2 lineages during 2020–2022 in a state in the Brazilian Northeast is associated with a number of cases

Moises Thiago de Souza Freitas,
Moises Thiago de Souza Freitas1,2*Ludmila Oliveira Carvalho SenaLudmila Oliveira Carvalho Sena3Kiyoshi Ferreira FukutaniKiyoshi Ferreira Fukutani4Cliomar Alves dos SantosCliomar Alves dos Santos3Francisco das Chagas Barros NetoFrancisco das Chagas Barros Neto5Julienne Sousa RibeiroJulienne Sousa Ribeiro5Erica Santos dos ReisErica Santos dos Reis1Valdir de Queiroz BalbinoValdir de Queiroz Balbino6Srgio de S Paiva LeitoSérgio de Sá Paiva Leitão7Marcus Vinicius de Arago BatistaMarcus Vinicius de Aragão Batista2Michael Wheeler LipscombMichael Wheeler Lipscomb8Tatiana Rodrigues de MouraTatiana Rodrigues de Moura1
  • 1Health Sciences Graduate Program, Federal University of Sergipe, Aracaju, Brazil
  • 2Parasitic Biology Graduate Program, Federal University of Sergipe, São Cristóvão, Brazil
  • 3Health Foundation Parreiras Horta, Central Laboratory of Public Health (LACEN/SE), Sergipe State Health Secretariat, Aracaju, Brazil
  • 4Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, NH, United States
  • 5Center for Biological and Health Sciences, Federal University of Sergipe, São Cristóvão, Brazil
  • 6Department of Genetics, Federal University of Pernambuco, Recife, Brazil
  • 7Academic Unit of Serra Talhada, Rural Federal University of Pernambuco, Serra Talhada, Brazil
  • 8Department of Pharmacology, University of Minnesota, Minneapolis, MN, United States

SARS-CoV-2 has caused a high number of deaths in several countries. In Brazil, there were 37 million confirmed cases of COVID-19 and 700,000 deaths caused by the disease. The population size and heterogeneity of the Brazilian population should be considered in epidemiological surveillance due to the varied tropism of the virus. As such, municipalities and states must be factored in for their unique specificities, such as socioeconomic conditions and population distribution. Here, we investigate the spatiotemporal dispersion of emerging SARS-CoV-2 lineages and their dynamics in each microregion from Sergipe state, northeastern Brazil, in the first 3 years of the pandemic. We analyzed 586 genomes sequenced between March 2020 and November 2022 extracted from the GISAID database. Phylogenetic analyses were carried out for each data set to reconstruct evolutionary history. Finally, the existence of a correlation between the number of lineages and infection cases by SARS-CoV-2 was evaluated. Aracaju, the largest city in northeastern Brazil, had the highest number of samples sequenced. This represented 54.6% (320) of the genomes, and consequently, the largest number of lineages identified. Studies also analyzed the relationship between mean lineage distributions and mean monthly infections, daily cases, daily deaths, and hospitalizations of vaccinated and unvaccinated patients. For this, a correlation matrix was created. Results revealed that the increase in the average number of SARS-CoV-2 variants was related to the average number of SARS-CoV-2 cases in both unvaccinated and vaccinated individuals. Thus, our data indicate that it is necessary to maintain epidemiological surveillance, especially in capital cities, since they have a high rate of circulation of resident and non-resident inhabitants, which contributes to the dynamics of the virus.

Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in China in late 2019 and rapidly spread across the globe, leading the World Health Organization (WHO) to declare a pandemic state on 11 March 2020 (1, 2). The virus has been widespread, causing waves of infections in almost all regions of the world (3). The first cases were confirmed in the state of São Paulo in February 2020. After that, actions were taken by the Ministry of Health in order to contain the emerging epidemic (4). As of today, 37.9 million cases in Brazil have resulted in 706,531 deaths, representing a mortality rate of 441.3 individuals per 100,000 inhabitants (accessed on 28 October 2023; available in https://covid.saude.gov.br//). This high mortality rate is related to the lack of a national policy against the disease, the increasing population mobility, especially in large urban centers, the return of face-to-face work activities, difficulties in implementing individual and community preventive measures to reduce the spread of COVID-19, and delays in vaccination have contributed to the emergence and spread of SARS-CoV-2 variants of concern (VOCs) across the country over time (5).

In Brazil, the pandemic was characterized by the co-circulation of multiple variants over time (6). The emergence of new variants was directly related to adaptive mutations in the viral genome that modified the pathogenic potential of SARS-CoV-2. A single amino acid change can dramatically affect a virus’s ability to evade the immune system and complicate the clinical status of infected individuals (7). Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), Delta (B.1.617.2), and omicron (B.1.1.529) lineages were important variants associated with greater transmissibility or virulence, reduced neutralization by antibodies obtained through natural infection or vaccination, ability to avoid detection, and/or decreased therapeutic or vaccination efficacy (8).

Monitoring SARS-CoV-2 was possible due to recent technological and scientific advances in genome sequencing and bioinformatics tools, allowing almost real-time genomic surveillance and tracking the emergence and replacement dynamics of variant emergence and prevalence among populations (9). Several studies focusing on genomic surveillance have provided crucial information to understand the dynamics of SARS-CoV-2 lineages in the states or regions of Brazil due to the large differences between inter- and intra-state population sizes, concentration, and dynamics of human movement (10, 11). This proposal has been shown to be relevant to determine the spread of the virus based on the specific characteristics of the state in a refined resolution (11). In the Brazilian Northeast, 7.4 million cases of COVID-19 and 136,000 deaths have already been reported. Bahia, Ceará, and Pernambuco are the states with the highest incidence of cases and deaths in the region. In Sergipe state, 363,329 individuals were diagnosed with COVID-19, resulting in 6,539 deaths (accessed on 28 October 2023; available at https://covid.saude.gov.br//). At the moment, a single study has been identified in the literature related to genomic surveillance in Sergipe, and this analyzed genomes sequenced between March 2020 and February 2021 (5). This demonstrates the necessity to implement new research aimed at understanding the effects of the pandemic.

Therefore, this study aimed to assess the dynamics of SARS-CoV-2 variants from 2020 to 2022 in the state of Sergipe within Brazil. Knowledge gained would identify viral evolutionary patterns and behavior as it relates to epidemiological impacts.

Methods

Study area

Sergipe is located in northeastern Brazil and has a land area of 21,938,188 km2 and an estimated population of 2,338,474 inhabitants. The state has 75 municipalities and is divided into 13 microregions (Agreste de Itabaiana, Agreste de Lagarto, Aracaju, Baixo Cotinguiba, Boquim, Carira, Cotinguiba, Estância, Japaratuba, Nossa Senhora das Dores, Propriá, Sergipana do Sertão do São Francisco, and Tobias Barreto) (Figure 1). The microregion of Aracaju is made up of the capital (Aracaju), and the municipalities of Barra dos Coqueiros, Nossa Senhora do Socorro, and São Cristóvão, forming the metropolitan region of Aracaju, which represents approximately 36% of the state population1.

FIGURE 1
www.frontiersin.org

Figure 1. State map of Brazil with emphasis on Sergipe state, showing the 13 microregions.

Data collection

Full-length SARS-CoV-2 genomes from February 2020 to November 2022 were obtained from the GISAID database2. Only complete genomes and complete collection data were used. The sequences were evaluated individually, considering the lineage, which was determined by the Pangolin software3, municipality, and collection date. Soon after, the genomes were separated by year, giving rise to three data sets. The Circos program (12) was used to visualize the distribution of the genomes by strains and municipalities.

In order to correlate the number of lineages of SARS-CoV-2 with the average of infections by months, daily cases, daily deaths, and admissions of vaccinated and unvaccinated patients, the data were uploaded to a cross-country database of COVID-194 (13, 14). The Pearson correlation test was performed using a native stats (V.4.0.3) package available in R software, and the grouped stacked bars with the abundance of lineage between months were performed and represented using the ggplot2 package (15, 16) and the correlation matrix was performed using corrplot package (17). All the differences with p-values <0.05 were considered statistically significant.

Phylogenetic analyses

Multiple sequence alignment was carried out using MAFFT v.7 with auto and add fragments parameters (18). The sequence from Wuhan-Hu-1 (NC_045512.2) was then added as an outgroup. Subsequently, the maximum likelihood (ML) phylogenetic trees were built using IQ-TREE v2.1.2 (19). The nucleotide substitution models TN + F, GTR + F + I + I + R4, and TIM + F + I + I + R3 were selected using ModelFinder in IQ-TREE2 v2.1.2 for the SARS-CoV-2 genomes of 2020, 2021, and 2022, respectively (20). Clade support was estimated using 1,000 replicates of bootstrap. The tree was visualized and edited using the iTOL v.4 tool (21). The haplotype network was created with PopART software version 1.7 (22) using the median-joining method to identify the existence of shared haplotypes.

Spatial analysis

The maps to represent the spatial distribution of SARS-CoV-2 lineages were constructed using the QGIS software version 3.18.2, with the cartographic projection corresponding to the Universal Reference System SIRGAS 2000. The cartographic projection used corresponded to the Universal Transverse Mercator (UTM) system, Terra Datum horizontal model (SIRGAS 2000) to segment by municipalities and states were collected from the databases of the Brazilian Institute of Geography and Statistics (IBGE).

Results

Genomic surveillance of SARS-CoV-2 variants in Sergipe

For this analysis, 586 SARS-CoV-2 viral genomes were evaluated and classified into 36 variant lineages (Figure 2). Sequences have been distributed in 47 municipalities, representing 62.7% of the total. Most of the genomes obtained from the GISAID database have their origin in the Aracaju microregion, as can be seen in Table 1.

FIGURE 2
www.frontiersin.org

Figure 2. Timeline of genomes sequenced for the SARS-CoV-2 virus in Sergipe state, Brazil (SARS-CoV-2 lineages in the state from March 2020 to November 2022, obtained from GISAID database).

TABLE 1
www.frontiersin.org

Table 1. Distribution of SARS-CoV-2 genomes obtained from the GISAID database divided by microregions of Sergipe.

In 2020, five lineages were detected circulating in Sergipe, B.1 (11 sequences, 36.7%) was the most frequent, followed by B.1.1 (6 sequences, 20%), B.1.1.33 (6 sequences, 20%), B.1.1.28 (4 sequences, 13.3%), and B.1.212 (3 sequences, 10%) (Figure 2). A total of 30 genomes were available on the GISAID database. In total, 19 of those 30 genomes were related to samples from Aracaju (Supplementary Figure S1). Genomic sequences have also been observed in 10 other municipalities (Figure 3). At first, B.1 was identified in the state on 12 March 2020 during the first wave. This sample belongs to an individual who resided in Aracaju with a travel history to Europe (Spain).

FIGURE 3
www.frontiersin.org

Figure 3. Sergipe state map showing the clinical distribution of the SARS-CoV-2 lineages in 2020 by municipality. Colors represent the municipality and lineages downloaded from the GISAID database.

For 2021, 406 sequences were used to create the datasets and subsequently classified into 16 viral variant lineages. In total, 212 samples were identified as the P.1 gamma variant, representing approximately 52.2% (Figure 2). Initially, the circulation of this variant was registered on 17 January 2021 in the municipality of Aracaju during the second wave. Delta sequences have been registered in Sergipe between January and September. P.1.2 (56 sequences, 13.8%) and P.2 Zeta variant (50 sequences, 12.3%) were also highly represented (Figure 4). This variant was predominant in infection cases from September and December. A total of 57 genomes of the AY.* lineages were found in the GISAID database. This is distributed in four strains (AY.34.1.1, AY.99.1, AY.99.2, and AY.101). AY.99.2 (45 sequences, 11.1%) was prevalent during this period. Lineages AY.34.1.1, AY.99.1, and AY.101 represented approximately 2.9% of the total genomes (Figure 2). All other strains identified in 2021 represent approximately 10.6% (43 sequences). Aracaju was the municipality with the highest number of strains circulating when compared to other localities (Figure 4). Lineages were also identified in 37 other cities (Supplementary Figure S2).

FIGURE 4
www.frontiersin.org

Figure 4. Sergipe state map showing the clinical distribution of the SARS-CoV-2 lineages in 2021 by municipality. Colors represent the municipality and lineages downloaded from the GISAID database.

An alignment with 150 genomes was created using the genomes of 2022, and it was possible to identify 18 lineages distributed in 27 municipalities (Figure 5) (Supplementary Figure S3). In January, nine sublineages of the Omicron variant were identified as circulating. The first variant sample detected was on 3 January 2022. The lineage BA.1.1 (33 sequences, 22%) was the most frequent during the third wave, followed by BA.1 (28 sequences, 18.7%), BA.5.2 (21 sequences, 14%), and BA.5.2.1 (18 sequences, 12%). All other lineages identified represented approximately 33.3% (50 sequences) (Figure 2).

FIGURE 5
www.frontiersin.org

Figure 5. Sergipe state map showing the clinical distribution of the SARS-CoV-2 lineages in 2022 by municipality. Colors represent the municipality and lineages downloaded from the GISAID database.

Evolutionary analysis of SARS-CoV-2 lineages

The maximum likelihood phylogenetic tree was constructed to confirm the SARS-CoV-2 variant classification that circulated between February 2020 and November 2022 in the state of Sergipe. Considering the sequences from 2020, the phylogenetic analysis suggested five distinct well-supported groups (B.1, B.1.1, B.1.212, B.1.1.28, and B.1.1.33) (Figure 6). The haplotype network has been constructed with the purpose of characterizing the ancestral relationships maintained between the lineages. Five well-defined clades (B.1, B.1.1, B.1.212, B.1.1.28, and B.1.1.33) were identified. Notably, haplotype sharing was not observed among the sequences from these different strains (Figure 7).

FIGURE 6
www.frontiersin.org

Figure 6. Maximum likelihood tree of 30 SARS-CoV-2 whole-genome sequences available at the GISAID database using the TN + F evolutive model. The colors assigned to the branches are related to their respective lineages and the samples are labeled according to their sampling dates and virus lineages.

FIGURE 7
www.frontiersin.org

Figure 7. Haplotype network, obtained with PopART software, showing relationships among haplotypes of the SARS-CoV-2 genomes available in 2020. Each circle represents one haplotype. Diameters of the circles correspond to the frequencies of the respective haplotypes. The numbers of dashes display mutational steps (each dash stands for one single nucleotide mutation). Small black circles represent hypothetical (missing) haplotypes.

For the genomes from 2021, the ML tree revealed five main well-supported clades. One clade was composed only of the delta variant. N.9, B.1.1.28, and P.2 lineages were divided into different clades with significant support values (Figure 8). B.1.1.28 was identified as a common ancestor of P.1 and P.2. A clade represented by sublineages relative to P.1 (P.1.7, P.1.14, and P.1.2) was observed. However, sequences belonging to lineages B.1.1, B.1.1.33, and P.4 have not demonstrated significant bootstrap values. P.7 clade showed high support value, and its genetic pattern is associated with the P.2 lineage. The haplotype network revealed five heterogeneous clusters, where P.1 was associated with P.1.2, P.1.14, and P.1.7. Another cluster was observed with P.2 and P.7 lineages. N.9, B.1.1.28, and all AY.* remained isolated in the phylogenetic tree (Figure 9).

FIGURE 8
www.frontiersin.org

Figure 8. Maximum likelihood tree of 406 SARS-CoV-2 whole-genome sequences available at the GISAID database using the GTR + F + I + I + R4 evolutive model. The colors assigned to the branches are related to their respective lineages and the samples are labeled according to their sampling dates and virus lineages.

FIGURE 9
www.frontiersin.org

Figure 9. Haplotype network, obtained with PopART software, showing relationships among haplotypes of the SARS-CoV-2 genomes available in 2021. Each circle represents one haplotype. Diameters of the circles correspond to the frequencies of the respective haplotypes. The numbers of dashes display mutational steps (each dash stands for one single nucleotide mutation). Small black circles represent hypothetical (missing) haplotypes.

Analyzing the genomes from 2022, it was possible to observe differences, revealing two distinct clades (Figure 10); both groups were significantly supported. A clade was formed by genomes from lineages BA.5, BA.5.1, BA.5.2, BA.5.2.1, BA.2, BA.2.12.1, BA.2.56, BA.4, and BA.4.1, and another clade was formed by genomes from lineages BA.1, BA.1.1, BA.1.14, BA.1.14.1, BA.1.14.2, BA.1.15, BA.1.17.2, BA.1.1.1, and BA.1.1.14. The haplotype network also suggested two clusters. One cluster was composed by BA.5, BA.5.1, BA.5.2, BA.5.2.1, BA.2, BA.2.12.1, BA.2.56, BA.4, and BA.4.1. On the other hand, the other cluster was formed by lineages BA.1, BA.1.1, BA.1.14, BA.1.14.1, BA.1.14.2, BA.1.15, BA.1.17.2, BA.1.1.1, and BA.1.1.14. In addition, it was observed genomes from different lineages sharing haplotypes, such as BA.1.1|10,322,472, BA.1.1|10,322,464, BA.1.14|10,322,474, BA.5.2.1|15,279,654, BA.5.2.1|15,202,013, BA.5.2.1|15,802,440, and BA.5.2.1|15,802,439 (Figure 11). In the BA.1.1|10,322,472 genome, genetic patterns associated with BA.1 and BA.1.1 have been identified, suggesting the maintenance of ancestral relationship.

FIGURE 10
www.frontiersin.org

Figure 10. Maximum likelihood tree of 150 SARS-CoV-2 whole-genome sequences available at the GISAID database using the TIM + F + I + I + R3 evolutive model. The colors assigned to the branches are related to their respective lineages and the samples are labeled according to their sampling dates and virus lineages.

FIGURE 11
www.frontiersin.org

Figure 11. Haplotype network, obtained with PopART software, showing relationships among haplotypes of the SARS-CoV-2 genomes available in 2022. Each circle represents one haplotype. Diameters of the circles correspond to the frequencies of the respective haplotypes. The numbers of dashes display mutational steps (each dash stands for one single nucleotide mutation). Small black circles represent hypothetical (missing) haplotypes.

Correlation analysis of SARS-CoV-2 lineages and infection cases in Sergipe

A correlation matrix analysis was employed to examine the relationship between the average distributions of viral lineages and the average number of individuals infected, as indicated by registered cases reported by global epidemics (Figure 12). Furthermore, we explored relationships among the averages of hospital admissions in vaccinated and unvaccinated patients as well as the averages of daily deaths and daily cases (Figure 12). Notably, the averages of registered cases exhibited a correlation cluster among all reported cases as well as between vaccinated and unvaccinated patients. Additionally, the averages of daily cases and deaths showed a direct correlation. Hospital admissions in vaccinated patients were associated with clusters of registered cases, while hospital admissions in unvaccinated patients were linked to daily deaths and daily cases. The observed increase in the number of variant lineages during the study period was directly correlated with the averages of infections.

FIGURE 12
www.frontiersin.org

Figure 12. Correlation matrix analysis of epidemiological data and SARS-CoV-19 lineages. The distribution means of lineage and epidemiological data per month were obtained from a cross-country COVID-19 database. All variables were tested for correlation using the Pearson correlation test. The size and color of each circle indicate the strength of the correlation coefficient; red indicates a positive correlation and blue indicates a negative correlation. A significant correlation (p-value <0.05) is indicated by a black frame. The matrix was created using the corrplot package in R.

Discussion

This is a pioneering study of the state of Sergipe. The evolutionary history of circulating SARS-CoV-2 genomes over the last 3 years was reconstructed using phylogenetic analyses based on ML and the median-joining method. Our data support that B.1 was the first lineage detected in Sergipe (as recorded in Aracaju on 12 March 2020). However, Gurgel et al. (23) report in their study that the first case of COVID-19 in the state of Sergipe may have occurred a few months earlier, as samples from asymptomatic individuals sent for blood tests between the months of January and April 2020 by reasons unrelated to COVID-19 showed the presence of SARS-CoV-2 immunoglobulins (IgM and IgG) before the notification of clinical cases in the state. The country registered its first COVID-19 case in late February in São Paulo (4). In Bahia, the first confirmed SARS-CoV-2 infections occurred on 28 February, being the first case in the Northeast region5.

The lineages B.1, B.1.1, B.1.1.33, B.1.1.28, and B.1.212 were prevalent until mid-August 2020. In an analysis carried out by Dos Santos et al. (5), B.1 (58.5%), B.1.1.33 (17.1%), B.1.1.119 (12.2%), B.1.1.28 (9.8%), and B.1.212 (2.4%) were dominant from March to August 2020 in Sergipe. In late 2020, the variants zeta (P.2) (24) and gamma (P.1) (25), descendants of lineage variants B.1.1.28, emerged and were associated with the second phase of the pandemic. P.1 was identified in the state of Amazonas in mid-December 2020, with a proposed emergence around November (25, 26).

P.1 was dominant in infection cases by SARS-CoV-2, having their circulation on 17 January 2021. Some studies proposed that the emergence of three mutations E484K, N501Y, and K417T in the Spike protein allowed the virus to escape from the host immune response (2729). In mid-January 2021, samples from 11 suspected cases and their contact reporting a travel history to/from Amazonas state were screened at the Central Laboratory of Health of the Bahia state (LACEN-BA). Genetic evidence has confirmed for the first time the circulation of the P.1 in the Brazilian Northeast (30). The study conducted by Dos Santos et al. (5) also confirms that P.1 circulated in Aracaju (Sergipe) on 17 January 2021 in a sample belonging to a resident from the city of Manaus (Amazonas) who traveled to Sergipe to visit his family (5).

Our results from the evolutionary analysis showed that B.1.1.28 is a common ancestor of P.1 and P.2. In the studies conducted by Varela et al. (31) and Harvey et al. (32), it was indicated that P.1 and P.2 descend from B.1.1.28 although they have different times of appearance and share the S: E484K mutation. A cluster formed by P.1, P.1.1, P.1.2, and P.1.7 demonstrates the ancestral relationship between these lineages, as reported by Varela et al. (31) and Machado et al. (11). In the present study, a shared ancestry between P.2 and P.7 was also observed similar to that suggested by Lamarca et al. (33).

Assessing the results, it was suggested that AY.99.2 (11.1%) became dominant in cases from September to December 2021. There are signs that the delta variant emerged in October 2020 on the Asian continent as has been classified by WHO (33). In Brazil, the first community-sustained transmission chains of the delta variant were registered in June 2021 in the state of Rio de Janeiro (34), and it has been widely detected in other Brazilian states over time (see footnote 5). Among the delta variants, the AY.99.2 was the most dominant, reaching 58% of all sublineages sampled during the period (35). Some evidence demonstrate that AY.99.2 emerged in Brazil; the first SARS-CoV-2 genomes from this lineage available in the GISAID database are from samples collected in April 2021 in the northeastern state of Ceará (36). Studies have identified mutations in the spike protein of the delta sublineages found in Brazil, and the most common mutations mentioned are T19R, T95I, E156G, DEL157/158, L452R, T478K, D614G, Q677H, P681R, D950N, V1104L, and L1265F7. Some of these mutations can be related to viral fitness advantages such as enhanced viral entry, pathogenesis, and immune escape (37, 38). Despite that, the number of hospitalizations declined from 6.9% (January to June 2021) to 3.6% (July to December 2021) during the community transmission of the delta variant with the progression of vaccination in the second half of 2021 in Sergipe (39).

The phylogenetic tree revealed that the samples of the delta variant formed a monophyletic clade. In an analysis using the neighbor-joining method with genomes from the delta lineage, a compatible structuring with a monophyletic clade was shown, and the omicron variant emerged from it (40).

In this study, the lineage BA.1.1 was reported as most frequent, followed by BA.1, BA.5.2, and BA.5.2.1. Genomic surveillance detected that in February 2022, the omicron variant was majority; 99.8% of the samples analyzed around the country being positive for the variant (see footnote 5). From January to September, BA.1 (4,253 genomes) and BA.1.1 (2,521 genomes) were also prevalent in infection cases by SARS-CoV-2 in the northeast Brazil (see footnote 5). Since the beginning of the pandemic, the SARS-CoV-2 genome has been rapidly evolving. This is mostly due to the inherent polymerase mistakes and host immune selection factors (32). The omicron variant is the most mutated variant containing more than 60 mutations in its genome. In total, 32 of these mutations lie within the receptor binding domain (RBD) of the spike protein (41, 42). The large number of mutations associated with the spike RBD domain can be related to infectivity rates, high transmission capacity, and rapid dispersal potential (26). Among the mutations identified in the omicron variants, 14 are exclusive and found in all the omicron variants (43).

Our phylogenetic tree for omicron VOC suggests the existence of two main clades, one composed of the sublineages linked to BA.1 and the other associated with BA.2, BA.4, and BA.5 variants. In a study developed by Veneziano et al. (44) with Omicron SARS-CoV-2 genomes in Italy, a group composed of BA.1 and another by BA.2, BA.4, and BA.5 lineages was identified. Six mutations have been identified in all the omicron variants, excluding omicron BA.1: Del24-26, V213G, T376A, S371F, D405N, and R408S (43). These mutations may have contributed to the structure of the clades of the phylogenetic tree and also in the haplotype network, as observed in other organisms (45, 46). Notably, our study presents a limitation due to the fact that in some months of 2020 and 2022, there were no records of SARS-CoV-2 genomes in the GISAID database. However, there is agreement between our genomic surveillance results and those observed in other states of Brazil.

Finally, the increase in the average number of SARS-CoV-2 lineages during the studied periods is related to the average number of infections in both unvaccinated and vaccinated individuals. Tarkowski et al. (47) demonstrated that vaccinated individuals presented higher levels of IgG against viral proteins of spike protein-1 (S1) and receptor-binding domain (RBD), which resulted in a better immune response to B.1 and P.1 variants although immune activation is less noticeable in response to the B.1.617.2 variant. A similar study revealed differences in the efficiency of humoral activity in vaccinated individuals against B.1.617.1, B.1.617.2, B.1.351, and P.1 lineages due to mutations in the spike protein (S) (48). Unvaccinated individuals are intrinsically associated with daily cases and deaths. Martins-Filho et al. (39) studied the dynamics of hospitalizations and the predominance of delta and omicron variants in the Northeast of Brazil and found that during the circulation of the delta variant (July to December 2021), the majority of deaths occurred in people who were not vaccinated or who had not completed the vaccination schedule. Furthermore, in 2020, vaccines were scarce, with high hospitalization rates (46% of the population with active infection) (39).

Conclusion

Our data suggest a correlation between the increase in the mean number of variant lineage strains and the mean number of infections in unvaccinated and vaccinated individuals. It is important to note that 3 years after the beginning of the SARS-CoV-2 pandemic, and despite the availability of several vaccines for 2 years, the restrictive measures to contain SARS-CoV-2 spreading were met with several challenges among most countries. In particular, the recent variants are generating new outbreaks of infection, even in countries where the level of vaccinations is high. However, it becomes necessary for continuous monitoring of the most predominant SARS-CoV-2 lineages as well as their specific dynamic and processes of evolution. Therefore, this knowledge gain and continual analysis of variant lineages is imperative for epidemiologists to define public health measures, perform adequate diagnostic tests, and strategically employ vaccines (49).

Despite the number of positive cases of COVID-19 in Sergipe, these did not have minimum values to be submitted for genetic sequencing; and we were unable to establish a stratified correlation between the number of lineages and the severity of COVID-19 cases in both vaccinated and unvaccinated individuals. This correlation could have demonstrated how lineage variability impacts the severity of infections.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

MF and VQ: conceptualization. MF, KF, LS, FN, and JR: investigation. MF, KF, and ER: methodology. MF and KF: formal analysis. MF: project administration. MF and TM: writing – original draft. MF, TM, ML, MA, SS, KF, JR, LS, and CS: writing – review and editing. All authors contributed to the article and approved the submitted version.

Acknowledgments

We thank Marilda Agudo Mendonça Teixeira de Siqueira and Paola Cristina Resende for coordinating sequencing at the Genomics Platform of the Respiratory Virus and Measles Laboratory at the Osvaldo Cruz Foundation (Fiocruz-RJ).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2023.1222152/full#supplementary-material

Footnotes

References

1. Wu, F, Zhao, S, Yu, B, Chen, YM, Wang, W, Song, ZG, et al. A new coronavirus associated with human respiratory disease in China. Nature. (2020) 579:265–9. doi: 10.1038/s41586-020-2008-3

PubMed Abstract | CrossRef Full Text | Google Scholar

2. World Health Organization (WHO). WHO director-General’s opening remarks at the media briefing on COVID-19 – 11 march 2020. (2020). Available at: https://www.who.int/director-general/speeches/detail/who-director-general-s-opening-remarks-at-themedia-briefing-on-covid-19-11-march-2020. (Accessed April 2, 2023)

Google Scholar

3. He, X, Lau, EHY, Wu, P, Deng, X, Wang, J, Hao, X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med. (2020) 26:672–5. doi: 10.1038/s41591-020-0869-5

CrossRef Full Text | Google Scholar

4. Jesus, JG, Sacchi, C, Candido, DDS, Claro, IM, Sales, FCS, Manuli, ER, et al. Importation and early local transmission of COVID-19 in Brazil, 2020. Rev Inst Med Trop Sao Paulo. (2020) 62:e30. doi: 10.1590/s1678-9946202062030

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Dos Santos, CA, Bezerra, GVB, de Azevedo, MARRA, Alves, JC, Tanajura, DM, and Martins-Filho, PR. SARS-CoV-2 genomic surveillance in Northeast Brazil: timing of emergence of the Brazilian variant of concern P1. J Travel Med. (2021) 28:1–3. doi: 10.1093/jtm/taab066

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Alcantara, LCJ, Nogueira, E, Shuab, G, Tosta, S, Fristch, H, Pimentel, V, et al. SARS-CoV-2 epidemic in Brazil: how the displacement of variants has driven distinct epidemic waves. Virus Res. (2022) 315:198785. doi: 10.1016/j.virusres.2022.198785

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Rodriguez-Morales, AJ, Cardona-Ospina, JA, Gutiérrez-Ocampo, E, Villamizar-Peña, R, Holguin-Rivera, Y, Escalera-Antezana, JP, et al. Clinical, laboratory and imaging features of COVID-19: a systematic review and meta-analysis. Travel Med Infect Dis. (2020) 34:101623. doi: 10.1016/j.tmaid.2020.101623

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Galán-Huerta, KA, Flores-Treviño, S, Salas-Treviño, D, Bocanegra-Ibarias, P, Rivas-Estilla, AM, Pérez-Alba, E, et al. Prevalence of SARS-CoV-2 variants of concern and variants of interest in COVID-19 breakthrough infections in a Hospital in Monterrey. Mexico Viruses. (2022) 14:154. doi: 10.3390/v14010154

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Oude Munnink, BB, Sikkema, RS, Nieuwenhuijse, DF, Molenaar, RJ, Munger, E, Molenkamp, R, et al. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science. (2021) 371:172–7. doi: 10.1126/science.abe5901

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Giovanetti, M, Slavov, SN, Fonseca, V, Wilkinson, E, Tegally, H, Patané, JSL, et al. Genomic epidemiology of the SARS-CoV-2 epidemic in Brazil. Nat Microbiol. (2022) 7:1490. doi: 10.1038/s41564-022-01191-z

CrossRef Full Text | Google Scholar

11. Machado, LC, Dezordi, FZ, de Lima, GB, de Lima, RE, Silva, LCA, Pereira, LM, et al. Spatiotemporal transmission of SARS-CoV-2 lineages during 2020-2021 in Pernambuco – Brazil. MedRxiv. (2023) 25:831. doi: 10.1101/2023.01.25.23284831

CrossRef Full Text | Google Scholar

12. Krzywinski, M, Schein, J, Birol, I, Connors, J, Gascoyne, R, Horsman, D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. (2009) 19:1639–45. doi: 10.1101/gr.092759.109

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Hasell, J, Mathieu, E, Beltekian, D, Macdonald, B, Giattino, C, Ortiz-Ospina, E, et al. A cross-country database of COVID-19 testing. Sci Data. (2020) 7:345. doi: 10.1038/s41597-020-00688-8

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Fukutani, KF, Barreto, ML, Andrade, BB, and Queiroz, ATL. Correlation between SARS-Cov-2 vaccination, COVID-19 incidence and mortality: tracking the effect of vaccination on population protection in real time. Front Genet. (2021) 12:679485. doi: 10.3389/fgene.2021.679485

CrossRef Full Text | Google Scholar

15. Wickham, H. ggplot2: Elegant graphics for data analysis. Berlin: Springer (2016).

Google Scholar

16. Césare, N, Mota, TF, Lopes, FFL, Lima, ACM, Luzardo, R, Quintanilha, LF, et al. Longitudinal profiling of the vaccination coverage in Brazil reveals a recent change in the patterns hallmarked by differential reduction across regions. Int J Infect Dis. (2020) 98:275. doi: 10.1016/j.ijid.2020.06.092

CrossRef Full Text | Google Scholar

17. Wei, T, and Simko, V. R package ‘corrplot’: visualization of a correlation matrix. (Version 0.92). (2021), Available at: https://github.com/taiyun/corrplot

Google Scholar

18. Katoh, K, Rozewicki, J, and Yamada, KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. (2019) 20:1160–6. doi: 10.1093/bib/bbx108

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Minh, BQ, Schmidt, HA, Chernomor, O, Schrempf, D, Woodhams, MD, von Haeseler, A, et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. (2020) 37:1530–4. doi: 10.1093/molbev/msaa015

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Kalyaanamoorthy, S, Minh, BQ, Wong, TKF, Von Haeseler, A, and Jermiin, LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. (2017) 14:587–9. doi: 10.1038/nmeth.4285

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Letunic, I, and Bork, P. Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. (2019) 47:W256–9. doi: 10.1093/nar/gkz239

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Leigh, JW, and Bryant, D. PopART: full-feature software for haplotype network construction. Methods Ecol Evol. (2015) 6:1110–6. doi: 10.1111/2041-210x.12410

CrossRef Full Text | Google Scholar

23. Gurgel, RQ, de Sá, LC, Souza, DRV, Martins, AF, Matos, ILS, Lima, AGA, et al. SARS-CoV-2 has been circulating in northeastern Brazil since February 2020: evidence for antibody detection in asymptomatic patients. J Infect. (2021) 82:186–230. doi: 10.1016/j.jinf.2020.11.037

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Voloch, CM, da Silva, FR, LGP, DA, Cardoso, CC, Brustolini, OJ, Gerber, AL, et al. Genomic characterization of a novel SARS-CoV-2 lineage from Rio de Janeiro. Brazil J Virol. (2021) 95:e00119–21. doi: 10.1128/JVI.00119-21

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Faria, NR, Mellan, TA, Whittaker, C, Claro, IM, Candido, DDS, Mishra, S, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus. Brazil Sci. (2021) 372:815. doi: 10.1126/science.abh2644

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Naveca, FG, Nascimento, V, de Souza, VC, Corado, AL, Nascimento, F, Silva, G, et al. COVID-19 in Amazonas, Brazil, was driven by the persistence of endemic lineages and P.1 emergence. Nat Med. (2021) 27:1230–8. doi: 10.1038/s41591-021-01378-7

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Nonaka, CKV, Franco, MM, Gräf, T, de Lorenzo Barcia, CA, de Ávila, MRN, de Sousa, KAF, et al. Genomic evidence of SARS-CoV-2 reinfection involving E484K spike mutation. Brazil Emerg Infect Dis. (2021) 27:1522–4. doi: 10.3201/eid2705.210191

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Resende, PC, Bezerra, JF, Teixeira Vasconcelos, RH, Arantes, I, Appolinario, L, Mendonça, AC, et al. Severe acute respiratory syndrome coronavirus 2 P.2 lineage associated with reinfection case, Brazil, June-October 2020. Emerg Infect Dis. (2021) 27:1789–94. doi: 10.3201/eid2707.210401

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Greaney, AJ, Starr, TN, Gilchuk, P, Zost, SJ, Binshtein, E, Loes, AN, et al. Complete mapping of mutations to the SARS-CoV-2 spike receptor-binding domain that escape antibody recognition. Cell Host Microbe. (2021) 29:44–57.e9. doi: 10.1016/j.chom.2020.11.007

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Tosta, S, Giovanetti, M, Brandão Nardy, V, Reboredo de Oliveira da Silva, L, Kelly Astete Gómez, M, Gomes Lima, J, et al. Short report: early genomic detection of SARS-CoV-2P.1 variant in Northeast Brazil. PLoS Negl Trop Dis. (2021) 15:e0009591. doi: 10.1371/journal.pntd.0009591

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Varela, APM, Prichula, J, Mayer, FQ, Salvato, RS, Sant’Anna, FH, Gregianini, TS, et al. SARS-CoV-2 introduction and lineage dynamics across three epidemic peaks in southern Brazil: massive spread of P.1. Infect Genet Evol. (2021) 96:105144. doi: 10.1016/j.meegid.2021.105144

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Harvey, WT, Carabelli, AM, Jackson, B, Gupta, RK, Thomson, EC, Harrison, EM, et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat Rev Microbiol. (2021) 19:409–24. doi: 10.1038/s41579-021-00573-0

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Lamarca, AP, de Almeida, LGP, Francisco, RDS, Jr Lima, LFA, Scortecci, KC, Perez, VP, et al. Genomic surveillance of SARS-CoV-2 tracks early interstate transmission of P.1 lineage and diversification within P.2 clade in Brazil. PLoS Negl Trop Dis. (2021) 15:e0009835. doi: 10.1371/journal.pntd.0009835

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Singh, J, Rahman, SA, Ehtesham, NZ, Hira, S, and Hasnain, SE. SARS-CoV-2 variants of concern are emerging in India. Nat Med. (2021) 27:1131–3. doi: 10.1038/s41591-021-01397-4

CrossRef Full Text | Google Scholar

35. Chen, C, Nadeau, S, Yared, M, Voinov, P, Xie, N, Roemer, C, et al. CoV-Spectrum: analysis of globally shared SARS-CoV-2 data to identify and characterize new variants. Bioinformatics. (2021) 38:1735–7. doi: 10.1093/bioinformatics/btab856

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Romano, CM, de Oliveira, CM, da Silva, LS, and Levi, JE. Early emergence and dispersal of Delta SARS-CoV-2 lineage AY.99.2 in Brazil. Front Med. (2022) 9:930380. doi: 10.3389/fmed.2022.930380

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Grant, RA, Morales-Nebreda, L, Markov, NS, Swaminathan, S, Querrey, M, Guzman, ER, et al. Circuits between infected macrophages and T cells in SARS-CoV-2 pneumonia. Nature. (2021) 590:635–41. doi: 10.1038/s41586-020-03148-w

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Li, D, Edwards, RJ, Manne, K, Martinez, DR, Schäfer, A, Alam, SM, et al. In vitro and in vivo functions of SARS-CoV-2 infection-enhancing and neutralizing antibodies. Cells. (2021) 184:4203. doi: 10.1016/j.cell.2021.06.021

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Martins-Filho, PR, Quintans-Júnior, LJ, BDS, S, Barboza, WS, Cavalcante, TF, et al. Dynamics of hospitalizations and in-hospital deaths from COVID-19 in Northeast Brazil: a retrospective analysis based on the circulation of SARS-CoV-2 variants and vaccination coverage. Epidemiol Health. (2022) 44:e2022036. doi: 10.4178/epih.e2022036

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Kandeel, M, Mohamed, MEM, Abd El-Lateef, HM, Enugopala, KN, and El-Beltagi, HS. Omicron variant genome evolution and phylogenetics. J Med Virol. (2022) 94:1627. doi: 10.1002/jmv.27515

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Tsang, AK, Cheng, PK, Mak, GC, Leung, PK, Yip, PC, Lam, ET, et al. Unusual high number of spike protein mutations for the SARS-CoV-2 strains detected in Hong Kong. J Clin Virol. (2022) 148:105081. doi: 10.1016/j.jcv.2022.105081

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Wang, L, and Cheng, G. Sequence analysis of the emerging SARS-CoV-2 variant omicron in South Africa. J Med Virol. (2021) 94:1728–33. doi: 10.1002/jmv.27516

CrossRef Full Text | Google Scholar

43. Caputo, E, and Mandrich, L. Structural and phylogenetic analysis of SARS-CoV-2 spike glycoprotein from the most widespread variants. Life. (2022) 12:1245. doi: 10.3390/life12081245

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Veneziano, C, Marascio, N, De Marco, C, Quaresima, B, Biamonte, F, Trecarichi, EM, et al. The spread of SARS-CoV-2 omicron variant in CALABRIA: a spatio-temporal report of viral genome evolution. Viruses. (2023) 15:408. doi: 10.3390/v15020408

PubMed Abstract | CrossRef Full Text | Google Scholar

45. de Souza Freitas, MT, Ríos-Velasquez, CM, Costa, CR, Figueirêdo, CA Jr, Aragão, NC, da Silva, LG, et al. Phenotypic and genotypic variations among three allopatric populations of Lutzomyia umbratilis, main vector of Leishmania guyanensis. Parasit Vectors. (2015) 8:448. doi: 10.1186/s13071-015-1051-7

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Lima Costa, CR, Jr, FMT, Santiago Figueirêdo, CA, Jr, ANC, da Silva, LG, Marcondes, CB, et al. Genetic structuring and fixed polymorphisms in the gene period among natural populations of Lutzomyia longipalpis in Brazil. Parasit Vectors. (2015) 8:193. doi: 10.1186/s13071-015-0785-6

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Tarkowski, M, de Jager, W, Schiuma, M, Covizzi, A, Lai, A, Gabrieli, A, et al. Anti-SARS-CoV-2 immunoglobulin isotypes, and neutralization activity against viral variants, according to BNT162b2-vaccination and infection history. Front Immunol. (2021) 12:793191. doi: 10.3389/fimmu.2021.793191

CrossRef Full Text | Google Scholar

48. Liu, C, Ginn, HM, Dejnirattisai, W, Supasa, P, Wang, B, Tuekprakhon, A, et al. Reduced neutralization of SARS-CoV-2 B.1.617 by vaccine and convalescent serum. Cells. (2021) 184:4220. doi: 10.1016/j.cell.2021.06.020

CrossRef Full Text | Google Scholar

49. Borges, LP, Martins, AF, de Melo, MS, de Oliveira, MGB, Neto, JMR, Dósea, MB, et al. Seroprevalence of SARS-CoV-2 IgM and IgG antibodies in an asymptomatic population in Sergipe. Brazil Rev Panam Salud Publica. (2020) 44:e108. doi: 10.26633/RPSP.2020.108

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: phylogenetic analysis, SARS-CoV-2 genomes, epidemiology, genomic surveillance, Sergipe

Citation: Freitas MTS, Sena LOC, Fukutani KF, Santos CAD, Neto FDCB, Ribeiro JS, dos Reis ES, Balbino VQ, Sá Paiva Leitão S, de Aragão Batista MV, Lipscomb MW and Moura TR (2023) The increase in SARS-CoV-2 lineages during 2020–2022 in a state in the Brazilian Northeast is associated with a number of cases. Front. Public Health. 11:1222152. doi: 10.3389/fpubh.2023.1222152

Received: 13 May 2023; Accepted: 15 November 2023;
Published: 14 December 2023.

Edited by:

Severino Jefferson Ribeiro da Silva, University of Toronto, Canada

Reviewed by:

Diogo Antonio Tschoeke, Federal University of Rio de Janeiro, Brazil
Colby T. Ford, University of North Carolina at Charlotte, United States

Copyright © 2023 Freitas, Sena, Fukutani, Santos, Neto, Ribeiro, dos Reis, Balbino, Sá Paiva Leitão, de Aragão Batista, Lipscomb and Moura. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Moises Thiago de Souza Freitas, moisesfrts@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.