Molecular Evolution of Human Norovirus GII.2 Clusters

Li, Xingguang; Liu, Haizhou; Rife Magalis, Brittany; Kosakovsky Pond, Sergei L.; Volz, Erik M.

doi:10.3389/fmicb.2021.655567

ORIGINAL RESEARCH article

Front. Microbiol. , 22 March 2021

Sec. Virology

Volume 12 - 2021 | https://doi.org/10.3389/fmicb.2021.655567

This article is part of the Research Topic Evolution & Genomic Adaptation of Emerging and Re-emerging RNA viruses View all 16 articles

Molecular Evolution of Human Norovirus GII.2 Clusters

$\r\nXingguang Li*$ Xingguang Li^1*

Haizhou Liu²

Brittany Rife Magalis³

Sergei L. Kosakovsky Pond³

Erik M. Volz⁴

¹Department of Hospital Office, The First People’s Hospital of Fangchenggang, Fangchenggang, China
²Centre for Emerging Infectious Diseases, The State Key Laboratory of Virology, Wuhan Institute of Virology, University of Chinese Academy of Sciences, Wuhan, China
³Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States
⁴MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom

Background: The human norovirus GII.2 outbreak during the 2016–2017 winter season was of unprecedented scale and geographic distribution.

Methods: We analyzed 519 complete VP1 gene sequences of the human norovirus GII.2 genotype sampled during the 2016–2017 winter season, as well as prior (dating back to 1976) from 7 countries. Phylodynamic analyses of these sequences were performed using maximum likelihood and Bayesian statistical frameworks in order to estimate viral evolutionary and population dynamics associated with the outbreak.

Results: Our results revealed an increase in the genetic diversity of human norovirus GII.2 during the recent Asian outbreak and diversification was characterized by at least eight distinct clusters. Bayesian estimation of viral population dynamics revealed a highly fluctuating effective population size, increasing in frequency during the past 15 years.

Conclusion: Despite an increasing viral diversity, we found no evidence of an elevated evolutionary rate or significant selection pressure in human norovirus GII.2, indicating viral evolutionary adaptation was not responsible for the volatility of or spread of the virus during this time.

Introduction

Human norovirus (HuNoV) is a pathogenic agent that contributes substantially to the global burden of sporadic cases and outbreaks of acute gastroenteritis across all settings and age groups in humans (Ahmed et al., 2014). HuNoV is a non-enveloped, single-stranded, positive-sense RNA virus from the genus Norovirus in the family Caliciviridae (Glass et al., 2009). HuNoV is classified into at least 10 genogroups (GI–GX) and 48 genotypes, based on phylogenetic analyses of the capsid gene (Zheng et al., 2006; Vinje, 2015; Chhabra et al., 2019, 2020). Among these different genogroups, genotypes belonging to the GI, GII, and GIV are primarily responsible for the acute gastroenteritis cases in humans (Kroneman et al., 2013; Vinje, 2015). GI and GII strains can be further stratified into nine and twenty-six genotypes, respectively (Kroneman et al., 2013; Chhabra et al., 2019, 2020).

Genotype GII.2 strains recently emerged, causing large outbreaks of acute gastroenteritis in Japan and China during the 2016–2017 winter season, mainly in childcare centers (Ao et al., 2017, 2018; Hata et al., 2018; Nagasawa et al., 2018). Many other countries also reported GII.2-dominated HuNoV outbreaks during the 2016–2017 winter season (Bidalot et al., 2017; Niendorf et al., 2017; Tohma et al., 2017; Medici et al., 2018; Thanusuwannasak et al., 2018).

To gain a better understanding of genetic variation during the recent HuNoV epidemic, it is essential to analyze the VP1 gene, which is crucial for viral attachment and entry, as well as the production of neutralizing antibodies (Prasad et al., 1999; Tan et al., 2004; Chakravarty et al., 2005). Therefore, in order to gain a more comprehensive understanding of the viral evolutionary factors responsible for the emergence and spread accompanying the recent HuNoV epidemic, we performed a detailed analysis of the complete VP1 gene of the GII.2 strains with known sampling dates and geographic locations. Our findings suggest that close monitoring of the global spread of this emergent GII.2 strains is necessary for the prevention and mitigation of HuNoV acute gastroenteritis outbreaks.

Materials and Methods

Sequence Data Set

All available complete ORF2 region sequences (at least 1,626 bp in length) of GII.2 strains with known sampling date and geographic location were retrieved from GenBank¹ on 28 April 2018. Sequences were aligned using MAFFT v7.409 (Katoh and Standley, 2013) and then adjusted manually in BioEdit v7.2.5 (Hall, 1999). The best-fit model of nucleotide substitution for this data set was identified according to the Bayesian information criterion in jModelTest v2.1.10 (Darriba et al., 2012). Given the phylogenetic scope of our analysis, we chose to exclude models that allowed a proportion of invariable sites (Jia et al., 2014). For this data set, the SYM + Γ₄ model provided the best fit and was used in genetic distance calculations and tree reconstruction.

Phylogenetic, Likelihood-Mapping, and Genetic Distance Analyses

To evaluate the phylogenetic signal for this data set, we performed a likelihood mapping analysis (Strimmer and von Haeseler, 1997) using TREE-PUZZLE v5.3.rc16 (Schmidt et al., 2002). We then inferred the phylogeny using maximum likelihood (ML) in PhyML v3.1 (Guindon and Gascuel, 2003). Bootstrap support values were calculated using 1,000 pseudoreplicates. Methods for identifying and defining phylogenetic clusters differ between studies. In the present study, we identify them on the basis of bootstrap support (cut-off of 99%) for groupings with more than two sequences. Genetic distances between and within clusters were calculated in MEGA v5.05 (Tamura et al., 2011) using the maximum composite likelihood model (Tamura et al., 2004) with 1,000 bootstrap replicates.

Evolutionary Rate, Time Origin, and Past Population Dynamic Inferences

To evaluate the temporal structure of this data set, we plotted root-to-tip genetic distances against date of sampling within the ML tree using TempEst v1.5 (Rambaut et al., 2016). We also estimated the evolutionary rate and the time to the most recent common ancestor for this data set using least-squares dating in LSD v0.3beta (To et al., 2016), a Gamma-Poisson mixture model in treedater package (Volz and Frost, 2017), and ML dating in TreeTime package (Sagulenko et al., 2018), respectively. We then employed a Bayesian phylogenetic approach using Markov chain Monte Carlo (MCMC) sampling to estimate the rate of evolution and the time to the most recent common ancestor for all sequences in BEAST v1.8.2 (Drummond et al., 2012). Evolutionary rates were estimated using a strict molecular clock model. A non-parametric coalescent Bayesian skygrid tree prior model was employed for demographic inference. The overall evolutionary rate was given an uninformative continuous-time Markov chain (CTMC) reference prior. The MCMC analysis was run for 500 million steps, with samples drawn every 50,000 steps. The first 1,000 samples in each chain were removed as burn-in. All parameters had effective sample sizes greater than 200, which is indicative of sufficient sampling. We ran multiple chains to check for convergence to the target distribution using Tracer v1.7.1 (Rambaut et al., 2018). Trees were summarized as maximum clade credibility (MCC) trees using TreeAnnotator v1.8.2 after discarding the first 10% as burn-in, and then visualized and annotated in FigTree v1.4.3². We also employed the skygrowth package in R to estimate the effective population size for the VP1 gene of the HuNoV GII.2 strains (Volz and Didelot, 2018).

Analysis of Selection

The evidence of gene-wide episodic positive selection along the backbone of the phylogenetic tree was performed using BUSTED, a test for episodic diversifying selection (Murrell et al., 2015), implemented in HyPhy software package (Pond et al., 2005). Initial branch lengths within the fixed MCC tree topology were estimated using the GTR (general time reversible) nucleotide model. A likelihood ratio test (LRT) was used to compare optimized likelihoods under the unconstrained, alternative model wherein the ratio of non-synonymous to synonymous rates (dN/dS) is allowed to exceed 1 (positive selection) and under the constrained (dN/dS = 1), null model (neutrality). A related test, RELAX (Wertheim et al., 2015) was used to investigate relaxation or intensification of selective regimes within each of the eight highly supported clades as compared to the persisting viral lineages. External branches were placed in the nuisance category to avoid dN/dS estimate inflation owing to transient polymorphisms. The selection intensity parameter (k) was estimated for each clade and the LRT used to compare the null model in which k is constrained to 1 (i.e., the same dN/dS distribution on test and reference branches) to an alternative model in which k is a free parameter for each clade. p ≤ 0.05 were considered significant evidence in favor of the alternative selection model for both tests.

Results

Sequence Data Set Information

This data set included 519 complete ORF2 region sequences of GII.2 strains from Australia (n = 3); the Mainland of China (n = 302); Hong Kong (n = 22); Japan (n = 166); Malaysia (n = 1); Netherlands (n = 10); Taiwan (n = 9); United Kingdom (n = 1); and United States (n = 5), with sampling dates between 1976 and 2017 (Table 1 and Supplementary Table S1).

TABLE 1

Table 1. Geographic source and sampling year of human norovirus GII.2 sequences used in the present study.

Phylogenetic Analysis

The phylogeny of this data set, inferred using maximum likelihood, indicated the presence of eight transmission clusters (Figures 1, 2, Supplementary Figure S1, Table 2, and Supplementary Table S2). Sequences from Cluster I (n = 4) were found in Netherlands and Japan between 1999 and 2002. Cluster II sequences (n = 6) were found in Japan between 2000 and 2006. Sequences from Cluster III (n = 21) were collected from Japan between 2004 and 2006. Cluster IV sequences (n = 25) were found in Japan between 2007 and 2015, whereas Cluster V sequences (n = 64) were collected from Japan and Taiwan between 2008 and 2015. Cluster VI included eighteen sequences collected from Japan and one sequence collected from United States between 2010 and 2012. The four sequences from Cluster VII were collected from Japan, Taiwan, Hong Kong, and the Mainland of China between 2015 and 2017. Cluster VIII sequences (n = 357) were found in Australia, the Mainland of China, Hong Kong, Japan, Taiwan, and United States between 2016 and 2017.

FIGURE 1

Figure 1. Maximum-likelihood phylogeny of HuNoV GII.2 strains. The two circles of color show geographic location (inner) and phylogenetic cluster (outer).

FIGURE 2

Figure 2. Geographic distribution of HuNoV GII.2 clusters identified in the present study. Each GII.2 cluster identified in this study is color-coded, as shown on the left. The figure was created using Adobe Illustrator CS5 version 15.0.0 software, based on the maps obtained from Craft MAP website (craftmap.box-i.net/).

TABLE 2

Table 2. Classification and geographic source of human norovirus GII.2 sequences used in the present.

The remaining 19 sequences (designated as Ungrouped) were scattered throughout the main GII.2 genotype and had been sampled in Japan (n = 8), Malaysia (n = 1), Netherlands (n = 7), United Kingdom (n = 1), and United States (n = 2) between 1976 and 2014.

Of the 302 Chinese strains in our analysis, all were found within either cluster VII (n = 1) or cluster VIII (n = 301) and all were sampled within either 2016 (n = 133) or 2017 (n = 169). However, the 166 Japanese strains were interspersed throughout the phylogeny and were sampled within a wide range (1997, 2000, 2002, and 2004–2016.

Likelihood-Mapping and Evolutionary Divergence Analysis

Our likelihood-mapping analysis revealed significant differences in phylogenetic signal and the amount of evolutionary information between each cluster in our data set (Supplementary Figure S2). The most notable result of the likelihood-mapping analysis revealed that 45.4% of the quartets from cluster III were distributed in the center of the triangle, indicating a relative strong star-like phylogenetic signal reflecting a new cluster, which might be due to exponential epidemic spread. Intriguingly, cluster III was characterized by the lowest genetic divergence among the eight distinct HuNoV GII.2 clusters (Supplementary Figure S3), which further confirmed that cluster III was most likely a new cluster, in accordance with likelihood-mapping analysis. Likewise, 38.5% of the quartets from cluster VI were distributed in the center of the triangle and showed lower genetic divergence. Conversely, 12.3% of the quartets from cluster VIII were distributed in the center of the triangle and showed higher genetic divergence than cluster VI. Notably, 43.5% of the quartets from cluster IV were distributed in the center of the triangle, but with higher genetic divergence than cluster VI. In contrast, 0, 0, 3.9, and 0% of the quartets from clusters I, II, V, and VII were distributed in the center of the triangle, but with higher genetic divergence, indicating the presence of a strong phylogenetic signal in the four clusters. Also of note, 3.4% of the quartets from the overall epidemics were distributed in the center of the triangle, but with lower genetic divergence (4.1%). We also observed that the smallest genetic distance separated clusters III and IV (3.9%), whereas the largest was between clusters I and VIII (10.1%). Notably, all the genetic distances between each of the eight clusters are higher than within each of the eight clusters, except for the genetic distance between clusters III and IV. We can also observe mixed genetic distances between and within each of the eight clusters (Supplementary Figure S3).

Evolutionary Rate and Time Origin Analyses

The correlation between root-to-tip distances and sampling time indicated a strong temporal signal (R² = 0.9359), without any clear outlier sequences (Figure 3). This result suggests a relatively clocklike pattern of molecular evolution, with an estimated evolutionary rate of 3.29 × 10^–3 substitutions per site per year and the most recent common ancestor occurring in 1970.37. We also analyzed the correlation between genetic distances and sampling time intervals, which indicated a strong linear regression signal for sampling time intervals > 12 years (Supplementary Figure S4). The estimated evolutionary rate and the time to the most recent common ancestor using a Gamma-Poisson mixture model were 2.76 × 10^–3 substitutions per site per year and 1968.60, respectively. The estimated evolutionary rate and the time to the most recent common ancestor using maximum likelihood dating were 2.48 × 10^–3 substitutions per site per year (95% credibility interval: 2.18 × 10^–3–2.78 × 10^–3) and 1969.0 (95% credibility interval: 1966.35–1971.65), respectively. The estimated evolutionary rate and the time to the most recent common ancestor using least-squares dating were 2.63 × 10^–3 substitutions per site per year (95% credibility interval: 2.30 × 10^–3–2.82 × 10^–3) and 1968.61 (95% credibility interval: 1965.27–1970.84), respectively. In our Bayesian phylogenetic analysis, we estimated a evolutionary rate of 3.00 × 10^–3 substitutions per site per year (95% credibility interval: 2.67 × 10^–3–3.35 × 10^–3). And the time to the most recent common ancestor of 1968.89 (95% credibility interval: 1965.49–1972.06). The age of each GII.2 cluster was also estimated in the analysis (Supplementary Table S3 and Supplementary Figure S5). The divergences between the sequences from clusters VI and VIII were estimated to have occurred in 2007.96 (95% credibility interval: 2007.10–2008.75) and 2012.39 (95% credibility interval: 2011.42–2013.44), respectively. And the time to the most recent common ancestor of 2004.61 (95% credibility interval: 2003.22–2005.83) for clusters VI and VIII.

FIGURE 3

Figure 3. Regression of the root-to-tip genetic distance against year of sampling for the HuNoV GII.2 sequences. Colors indicate different geographic locations.

Analysis of Past Population Dynamics

We further investigated the past population dynamics of this data set using a Bayesian skygrid plot. The effective population size of the VP1 gene in the HuNoV GII.2 strains experienced very high volatility of population size, especially for the past 15 years (Figure 4). The effective population size of the VP1 gene in the HuNoV GII.2 strains was also estimated using MCMC method in skygrowth package (Supplementary Figure S6), the estimates of the effective population size were consistent with the Bayesian skygrid model. The estimates of the phylogenetic relationships among the HuNoV GII.2 sequences using Bayesian coalescent framework were consistent with maximum likelihood tree reconstruction (Figure 5).

FIGURE 4

Figure 4. Bayesian skygrid demographic reconstruction of HuNoV GII.2. The vertical axis shows the effective number of infections (N_e) multiplied by mean viral generation time (τ). The solid line and shaded region represent the median and 95% credibility interval, respectively, of the inferred N_eτ through time.

FIGURE 5

Figure 5. Maximum-clade-credibility tree estimated from complete VP1 gene sequences of HuNoV GII.2. Light blue horizontal bars represent 95% credibility intervals for estimates of node times. Posterior probabilities are shown at the node. Each cluster is shown with different colors.

Analysis of Selection

Given the observed variations in evolutionary dynamics for this data set, we sought to investigate the selective pressures driving the persistence of HuNoV GII.2. No evidence of episodic positive selection was detected among the persisting viral lineages (Supplementary Table S4, p = 0.18); however, significant differences in selective strength among outbreaks when compared to these persisting lineages were found (p = 0.04). Clusters II, III, V, and VII exhibited intensified selection (k > 2), whereas sister clusters VI and VIII were considered to have relaxed in selective strength (k < 0.5). Clusters I and IV were characterized by relatively similar selective regimes in comparison to the backbone (k = 0.72 and 0.6, respectively). The results suggested that the evolutionary dynamics of the VP1 gene of HuNoV GII.2 strains in this data set were influenced largely by natural evolutionary forces, rather than adaptation to selective pressure from the host immune system, which was consistent with the previous study (Tohma et al., 2017).

Discussion

Our evolutionary analyses, based on all of the available complete VP1 sequences of HuNoV GII.2 genotype that included country of origin and year of sampling, revealed the presence of at least eight independent clusters. Cluster VII was restricted to countries within Asia, whereas cluster VIII had dispersed to Asia and North America. The recent sequences from cluster VIII outbreak formed a highly distinct sister group to cluster VII. As such, the putative HuNoV GII.2 reservoirs for cluster VIII outbreak sequences might be distinct from that of cluster VII.

Our estimates of the substitution rate for this data set were reassuringly consistent across different methods and models. The mean estimated rates of HuNoV GII.2 molecular evolution using TempEst v1.5 and BEAST v1.8.2 were 3.00 × 10^–3 and 3.29 × 10^–3 substitutions per site per year, respectively. These were also similar to the mean estimates previously reported for HuNoV GII.2 using BEAST v2.4.5, which was 3.26 × 10^–3 substitutions per site per year. However, the mean estimated rates of HuNoV GII.2 molecular evolution using LSD v0.3beta, TreeTime package, and treedater package were 2.63 × 10^–3, 2.48 × 10^–3, and 2.76 × 10^–3 substitutions per site per year, respectively, which were lower than the results using TempEst v1.5 and BEAST v1.8.2. In summary, despite the extensive and prolonged human-to-human transmission in the recent HuNoV GII.2 outbreak, HuNoV GII.2 was not mutating at an unusually elevated rate. Thus, we found little evolutionary signal that the virus has enhanced its virulence and/or transmissibility in humans during the recent HuNoV GII.2 outbreak. Coalescent-based demographic inference revealed a rapidly increasing population size for HuNoV GII.2 from late 2015 to early 2017 (Figure 4), consistent with the outbreak during this period of time.

Our analyses of selection did not reveal episodic positive selection representative of continual adaptation, nor was the more recent human norovirus GII.2 outbreak characterized by intensified selective pressure. Thus, the results from the present study seem to be consistent with earlier reports (Kobayashi et al., 2016; Parra et al., 2017; Nagasawa et al., 2018) and suggest that the recent evolution of HuNoV GII.2 has been dominated by genetic drift. Furthermore, the results also suggested a relationship between selection strength variation and geographical spread, as Clusters VI and VIII were the only clusters to contain sequences outside of Asia after 2010.

Our study has several limitations. First, our analysis was based on all of the available complete VP1 sequences of HuNoV GII.2 genotype that included country of origin and year of sampling, and we did not randomly collected sequences, we may therefore have underestimated the total number of infections. Second, the number of sample size is limited, which may provide an incomplete picture of phylogenetic relationship of the virus, therefore, which may have underestimated the total number of phylogenetic clusters. Additional sequences are critically needed to investigate the phylogenetic cluster changes over time, estimate its date of emergence, monitor local transmission, and infer past population dynamics of the virus. Additional sequences will also help to reveal the presence of additional phylogenetic clusters. Third, there are probably biases in geographic and temporal distributions of the VP1 sequences presented in the present study, given that our study relies on data from different countries or stages that is varied significantly, which may introduce bias our estimates. Despite the aforementioned limitations, we believe our findings are reliable and robust.

Taken together, we analyzed 519 complete VP1 gene sequences of the human norovirus GII.2 genotype. Most of the samples were collected during the outbreak during the 2016–2017 winter season from the Mainland of China. We explored several aspects about geographic distribution, demographic analysis and selective pressure of the GII.2 norovirus strains. This information is important for surveillance of viral strains with pandemic potential. Based on the phylogenetic analysis of these sequences, we showed an increase in the genetic diversity of human norovirus GII.2 for at least eight distinct clusters, and the geographic and gene divergence varied among clusters. Our results sufficiently demonstrated an increase in the genetic diversity of human norovirus GII.2 during the recent Asian outbreak and a highly fluctuating effective population size. We also indicated that viral evolutionary adaptation was not responsible for the volatility of or spread of the virus during this time. Our results also emphasize the importance of ongoing genetic and demographic surveillance of the epidemic, especially, sampled during the epidemic and across all the geographical range of the virus. These efforts, combined with epidemiological investigation, clinical recognition, and population movement data, are required to trace the history of recent epidemic waves and to assess adaptability and selection in HuNoV GII.2 strains. This is increasingly important for surveillance of viral strains with pandemic potential, and will further guide research on vaccines and therapeutic targets.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

XL conceived and designed the study, performed the experiments, and drafted the manuscript. XL, BR, and EV analyzed the data. XL, HL, BR, EV, and SK interpreted data and provided critical comments. All authors reviewed and approved the final manuscript.

Funding

This study was supported by a grant from the National Key Research and Development Program of China (2018YFC1603803) to HL, and this study was also supported by grants from U01 GM110749 (NIH/NIGMS) and R01 GM093939 (NIH/NIGMS) to SK.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.655567/full#supplementary-material

Supplementary Figure 1 | Phylogenetic relationships of complete VP1 gene sequences from HuNoV GII.2. The tree was inferred using a maximum-likelihood analysis of 519 complete VP1 gene sequences. Bootstrap values were shown at the nodes. Clusters were shown with different colors. The scale bar showed nucleotide substitutions per site.

Supplementary Figure 2 | Likelihood-mapping analyses of complete VP1 gene sequences from HuNoV GII.2. Likelihood-mapping analyses were performed for this data set, and also each of eight clusters defined by the maximum-likelihood phylogenetic analyses of complete VP1 gene sequences from HuNoV GII.2.

Supplementary Figure 3 | Average pairwise genetic distances within and between HuNoV GII.2 clusters using ggplot2 package. Green and orange colors indicated within and between human norovirus GII.2 clusters.

Supplementary Figure 4 | Sampling time interval with respective genetic distance between pairs of HuNoV GII.2 clusters. Green and orange colors indicated within and between HuNoV GII.2 clusters.

Supplementary Figure 5 | Estimated time to the most recent common ancestor of each HuNoV GII.2 cluster. Clusters were shown with different colors.

Supplementary Figure 6 | Estimated effective population size through time using Bayesian Gibbs-within-Metropolis MCMC and fast maximum a posteriori algorithm. Estimated effective population size through time was based on the tree in Supplementary Figure 1.

Footnotes

References

Ahmed, S. M., Hall, A. J., Robinson, A. E., Verhoef, L., Premkumar, P., Parashar, U. D., et al. (2014). Global prevalence of norovirus in cases of gastroenteritis: a systematic review and meta-analysis. Lancet Infect. Dis. 14, 725–730. doi: 10.1016/s1473-3099(14)70767-4

CrossRef Full Text | Google Scholar

Ao, Y., Cong, X., Jin, M., Sun, X., Wei, X., Wang, J., et al. (2018). Genetic analysis of reemerging GII.P16-GII.2 Noroviruses in 2016-2017 in China. J. Infect. Dis. 218, 133–143. doi: 10.1093/infdis/jiy182

PubMed Abstract | CrossRef Full Text | Google Scholar

Ao, Y., Wang, J., Ling, H., He, Y., Dong, X., Wang, X., et al. (2017). Norovirus GII.P16/GII.2-associated gastroenteritis, China, 2016. Emerg. Infect. Dis. 23, 1172–1175.

Google Scholar

Bidalot, M., Thery, L., Kaplon, J., De Rougemont, A., and Ambert-Balay, K. (2017). Emergence of new recombinant noroviruses GII.p16-GII.4 and GII.p16-GII.2, France, winter 2016 to 2017. Euro. Surveill. 22:30508.

Google Scholar

Chakravarty, S., Hutson, A. M., Estes, M. K., and Prasad, B. V. (2005). Evolutionary trace residues in noroviruses: importance in receptor binding, antigenicity, virion assembly, and strain diversity. J. Virol. 79, 554–568. doi: 10.1128/jvi.79.1.554-568.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

Chhabra, P., De Graaf, M., Parra, G. I., Chan, M. C., Green, K., Martella, V., et al. (2019). Updated classification of norovirus genogroups and genotypes. J. Gen. Virol. 100, 1393–1406. doi: 10.1099/jgv.0.001318

PubMed Abstract | CrossRef Full Text | Google Scholar

Chhabra, P., Graaf, M., Parra, G. I., Chan, M. C., Green, K., Martella, V., et al. (2020). Corrigendum: updated classification of norovirus genogroups and genotypes. J. Gen. Virol. 101:893. doi: 10.1099/jgv.0.001475

PubMed Abstract | CrossRef Full Text | Google Scholar

Darriba, D., Taboada, G. L., Doallo, R., and Posada, D. (2012). jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9:772. doi: 10.1038/nmeth.2109

PubMed Abstract | CrossRef Full Text | Google Scholar

Drummond, A. J., Suchard, M. A., Xie, D., and Rambaut, A. (2012). Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973. doi: 10.1093/molbev/mss075

PubMed Abstract | CrossRef Full Text | Google Scholar

Glass, R. I., Parashar, U. D., and Estes, M. K. (2009). Norovirus gastroenteritis. N. Engl. J. Med. 361, 1776–1785.

Google Scholar

Guindon, S., and Gascuel, O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696–704. doi: 10.1080/10635150390235520

PubMed Abstract | CrossRef Full Text | Google Scholar

Hall, T. A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 41, 95–98.

Google Scholar

Hata, M., Nakamura, N., Kobayashi, S., Onouchi, A., Saito, T., Hirose, E., et al. (2018). Emergence of new recombinant noroviruses GII.P16-GII.2 and GII.P16-GII.4 in Aichi, Japan, during the 2016/17 season. Jpn. J. Infect. Dis. 71, 319–322. doi: 10.7883/yoken.jjid.2017.520

PubMed Abstract | CrossRef Full Text | Google Scholar

Jia, F., Lo, N., and Ho, S. Y. (2014). The impact of modelling rate heterogeneity among sites on phylogenetic estimates of intraspecific evolutionary rates and timescales. PLoS One 9:e95722. doi: 10.1371/journal.pone.0095722

PubMed Abstract | CrossRef Full Text | Google Scholar

Katoh, K., and Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. doi: 10.1093/molbev/mst010

PubMed Abstract | CrossRef Full Text | Google Scholar

Kobayashi, M., Matsushima, Y., Motoya, T., Sakon, N., Shigemoto, N., Okamoto-Nakagawa, R., et al. (2016). Molecular evolution of the capsid gene in human norovirus genogroup II. Sci. Rep. 6:29400.

Google Scholar

Kroneman, A., Vega, E., Vennema, H., Vinje, J., White, P. A., Hansman, G., et al. (2013). Proposal for a unified norovirus nomenclature and genotyping. Arch. Virol. 158, 2059–2068. doi: 10.1007/s00705-013-1708-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Medici, M. C., Tummolo, F., Martella, V., De Conto, F., Arcangeletti, M. C., Pinardi, F., et al. (2018). Emergence of novel recombinant GII.P16_GII.2 and GII. P16_GII.4 Sydney 2012 norovirus strains in Italy, winter 2016/2017. New Microbiol. 41, 71–72.

Google Scholar

Murrell, B., Weaver, S., Smith, M. D., Wertheim, J. O., Murrell, S., Aylward, A., et al. (2015). Gene-wide identification of episodic selection. Mol. Biol. Evol. 32, 1365–1371. doi: 10.1093/molbev/msv035

PubMed Abstract | CrossRef Full Text | Google Scholar

Nagasawa, K., Matsushima, Y., Motoya, T., Mizukoshi, F., Ueki, Y., Sakon, N., et al. (2018). Genetic Analysis of human Norovirus strains in Japan in 2016-2017. Front. Microbiol. 9:1. doi: 10.3389/fmicb.2018.00001

PubMed Abstract | CrossRef Full Text | Google Scholar

Niendorf, S., Jacobsen, S., Faber, M., Eis-Hubinger, A. M., Hofmann, J., Zimmermann, O., et al. (2017). Steep rise in norovirus cases and emergence of a new recombinant strain GII.P16-GII.2, Germany, winter 2016. Euro. Surveill. 22:30447.

Google Scholar

Parra, G. I., Squires, R. B., Karangwa, C. K., Johnson, J. A., Lepore, C. J., Sosnovtsev, S. V., et al. (2017). Static and evolving Norovirus genotypes: implications for epidemiology and immunity. PLoS Pathog. 13:e1006136. doi: 10.1371/journal.ppat.1006136

PubMed Abstract | CrossRef Full Text | Google Scholar

Pond, S. L., Frost, S. D., and Muse, S. V. (2005). HyPhy: hypothesis testing using phylogenies. Bioinformatics 21, 676–679. doi: 10.1093/bioinformatics/bti079

PubMed Abstract | CrossRef Full Text | Google Scholar

Prasad, B. V., Hardy, M. E., Dokland, T., Bella, J., Rossmann, M. G., and Estes, M. K. (1999). X-ray crystallographic structure of the Norwalk virus capsid. Science 286, 287–290. doi: 10.1126/science.286.5438.287

PubMed Abstract | CrossRef Full Text | Google Scholar

Rambaut, A., Drummond, A. J., Xie, D., Baele, G., and Suchard, M. A. (2018). Posterior summarisation in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904. doi: 10.1093/sysbio/syy032

PubMed Abstract | CrossRef Full Text | Google Scholar

Rambaut, A., Lam, T. T., Max Carvalho, L., and Pybus, O. G. (2016). Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2:vew007. doi: 10.1093/ve/vew007

PubMed Abstract | CrossRef Full Text | Google Scholar

Sagulenko, P., Puller, V., and Neher, R. A. (2018). TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4:vex042.

Google Scholar

Schmidt, H. A., Strimmer, K., Vingron, M., and Von Haeseler, A. (2002). TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502–504. doi: 10.1093/bioinformatics/18.3.502

PubMed Abstract | CrossRef Full Text | Google Scholar

Strimmer, K., and von Haeseler, A. (1997). Likelihood-mapping: a simple method to visualize phylogenetic content of a sequence alignment. Proc. Natl. Acad. Sci. U.S.A. 94, 6815–6819. doi: 10.1073/pnas.94.13.6815

PubMed Abstract | CrossRef Full Text | Google Scholar

Tamura, K., Nei, M., and Kumar, S. (2004). Prospects for inferring very large phylogenies by using the neighbor-joining method. Proc. Natl. Acad. Sci. U.S.A. 101, 11030–11035. doi: 10.1073/pnas.0404206101

PubMed Abstract | CrossRef Full Text | Google Scholar

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M., and Kumar, S. (2011). MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731–2739. doi: 10.1093/molbev/msr121

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan, M., Hegde, R. S., and Jiang, X. (2004). The P domain of norovirus capsid protein forms dimer and binds to histo-blood group antigen receptors. J. Virol. 78, 6233–6242. doi: 10.1128/jvi.78.12.6233-6242.2004

PubMed Abstract | CrossRef Full Text | Google Scholar

Thanusuwannasak, T., Puenpa, J., Chuchaona, W., Vongpunsawad, S., and Poovorawan, Y. (2018). Emergence of multiple norovirus strains in Thailand, 2015-2017. Infect. Genet. Evol. 61, 108–112. doi: 10.1016/j.meegid.2018.03.021

PubMed Abstract | CrossRef Full Text | Google Scholar

To, T. H., Jung, M., Lycett, S., and Gascuel, O. (2016). Fast dating using least-squares criteria and algorithms. Syst. Biol. 65, 82–97. doi: 10.1093/sysbio/syv068

PubMed Abstract | CrossRef Full Text | Google Scholar

Tohma, K., Lepore, C. J., Ford-Siltz, L. A., and Parra, G. I. (2017). Phylogenetic analyses suggest that factors other than the capsid protein play a role in the epidemic potential of GII.2 Norovirus. mSphere 2:e00187-17.

Google Scholar

Vinje, J. (2015). Advances in laboratory methods for detection and typing of norovirus. J. Clin. Microbiol. 53, 373–381. doi: 10.1128/jcm.01535-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Volz, E. M., and Didelot, X. (2018). Modeling the growth and decline of pathogen effective population size provides insight into epidemic dynamics and drivers of antimicrobial resistance. Syst. Biol. 67, 719–728. doi: 10.1093/sysbio/syy007

PubMed Abstract | CrossRef Full Text | Google Scholar

Volz, E. M., and Frost, S. D. W. (2017). Scalable relaxed clock phylogenetic dating. Virus Evol. 3:vex025.

Google Scholar

Wertheim, J. O., Murrell, B., Smith, M. D., Kosakovsky Pond, S. L., and Scheffler, K. (2015). RELAX: detecting relaxed selection in a phylogenetic framework. Mol. Biol. Evol. 32, 820–832. doi: 10.1093/molbev/msu400

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, D. P., Ando, T., Fankhauser, R. L., Beard, R. S., Glass, R. I., and Monroe, S. S. (2006). Norovirus classification and proposed strain nomenclature. Virology 346, 312–323. doi: 10.1016/j.virol.2005.11.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: human norovirus, genetic diversity, positive selection, virus evolution, phylogenetic analyses

Citation: Li X, Liu H, Rife Magalis B, Kosakovsky Pond SL and Volz EM (2021) Molecular Evolution of Human Norovirus GII.2 Clusters. Front. Microbiol. 12:655567. doi: 10.3389/fmicb.2021.655567

Received: 19 January 2021; Accepted: 15 February 2021;
Published: 22 March 2021.

Edited by:

Kai Huang, University of Texas Medical Branch at Galveston, United States

Reviewed by:

Silva Luciana Damascena, Evandro Chagas Institute, Brazil
Tao Jin, Guangdong Meige Gene Technology Co., Ltd., China

Copyright © 2021 Li, Liu, Rife Magalis, Kosakovsky Pond and Volz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xingguang Li, eGluZ2d1YW5nbGVlQGhvdG1haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Molecular Evolution of Human Norovirus GII.2 Clusters

Introduction

Materials and Methods

Sequence Data Set

Phylogenetic, Likelihood-Mapping, and Genetic Distance Analyses

Evolutionary Rate, Time Origin, and Past Population Dynamic Inferences

Analysis of Selection

Results

Sequence Data Set Information

Phylogenetic Analysis

Likelihood-Mapping and Evolutionary Divergence Analysis

Evolutionary Rate and Time Origin Analyses

Analysis of Past Population Dynamics

Analysis of Selection

Discussion

Data Availability Statement

Author Contributions

Funding

Conflict of Interest

Supplementary Material

Footnotes

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good