Genome Sequence Variations of Infectious Bronchitis Virus Serotypes From Commercial Chickens in Mexico

Kariithi, Henry M.; Volkening, Jeremy D.; Leyson, Christina M.; Afonso, Claudio L.; Christy, Nancy; Decanini, Eduardo Lucio; Lemiere, Stéphane; Suarez, David L.

doi:10.3389/fvets.2022.931272

ORIGINAL RESEARCH article

Front. Vet. Sci., 12 July 2022

Sec. Veterinary Epidemiology and Economics

Volume 9 - 2022 | https://doi.org/10.3389/fvets.2022.931272

This article is part of the Research TopicSequencing and Phylogenetic Analysis as a Tool in Molecular Epidemiology of Veterinary Infectious DiseasesView all 19 articles

Genome Sequence Variations of Infectious Bronchitis Virus Serotypes From Commercial Chickens in Mexico

Henry M. Kariithi^1,2^*

Jeremy D. Volkening³

Christina M. Leyson¹

Claudio L. Afonso³

Nancy Christy⁴

Eduardo Lucio Decanini⁴

Stéphane Lemiere⁵

David L. Suarez¹^*

¹Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, U.S. National Poultry Research Center, USDA-ARS, Athens, GA, United States
²Biotechnology Research Institute, Kenya Agricultural and Livestock Research Organization, Nairobi, Kenya
³BASE₂BIO, Oshkosh, WI, United States
⁴Boehringer Ingelheim Animal Health, Guadalajara, Mexico
⁵Boehringer Ingelheim Animal Health, Lyon, France

New variants of infectious bronchitis viruses (IBVs; Coronaviridae) continuously emerge despite routine vaccinations. Here, we report genome sequence variations of IBVs identified by random non-targeted next generation sequencing (NGS) of vaccine and field samples collected on FTA cards from commercial flocks in Mexico in 2019–2021. Paired-ended sequencing libraries prepared from rRNA-depleted RNAs were sequenced using Illumina MiSeq. IBV RNA was detected in 60.07% (n = 167) of the analyzed samples, from which 33 complete genome sequences were de novo assembled. The genomes are organized as 5'UTR-[Rep1a-Rep1b-S-3a-3b-E-M-4b-4c-5a-5b-N-6b]-3'UTR, except in eight sequences lacking non-structural protein genes (accessory genes) 4b, 4c, and 6b. Seventeen sequences have auxiliary S2' cleavage site located 153 residues downstream the canonically conserved primary furin-specific S1/S2 cleavage site. The sequences distinctly cluster into lineages GI-1 (Mass-type; n = 8), GI-3 (Holte/Iowa-97; n = 2), GI-9 (Arkansas-like; n = 8), GI-13 (793B; n = 14), and GI-17 (California variant; CAV; n = 1), with regional distribution in Mexico; this is the first report of the presence of 793B- and CAV-like strains in the country. Various point mutations, substitutions, insertions and deletions are present in the S1 hypervariable regions (HVRs I-III) across all 5 lineages, including in residues 38, 43, 56, 63, 66, and 69 that are critical in viral attachment to respiratory tract tissues. Nine intra-/inter-lineage recombination events are present in the S proteins of three Mass-type sequences, two each of Holte/Iowa-97 and Ark-like sequence, and one each of 793B-like and CAV-like sequences. This study demonstrates the feasibility of FTA cards as an attractive, adoptable low-cost sampling option for untargeted discovery of avian viral agents in field-collected clinical samples. Collectively, our data points to co-circulation of multiple distinct IBVs in Mexican commercial flocks, underscoring the need for active surveillance and a review of IBV vaccines currently used in Mexico and the larger Latin America region.

Introduction

Infectious bronchitis virus (IBV; type species of the Coronaviridae family) causes the acute, highly contagious avian infectious bronchitis (IB) disease that primarily affects the upper respiratory tract in chickens of any age, but which can also cause urogenital or enteric disease, resulting in decreased production depending on virus strain and co-infecting pathogens, age, vaccination history, and immune competency of chickens (1, 2). The virus is shed by naturally infected and/or vaccinated birds and is transmitted via respiratory discharges (acute phase) and feces (disease recovery phase) to susceptible naïve birds (3). After initial respiratory tract infection, the virus can be disseminated to other tissues including trachea, lungs, kidney, oviduct, alimentary, and proventriculus (2).

The non-segmented, positive single-stranded RNA genome (~ 27.6 kb in size) of IBV, which can also serve as a viral mRNA, comprises six genes flanked by 5′-/3′- untranslated regions (UTRs) (4, 5). Occupying about two thirds of the genomic 5′-end, gene 1 encodes the replicase/polymerase complex 1a and 1ab (Rep1a/1ab), which is produced by a programmed−1 ribosomal frameshifting mechanism that allows continuation of translation beyond the Rep1a stop codon. Rep1a/ab is proteolytically cleaved into 15 non-structural proteins by the virally-encoded accessory gene 3 (papain-like protease; PL ^pro) and gene 5 (3-C-like protease; 3CL ^pro) proteases (6, 7). Gene 2 encodes spike (S) glycoprotein, the largest structural and most divergent of all IBV proteins, which is proteolytically cleaved by cellular furin protease into subunits S1 and S2 (8). Genes 3 and 4 encode two accessory proteins and one membrane-binding structural protein each [3a/3b, and envelope (E), and 4b/4c, and membrane (M), respectively]. Gene 5 encodes accessory proteins 5a and 5b, while gene 6 encodes the structural nucleocapsid (N) protein and accessory protein 6. Accessory genes 3a, 3b, 4b, 4c, 5a, 5b, and 6b, which have a wide range of genomic configurations, are non-essential for virus replication (9). Accessory genes 4b, 4c, and 6b are rarely reported in the literature despite being present in many unpublished IBV genome sequences, but little is known about their roles in viral replication or pathogenesis (10, 11).

Whereas, spike subunit S2 is highly conserved amongst IBVs, the hypervariable regions (HVRs I-III) of the S1 harbor the majority of nucleotide (nt) heterogeneity between different strains (12, 13). Subunit S1 also contains the receptor-binding domain (RBD), which is essential for entering susceptible chicken cells and induction of host immune responses (14). New IBV variants, which can evade vaccine-induced immunity, continually emerge due to the mutations (caused by replication errors), and/or recombination events (caused by template switching) in the S1 sequences (15–17). Due to its close correlation with IBV serotypes, S1 sequence is used to classify IBV into 7 genogroups (GI-GVII) comprising at least 32 distinct lineages and several unassigned inter-lineage recombinants (1, 9, 18–20). “Genogroup” represents stable IBV categories and “lineage” is a descriptive dynamic serotype grouping (21).

The Mass-type serotype was first identified in the USA in the 1930s (22), but IBVs are now globally distributed in poultry, with regional-specific genotypes and serotypes that are often closely related to the live attenuated vaccine strains used in the regions, or are unique variants (15, 18, 23–25). Serotypes belonging to lineage GI-11 [South American 1 (SAI) serotypes] and lineage GI-16 (Q1-like serotypes) extensively circulate in South American chicken flocks (21). The uniquely South American SAI serotypes, which are associated with respiratory/enteric disease and reduced egg production, emerged in the 1950s in commercial poultry flocks in Brazil, and later spread to Argentina and Uruguay (18, 25). The Q1-like serotypes, which emerged in late 1970s in Asia, are more widespread than the SAI serotypes (18, 21). Both the SAI and Q1-like serotypes have low antigenic relatedness with the Mass-type vaccine strains (lineage GI-1) that are extensively used vaccination programs globally (16, 21). Other serotypes reported in South America include lineage GI-13 (793B or 4/91-like) in Brazil, Chile and Honduras (26–28). In Mexico, Mass-type (lineage GI-1), Holte/Iowa-97 (BL-56; lineage GI-3), Ark (lineage GI-9), and Q1-like serotypes have been reported in commercial flocks (29–31).

Live-attenuated (derived from virulent strains attenuated via serial passage in chicken eggs) or inactivated vaccines are routinely used in IB control (32–34). Routine use of serotype-specific live vaccines can result in evolution of novel variants (due to mutations, insertions, deletions, or recombination between co-infecting field and vaccine strains) that are serologically distinct from the vaccine strains, with potential negative impacts on vaccine efficacy (35–37). The apparent regionality of IBV diversity underscores the importance of viral characterization, which can be used to assess and properly deploy existing vaccines and potentially identify when new vaccines need to be developed.

In the current study, we sequenced 30 complete IBV genome sequences from clinical field samples collected from commercial flocks in Northern, Central and Southern Mexico in 2019–2021, along with 3 vaccine samples. We performed comparative analysis of sequence variation (vaccine vs. field sequences) and phylogenetic relationships with other IBVs, and assessed potential recombination events, point mutations, insertions, and deletions in the HVRs of the S glycoprotein. The data presented here expand current knowledge of the IBVs circulating in Mexico, which can inform vaccination strategies to control IB outbreaks in the country and in Latin America.

Materials and Methods

Type, Origin, and Processing of Samples

The samples used in this study were randomly collected from “apparently healthy” (i.e., no observable clinical signs of disease at the time of sampling) commercial broiler (aged 21–42 days) and layer (aged 7 weeks) chicken flocks in central, northern, and southern regions of Mexico between April 2019 and December 2021. The samples were derived from respiratory (choana and lung), immunological (spleen and bursa) and digestive (cloaca) tissues from 100 birds per flock using sterile flocked swabs and pooled (25 samples per pool) in sterile 1.5 mL viral transport media. Samples were then spotted on 4-sample-area (125 μl per area) Whatman Flinders Technology Associates (FTA) cards^® (Millipore-Sigma) within 24 h of collection and dried for at least 2 h at room temperature (RT; 15–25°C). After drying, each sample-spotted FTA card was individually enclosed in double leak-proof zip-lock plastic bags with Whatman desiccant packets (GE healthcare). In addition to field samples, three vaccine samples (i.e., live attenuated Mass-type, 4/91 variant and Mass-type/Connecticut recombinant strains) from Boehringer Ingelheim Animal Health (BIAH), Mexico, were also FTA-spotted. The detailed information about what flocks were vaccinated and with what type of vaccine is not publicly available because of propriety and confidentiality between BIAH and their clients. All samples were shipped to the Southeast Poultry Research Laboratory (SEPRL), USDA-ARS, Athens, GA, and stored at −80°C in a BSL-3 laboratory until further processing.

RNA Extraction

From each sample-spotted FTA card, 24, 3-mm disks (i.e., 6 disks per spotted area) were punched out using sterile disposable biopsy punches (Robbins Instruments, USA) and incubated for 30 min at room temperature in 240 μL of nuclease-free TE buffer (10 mM Tris-HCl; 0.1 mM EDTA, pH 8.0) to elute nucleic acids. Total RNA was extracted from 100 μL of the TE eluate using MagMAX™-96 AI/ND Viral RNA Isolation Kit (Thermo Fisher Scientific, MA, USA) on an automated KingFisher Magnetic Particle Processor (Thermo Fisher, USA) following manufacturer's instructions. To selectively deplete abundant host-specific RNAs (18S, 28S and mitochondrial) and bacterial rRNAs (16S/23S), extracted RNAs were treated with an in-house RNaseH host rRNA depletion protocol we have recently described (38).

Library Preparation and NGS

Sequencing libraries were prepared using sequence-independent, single-primer amplification (SISPA) as previously described (39). Briefly, cDNAs were synthesized from 10 μL of RNA using random K-8N primer with SuperScript ^TM IV First Strand synthesis Kit (Invitrogen, USA) and Klenow polymerase (NEB Inc., USA) kits, and then purified using Agencourt AMPure XP beads (Beckman Coulter Life Sciences, USA). Purified cDNAs were amplified by Phusion^® High-Fidelity PCR Kit (NEB Inc., USA), and used to prepare sequencing libraries by the Nextera ^TM DNA Flex kit (Illumina, USA), which were then quantified by Qubit™ dsDNA HS Assay Kit (Thermo-Fisher Scientific) and Agilent 4150 TapeStation HS D5000 System (Agilent Technologies, Inc.). Based on their concentrations and average fragment sizes, equimolar concentrations (4 nM, 8 μL of each library) of the libraries were pooled, then digested by incubation with 0.2 N NaOH (5 min at RT). Pooled libraries were further diluted to 10 pM final concentration, a control library added (5% PhiX library v 3) and paired-end (2 × 300 bp) sequencing performed using a 600-cycle MiSeq Reagent Kit v3 (Illumina, USA). Each NGS run consisted of 48 multiplexed samples.

Sequence Assembly

Customized workflows executed in Galaxy and Geneious Prime^® 2021.2.2 platforms were used to assess the quality of the raw NGS reads (by FastQC) and trimming of adaptors (by Cutadapt) as previously described (40–43). Briefly, the host (chicken) and PhiX control reads were filtered out by mapping the trimmed reads against chicken (Gallus gallus) and PhiX174 reference genomes using BWA-MEM v 0.7.15.1 (44) with standard parameters. Trimmed/filtered overlapping forward and reverse read-pair sets were merged with PEAR v 0.9.6.1 (45); an in-house wrapper tool was then used to identify and remove chimeric Nextera reads. After digital normalization of the reads via k-mer abundances using khmer package (46), de novo sequence assembly was performed using MIRA v3.4.0 (47). BWA-MEM/samtools was used to re-call consensus sequences from the NGS reads aligned to the de novo-generated contigs (minimum coverage depth to call a base set at 3X).

Sequence Annotation and Phylogenetic Analysis

Open reading frames (ORFs; used here to refer to contiguous nt stretches from start to stop codons without interrupting in-frame stop codons) in the assembled consensus sequences were determined and annotated using ORF Finder (minimum ORF size set at 50 NT including start and stop codons) within the Geneious Prime^® v 2022.1.1 (https://www.geneious.com). ORFs were confirmed by comparative analyses with annotated coronavirus (CoV) genes and genomes available at GenBank and/or PubMed. Classification of the assembled sequences was based on Valastro et al. (18). Putative protein domains were predicted using translated protein sequences with InterProScan v 2.0 (48) executed in Geneious Prime. For confirmation, putative annotations were aligned with similar annotations (coding regions and domains/motifs of translated amino acid sequences) of a Geneious Prime local database (containing reference sequences retrieved from GenBank and/or PubMed using BLASTp algorithm). ProPserver v1.0 (49) was used to predict putative cleavage site of the S glycoprotein at the border between subunits S1 and S2 (R-X-X-R↓S motif), and in S2′ site (R-X-R↓S motif) located upstream of subunit S2 ectodomain. Asparagine (N)-linked glycosylation sites in S1 protein sequences were predicted using NetNGlyc-1.0 (www.cbs.dtu.dk/services/NetNGlyc/) only sites with scores of at least 0.6 and supported by six of the nine predictive neural networks of the server were accepted.

The genome and specific gene sequences obtained from this study, together with sequences of representative serotypes within the IBV lineages [retrieved from the GenBank; lineages based on current IBV classification system (18)] were used for multiple sequence alignment using MAFFT v 7.490 (50). To minimize effects of poorly aligned regions, the multiply aligned sequences were trimmed using trimAl tool v 1.3 (51) with gappyout mode. Phylogenetic analysis was performed using maximum likelihood method executed in MEGA with 1,000 bootstrap replicates of the original data and the best model automatically identified by the software (52).

Analyses of Point Mutation, Insertions/Deletions, and Putative Recombination Events

Multiple alignments of amino acid sequences of subunit S1 HVRs I-III from this study, together with sequences of representative serotypes within IBV lineages, were analyzed for presence of deletions/insertions (hereafter abbreviated as indels) and point mutations. For recombination events analysis, SplitsTree5 v 5.0.0_alpha (53) was used to determine the likelihood of recombination events in the complete S-gene sequences. Recombination events were further examined using seven heuristic recombination detection algorithms (RDP, GENECONV, BootScan, MaxChi, Chimaera, SiSscan, and 3Seq) executed in the Recombination Detection Program 4 (RDP4) v4.101 software suite (54), at highest p-value of 0.05 with Bonferroni multiple correction and SEQ-GEN parametric data simulations. Confirmation of putative recombinant event was accepted only when the recombination breakpoints were detected by at least five of the seven algorithms, and with breakpoints of the transferred fragments (recombinant regions) supported by corrected p-values of ≤1 × 10–6.

Results

Sequencing Libraries and Sequencing Data

In this study, we analyzed three vaccine and 275 field samples derived from immunological (n = 126), respiratory (n = 141), digestive (n = 8) tissues, and the vaccine samples (n = 3). Average fragment length distribution of the adaptor-ligated libraries (TapeStation estimates) ranged from 425 to 608 bp, but actual post-NGS average fragment length distribution post-FastQC (excluding adaptor sequences) were shorter, a discrepancy attributable to the fact that shorter fragments tend to cluster more efficiently than longer fragments (55). Total trimmed/filtered read counts ranged from 16,541 to 1.3 million reads. Proportions of chicken-specific reads ranged from 0.82 to 77.8%, with only six samples having over 50% of the reads mapping to the host genome.

Detection of Viral and Pathogenic Bacterial RNAs

IBV RNA was detected in 60.07% (n = 167) of the analyzed samples. Fifty-five of the samples (20 immunological tissue samples, 32 respiratory tissue samples, and the 3 vaccine samples) had enough IBV-specific reads to allow for assembly of complete or nearly-complete genome or S-gene consensus sequences (Supplementary Table 1). In addition to IBVs, 10 spleen/bursa and nine choanal/lung tissues contained RNA of avian viruses belonging to families Astroviridae (chicken astrovirus, serogroup 1b), Birnaviridae (infectious bursal disease virus, genogroup 2b), Paramyxoviridae (avian paramyxovirus type 1, subgenotype V.1), Pneumoviridae (avian metapneumovirus subtype A), Reoviridae (avian rotavirus serogroups A, D, and F), and Picornaviridae (avian nephritis virus, chicken gallivirus A, chicken megrivirus group C-3, and sicinivirus type A). Picornaviruses, and in particular SiV, were overrepresented (detected together with IBV in 73.68% of the 19 samples). In addition to viral agents, avian pathogenic bacterial species were detected in 29 out of the 55 samples, including Bordetella, Enterococcus spp, Gallibacterium anatis, Salmonella enterica, and Streptococcus spp.

Assembly of IBV Genome Sequences

Thirty-three complete genome sequences, six partial genome sequences, three complete S-gene sequences, and 13 partial S-gene sequences of IBVs were assembled (Table 1). Further analyses were restricted to the 33 complete genome sequences. One sequence was obtained from a spleen/bursa tissue sample swabbed from a 7-week old layer chicken, while 21 and nine sequences were from choanal/lung and spleen/bursa tissue samples, respectively, swabbed from broiler chickens aged between 21 and 42 days. All 33 genome sequences were supported by sufficient read depths (median read depths of 10–1,175 X) and genome coverage (99.98–100%). However, 19 of the genome sequences missed short stretches at the 5′- and/or the 3′- termini, which is not unusual for randomly primed viral genome sequencing (56).

TABLE 1

Table 1. Sequence assembly coverage of 33 complete genome sequences assembled in this study.

Sequence Analyses

Genomic Organization and Features

All 33 complete sequences, with lengths ranging from 27,022 to 27,805 nt excluding poly(A) tails, contain the six IBV genes flanked by 5′- and 3′-UTRs (294–643 nt and 136–446 nt in length, respectively); 14 of the sequences, including the 3 vaccine sequences, have poly(A) tails of variable lengths (Supplementary Table 2). The genomes are organized as 5'UTR-[Rep1a-Rep1b-S-3a-3b-E-M-4b-4c-5a-5b-N-6b]-3'UTR, with 25 sequences having a cassette of seven “accessory” genes (3a, 3b, 4b, 4c, 5a, 5b, and 6b) interspersed variably downstream of gene 2 (S) genomic region. Genes 4b, 4c, and 6b are absent in eight sequences as follows: both genes 4b and 4c are absent in field sequence 2360/20 from Southern Mexico, 4b is absent in sequences 2359/20 and 2754/21 (from Southern Mexico) and 2723/21 (from Northern Mexico), and 6b is absent in the Mass-type vaccine strain 1616/19 and Mass-type/Conn recombinant vaccine strain 1623/19 sequences, and the field sequences 2523/21 and 2598/21 from Central and Northern Mexico, respectively.

Gene 1 (Rep1a/1ab Complex)

Lengths of gene 1 (containing 2 overlapping ORFs encoding Rep1a and Rep1ab) varies from 19,490 to 19,970 nt among the 33 sequences, but all sequences have a 4-nt overlap between Rep1a and 1b (Table 2). Whereas, the lengths of Rep1a vary (11,520–11,937 nt), 30 Rep1b sequences are 8,037 nt long; the field sequences 2721/21, 2353/20, and 2752/21 have comparatively shorter Rep1b lengths (7,935, 7,938, and 7,974 nt, respectively). Domain features and borders of accessory genes 2-16 produced from the cleavage of Rep1ab by the virally-encoded proteases PL ^pro (gene 3) and 3CL ^pro (gene 5) are shown in the Supplementary Table 3. As expected for CoVs (11, 57–60), cleavage sites and lengths of the accessory genes in Rep1a/1ab are conserved across all the sequences, including amino acid residues Q/S required by the 3CL ^pro for the cleavage of Rep1ab into Rep1a and 1ab, which releases products of genes10 (exonuclease; 145 amino acids), 11 (unknown function; 23 amino acids), and 12 (RdRp; 917 amino acids). Although the cleavage sites of PL ^pro are conserved in all sequenced, their length varies (1,529–1,619 amino acids) compared to the consistent lengths of the main CoV protease, 3CL ^pro (307 amino acid residues).

TABLE 2

Table 2. Nucleotide overlaps between genes and ORFs in the 33 IBV sequences analyzed in this study.

Gene 2 (S)

In all 33 sequences, gene 1 (Rep1a/1ab) and gene 2 (S) overlap by 50 nt (Table 2). Lengths of gene 2 (with a single ORF encoding S glycoprotein) vary, with the most sequences (n = 13) having 3,495 nt and others with either 3,510 nt (n = 7) or 3,498 nt (n = 6) (Supplementary Table 4). The shortest S genes are found in the Mass-type vaccine strain 1616/19 (3,462 nt) and the 4/91 vaccine variant 1619/19 (3,468 nt) sequences, and the field sequence 2598/21 (3,453 nt). Whereas, the lengths of subunit S1 vary (1,602–1,632 nt), all subunit S2 sequences have lengths of 1,878 nt, except the above-mentioned 3 sequences (1,851-nt long). Figure 1 illustrates general S glycoprotein features. The lengths of HVR I, II and III (residues 60–88, 115–142, and 275–293, respectively) vary among the 33 sequences; HVR I of 27 or 28 residues (in 11 and 22 sequences, respectively), HVR II of 25 or 27 residues (in 8 and 25 sequences, respectively), and HVR III of 18 residues (in all sequences).

FIGURE 1

Figure 1. Schematic and amino acid alignment of the overall features of the S glycoprotein. The scheme was constructed in Geneious Prime based on the S protein sequence of the field sequence 2602/21; all 33 S protein sequences in this study have similar general features (see Supplementary Table 4). Sequence IDs are shown on the alignments; the positions indicated at the top of the sequence alignments are in reference to ungapped amino acid residues; dots and dashes indicate identical and deleted amino acid residues, respectively. Features of subunit S1 include N-terminal signal peptide (SP), N-and C-terminal domains (S1-NTD/CTD), which harbor the receptor-binding domains (RBD), and hypervariable regions (HVR I-III). Shown is a 20- residue (running from positions P14–P6') motif furin S1/S2 cleavage site, consisting of a core region (8-residues; position P6–P2'), which harbors the canonical 4-residue motif (Rx[K/R]-R↓S); the core region is flanked by two solvent accessible regions (8-residue; P7 to P14, and a 4-residue; P6 to P2'). Note the backward and forward numbering of P1-P14 and P1'-P6', respectively, starting at the conserved R immediately upstream of the cleavage S1/S2 site. The auxiliary S2′ (PS(G)SPRxR↓S) cleavage site positioned 153-residues downstream the primary S1/S2 cleavage site is also shown. Subunit S2 domains include fusion peptide (FP), heptad repeat regions (HR1 and HR2), transmembrane domain (TM), and intracellular (IC) tail. Purple vertical bars represent predicted N-linked glycosylation sites.

All 33 S-gene sequences contain the canonical 4 amino acid (aa) residue furin recognition consensus motif R-X-[K/R]-R↓S at the S1/S2 cleavage site (X is any residue, ↓ is cleavage position and underlined residues are conserved)—Figure 1. The canonical motif is within a 20 aa region (positions P14 to P6', with backward and forward numbering of P1-P14 and P1'-P6', respectively, starting at the conserved R immediate of the R↓S cleavage position), consisting of a core region (8 aa; P6-P2') flanked by two solvent accessible regions (8 aa P7-P14, and a 4 aa; P6-P2') (61). The S1/S2 cleavage motif contains critical physical properties required for furin cleavage and fusion efficiency (61), including absence of the acidic cysteine residue in the core region (P1-P6), presence of a positively-charged residue at position P4 (R is favored), hydrophilic residues in regions flanking the S1/S2 site (positions P7-P10 and P3′-P6′), small hydrophilic residue at position P1′ (S is preferred) and hydrophobic-aliphatic residue at position P2′ (V is favored). There are however, two exemptions in the 8 aa P6-P2′ region: firstly, nine sequences have the hydrophobic F-residue at P3 (instead of preferred hydrophilic residue), and secondly, sequences 2731/21 and 2960/21 have the hydrophilic T-residue at P2′ (instead of the preferred hydrophobic V or I residues). These exceptions are not unusual as the specific interactions of the residues in P2′ and P3 within the furin cleavage pocket are unclear (61).

An auxiliary S2′ cleavage (motif PSxSPRxR↓S) is present in 17 sequences positioned 153 amino acid residues downstream of the primary S1/S2 cleavage site (Supplementary Table 4 and Figure 1). Further, all sequences contain a membrane-fusion peptide (FP) in the conserved region flanked by 2 cysteine (C) residues located immediately downstream of the S2′ cleavage site, with the consensus motif CTAGPLGF/(T)XKDLXC (Figure 1); underlined residues C, D, L, and C are conserved across CoVs (62). Numbers of N-linked glycosylation sites (on conserved consensus NXS/T motif) varies among the 33 sequences (22–27 sites), with subunit S1 having more sites (n = 12–16) compared to subunit S2 (n = 10–12); these numbers are within the range of 19–39 sites reported in CoVs (9).

Gene 3 (3a/3b and E)

A one-nt overlap occurs between gene 2 (S) and gene 3 (accessory genes 3a-3b and E) in 30 of the 33 sequences. In the Mass-type vaccine 1616/19 and the 4/91 vaccine variant 1619/19 sequences, and the field sequence 2598/21, there is a 26-nt non-coding region (hereafter abbreviated as VncRNA to refer to genomic region between adjacent genes or ORFs without any ORF) between the two genes (Table 2). Within gene 3, there is a 1-nt overlap between 3a and 3b, while 3b and E overlap by 17, 20, and 38 nt in six, 26 and one sequences, respectively. In all sequences, 3a and 3b are of the same lengths (174 and 195 nt, respectively), but the length of the structural E varies (285–330 nt) among the sequences (Supplementary Table 2). Differences in lengths of E-gene is expected due to its extreme divergence in CoVs (9). The E protein sequences in all sequences contained two conserved cysteine residues at position 45 and 46, which serve as E protein palmitoylation sites (63).

Gene 4 (M, 4b/4c)

The overlap between gene 3 and gene 4 (M and accessory genes 4b/4c) varies among the sequences (Table 2). Twelve sequences have a 20 nt overlap and nine sequences each have 23-nt and 29-nt overlaps, while the field sequence 2944/21 and the 4/91 vaccine variant 1619/19 sequence have 4 and 8-nt overlaps between the two genes, respectively. There is a 16-nt VncRNA region between the two genes in sequence 2930/21. Gene 4 ORFs vary in length; 654–690 nt (M gene), 102–327 nt (4b), and 132–171 nt (4c; 25 4c are 171 nt in length). Amongst the 3 gene 4 ORFs, there is no overlap of M and 4b in 27 sequences. Sequences 2721/21 and 2731/21 have VncRNAs of 21 nt and 13 nt between M and 4b, respectively. Sequences 2359/20, 2360/20, 2723/2021, and 2754/21 lack 4b; sequence 2360/20 lacks both 4b and 4c (Supplementary Table 2). In 28 sequences, 4b and 4c overlap by 80 nt; the two 2 accessory genes overlap by 17 and 95 nt in sequences 2731/21 and 2743/21, respectively.

Gene 5 (5a and 5b)

In all the sequences, gene 4 and gene 5 (5a-5b) overlap by 17 nt; 5a and 5b are 198 and 249 nt in length, respectively, all overlapping by 4-nt (see Table 2 and Supplementary Table 2). The presence and apparent conservation of 5a and 5b across all the 33 sequences in this study agree with reports that all avian CoVs contain gene 5, whose protein products are postulated to contribute to virus/host interactions (64).

Gene 6 (N and 6b)

Genes 5 and 6 (N and 6b) overlap by 58 nt in 29 of the 33 sequences (6b is absent in the Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences, and field sequences 2523/21 and 2598/21) (Table 2). Lengths of VncRNAs between N and 6b varies among the sequences; 8 nt (n = 16 sequences), 23 nt (n = 11 sequences), 17 nt (in sequence 2960/21), and 27 nt (in sequence 2725/21)—Supplementary Table 2. Whereas, all N gene sequences are of the same length (1,230 nt), the lengths of 6b varies widely from 129 to 321 nt amongst the sequences.

Sequence Comparison

Relationships Between Vaccine and Field Sequences

Supplementary Table 5 presents the results of comparative pairwise nt sequence identities of vaccine vs. field sequences identified in this study. Amongst the three vaccines, the highest identity (100%) is between gene 3 (3a and 3b) the Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences, and the lowest (77.31%) between subunit S1-gene of sequences of the Mass-type/Conn recombinant vaccine 1623/19 and the 4/91 vaccine variant 1619/19.

Comparing the vaccine vs. field sequences, the highest identities (99.43–100%) are between genes 3 and 4, and 5a and 5b of field sequence 2598/21 and their homologs in the vaccine sequences. The lowest identity is between 6b (60%) and the 3'-UTR (64.1%) of the field sequences 2723/21 and 2360/20, respectively, and their homologous genomic regions in the 4/91 vaccine variant 1619/19 sequence. For the S1-gene sequence, which is used for IBV classification (18), the highest identity (98.14%) is between field sequence 2860/21 and the Mass-type vaccine 1616/19 sequence, while lowest (75.78%) is between the field sequence 2944/21 and the 4/91 vaccine variant 1619/19 sequence. The two field sequences also showed similar identities in their complete S-gene sequences (highest and lowest identities of 98.15 and 80.15%, respectively) to the vaccine sequence.

Overall, the most conserved genomic regions amongst the 33 sequences are Rep1ab (88.5–93.24% nt identity) and 5b (93.98–100% nt identity), while the least conserved region are 6b (60–94.22% nt identity) and 4c (74.27–100% nt identity).

Relationships With Other Serotypes

Relationships between the 33 sequences in this study with other IBVs are summarized in Table 3 (complete S-gene sequences) and Supplementary Table 6 (complete genome sequence). Phylogenetic trees based on nt sequences of complete S-gene, S1-gene, and HVRs I-III classified the 33 sequences in this study within five different lineages (Figure 2).

TABLE 3

Table 3. BLASTn results of the 33 complete S-gene sequences in this study.

FIGURE 2

Figure 2. Maximum likelihood phylogenetic tree of nt sequences of subunit S1-gene, hypervariable regions I-III (HVRs I-III) of the S1-gene, and complete S-gene using GTR model in MEGA 6. The 33 sequences in this study (3 vaccine sequences highlighted in blue; 30 field sequences; color-coded based on sampling regions in Mexico) clustered with serotypes in 5 lineages of IBVs, i.e., lineages GI-1 (Mass-type serotypes; n = 8 sequences), GI-3 (Holte/Iowa/97 serotypes; n = 2 sequences), GI-9 (Ark-like serotypes; n = 8 sequences), GI-13 (793B also known as 4/91 serotypes; n = 14 sequences), and GI-17 (CAVs; n = 1 sequence). The analysis involved 94 sequences. All positions with <95% site coverage were eliminated. The final datasets had 1,576, 757, and 3,431 positions for the complete S1-gene, HVRs I-III, and complete S-gene, respectively.

Lineage GI-1 (Mass-Type). As shown in Table 3, six and two field and vaccine sequences are closely related to three different Mass-type strains, and the Mass-type/Conn recombinant vaccine 1623/19 sequence is 98.45% similar to the pathogenic field strain MN696791/ck/T&T/18RS1461-3/14 isolated from a vaccinated broiler with respiratory disease (65). The Mass-type vaccine 1616/19 sequence and field sequence 2598/21 from Northern Mexico are 99.51 and 98.51% identical to strain MK937828/ck/CN/I1124/16, respectively. The remaining five field sequences from southern Mexico (2602/21, 2754/21, 2743/21, 2748/21, 2860/21), are 99.46–99.89% identical to a Brazilian government-licensed Ma5 vaccine strain [KY626045/BR/Ma5/16; (66)]. Phylogenetic clustering based on complete S-gene, S1-gene and HVRs I-III mirrored the BLASTn results, where the above-mentioned five sequences segregate in a subclade containing the Brazilian Ma5 and Canadian/European/Brazilian H120 vaccine strains (Figure 2). Based on the HVR I-III, the Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences are in a subclade containing the 2016 Chinese strain and a representative of Mass-type viruses (i.e., AY561711/Mas41), while the field sequence 2598/21 is in a distinct subclade containing the T&T and Connecticut vaccine strains. All of the 8 Mass-type sequences in this study contained an S2′ site downstream of the primary S1/S2 cleavage site (Supplementary Table 4 and Figure 1).

Lineage GI-3 (Holte/Iowa-97). Based on complete S-gene sequences, viruses 2960/21 and 2731/21 from Central Mexico match closest (87.52 and 87.66% nt identities) to strain GU393334/ck/US/Gray/60 belonging to lineage GI-3, which comprises of respirotropic and nephropathogenic Holte, JMK, Gray and Iowa-97 viruses (18, 67). However, BLASTn searches using complete genome sequences returned closest hit (93.39 and 92.99%) to the pathogenic field strain MH779860/ck/USA/Ark99/14 of lineage GI-9 (Supplementary Table 6). From Figure 2, complete S-gene does not phylogenetically place the two sequences with serotypes in either lineages GI-3 or GI-9; rather, the sequences are in a distinct subclade within a larger clade containing California variants (CAVs; lineage GI-17). However, based on the S1-gene and HVRs I-III, these sequences cluster with lineage GI-3 serotypes, but in a distinct subclade containing strain AF352831/BL-56/96, which has been previously described as uniquely Mexican (25).

Lineage GI-9 (Ark-Like). The North American lineage GI-9 serotypes were implicated in the rolling reactions in vaccinated flocks and the persistence of respiratory syndromes in flocks (25). Based on the complete S-gene sequences (Table 4), four sequences from Central Mexico (2930/21, 2961/21, 2562/21, and 2563/21), and four from Southern Mexico (2742/21, 2956/21, 2359/20, and 2360/20), are all closely-related to DQ458217/US/AL/4614/98 (94.19–94.65% nt identities), an Ark-DPI-derived vaccine virus originally isolated from a 40-day-old chicken with respiratory disease (68). Complete genome sequences returned similar hits, except that seven sequences are closest to an Ark pathogenic field strain (94.07–99.82% nt identity), while an attenuated ArkGA vaccine virus (MH779857/ck/USA/ArkGA-P20/15) is the closest match to sequence 2961/21 with 93.7% nt identity (Supplementary Table 6). Based on complete S-/S1-gene and HVRs I-III, all 8 sequences phylogenetically cluster in separate subclade distinct from other Ark-like strains, supported by bootstrap values of 100% (S-/S-1 genes) and 83% (HVRs I-III). As in the case of Mass-type viruses, all 8 Ark-like sequences contained the S2′ cleavage (Supplementary Table 4 and Figure 1).

TABLE 4

Table 4. Recombination events in the complete S-gene sequences in this study.

Lineage GI-13 (793B, 4/91 or CR88). Genome sequences of 13 field samples and the 4/91 vaccine variant 1619/19 match closest to and cluster with lineage GI-13 serotypes (see Table 4, Supplementary Table 6, and Figure 2). Although field and vaccine 793B-like sequences have a near global presence, they have rarely been reported in Latin America. Both complete genome and S-gene of the 4/91 vaccine variant 1619/19 is closest MN548285/ck/UK/CR88/11 (99.94% and 99.98% nt identities, respectively). The 13 field sequences match to different strains depending on sampling regions (Table 4). Sequences 2353/20, 2354/20, 2592/21, 2752/21, 2753/21, and 2826/21 from Southern Mexico are 99.86–100% identical to KP036503/ck/CH/LHB/121010/12 based on complete S-gene sequences, while complete genome sequences are 99.87–99.97% identical to isolate MZ367369/ck/Belgium/4134 001/19. Six of the seven sequences from Northern Mexico (2721/21, 2723/21, 2725/21, 2818/21, and 2819/21) have nt identities of 99.54–99.63% to isolate KP118880/ck/CH/LHB/130927/13, while the field sequence 2833/21 is 99.1% identical to isolate JN192154/ck/4/91. Sequence 2523/21, from Central Mexico, is 99.28% similar to strain KP118891/ck/CH/LHLJ/111246/11 based on the complete S-gene, but the complete genome sequence is 92.96% identical MT665806/ck/Canada/17-038913/17. The BLASTn results are reflected in the phylogenetic clustering based on complete S-gene, S1-gene and the HVRs I-III, where the vaccine sequence group with CR88 strain is in a distinct subclade away from the field sequences (Figure 2).

Lineage GI-17 (California Variants; CAVs). Complete S-gene sequence of the field sequence 2944/21 from Northern Mexico is 99.54% identical to a Delmarva (DMV) virus isolated from broiler chickens [MK878536/ck/GA9977/19; (69)]—Table 4. The complete genome sequence is 96.04% identical to strain MN512437/ck/Can/18-048430/17. Phylogenetically, the S-/S1-gene trees group the Mexican field sequence with strains GA9977 and MT176422/ck/PA/P1810234-KD/18 in a subcluster distinct from other Canadian and USA viruses, but the HVRs I-III tree does not include the P1810234-KD strain (Figure 2). Again, like in the Mass-type and Ark-like viruses, the CAV-like sequence 2944/21 contained S2′ cleavage (Supplementary Table 4 and Figure 1).

We also performed phylogenetic analyses based on complete genome and Rep1ab (Figure 3), structural genes E, M and N (Supplementary Figure 1), and 3ab, 5a/b, and 6b (Supplementary Figure 2). Notably, none of the 33 sequences cluster with lineage GI-11 (SAI serotypes), which have been described as serologically and phylogenetically unique to South America (29–31). However, the previously described Mexican SAI viruses group in distinct subclades within larger clades containing Mass-type (based on nt sequences of the complete genome, Rep1b and 3a/b) and 793B (based on nt sequences of the complete S-gene, structural genes E/M and 5a/b) strains.

FIGURE 3

Figure 3. Maximum likelihood phylogenetic tree nt sequences of complete genome, Rep1a, and Rep1b using GTR model in MEGA 6. The 3 vaccine sequences are highlighted in blue color; the 30 field sequences are color-coded based on sampling regions in Mexico. The analysis involved 83 sequences. All positions with <95% site coverage were eliminated. The final dataset had 26,895 (complete genome), 11,782 (Rep1a), and 8,037 (Rep1b) positions.

Indels and Mutations in the HVRs of S1-Gene Sequence

Alignment of translated S1 HVR I (residues 37–88), HVR II (residues 115–146) and HVR III (residues 282–301) sequences revealed considerable variations among the 33 IBVs (Figure 4).

FIGURE 4

Figure 4. Analyses of the S1 glycoprotein HVRs I- III (amino acid residues 37-301). The phylogenetic tree was constructed using the JTT matrix-based model in MEGA and involved 67 sequences and a final dataset consisting of 253 positions. The 33 sequences in this study (3 vaccine sequences highlighted in blue; 30 field sequences; color-coded based on sampling regions in Mexico) assembled in this study are aligned with representatives of serotypes belonging to 5 IBV lineages. Dots and dashes in the alignments indicate identical and deleted amino acid residues, respectively. Amino acid residues critical for attachment of the Mass-type prototype (M41; GenBank accession No. AY851295) to chicken respiratory tract tissues are boxed in black [i.e., N38, H43, S56, P63, I66, and T69 (14)]; red and blue boxes indicate deleted and inserted amino acid residues, respectively.

HVR I. Six residues in HVR I of the Mass-type prototype AY851295/M41 (i.e., N38, H43, S56, P63, I66, and T69) are critical for viral attachment to respiratory tract tissues (14). Within the Mass-type sequences, field sequence 2598/21 from Northern Mexico, and the Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences, have an asparagine in position 38 (N38) similar to the M41 sequence, while all five Mass-type sequences from Southern Mexico have an aspartic acid at position 38 (D38) similar to the Brazilian Ma5 vaccine, UK H120 and 2006 Mass41 sequences. The consensus histidine at position 43 (H43) is conserved across all 33 sequences, except in the 4/91 vaccine variant 1619/19 sequence (lineage GI-13), which has a substitution of histidine with tyrosine (H43Y) similar to the CR88 strain from UK. Serine and threonine residues at positions 56 (S56 and T56) are conserved amongst the 793B-like and Mass-type sequences, respectively, but varies among the Ark-like sequences. Further, the critical amino acid at position 63 (S63) is conserved in the sequences across the five IBV lineages used in the alignment, except in field sequences 2731/21 and 2960/21 (from Central Mexico) and sequence 2944/21 (from Northern Mexico), which have S63G substitution, and sequence 2598/21 (from Northern Mexico) with an S63R substitution. Position 66 is largely conserved across sequences in all 5 lineages, while position 69 appear to be lineage-specifically conserved, except in the Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences, which have T69 compared the six field sequences from Southern (n = 5) and Northern (n = 1) Mexico in the Mass-type group.

In addition to substitutions, HVR I has three instances of deletion. One is in field sequences 2731/21 and 2960/21 from Central Mexico (in lineage GI-3) with a 2-residue deletion (positions 61–62), which is also present in the Mexican BL-56 strain. Another instance is a 4-residue deletion (positions 57–60) in the field sequence 2944/21 from Northern Mexico (lineage GI-17), which is also present in the DMV strains MK878536/ck/GA9977/19 and MN512438/ck/Can/18-049707/17. The third is a 3-residue deletion (positions 60 to 62) in field Mass-type sequence 2598/21 from Northern Mexico, which is present in strain QKE31017/T&T/18RS1461-8/14) and three Conn vaccine strains from USA.

HVR II. Most amino acid variations in HVR II are between the positions 116–121 and 138–147 (Figure 4). For example, in lineage GI-13, all the six field sequences from Northern Mexico, and the 4/91 vaccine variant 1619/19 sequence have N117 (similar to CR88 strains from UK and a field variant from Ireland), while all six field sequences from Southern Mexico have S117 (similarly present in a pathogenic 4/91 strain from UK and two other Chinese strains). The Mass-type vaccine 1616/19 and Mass-type/Conn recombinant vaccine 1623/19 sequences, and all the field sequences in lineage GI-1 (one sequence from northern and five from Southern Mexico) have a 2-residue deletion between positions 117 and 120. Additionally, some of the field sequences from Northern (n = 5) and Central (n = 2) Mexico have a G145R substitution, also present in the Canadian and USA DMV-like strains. All eight field Ark-like sequences (four each from Central and Southern Mexico), and the CAV-like sequence 2944/21 from Northern Mexico have a single residue insertion between positions 142 and 143.

HVR III. One of the variations in the HVR III is in the Ark-like sequences, where all eight field sequences have E284V substitution compared to the USA pathogenic field strain AYA44731/US/Ark99. This substitution is also present in the CAV viruses, including in field sequence 2944/21 from Northern Mexico. Another variation is in the 4/91 vaccine variant 1619/19 sequence and the Holte/Iowa-97-like field sequence 2731/21 from Central Mexico, which have a single amino acid (alanine) insertion at position 296, which is also present in the CR88 strains AJ618986/UK/FR-88061-88 and MN548285/ck/UK/CR88/11.

Recombination Events

Neighbor-net analyses showed likelihood of intra-/inter-lineage recombination events between S-gene sequences. Figure 5 represents a networked relationships among 70 S-gene sequences (including the 33 sequences from this study). Verification of recombination breakpoints using RDP4 resulted in identification of nine recombination events (Table 4), all of which reflected the results presented in Figure 5. All nine recombination events were statistically (corrected p-values of ≤1 × 10 ⁻⁶) supported by at least six of the seven algorithms used for the analyses. Eight of the recombination events are located in the subunit S2 (in regions containing FP and/or HR1/2 domains), while the recombination event in the field sequence 2598/21 is in the N-terminal receptor domain, which contains HVRs I and II (Table 4).

FIGURE 5

Figure 5. Reticulate network using complete S-gene sequences constructed using SplitsTree5 v 5.0.0 (53). The network predicts putative evolutionary histories of the IBVs, where the internal nodes and the edges correspond to ancestral taxa and patterns of descents, respectively. Nodes with more than two parents represent reticulate events (e.g., recombination, horizontal gene transfer, or hybridization). “Split” is a partition of the taxa into two subsets with the edges separating the taxa subsets of the split from those on the other side of the split (the length of the edge in the network is proportional to the weight of the split it is associated with, which is analogous to branches in conventional phylogenetic trees). (A) The input contained 70 taxa and 101 trees constructed using Splits Network Algorithm with default options to obtain a splits network with 153 nodes and 165 edges. In the figure inset (B), the input consisted of a subset of 28 taxa and 101 trees (from a splits network with 66 nodes and 72 edges). The recombination events of the sequences in panel B of this figure as determined using RDP4 are shown in Table 4.

Three recombination events are in Mass-type, i.e., in the Mass-type/Conn recombinant vaccine 1623/19 sequence, and the field sequences 2860/21 and 2598/21 from Southern and Northern Mexico, respectively. In all three events, both “major parent” sequence (i.e., sequence in other viruses most closely related to the sequence surrounding the recombinant/transferred regions) and “minor parent” sequence (i.e., sequence in other viruses most closely related to the recombinant regions) are Mass-type, with 99.8–100% nt identities between the recombinant region and the “parental” sequences (Table 4). Four other recombination events, all identified in Holte/Iowa-97-like sequences 2731/21 and 2960/21, and Ark-like sequences 2563/21 and 2930/21, all from Central Mexico, have their “parental” sequences in viruses from different lineages (Table 4). One of the recombination events (involving the 4/91 vaccine variant 1619/19 sequence) has field sequence 2833/21 as “major parent” and an unknown “minor” parent inferred to be a 1996 Mass-type vaccine strain (FJ904716/US/Conn46 vaccine/96). In this event, the nt identities between the recombinant region and the “parental” sequences is low (96–96.8%), which probably imply the recombinant regions could have accumulated further mutations, or that the recombination event happened long before the isolation and eventual divergence of the recombinant and parental viruses.

Discussion

Infectious bronchitis is arguably one of the most important avian respiratory diseases in Mexico, and IBVs are commonly found in flocks, which, despite being vaccinated with the government-approved Mass-type vaccines, exhibit respiratory clinical signs consistent with IB. Ark-like, Mass-type, Holte/Iowa-97, and SAI strains have been previously identified in Mexico using molecular and serological assays (18, 29–31). Most of the Mexican variants have been reported from commercial flocks, but the likelihood of spillover to backyard poultry via introduction of surplus chickens from commercial enterprises, and via use of attenuated vaccines, cannot be ruled out. It is notable that all the IBVs reported in the current study were obtained from clinical samples from apparently healthy flocks. This observation may be due to mild clinical signs. Additionally, a lack of clinical signs of viral infection could be due to persistent (i.e., latent, chronic or slow) infection where the virus is not cleared from infected birds but remains in specific host cells. Although some studies have suggested occurrence of this type of infection for IBVs (70), the phenomenon remains to be convincingly demonstrated.

Studies have demonstrated the versatility of FTA cards as a low-cost option for collection, storage, transportation and preservation of genetic materials from field samples for the surveillance of various viral agents (71–74). Our study demonstrates that, despite the weakness of FTA cards in yielding lower quantities of mostly fragmented RNAs, resulting data is of sufficiently high quality to allow the assembly of full-length viral genomes, even when the clinical samples are prepared under field conditions. This opens up the advantages of FTA cards to NGS-based diagnostics of avian viruses via direct RNA sequencing from field-collected samples without the need of passage in eggs or transportation in liquid media. Avoiding egg-passage is advantageous for several reasons, including biosafety issues (there is no live virus manipulation, amplification, escape issues), and the absence of selective virus amplification (passage may select variants with a minority representation in the host, or introduce genetic changes).

High abundance of host/bacterial RNAs in samples prepared under field conditions, which, from our experiences and those of others, can constitute over 95% of NGS reads (38), results in low sensitivity in detection of viral RNAs. These challenges notwithstanding, our optimized protocols increased NGS sensitivity (high levels of virus-specific reads vs. low levels of host-/bacteria-specific reads), which produced complete genome and/or S-gene sequences (with optimal read coverage depth and coverage). Some of the assembled genome sequences were missing nt at the 5′- and/or 3′- termini, but all coding regions had sufficient depth coverage (≥10X). Furthermore, within single NGS runs (48 multiplexed samples per run), we obtained complete and/partial genome/gene sequences of various RNA viruses belonging to six taxonomic families, some of which were co-isolated with IBV.

Although the general genome organization (5′UTR-[Rep1a-Rep1b-S-3a-3b-E-M-4b-4c-5a-5b-N-6b]-3′UTR) slightly differs from many previously reported IBVs regarding presence of 4b, 4c, and 6b, it is consistent with some IBVs previously reported in various geographical regions across the globe (11, 75–77). The three accessory genes are thought to be strain-dependent/species-specific, but their presence and/or genomic organization are not a prerequisite to production of viable virus progeny, and their role in viral pathogenesis remains undetermined (11, 78). The presence of these auxiliary and apparently non-essential accessory genes is attributable to the IBV's flexible genome, which tolerates not only for stable insertion of novel genes, but also for reorganization (78). The genomic organization of the essential genes is however canonically conserved, including the overlaps between the six genes, except in overlaps between genes 3 and 4 in the 4/91 vaccine variant 1619/19 sequence and the field sequence 2944/21 from Northern Mexico. Most of the variations in the genome organization are in overlaps among the accessory genes and in the presence of VncRNAs in gene 4 (between the structural M and 4b) and gene 6 (between the structural N and the 6b). The sources of gene overlaps remain unknown, but one school of thought is that they result from either utilization of previously unused ORFs to create novel genes, or by adjusting locations of start/stop codons into sequences of existing genes (79).

Both the length and key physical properties at specific positions within the 20-residue region harboring the S1/S2 cleavage motif in all the 33 sequences are evidence that the S-glycoprotein of the viruses undergo sufficient furin-mediated cleavage and viral-host membrane fusion (61). In addition to the S1/S2 cleavage, all eight Mass-type, eight Ark-like and the single CAV viruses assembled in this study contain the auxiliary S2'cleavage site downstream the primary S1/S2 cleavage site, which results in a 153-amino acid-long dual-cleavage peptide. Although it does not directly influence IBV pathogenesis, this peptide is postulated to influence the S glycoprotein conformation in some CoVs; deletion of the peptide was demonstrated to affect fusion and recovery of Vero cell-adapted Baudette mutants (8). Although the S2' site is not absolutely required, it is thought to enhance viral infection and replication in some CoVs (80). Nevertheless, it remains unclear why the S2' cleavage site is absent in the sequences belonging to lineages GI-3 (n = 2 sequences) and GI-13 (n = 14 sequences), regardless of the sample types.

Results of the phylogenetic classification agreed with the comparative pairwise nt sequence homologies between the vaccine vs. field sequences as well as the homologies shared with other IBVs in genomic databases. To the best of our knowledge, 793B- and CAV-like strains have not been previously reported in Mexico. However, there are unpublished reports f the use of 793B vaccines in Mexico. CAVs are indigenously North America currently. CAVs are indigenously North America currently consisting of 12 published viruses from Pennsylvania, California and Alabama, causing respiratory, renal and reproductive diseases (18). Except the 4/91 vaccine variant 1619/19 sequence and the field sequence 2523/21 from Central Mexico, 793B-like sequences are from Southern and Northern Mexico (n = 6 sequences each). The only CAV-like sequence is the field sequence 2944/21 from Northern Mexico, which clustered with, among others, isolate MN512437/ck/Can/18-048430/17. The Canadian 18-048430 virus is similar to a DMV strain originally isolated from a virus outbreak in a commercial broiler flock in the Delmarva peninsula in 2011 and the DMV-like strain MK878536/ck/GA9977/19 isolated from broiler chickens in Georgia, USA in 2019 (69, 81). None of the 33 sequences in this study clustered with indigenous SAI viruses based on the S-/S1 and HVRs I-III trees. However, based on complete genome and other specific genomic regions, the SAI viruses clustered in distinct subclades within larger clades containing strains from other lineages, which, coupled with the clustering observed from S-/S1 and HVRs I-III trees, could be interpreted to imply decreased prevalence of the SAI viruses, resulting in the emergence of new, more fit field variants.

Since the coronaviruses can undergo recombination to result in novel variants, and since changes in the spike protein gene can result in shifts in antigenicity and/or tropism, we screened specifically for mutations and recombination in the S-gene (82). We noted various indels and point mutations in the subunit S1 HVRs I-III across the five IBV lineages. The most notable are in the HVR I (residues 38–69), which has been experimentally demonstrated in some IBVs to be critical for binding of S glycoprotein to host's respiratory tract tissues (14). Examples include 2-residue deletions between 60–63 and P63G substitutions in the sequences from Central (2731/21 and 2960/21) and Northern (2944/21) Mexico. Another example is a 3-residue deletion (between positions 69-63) coupled with a P63R substitution in sequence 2598/21 (Northern Mexico). Further, a H43Y (found in the sequence of the 4/91 vaccine variant 1619/19 in our study) was postulated to enhance the fitness of ArkDPI vaccine strain (via efficiency in binding to tracheal membranes) (83). Whether these changes in the S1 spike result in subsequent change in host cell binding, tissue tropism, or evasion of neutralizing antibodies remains to be determined.

The recombination events, presumably between vaccine and field viruses (intra- and inter-lineages), were found in the HVRs 1 and II of subunit S1 (in the field sequence 2598/21) and in the FP, HRs 1 and 2 domains of subunit S2 (in two vaccine and six field sequences). Recombination in the HVRs and in the above-mentioned domains, coupled with the various indels and mutations found in these regions, could modulate the S glycoprotein fusion properties, with the potential of variants to not only broaden their tissue-tropism (and possibly host-range), but also to efficiently adapt to naïve hosts as previously suggested (82). Additionally, these variants can potentially be a result of immune selection pressures (36). An example of this is the DE072 vaccine, which when first identified, showed more relatedness to a Dutch variant that to the US variants, but after undergoing a decade-long use, the strain evolved to a poor vaccine candidate (24).

Conclusions

This study has demonstrated FTA cards as adoptable, low-cost option for untargeted discovery and full-length sequencing of avian viruses from field-collected clinical samples from various tissues. Our data demonstrates that multiple distinct IBV serotypes/strains are co-circulating in Mexican commercial chickens, with the high likelihood of intra- and inter-lineage recombination, as well as indels and point mutations which are potentially driving the generation of new subpopulations of field variants capable spreading and adapting to chicken populations in the country. It should be further investigated whether strains emerging from the commercial enterprises may have spilled over to backyard poultry, and whether they have further evolved into strains that are distantly related to the predominantly SAI strains. More importantly, our data reiterates the need for enhanced surveillance of IBVs in Mexico and the Latin America region, as well as a review of the vaccines currently used in the control of IBVs.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/genbank/, OM912676, OM912677, OM912678, OM912679, OM912680, OM912681, OM912682, OM912683, OM912684, OM912685, OM912686, OM912687, OM912688, OM912689, OM912690, OM912691, OM912692, OM912693, OM912694, OM912695, OM912696, OM912697, OM912698, OM912699, OM912700, OM912701, OM912702, OM912703, OM912704, OM912705, OM912706, OM912707, and OM912708.

Author Contributions

CA, ED, and HK: conceptualization. CA, DS, and ED: funding acquisition. DS, SL, NC, and ED: project administration and coordination. HK: methodology and writing—original draft. HK, JV, and CL: data curation and formal analyses. CL, DS, and CA: writing—review and editing. All authors read and agreed to the published version of the manuscript.

Funding

This research was funded by Agricultural Research Service (ARS), USDA CRIS, and Oak Ridge Institute for Science and Education (ORISE) postdoctoral appointment. The funders had no role in data analyses or interpretation, or in the writing of the manuscript.

Conflict of Interest

JV and CA were employed by BASE₂BIO.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors acknowledge Dawn Williams-Coplin, Jesse Gallagher (SEPRL), and the field sampling teams (BIAH) for technical assistance in sample collection and preparation. The authors also acknowledge the contributions made by the BIAH field sample collection team (Alberto Carrillo, Alejandro Hori, Amauri Carrillo, David Garcia, and Marcos Jimenez).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2022.931272/full#supplementary-material

Supplementary Table 1. Background information and sequencing data of the 55 samples in which IBV RNA was detected in this study. The quantification of the dsDNA (HiFi PCR products), sequencing libraries, proportions of the host (chicken) reads, and the IBV-specific reads obtained from the NGS data are shown. Complete genome sequences (n = 33), partial genome sequences (n = 6), complete S-gene sequences (n = 5), and partial S-gene sequences (n = 11) were assembled from NGS reads in the current study. Other identified microbial agents (viral and bacterial) that are of avian interest based on literature and expert knowledge are listed in the table.

Supplementary Table 2. Genes, coding regions and nt lengths of the 33 IBV sequences identified in this study. The nt lengths (and gene intervals in brackets) of each of the genes and the 5'-/3'-UTRs are shown.

Supplementary Table 3. The 15 accessory proteins (2–15) that are proteolytically processed from Rep1ab by the virally-encoded PL ^pro and 3CL ^pro. For each of the 33 IBV sequences assembled in this study, the size (amino acid) of each cleavage product are indicated. The position (amino acid residue) of each product on the Rep1a (2–11) and Rep1b (12–16) are shown in brackets.

Supplementary Table 4. Analysis of the S-gene nt sequences assembled in this study. The arrow (↓) indicates the S1/S2 proteolytic cleavage site. The conserved cysteine-flanked region in the S2 is indicated (amino acid residues in bold and underlined). The numbers in subscript indicate the position of the amino acid residues in the S protein sequence.

Supplementary Table 5. Comparative pairwise homologies of IBV gene nt sequences between the field and vaccine sequences assembled in this study. Field sequences with the highest pairwise homologies to the vaccine sequences are in bold font, while sequences with the lowest are underlined. The lineage (serotype) classification is based on S1-gene sequence according to Valastro et al. (18).

Supplementary Table 6. BLASTn results of the 33 complete genome nt sequences assembled in this study. The lineage (serotype) classification is based on S1-gene sequence according to Valastro et al. (18). GenBank accession numbers are shown for the best BLASTn hit for each sequence.

Supplementary Figure 1. Maximum likelihood phylogenetic tree of nt sequences of the envelope (E), membrane (M) and nucleocapsid (N) genes using T92 model in MEGA 6. The 3 vaccine sequences assembled in this study are highlighted in blue color; the 30 field sequences are color-coded based on sampling regions in Mexico. The analysis involved 74 sequences. All positions with <95% site coverage were eliminated. The final datasets for the E, M, and N genes had 304, 659, and 1,215 positions, respectively.

Supplementary Figure 2. Maximum likelihood phylogenetic tree of nt sequences of accessory genes in (A) gene 3 (3a/3b), gene 5 (5a/5b) and gene 6 (6b) using T92 model in MEGA 6. The 3 vaccine sequences assembled in this study are highlighted in blue color; the 30 field sequences are color-coded based on sampling regions in Mexico. The analysis involved 74 (3a/b and 5a/b) and 63 (6b) sequences. All positions with less than 95% site coverage were eliminated. The final datasets for the 3a/b, 5a/b, and 6 genes had 365, 443, and 130 positions, respectively.

References

1. Jackwood MW, de Wit S. CHAPTER 4: Infectious Bronchitis. In: Swayne DE, Boulianne CM, Logue CM, McDougald LR, Nair V, Suarez DL, de Wit S, Grimes T, Johnson D, Kromm M, et al., editors. Diseases of Poultry. Hoboken, NJ: John Wiley & Sons, Ltd. (2020). p. 167–88.

2. Najimudeen SM, Hassan MSH, Cork SC, Abdul-Careem MF. Infectious bronchitis coronavirus infection in chickens: multiple system disease with immune suppression. Pathogens. (2020) 9:779. doi: 10.3390/pathogens9100779

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ignjatović J, Sapats S. Avian infectious bronchitis virus. Rev Sci Tech Off Int Epiz. (2000) 19:493–508. doi: 10.20506/rst.19.2.1228

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Weiss SR, Navas-Martin S. Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus. Microbiol Mol Biol Rev. (2005) 69:635–64. doi: 10.1128/MMBR.69.4.635-664.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Ziebuhr J. The coronavirus replicase. In: Enjuanes LJ, editor. Coronavirus Replication and Reverse Genetics. Current Topics in Microbiology and Immunology. Heidelberg: Springer-Verlag GmbH (2005). p. 57–94.

6. Malone B, Urakova N, Snijder EJ, Campbell EA. Structures and functions of coronavirus replication–transcription complexes and their relevance for SARS-CoV-2 drug design. Nat Rev Mol Cell Biol. (2022) 23:21–39. doi: 10.1038/s41580-021-00432-z

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Brierley I, Boursnell M, Binns M, Bilimoria B, Blok V, Brown T, et al. An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV. EMBO J. (1987) 6:3779–85. doi: 10.1002/j.1460-2075.1987.tb02713.x

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Yamada Y, Liu DX. Proteolytic activation of the spike protein at a novel RRRR/S motif is implicated in furin-dependent entry, syncytium formation, and infectivity of coronavirus infectious bronchitis virus in cultured cells. J Virol. (2009) 83:8744–58. doi: 10.1128/JVI.00613-09

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Masters PS. The molecular biology of coronaviruses. Adv Virus Res. (2006) 66:193–292. doi: 10.1016/S0065-3527(06)66005-3

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Hodgson T, Britton P, Cavanagh D. Neither the RNA nor the proteins of open reading frames 3a and 3b of the coronavirus infectious bronchitis virus are essential for replication. J Virol. (2006) 80:296–305. doi: 10.1128/JVI.80.1.296-305.2006

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Feng K, Chen T, Zhang X, Shao G, Cao Y, Chen D, et al. Molecular characteristic and pathogenicity analysis of a virulent recombinant avian infectious bronchitis virus isolated in China. Poult Sci. (2018) 97:3519–31. doi: 10.3382/ps/pey237

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Cavanagh D, Davis PJ, Mockett AA. Amino acids within hypervariable region 1 of avian coronavirus IBV (Massachusetts serotype) spike glycoprotein are associated with neutralization epitopes. Virus Res. (1988) 11:141–50. doi: 10.1016/0168-1702(88)90039-1

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Moore K, Jackwood M, Hilt D. Identification of amino acids involved in a serotype and neutralization specific epitope with in the s1 subunit of avian infectious bronchitis virus. Arch Virol. (1997) 142:2249–56. doi: 10.1007/s007050050239

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Promkuntod N, Van Eijndhoven R, De Vrieze G, Gröne A, Verheije M. Mapping of the receptor-binding domain and amino acids critical for attachment in the spike protein of avian coronavirus infectious bronchitis virus. Virology. (2014) 448:26–32. doi: 10.1016/j.virol.2013.09.018

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Moreno A, Franzo G, Massi P, Tosi G, Blanco A, Antilles N, et al. A novel variant of the infectious bronchitis virus resulting from recombination events in Italy and Spain. Avian Pathol. (2017) 46:28–35. doi: 10.1080/03079457.2016.1200011

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Cook JK, Jackwood M, Jones R. The long view: 40 years of infectious bronchitis research. Avian Pathol. (2012) 41:239–50. doi: 10.1080/03079457.2012.680432

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Zúñiga S, Cruz JL, Sola I, Mateos-Gómez PA, Palacio L, Enjuanes L. Coronavirus nucleocapsid protein facilitates template switching and is required for efficient transcription. J Virol. (2010) 84:2169–75. doi: 10.1128/JVI.02011-09

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Valastro V, Holmes EC, Britton P, Fusaro A, Jackwood MW, Cattoli G, et al. S1 gene-based phylogeny of infectious bronchitis virus: an attempt to harmonize virus classification. Infect Genet Evol. (2016) 39:349–64. doi: 10.1016/j.meegid.2016.02.015

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Kingham B, Keeler C Jr, Nix W, Ladman B, Gelb J Jr. Identification of avian infectious bronchitis virus by direct automated cycle sequencing of the S-1 gene. Avian Dis. (2000) 44:325–35. doi: 10.2307/1592547

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Ma T, Xu L, Ren M, Shen J, Han Z, Sun J, et al. Novel genotype of infectious bronchitis virus isolated in China. Vet Microbiol. (2019) 230:178–86. doi: 10.1016/j.vetmic.2019.01.020

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Marandino A, Pérez R. Genetic and antigenic diversity of infectious bronchitis virus in South America. Avian Dis. (2021) 65:622–8. doi: 10.1637/aviandiseases-D-21-00103

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Fabricant J. The early history of infectious bronchitis. Avian Dis. (1998) 42:648–50. doi: 10.2307/1592697

CrossRef Full Text | Google Scholar

23. Cavanagh D. Coronavirus avian infectious bronchitis virus. Vet Res. (2007) 38:281–97. doi: 10.1051/vetres:2006055

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Sjaak de Wit JJ, Cook JK, van der Heijden HM. Infectious bronchitis virus variants: a review of the history, current situation and control measures. Avian Pathol. (2011) 40:223–35. doi: 10.1080/03079457.2011.566260

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Jackwood MW. Review of infectious bronchitis virus around the world. Avian Dis. (2012) 56:634–41. doi: 10.1637/10227-043012-Review.1

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Villarreal L, Sandri TL, Souza S, Richtzenhain LJ, De Wit J, Brandao PE. Molecular epidemiology of avian infectious bronchitis in Brazil from 2007 to 2008 in breeders, broilers, and layers. Avian Dis. (2010) 54:894–8. doi: 10.1637/9218-121709-Reg.1

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Guzmán M, Sáenz L, Hidalgo H. Molecular and antigenic characterization of GI-13 and GI-16 avian infectious bronchitis virus isolated in Chile from 2009 to 2017 regarding 4/91 vaccine introduction. Animals. (2019) 9:656. doi: 10.3390/ani9090656

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Cook JK, Orbell SJ, Woods MA, Huggins MB. Breadth of protection of the respiratory tract provided by different live-attenuated infectious bronchitis vaccines against challenge with infectious bronchitis viruses of heterologous serotypes. Avian Pathol. (1999) 28:477–85. doi: 10.1080/03079459994506

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Gelb J Jr, Ladman B, Tamayo M, Gonzalez M, Sivanandan V. Novel infectious bronchitis virus S1 genotypes in Mexico 1998–1999. Avian Dis. (2001) 45:1060–3. doi: 10.2307/1592889

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Callison S, Jackwood M, Hilt D. Molecular characterization of infectious bronchitis virus isolates foreign to the United States and comparison with United States isolates. Avian Dis. (2001) 45:492–9. doi: 10.2307/1592994

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Escorcia M, Jackwood M, Lucio B, Petrone V, Lopez C, Fehervari T, et al. Characterization of Mexican strains of avian infectious bronchitis isolated during 1997. Avian Dis. (2000) 44:944–7. doi: 10.2307/1593069

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Cavanagh D. Severe acute respiratory syndrome vaccine development: experiences of vaccination against avian infectious bronchitis coronavirus. Avian Pathol. (2003) 32:567–82. doi: 10.1080/03079450310001621198

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Finney PM, Box PG, Holmes HC. Studies with a bivalent infectious bronchitis killed virus vaccine. Avian Pathol. (1990) 19:435–50. doi: 10.1080/03079459008418698

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Jackwood MW. Current and Future Recombinant Viral Vaccines for Poultry. In: Schultz RD, editor. Advances in Veterinary Medicine. London: Academic Press (1999). p. 517–22 doi: 10.1016/S0065-3519(99)80038-X

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Cavanagh D, Davis P, Cook J. Infectious bronchitis virus: evidence for recombination within the Massachusetts serotype. Avian Pathol. (1992) 21:401–8. doi: 10.1080/03079459208418858

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Jackwood MW, Hall D, Handel A. Molecular evolution and emergence of avian gammacoronaviruses. Infect Genet Evol. (2012) 12:1305–11. doi: 10.1016/j.meegid.2012.05.003

PubMed Abstract | CrossRef Full Text | Google Scholar

37. McKinley ET, Jackwood MW, Hilt DA, Kissinger JC, Robertson JS, Lemke C, et al. Attenuated live vaccine usage affects accurate measures of virus diversity and mutation rates in avian coronavirus infectious bronchitis virus. Virus Res. (2011) 158:225–34. doi: 10.1016/j.virusres.2011.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Parris DJ, Kariithi HM, Suarez DL. Non-target RNA depletion strategy to improve sensitivity of next-generation sequencing for the detection of RNA viruses in poultry. J. Vet. Diagn. Invest. (in press). doi: 10.1177/10406387221102430

CrossRef Full Text | Google Scholar

39. Chrzastek K, Lee D, Smith D, Sharma P, Suarez DL, Pantin-Jackwood M, et al. Use of Sequence-Independent, Single-Primer-Amplification (SISPA) for rapid detection, identification, and characterization of avian RNA viruses. Virology. (2017) 509:159–66. doi: 10.1016/j.virol.2017.06.019

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Afgan E, Baker D, Batut B, Van Den Beek M, Bouvier D, Cech M, et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. (2018) 46:W537–44. doi: 10.1093/nar/gky379

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Dimitrov KM, Sharma P, Volkening JD, Goraichuk IV, Wajid A, Rehmani SF, et al. A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses. Virol J. (2017) 14:72. doi: 10.1186/s12985-017-0741-5

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. (2012) 28:1647–9. doi: 10.1093/bioinformatics/bts199

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. (2011) 17:10–2. doi: 10.14806/ej.17.1.200

CrossRef Full Text | Google Scholar

44. Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. (2009) 25:1754–60. doi: 10.1093/bioinformatics/btp324

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. (2013) 30:614–20. doi: 10.1093/bioinformatics/btt593

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Crusoe MR, Alameldin HF, Awad S, Boucher E, Caldwell A, Cartwright R, et al. The khmer software package: enabling efficient nucleotide sequence analysis. F1000Res. (2015) 4:900. doi: 10.12688/f1000research.6924.1

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Chevreux B, Wetter T, Suhai S. Genome sequence assembly using trace signals and additional sequence information. In: The German Conference on Bioinformatics, GCB'99. Hanover: Citeseer (1999). p. 45–56.

48. Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. (2014) 30:1236–40. doi: 10.1093/bioinformatics/btu031

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Duckert P, Brunak S, Blom N. Prediction of proprotein convertase cleavage sites. Protein Eng Des Sel. (2004) 17:107–12. doi: 10.1093/protein/gzh013

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. (2013) 30:772–80. doi: 10.1093/molbev/mst010

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. (2009) 25:1972–3. doi: 10.1093/bioinformatics/btp348

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis v6. Mol Biol Evol. (2013) 30:2725–9. doi: 10.1093/molbev/mst197

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. (2006) 23:254–67. doi: 10.1093/molbev/msj030

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Martin DP, Murrell B, Golden M, Khoosal A, Muhire B. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. (2015) 1:vev003. doi: 10.1093/ve/vev003

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Gohl DM, Magli A, Garbe J, Becker A, Johnson DM, Anderson S, et al. Measuring sequencer size bias using REcount: a novel method for highly accurate Illumina sequencing-based quantification. Genome Biol. (2019) 20:85. doi: 10.1186/s13059-019-1691-6

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Alfson KJ, Beadles MW, Griffiths A. A new approach to determining whole viral genomic sequences including termini using a single deep sequencing run. J Virol Methods. (2014) 208:1–5. doi: 10.1016/j.jviromet.2014.07.023

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Fang SG, Shen H, Wang J, Tay FP, Liu DX. Proteolytic processing of polyproteins 1a and 1ab between non-structural proteins 10 and 11/12 of Coronavirus infectious bronchitis virus is dispensable for viral replication in cultured cells. Virology. (2008) 379:175–80. doi: 10.1016/j.virol.2008.06.038

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Liu X, Su J, Zhao J, Zhang G. Complete genome sequence analysis of a predominant infectious bronchitis virus (IBV) strain in China. Virus Genes. (2009) 38:56–65. doi: 10.1007/s11262-008-0282-5

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Zhao F, Zou N, Wang F, Guo M, Liu P, Wen X, et al. Analysis of a QX-like avian infectious bronchitis virus genome identified recombination in the region containing the ORF 5a, ORF 5b, and nucleocapsid protein gene sequences. Virus Genes. (2013) 46:454–64. doi: 10.1007/s11262-013-0884-4

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Hemida M, Barta J, Ojkic D, Yoo D. Complete genomic sequence of turkey coronavirus. Virus Res. (2008) 135:237–46. doi: 10.1016/j.virusres.2008.03.020

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Tian S. A 20 residues motif delineates the furin cleavage site and its physical properties may influence viral fusion. Biochem Insights. (2009) 2:BCI-S2049. doi: 10.4137/BCI.S2049

CrossRef Full Text | Google Scholar

62. Madu IG, Roth SL, Belouzard S, Whittaker GR. Characterization of a highly conserved domain within the severe acute respiratory syndrome coronavirus spike protein S2 domain with characteristics of a viral fusion peptide. J Virol. (2009) 83:7411–21. doi: 10.1128/JVI.00079-09

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Corse E, Machamer CE. The cytoplasmic tail of infectious bronchitis virus E protein directs Golgi targeting. J Virol. (2002) 76:1273–84. doi: 10.1128/JVI.76.3.1273-1284.2002

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Casais R, Davies M, Cavanagh D, Britton P. Gene 5 of the avian coronavirus infectious bronchitis virus is not essential for replication. J Virol. (2005) 79:8065–78. doi: 10.1128/JVI.79.13.8065-8078.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Brown Jordan A, Fusaro A, Blake L, Milani A, Zamperin G, Brown G, et al. Characterization of novel, pathogenic field strains of infectious bronchitis virus (IBV) in poultry in Trinidad and Tobago. Transbound Emerg Dis. (2020) 67:2775–88. doi: 10.1111/tbed.13637

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Brandão PE, Taniwaki SA, Berg M, Hora AS. Complete genome of avian coronavirus vaccine strains Ma5 and BR-I. Genome Announc. (2017) 5:e00201–17. doi: 10.1128/genomeA.00201-17

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Thor SW, Hilt DA, Kissinger JC, Paterson AH, Jackwood MW. Recombination in avian gamma-coronavirus infectious bronchitis virus. Viruses. (2011) 3:1777–99. doi: 10.3390/v3091777

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Toro H, Van Santen V, Li L, Lockaby S, Van Santen E, Hoerr F. Epidemiological and experimental evidence for immunodeficiency affecting avian infectious bronchitis. Avian Pathol. (2006) 35:455–64. doi: 10.1080/03079450601028811

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Goraichuk IV, Kulkarni AB, Williams-Coplin D, Suarez DL, Afonso CL. First complete genome sequence of currently circulating infectious bronchitis virus strain DMV/1639 of the GI-17 lineage. Microbiol Resour Announc. (2019) 8:e00840–19. doi: 10.1128/MRA.00840-19

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Naqi S, Gay K, Patalla P, Mondal S, Liu R. Establishment of persistent avian infectious bronchitis virus infection in antibody-free and antibody-positive chickens. Avian Dis. (2003) 47:594–601. doi: 10.1637/6087

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Cortes AL, Montiel ER, Gimeno IM. Validation of Marek's disease diagnosis and monitoring of Marek's disease vaccines from samples collected in FTA^® cards. Avian Dis. (2009) 53:510–6. doi: 10.1637/8871-041009-Reg.1

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Kraus RHS, van Hooft P, Waldenström J, Latorre-Margalef N, Ydenberg RC, Prins HHT. Avian influenza surveillance with FTA cards: field methods, biosafety, and transportation issues solved. J Vis Exp. (2011) 54:e2832. doi: 10.3791/2832

PubMed Abstract | CrossRef Full Text | Google Scholar

73. Perozo F, Villegas P, Estevez C, Alvarado I, Purvis LB. Use of FTA^® filter paper for the molecular detection of Newcastle disease virus. Avian Pathol. (2006) 35:93–8. doi: 10.1080/03079450600597410

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Józwiak M, Wyrostek K, Domańska-Blicharz K, Olszewska-Tomczyk M, Smietanka K, Minta Z. Application of FTA^® cards for detection and storage of avian influenza virus. J Vet Res. (2016) 60:1–6. doi: 10.1515/jvetres-2016-0001

CrossRef Full Text | Google Scholar

75. Reddy VR, Theuns S, Roukaerts ID, Zeller M, Matthijnssens J, Nauwynck HJ. Genetic characterization of the Belgian nephropathogenic infectious bronchitis virus (NIBV) reference strain B1648. Viruses. (2015) 7:4488–506. doi: 10.3390/v7082827

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Abozeid HH, Paldurai A, Khattar SK, Afifi MA, El-Kady MF, El-Deeb AH, et al. Complete genome sequences of two avian infectious bronchitis viruses isolated in Egypt: evidence for genetic drift and genetic recombination in the circulating viruses. Infect Genet Evol. (2017) 53:7–14. doi: 10.1016/j.meegid.2017.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Hewson KA, Ignjatovic J, Browning GF, Devlin JM, Noormohammadi AH. Infectious bronchitis viruses with naturally occurring genomic rearrangement and gene deletion. Arch Virol. (2011) 156:245–52. doi: 10.1007/s00705-010-0850-6

PubMed Abstract | CrossRef Full Text | Google Scholar

78. de Haan CA, Volders H, Koetzner CA, Masters PS, Rottier PJ. Coronaviruses maintain viability despite dramatic rearrangements of the strictly conserved genome organization. J Virol. (2002) 76:12491–502. doi: 10.1128/JVI.76.24.12491-12502.2002

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Simon-Loriere E, Holmes EC, Pagán I. The effect of gene overlapping on the rate of RNA virus evolution. Mol Biol Evol. (2013) 30:1916–28. doi: 10.1093/molbev/mst094

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Cheng J, Zhao Y, Xu G, Zhang K, Jia W, Sun Y, et al. The S2 subunit of QX-type infectious bronchitis coronavirus spike protein is an essential determinant of neurotropism. Viruses. (2019) 11:972. doi: 10.3390/v11100972

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Hassan MS, Ojkic D, Coffin CS, Cork SC, Van Der Meer F, Abdul-Careem MF. Delmarva (DMV/1639) infectious bronchitis virus (IBV) variants isolated in eastern Canada show evidence of recombination. Viruses. (2019) 11:1054. doi: 10.3390/v11111054

PubMed Abstract | CrossRef Full Text | Google Scholar

82. Fang SG, Shen S, Tay FP, Liu D. Selection of and recombination between minor variants lead to the adaptation of an avian coronavirus to primate cells. Biochem Biophys Res Commun. (2005) 336:417–23. doi: 10.1016/j.bbrc.2005.08.105

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Leyson C, França M, Jackwood M, Jordan B. Polymorphisms in the S1 spike glycoprotein of Arkansas-type infectious bronchitis virus (IBV) show differential binding to host tissues and altered antigenicity. Virology. (2016) 498:218–25. doi: 10.1016/j.virol.2016.08.030

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: lineage, NGS, mutation, recombination, hypervariable region, vaccine

Citation: Kariithi HM, Volkening JD, Leyson CM, Afonso CL, Christy N, Decanini EL, Lemiere S and Suarez DL (2022) Genome Sequence Variations of Infectious Bronchitis Virus Serotypes From Commercial Chickens in Mexico. Front. Vet. Sci. 9:931272. doi: 10.3389/fvets.2022.931272

Received: 28 April 2022; Accepted: 20 June 2022;
Published: 12 July 2022.

Edited by:

Nicola Pugliese, University of Bari Aldo Moro, Italy

Reviewed by:

Joanna Sajdewicz-Krukowska, National Veterinary Research Institute (NVRI), Poland
Cao Yong Chang, Sun Yat-sen University, China

Copyright © 2022 Kariithi, Volkening, Leyson, Afonso, Christy, Decanini, Lemiere and Suarez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Henry M. Kariithi, aGVucnkua2FyaWl0aGlAa2Fscm8ub3Jn; David L. Suarez, ZGF2aWQuc3VhcmV6QHVzZGEuZ292

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.