- 1Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, United States
- 2Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, United States National Poultry Research Center, Agricultural Research Service, United States Department of Agriculture, Athens, GA, United States
- 3Biotechnology Research Institute, Kenyan Agricultural and Livestock Research Organization, Nairobi, Kenya
- 4BASE2BIO, Oshkosh, WI, United States
- 5Department of Pathology, College of Veterinary Medicine, University of Georgia, Athens, GA, United States
Co-infections of avian species with different RNA viruses and pathogenic bacteria are often misdiagnosed or incompletely characterized using targeted diagnostic methods, which could affect the accurate management of clinical disease. A non-targeted sequencing approach with rapid and precise characterization of pathogens should help respiratory disease management by providing a comprehensive view of the causes of disease. Long-read portable sequencers have significant potential advantages over established short-read sequencers due to portability, speed, and lower cost. The applicability of short reads random sequencing for direct detection of pathogens in clinical poultry samples has been previously demonstrated. Here we demonstrate the feasibility of long read random sequencing approaches to identify disease agents in clinical samples. Experimental oropharyngeal swab samples (n = 12) from chickens infected with infectious bronchitis virus (IBV), avian influenza virus (AIV) and Mycoplasma synoviae (MS) and field-collected clinical oropharyngeal swab samples (n = 11) from Kenyan live bird markets previously testing positive for Newcastle disease virus (NDV) were randomly sequenced on the MinION platform and results validated by comparing to real time PCR and short read random sequencing in the Illumina MiSeq platform. In the swabs from experimental infections, each of three agents in every RT-qPCR-positive sample (Ct range 19–34) was detectable within 1 h on the MinION platform, except for AIV one agent in one sample (Ct = 36.21). Nine of 12 IBV-positive samples were assigned genotypes within 1 h, as were five of 11 AIV-positive samples. MinION relative abundances of the test agent (AIV, IBV and MS) were highly correlated with RT-qPCR Ct values (R range−0.82 to−0.98). In field-collected clinical swab samples, NDV (Ct range 12–37) was detected in all eleven samples within 1 h of MinION sequencing, with 10 of 11 samples accurately genotyped within 1 h. All NDV-positive field samples were found to be co-infected with one or more additional respiratory agents. These results demonstrate that MinION sequencing can provide rapid, and sensitive non-targeted detection and genetic characterization of co-existing respiratory pathogens in clinical samples with similar performance to the Illumina MiSeq.
Introduction
Respiratory diseases are a continual significant threat to the global poultry industry (1). Newcastle disease virus (NDV), infectious bronchitis virus (IBV), avian influenza virus (AIV), Ornithobacterium rhinotracheale (ORT) (2), Mycoplasma synoviae (MS) and M. gallisepticum (MG) have been isolated from different avian species presenting similar clinical respiratory disease (3–7). Co-infections with these microbial pathogens produce respiratory disease complexes and complicate accurate disease diagnosis, when using target-specific approaches (5, 8, 9). For example, a commercial broiler flock, first diagnosed as infected with IBV, based on serological assays, was also co-infected with an atypical velogenic NDV, which was overlooked by relying on a single, target-specific detection approach (8). Experimental studies have shown that vaccine strains of IBV prolonged shedding of low pathogenic AIV (LPAIV) type H9N2 and increased the severity of clinical signs and postmortem lesions (10). Progressive pneumonia is a problem in commercial broiler flocks where ORT and H9N2 were primarily isolated; but it was difficult to establish the primary cause of the disease due to mixed infections (5). Co-infections complicate respiratory disease diagnostics and currently, diagnostic approaches to characterize the co-infecting viral and bacterial respiratory pathogens from chicken samples require both classical and molecular diagnostic tools.
One common approach to identify avian pathogens is isolation in embryonating eggs from specific-pathogen-free (SPF) chickens (11). However, coexistence of avian respiratory pathogens (i.e., NDV and AIV) in the same sample might present a diagnostic problem as it is possible to observe overwhelming growth of one agent over the other during isolation causing a biased characterization of clinical samples (12–14). Furthermore, some microbial pathogens associated with respiratory diseases such as Mycoplasmas are difficult and time consuming to culture under laboratory conditions.
A variety of polymerase chain reaction (PCR)-based rapid multiplexed diagnostic assays have been used for detection and molecular epidemiology of respiratory co-infecting pathogens (15–18). However, these assays were developed for specific pathogens, precluding the possibility of detecting unknown pathogens in the samples (19). Additionally, these conventional assays are sensitive to genetic variation, and mismatches on the pathogen's target sequence can lead to false negative results (20).
For decades, pathogen diagnostics and sequencing have been separate endeavors, with sequencing following diagnostics via PCR. Sanger sequencing has historically been the gold standard for sequence-based characterization of pathogens, but this approach is time consuming and expensive for complete identification of coinfecting agents in clinical samples (21, 22). More recently, the next generation sequencing (NGS) platforms have changed this paradigm by providing the possibility for simultaneous diagnostic testing and sequencing of novel and re-emerging pathogens directly from a clinical sample (23). We have recently optimized conditions for efficient detection or multiple respiratory pathogens in poultry by directly sequencing clinical samples with the Illumina short read sequences (24–26). The widespread application of these sequencing platforms for routine diagnostics is still limited due to the associated longer processing time, complex bioinformatics expertise of random sequencing data analysis, and higher cost, hence need for the improving recent alternative diagnostic methods to counter these challenges.
Targeted sequencing on the long-read sequencing platform (Oxford Nanopore Technologies) (27), MinION has recently been used to increase the utility of high-throughput sequencing as a tool for avian pathogen characterization (28, 29). The ability to perform near-real-time sequence analysis of long DNA molecules reduce the time from sample collection to outcome. MinION-based targeted sequencing has been used to genetically type respiratory pathogens such as NDV (30), IBV (31), AIV (32), and infectious laryngotracheitis (ILTV) (33). Recently, a random strand-switching approach was used to identify the novel avian paramyxovirus (APMVs) from cultured samples (34). However, a target-independent, multiplexed, single assay for these respiratory pathogens from uncultured swab samples has not been fully developed. Multiplexed, time- and cost-effective assays that require minimal equipment would be useful in rapidly diagnosing infections and co-infections. In the current study, a multiplexed, random sequencing approach based on MinION nanopore sequencer was developed and compared to RT-qPCR and Illumina MiSeq for the detection of viral and bacterial co-infections in commercial poultry. Additionally, automated bioinformatics pipelines were developed for the rapid characterization of samples by non-experts.
Materials and methods
Samples
Clinical oropharyngeal swab samples (hereafter referred to as “clinical samples”) were obtained from chickens from live bird markets in Kenya (n = 11) and submitted to the Southeast Poultry Research Laboratory (SEPRL), Athens, Georgia, USA as previously described (35). A second batch of archived chicken oropharyngeal swab samples (n = 12) were collected using standard procedures during an experimental coinfection study at SEPRL (hereafter referred to as “experimental samples”) was used in the current study (Supplementary Table 1). Allantois fluid obtained from SPF eggs was used a negative control for both set of samples.
RT-qPCR assay on experimental swab samples
Total RNA was extracted from 200 μl of each of the virus isolation media used to collect the swab samples using the MagMax RNA extraction kit (Thermo Fisher Scientific, Waltham, MA, USA) as per manufacturer's instructions, and stored at −80°C until further use. The experimental samples were tested by three separate previously described RT-qPCR assays to detect IBV (spike gene) (36), AIV (matrix gene) (37), and Mycoplasma (16S-23S intergenic spacer region) (38) using the AgPath ID, One step RT-PCR kit (Thermo Fisher Scientific, Waltham, MA, USA) performed on the Applied Biosystems 7500 FAST.
MinION sample preparation
cDNA synthesis
For cDNA synthesis, prior to MinION library preparation, a reaction mixture of 11.5 μl of total RNA (~50 ng RNA), 0.5 μl of 250 nM random hexamers (New England Biolabs, Ipswich, MA) and 1 μl of 10 mM dNTPs was incubated at 65 °C for 5 min, chilled on ice for 1 min followed by the addition of 7 μl of cDNA synthesis mix including SuperScript IV (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer's instructions. The reaction mixture was incubated at 23°C for 10 min and, 55°C for 10 min for cDNA synthesis. The reaction was terminated at 80°C for 10 min, and then chilled on ice. To remove residual RNA, the cDNA solution was incubated with 1 μl of RNase H at 37°C for 20 min according to the manufacture's instruction.
Second strand DNA synthesis
The cDNA was immediately used for second strand synthesis. Briefly, 20 μl of cDNA solution was mixed with 10 μl of NEBNext (New England Biolabs, Ipswich, MA) second strand synthesis reaction buffer, 5 μl of NEBNext second strand enzyme mix and 45 μl of nuclease free water (NFW), incubated at 16°C for 1 h and cooled at 4°C. dsDNA was purified with AMPure XP beads (Beckman Coulter, Indianapolis, Indiana) at a bead: DNA volumetric ratio of 1.8:1 and eluted in 52 μl of NFW.
Adapter ligation
dsDNA was repaired and dA-tailed using 45 μl of dsDNA, 7 μl of Ultra II End-prep reaction buffer, 3 μl Ultra II End-prep enzyme mix (NEBNext Ultra End Repair/dA-Tailing Module, New England Biolabs) and 5 μl Nuclease-free water. The reaction mixture was incubated at 20°C for 5 min and 5 min at 65°C. AMPure bead purification was performed at 1:1 volumetric ratio of beads: DNA according to manufacturer's protocol. 15 μl of end-prepped DNA was mixed with 5 μl barcode adapter (1-96 barcoding kit, ONT) and 20 μl blunt/TA ligase master mix. The reaction mixture was incubated for 15 min at room temperature (RT). The adapter-ligated DNA was purified with AMPure bead at 0.4:1 volumetric ratio of bead: DNA and eluted in 26 μl NFW.
PCR-based barcoding was performed using 25 μl of adapter-ligated dsDNA, 2 μl of barcode and 50 μl of Long-Amp Taq 2X Master mix (New England Biolabs, Ipswich, MA). A reaction mixture of 100 μl was used for amplicon synthesis with the following conditions: denaturation at 95°C for 3 min; 17 cycles of denaturation at 95°C for 15 s, annealing at 62°C for 15 s and extension at 65°C for 90 s, and final extension of 65°C for 90 s and chilled at 4°C.
Library preparation and sequencing
The barcoded dsDNA was purified using AMPure beads (bead: DNA, 1:1 volumetric ratio), repaired by dA-tailing end prepped, purified (bead: DNA, 1.6:1), and adapter ligated by using 60 μl of 700 ng pooled (equal volume) barcoded sample, 10 μl of Adapter Mix (AMX 1D), 20 μl of NEBNext Quick Ligation Reaction Buffer (5X) and 10 μl of Quick T4 DNA Ligase (New England Biolabs, Ipswich, MA). The reaction mixture was mixed gently by flicking the tube, and incubated for 10 min at RT. After bead purifying the prepared DNA library was eluted in 15 μl of elution buffer.
A FLO-MIN106 R9.4 flow cell (27) was equilibrated to RT for 10 min and then primed with running buffer as per manufacturer's instructions. The DNA libraries were prepared by combining 12 μL of the library pool with 2.5 μL NFW, 35 μL RBF (Running Buffer Fuel), and 25.5 μL library loading beads. After the MinION Platform QC run, the DNA library was loaded into the MinION flow cell via the SpotON port. The standard 1D sequencing protocol was initiated using the MinKNOW software v.5.12.
MiSeq sample preparation
The same clinical and experimental samples were processed for MiSeq sequencing for a side-by-side comparison of sequencing data obtained from MinION and MiSeq. Briefly, DNA libraries were prepared from total RNA (used for MinION library synthesis) (n = 25) using the KAPA Stranded RNA-Seq Kit (Roche Sequencing Solutions, Inc., CA, USA) according to manufacturer's recommendations. Concentrations and distribution sizes (bp) of the cDNA in the KAPA libraries were assessed by Qubit® dsDNA HS Assay Kit (Thermo Fisher Scientific, Waltham, MA), and Agilent 2,100 Bioanalyzer (Agilent technologies Inc., Germany), respectively. Paired-end sequencing of the diluted pooled libraries (10 μl each; 4 nM final concentration) was performed on an Illumina MiSeq platform for 39 h using the 300 cycle MiSeq Reagent Kit v 2 (Illumina, USA) according to manufacturer's instructions (39).
MinION data analysis
Raw FAST5 data was basecalled, demultiplexed, and trimmed using Guppy 6.1.2, model “dna_r9.4.1_450bps_hac,” barcode set “EXP-PBC096,” with “–detect_mid_strand_barcodes,” and “—trim_barcodes,” without “–require_barcodes_both_ends.” Reads were assigned taxonomic classifications using KrakenUniq (v0.5.8) (40) modified with local patches, against a hierarchical set of databases containing vector/contaminant sequences, host genome (Gallus gallus GRCg6a), human genome (GRCh38.p13), and the BASE2BIO LLC (Oshkosh, WI, USA) untargeted database of microbial reference sequences. Classifications were further adjusted using a patched version of the “krakenuniq-filter” script, adjusting assignments up the taxonomic tree until the k-mer specificity was 0.05 for viral taxa and 0.25 for all other taxa. Each identified taxon was then further verified by BLASTn (41) search of a random subset of taxon-assigned reads against the full GenBank “nt” database and subsequent lowest common ancestor assignment by in-house tools. For taxa of interest (i.e., NDV, IBV, and AIV), genotypes were called using the standard BASE2BIO genotyping module and curated agent databases. De novo assemblies of non-host reads were performed using MEGAHIT (v1.2.9) (42) with default settings, minimum contig length of 500 bp. All steps involving read mapping were performed with minimap2 (v2.24-r1122) (43). Tabulation, summarization, and visualization of correlation analysis results was performed in R v4.1.1 (https://www.R-project.org).
MiSeq data analysis
Analysis of Illumina MiSeq data was performed as described above for the MinION but with the following differences: Data was trimmed using Trim Galore (https://github.com/FelixKrueger/TrimGalore) v06.7 to remove residual adapter sequences and low-quality 3' ends. Steps involving read mapping were performed using BWA MEM (v0.717-r1188) (44) with default settings.
Results
Rapid detection of pathogens from clinical samples
A total of twenty-three samples were used in this study, along with two negative controls. Samples 1 to 11 were clinical samples and 13 to 24 were experimental samples (Supplementary Table 1). In Table 1, comparative characterization of the clinical samples with MinION vs. MiSeq are presented as total reads, classified reads (belong to chicken and microbial genomes), the genotype calling based on the coverage breadth (3x) and the identification of major respiratory pathogens which included a selected list of known targeted and non-targeted agents as obtained from the known avian disease literature and identified using >1% of relative read abundance threshold. In addition to the full 8 h MinION run, a subset of reads corresponding to the first hour of the MinION sequencing run were analyzed separately to provide insight into the potential for rapid turnaround times. Prior to sequencing, the clinical samples 1 to 11 were NDV-positive using RT-qPCR targeting the Matrix gene, with Ct values ranging from 12.04 to 36.59. NDV was detected in all 11 samples in the first hour of MinION sequencing. Ten of the 11 samples had sufficient genome sequence coverage to accurately assign the genotype after 1 h–all 11 were correctly genotyped in the MiSeq run. In addition to genotype, determination of the fusion cleavage site amino acid motif and subsequent virulence classification was possible for some, or all samples depending on platform and run time (MiSeq: 11/11; MinION 8 h: 6/11; MinION 1 h: 3/11). The consensus sequences assembled from pathogen-specific reads showed presence of virulent NDV as predicted by the presence of amino acid motif (RRQKR↓F) at the cleavage site of the Fusion (F) protein (Supplementary Table 2). The co-infecting pathogens were present at the arbitrarily designed 1% threshold on at least one of the platforms–however, additional agents were detected in additional samples at levels below this cutoff. All 11 clinical samples contained other known avian respiratory pathogens, which were detected in addition to NDV at significant levels (>1% of total microbial reads), including Mycoplasma (M. gallisepticum, M. synoviae, and M. pullorum), Avibacterium sp., Gallibacterium sp., and Ornithobacterium rhinotracheale.
Table 1. Pathogen identification from MinION and MiSeq sequencing from clinical samples collected from Kenya.
Rapid detection of pathogens from experimental samples
MinION sequencing of the twelve experimental samples and a negative template control is summarized in Table 2 as total reads, median read length, classified read counts, chicken host reads, fraction of host reads from total reads, read count for each pathogen from 8 to 1 h of sequencing, coverage breadth across the whole microbial genomes (AIV, IBV and MS). The fraction of host reads ranged from 0.80 to 0.96 of the total reads. The non-chicken reads (from 8 to 1 h sequencing run) were classified as microbial reads belonging to IBV, AIV and MS, alone or combined. These experimental samples were also tested using RT-qPCR for the presence of the suspected pathogens, and Ct values for the respective pathogen are included in Table 2. The Ct values for IBV fell in a narrow range from 21.94 to 25.87. IBV reads were detected in all twelve samples in the first hour of MinION sequencing. Coverage read depth was sufficient (3x) to accurately assign IBV genotypes in 9/12 samples after 1 h, and in 12/12 samples after 8 h (as well as in the MiSeq run). For AIV, reads were detected in 10/11 RT-qPCR-positive samples after 1 and 8 h. Full HA/NA subtypes were assigned for 5/11 samples after 1 h, 6/11 after 8 h, and 11/11 in the MiSeq run. For MS, reads were detected after 1 h in 10/10 RT-qPCR-positive samples. No target agents were detected in the mock control. Of all agent/sample combinations, in only one sample contained a single MS-specific read in the full 8 h sequencing run, but the sample was RT-qPCR-negative for the agent.
Table 2. Microbial reads obtained at different hours (h) from random sequencing using MinION on experimental oropharyngeal swab samples.
Reference-based and de novo genome determination
In addition to detection and genotyping, rapid NGS can provide detailed sequence information on the pathogens found in a sample. For each experimental sample, Table 2 lists the breadth of genome coverage (minimum 3x depth) calculated from read alignment. After 1 h of MinION sequencing, 3/12 samples had IBV coverage breadth > 50%, corresponding approximately with a Ct cutoff of 22. After 8 h, this fraction increased to 11/12. For MS, 9/10 samples had >50% coverage after 1 h, and 10/10 had >75% coverage after 8 h. For AIV, 3/11 samples had >50% coverage after a single hour, while 6/11 had similar coverage after 8 h.
Correlation of MinION and MiSeq sequencing with RT-qPCR on AIV, IBV, and MS
To model the relationship between RT-qPCR Ct values and NGS read abundance, relative read abundance (out of all microbial reads) for each of the four agents with RT-qPCR data available were plotted on a log2 scale against their known Ct values (Figures 1A–H). A linear least-squares model was fit to the data, and the lower 95% confidence interval of this model was used to estimate the lowest Ct value at which an agent read would be detected on average under several different sets of experimental assumptions. This model was also used to estimate the expected Ct value of an agent that is seen at an abundance of one read per thousand microbial reads. The estimated Ct thresholds at which a single read (MinION/MiSeq sequencing) per thousand microbial reads to be observed with 95% confidence at different run times and levels of host contamination, given current experimental conditions, and assuming 12 multiplexed samples was determined to be 27/27.5 for AIV, 26.5/26 for IBV and 36/36.5 for MS (Figures 1A–F, purple horizontal line). The three agents used in the experimental study all have strong correlation between Ct and log2 abundance (between−0.82 and−0.98). IAIV has the strongest correlation, and the largest range of Ct values (19–36) (Figures 1A,B). The other two agents in this study, IBV (Figures 1C,D) and MS (Figures 1E,F), had slightly weaker correlation, but also significant smaller Ct ranges (22–24 and 29–34, respectively).
Figure 1. MinION and MiSeq read abundance vs RT-PCR Ct; for Avian Influenza virus (A,B); for Infectious bronchitis virus (C,D); for Mycoplasma synoviae (E,F) in experimental samples and for Newcastle disease virus (G,H) in clinical samples. Black line indicates best-fit linear regression model. Red/blue horizontal lines mark Ct thresholds at which a single read would be estimated to be observed with 95% confidence at different run times and levels of host contamination, given current experimental conditions, and assuming 12 multiplexed samples. Purple horizontal line marks Ct threshold corresponding to on average one agent read per thousand microbial reads at 95% confidence.
For the clinical samples, correlation of NDV relative abundance and Ct is generally poor (Figures 1G,H). The overlaps of taxon IDs between MiSeq and MinION is demonstrated in Figure 2. Nearly parallel lines representing the number of taxon identification indicated a relationship (as expected) on the overlaps in detection as a function of taxon abundance and MinION run length. The comparison of per-taxon relative read abundances between Illumina MiSeq and MinION sequencing runs in combined clinical and experimental samples showed strong correlation (R = 0.85) as the identified taxa are highlighted with different color-coded symbols in Figure 3. Here, for simplicity the comparison was done with a selected number of viral and bacterial respiratory disease-causing agents. In addition, there was extremely high correlation (R = 0.95) between relative abundance estimates from MinION and MiSeq sequencing data from the experimental swab samples (Figure 4). The per-taxon relative read abundances comparison between Illumina MiSeq and MinION sequencing runs of clinical samples also showed a moderate correlation (R = 0.79) as shown in Figure 5.
Figure 2. Overlaps of taxon IDs between MiSeq and MinION runs as a function of taxon abundance and MinION run length. A minimum k-mer count of 20 and a minimum b-score (subsampled BLAST agreement) of 0.7 was calculated from IDs.
Figure 3. Correlation of per-taxon relative read abundances between Illumina MiSeq and MinION runs. Individual trend lines show least-squares regression models. Dashed black line indicates the identity (1:1) relationship. Data used was from experimental and clinical samples. Pearson's R value is calculated from the combined dataset.
Figure 4. Correlation of per-taxon relative read abundances between Illumina MiSeq and MinION runs of experimental samples. Individual trend lines show least-squares regression models. Dashed black line indicates the identity (1:1) relationship. Pearson's R value is calculated from the combined dataset.
Figure 5. Correlation of per-taxon relative read abundances between Illumina MiSeq and MinION runs of clinical samples. Individual trend lines show least-squares regression models. Dashed black line indicates the identity (1:1) relationship. Pearson's R value is calculated from the combined dataset.
Discussion
Sequence-based pathogen characterization approaches have evolved rapidly and are broadly accepted in the global research community. This study demonstrates the utility of random amplification for untargeted MinION nanopore sequencing to achieve accurate identification and preliminary genetic characterization of viral and bacterial agents in co-infected clinical and experimental samples. We have shown that the MinION platform, as expected produces much smaller read output than MiSeq platform; however, in terms of positive-negative detection and sequence-based agent identification, it can approach the sensitivity of the MiSeq-based approach. This identification of the pathogen, when present at moderate abundance can be achieved with runtimes as short as 60 min, providing advantages in terms of cost, speed of data acquisition and processing time per sample in clinical settings. We have demonstrated strong quantitative correlation both between sequencing platforms and between sequence read abundance and RT-qPCR Ct values, indicating that detection sensitivity of nanopore sequencing is limited primarily by sequencing depth rather than any inherit weakness of the platform. A major advantage of this platform is the ability to adjust run times to suit requirements, even during a run. Therefore, it lends itself to cost optimization by balancing run times, multiplexing depth, and host depletion optimization to achieve target sensitivity levels.
Rapid and accurate detection and characterization of the microbial pathogens present in clinical samples has long been a major goal in diagnostic settings, and numerous advances have been made to improve tests. However, no single diagnostic test is perfect and varying scenarios often require a variety of diagnostic tests. For rapid identification of avian respiratory pathogens, single or multiplexed PCR-based diagnostic assays have been developed and widely used. However, the target-specific nature of these assays makes them vulnerable to failure because of the ability of these pathogens to change, causing false negative results (20). Additionally, PCR-based rapid assays provide limited or no additional genetic information about the detected pathogens. Recently, a strand-switching based random MinION sequencing approach has been used on cultured viral pathogens (45). The work described here is a step further toward rapid characterization, as it demonstrates accurate identification and genetic typing of viral and bacterial pathogens from clinical samples. Also, it is demonstrated that a single targeted assay of the suspected viral pathogen (NDV) would have failed to identify co-infecting pathogens including A. paragallinarum, M. pullorum, and G. anatis, and which would have remained undetected in these clinical samples without additional testing. However, these bacterial pathogens were detected by untargeted MinION sequencing and in most samples confirmed by MiSeq sequencing.
Although multiplexed PCR based assays have been developed to detect multiple respiratory pathogens in a single assay, this incrementally improved approach still requires a prediction of what pathogens (or genetic variant in case of RNA viruses) are in a sample and will not identify unknown agents, which would result in the incomplete characterization of the clinical samples (8). The presence of more than one pathogen in clinical samples is known to occur and the diversity in the genetic material makes it necessary to perform additional assays to identify some of the pathogens despite the availability of multiplexed assays for respiratory viral pathogens. This MinION sequencing approach is target independent, which may reduce the chances of failure in detection of pathogens due to genetic change. Additionally, the ability to use total RNA (rRNA and mRNA) provides an opportunity to detect pathogens both from genomic viral RNA as well as the rRNAs of replicating bacterial pathogens and mRNA of replicating DNA viruses.
It has been reported that upgraded nanopore sequencing flow cells are capable of achieving as high as 95% raw accuracy (46). It is likely that as the sequencing technology and base calling algorithms will improve the single-read accuracy, further diminishing data analysis challenges specific to noisy long-read data as compared to short-read sequencing. Short-read-based metagenomics studies on platforms such as Illumina have experienced widespread use due to the high accuracy of sequencing. Although Sanger and Illumina (sequencing-by-synthesis) sequencing platforms are considered the gold standard in terms of accuracy (23), these approaches have limitations. Sanger sequencing is necessarily target specific and Illumina-based sequencing shares some similar limitations as MinION in terms of data management. It can become challenging to analyze hundreds of thousands to millions of individual reads due to the computational power and time required. In our current study, an automated pipeline running on cloud resources analyzed each sample in parallel in an average of 2.6 h (minimum 1.5, maximum 4.8) with no user intervention. This workflow overcomes the challenges associated with the lack of computational resources and speed of data analysis.
The primary current limitation of the non-targeted metagenomic sequencing assay is its lower sensitivity compared with targeted amplification. However, this study demonstrates that samples with Ct values into the 30s can be reliably detected from randomly amplified samples in as little as 1 h of sequencing time, often with depth sufficient to yield the genotypic classification. This observation is supported by linear modeling of NGS abundance and RT-qPCR thresholds, which backs the conclusion that at least some agents with Ct values > 30 should be reliably identifiable under similar experimental conditions. The sensitivity of detection in these experimental samples was hampered both by a high degree of host contamination, which varied from 63 to 99% in the experimental MinION run, as well as multiplexed library sizes that varied by several orders of magnitude. Improvements in handling these challenges would be expected to significantly increase the sensitivity beyond that already observed and improve the reliability of the approach as a diagnostic tool.
One of the biggest challenges in sequencing-based diagnostics is the presence of nucleic acids in clinical samples from both host and pathogens, which may be sourced from a variety of genomic material. The total RNA sequencing approach adopted in this study allows the capture of broad population of RNAs from the clinical oral swab samples. Although oropharyngeal swab samples have comparatively lower host and commensal bacterial populations, there is still the background of many chicken-reads. Because clinical samples were collected at different time points and the quality of RNA may be low as well, these samples should be sequenced for around 8 h. It is notable that although no pre-enrichment approach was used to detect microbial RNA in the clinical samples, only 60 min of MinION sequence data was sufficient to detect all the test co-infecting viral and bacterial pathogens from the experimental samples. Specific reduction in the rRNA of chicken host will further increase the utility of this approach in recovery of viral genomes from metagenomic samples. At the moment, several companies offer kits for elimination of host and bacterial ribosomal RNAs, and the utilization of those are likely to improve the sensitivity of detection of pathogens (47). An RNaseH approach with probes targeted to rRNA from both chickens and bacteria has shown a significant increase in sensitivity and can be potentially used with this MinION approach as well (24). The utility of the assay for other species has not been tested but this protocol does not include any pre-enrichment of pathogen RNA which could also make it very useful for human pathogens as well.
Conclusions
The presence of important viral and bacterial pathogens in respiratory clinical samples of chickens was detected by direct extraction of total RNA followed by the use of Oxford Nanopore MinION or Illumina MiSeq sequencing technologies. NDV and various bacterial respiratory agents were detected in chickens from Kenyan live bird markets with both technologies. The MinION platform provided a rapid but still accurate characterization of the co-infecting viral and bacterial pathogens in experimental swab samples. Extensive testing on diverse clinical samples will further evaluate the viability of this protocol for diagnostic settings. In addition, because this MinION-based approach provides for rapid, multiplexed, and cost-effective detection of viral and bacterial pathogens in clinical samples with sufficient sensitivity for many applications, it represents a legitimate alternative for diagnostic laboratories that cannot afford more expensive equipment for next-generation diagnostics. Based on this work and related studies, the goal of a cost-effective, sensitive, and untargeted NGS-based diagnostic tool appears one step closer to reality.
Data availability statement
These dataset in this study are deposited in the SRA repository under BioProject PRJNA900571 and the accession numbers: SRR22262942- SRR22262990. These datasets can be accessed at link below: https://www.ncbi.nlm.nih.gov/sra/PRJNA900571.
Author contributions
SB processed the egg-grown and clinical samples, created the MinION libraries, analyzed the MinION data, and wrote the manuscript. HK contributed to the preparation and analysis of NGS data and manuscript preparation. JV developed the MinION data analysis workflow and assisted with manuscript preparation. TT and CL helped with the RT-qPCR for IBV, AIV, and MS. MP-J, DS, and JS assisted in data interpretation and manuscript preparation. CA was involved in the design of the study, data analysis, data interpretation, and writing of the manuscript. All authors were involved with editing the manuscript, read, and approved the final manuscript.
Funding
This project was supported by USDA CRIS 6040–32000-072.
Acknowledgments
Drs. Mark Jackwood and Naola Ferguson from Poultry Diagnostic and Research Center, College of Veterinary Medicine, University of Georgia, Athens, GA, United States helped provide the viral and bacterial reagents to optimize the protocol.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2022.1073919/full#supplementary-material
Supplementary Table 1. Details of clinical and experimental oropharyngeal swab samples.
Supplementary Table 2. Details of comparison of de novo contigs assigned per taxon from MinION and MiSeq sequencing platforms.
References
1. Saif F, Glisson M, David ES. Diseases of Poultry 12th Edition. Ames, IA: Blackwell Publishers (2008).
2. Guo Y, Krauss S, Senne D, Mo I, Lo K, Xiong X, et al. Characterization of the pathogenicity of members of the newly established H9N2 influenza virus lineages in Asia. Virology. (2000) 267:279–88. doi: 10.1006/viro.1999.0115
3. Franca M, Howerth EW, Carter D, Byas A, Poulson R, Afonso CL, et al. Co-infection of mallards with low-virulence Newcastle disease virus and low-pathogenic avian influenza virus. Avian Pathol. (2014) 43:96–104. doi: 10.1080/03079457.2013.876530
4. Kammon A, Heidari A, Dayhum A, Eldaghayes I, Sharif M, Monne I, et al. Characterization of avian influenza and newcastle disease viruses from poultry in libya. Avian Dis. (2015) 59:422–30. doi: 10.1637/11068-032215-ResNote.1
5. Pan Q, Liu A, Zhang F, Ling Y, Ou C, Hou N, et al. Co-infection of broilers with ornithobacterium rhinotracheale and H9N2 avian influenza virus. BMC Vet Res. (2012) 8:104. doi: 10.1186/1746-6148-8-104
6. Agnew-Crumpton R, Vaz PK, Devlin JM, O'Rourke D, Blacker-Smith HP, Konsak-Ilievski B, et al. Spread of the newly emerging infectious laryngotracheitis viruses in Australia. Infect Genet Evol. (2016) 43:67–73. doi: 10.1016/j.meegid.2016.05.023
7. Kouakou AV, Kouakou V, Kouakou C, Godji P, Kouassi AL, Krou HA, et al. Prevalence of Newcastle disease virus and infectious bronchitis virus in avian influenza negative birds from live bird markets and backyard and commercial farms in Ivory-Coast. Res Vet Sci. (2015) 102:83–8. doi: 10.1016/j.rvsc.2015.07.015
8. Umali DV, Ito H, Shirota K, Ito T, Katoh H. Atypical velogenic Newcastle disease in a commercial layer flock in Japan. li DV, Ito H, Shirota KPoult Sci. (2015) 94:890–7. doi: 10.3382/ps/pev011x
9. Jones RC. Viral respiratory diseases (ILT, aMPV infections, IB): are they ever under control? Br Poult Sci. (2010) 51:1–11. doi: 10.1080/00071660903541378
10. Haghighat-Jahromi M, Asasi K, Nili H, Dadras H, Shooshtari A. Coinfection of avian influenza virus (H9N2 subtype) with infectious bronchitis live vaccine. Arch Virol. (2008) 153:651–5. doi: 10.1007/s00705-008-0033-x
11. Dufour-Zavala L. A Laboratory Manual for the Isolation, Identification, and Characterization of Avian Pathogens. Athens, GA: American Association of Avian Pathologists (2008).
12. Slemons R, Shieldcastle M, Heyman L, Bednarik K, Senne D. Type A influenza viruses in waterfowl in Ohio and implications for domestic turkeys. Avian Dis. (1991) 35:165–73. doi: 10.2307/1591309
13. Zowalaty MEE, Chander Y, Redig PT, El Latif HKA, Sayed MAE, Goyal SM. Selective isolation of Avian influenza virus (AIV) from cloacal samples containing AIV and Newcastle disease virus. J Vet Diagn Invest. (2011) 23:330–2. doi: 10.1177/104063871102300222
14. Costa-Hurtado M, Afonso CL, Miller PJ, Spackman E, Kapczynski DR, Swayne DE, et al. Virus interference between H7N2 low pathogenic avian influenza virus and lentogenic Newcastle disease virus in experimental co-infections in chickens and turkeys. Vet Res. (2014) 45:1. doi: 10.1186/1297-9716-45-1
15. Xie Z, Pang Y-s, Liu J, Deng X, Tang X, Sun J, et al. A multiplex RT-PCR for detection of type A influenza virus and differentiation of avian H5, H7, and H9 hemagglutinin subtypes. Mol Cell Probes. (2006) 20:245–9. doi: 10.1016/j.mcp.2006.01.003
16. Pang Y, Wang H, Girshick T, Xie Z, Khan MI. Development and application of a multiplex polymerase chain reaction for avian respiratory agents. Avian Dis. (2002) 46:691–9. doi: 10.1637/0005-2086(2002)046[0691:DAAOAM]2.0.CO;2
17. Xie Z, Luo S, Xie L, Liu J, Pang Y, Deng X, et al. Simultaneous typing of nine avian respiratory pathogens using a novel GeXP analyzer-based multiplex PCR assay. J Virol Methods. (2014) 207:188–95. doi: 10.1016/j.jviromet.2014.07.007
18. Tadese T, Potter AE, Fitzgerald S, Reed WM. Concurrent infection in chickens with fowlpox virus and infectious laryngotracheitis virus as detected by immunohistochemistry and a multiplex polymerase chain reaction technique. Avian Dis. (2007) 51:719–24. doi: 10.1637/0005-2086(2007)51[719:CIICWF]2.0.CO;2
19. Belák S, Thorén P, LeBlanc N, Viljoen G. Advances in viral disease diagnostic and molecular epidemiological technologies. Expert Rev Mol Diagn. (2009) 9:367–81. doi: 10.1586/erm.09.19
20. Cattoli G, De Battisti C, Marciano S, Ormelli S, Monne I, Terregino C, et al. False-negative results of a validated real-time PCR protocol for diagnosis of newcastle disease due to genetic variability of the matrix gene. J Clin Microbiol. (2009) 47:3791–2. doi: 10.1128/JCM.00895-09
21. Ansorge WJ. Next-generation DNA sequencing techniques. N Biotechnol. (2009) 25:195–203. doi: 10.1016/j.nbt.2008.12.009
22. Xiao YL, Kash JC, Beres SB, Sheng ZM, Musser JM, Taubenberger JK. High-throughput RNA sequencing of a formalin-fixed, paraffin-embedded autopsy lung tissue sample from the 1918 influenza pandemic. J Pathol. (2013) 229:535–45. doi: 10.1002/path.4145
23. Willner D, Furlan M, Haynes M, Schmieder R, Angly FE, Silva J, et al. Metagenomic analysis of respiratory tract DNA viral communities in cystic fibrosis and non-cystic fibrosis individuals. PLoS ONE. (2009) 4:e7370. doi: 10.1371/journal.pone.0007370
24. Parris DJ, Kariithi H, Suarez DL. Non-target RNA depletion strategy to improve sensitivity of next-generation sequencing for the detection of RNA viruses in poultry. J Vet Diagn Invest. (2022) 34:638–45. doi: 10.1177/10406387221102430
25. Kariithi HM, Volkening JD, Leyson CM, Afonso CL, Christy N, Decanini EL, et al. Genome sequence variations of infectious bronchitis virus serotypes from commercial chickens in Mexico. Front Vet Sci. (2022) 9:931272. doi: 10.3389/fvets.2022.931272
26. Kariithi HM, Christy N, Decanini EL, Lemiere S, Volkening JD, Afonso CL, et al. Detection and genome sequence analysis of avian metapneumovirus subtype a viruses circulating in commercial chicken flocks in Mexico. Vet Sci. (2022) 9:579. doi: 10.3390/vetsci9100579
27. Rutvisuttinunt W, Chinnawirotpisan P, Simasathien S, Shrestha SK, Yoon I-K, Klungthong C, et al. Simultaneous and complete genome sequencing of influenza A and B with high coverage by Illumina MiSeq platform. J Virol Methods. (2013) 193:394–404. doi: 10.1016/j.jviromet.2013.07.001
28. Brown BL, Watson M, Minot SS, Rivera MC, Franklin RB. MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach. Gigascience. (2017) 6:1–10. doi: 10.1093/gigascience/gix007
29. Kilianski A, Haas JL, Corriveau EJ, Liem AT, Willis KL, Kadavy DR, et al. Bacterial and viral identification and differentiation by amplicon sequencing on the MinION nanopore sequencer. Gigascience. (2015) 4:12. doi: 10.1186/s13742-015-0051-z
30. Butt SL, Taylor TL, Volkening JD, Dimitrov KM, Williams-Coplin D, Lahmers KK, et al. Rapid virulence prediction and identification of Newcastle disease virus genotypes using third-generation sequencing. Virol J. (2018) 15:179. doi: 10.1186/s12985-018-1077-5
31. Butt SL, Erwood EC, Zhang J, Sellers HS, Young K, Lahmers KK, et al. Real-time, MinION-based, amplicon sequencing for lineage typing of infectious bronchitis virus from upper respiratory samples. J Vet Diagn Invest. (2021) 33:179–90. doi: 10.1177/1040638720910107
32. Wang J, Moore NE, Deng Y-M, Eccles DA, Hall RJ. MinION nanopore sequencing of an influenza genome. Front Microbiol. (2015) 6:766. doi: 10.3389/fmicb.2015.00766
33. Spatz SJ, Garcia M, Riblet S, Ross TA, Volkening JD, Taylor TL, et al. MinION sequencing to genotype US strains of infectious laryngotracheitis virus. Avian Pathol. (2019) 48:255–69. doi: 10.1080/03079457.2019.1579298
34. Young KT, Stephens JQ, Poulson RL, Stallknecht DE, Dimitrov KM, Butt SL, et al. Putative Novel Avian Paramyxovirus (AMPV) and Reidentification of APMV-2 and APMV-6 to the Species Level Based on Wild Bird Surveillance (United States, 2016–2018). Appl Environ Microbiol. (2022) 88:e00466-22. doi: 10.1128/aem.00466-22
35. Kariithi HM, Welch CN, Ferreira HL, Pusch EA, Ateya LO, Binepal YS, et al. Genetic characterization and pathogenesis of the first H9N2 low pathogenic avian influenza viruses isolated from chickens in Kenyan live bird markets. Infect Genet Evol. (2020) 78:104074. doi: 10.1016/j.meegid.2019.104074
36. Callison SA, Hilt DA, Boynton TO, Sample BF, Robison R, Swayne DE, et al. Development and evaluation of a real-time Taqman RT-PCR assay for the detection of infectious bronchitis virus from infected chickens. J Virol Methods. (2006) 138:60–5. doi: 10.1016/j.jviromet.2006.07.018
37. Spackman E. Avian influenza virus detection and quantitation by real-time RT-PCR. Methods Mol Biol. (2014) 1161:105–18. doi: 10.1007/978-1-4939-0758-8_10
38. Raviv Z, Kleven SH. The development of diagnostic real-time TaqMan PCRs for the four pathogenic avian mycoplasmas. Avian Dis. (2009) 53:103–7. doi: 10.1637/8469-091508-Reg.1
39. Imai K, Tamura K, Tanigaki T, Takizawa M, Nakayama E, Taniguchi T, et al. Whole genome sequencing of influenza A and B viruses with the MinION sequencer in the clinical setting: a pilot study. Front Microbiol. (2018) 9:2748. doi: 10.3389/fmicb.2018.02748
40. Breitwieser FP, Baker DN, Salzberg SL. KrakenUniq: confident and fast metagenomics classification using unique k-mer counts. Genome Biol. (2018) 19:198. doi: 10.1186/s13059-018-1568-0
41. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. (1990) 215:403–10. doi: 10.1016/S0022-2836(05)80360-2
42. Li D, Liu C-M, Luo R, Sadakane K, Lam T-W, MEGAHIT. an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. (2015) 31:1674–6. doi: 10.1093/bioinformatics/btv033
43. Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. (2018) 34:3094–100. doi: 10.1093/bioinformatics/bty191
44. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint arXiv:13033997 (2013).
45. Young KT, Lahmers KK, Sellers HS, Stallknecht DE, Poulson RL, Saliki JT, et al. Randomly primed, strand-switching, MinION-based sequencing for the detection and characterization of cultured RNA viruses. J Vet Diagn Invest. (2021) 33:202–15. doi: 10.1177/1040638720981019
46. Amarasinghe SL, Su S, Dong X, Zappia L, Ritchie ME, Gouil Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol. (2020) 21:1–16. doi: 10.1186/s13059-020-1935-5
Keywords: MinION, MiSeq, Newcastle disease virus, avian influenza virus, infectious bronchitis virus, mycoplasma spp., clinical samples, respiratory disease
Citation: Butt SL, Kariithi HM, Volkening JD, Taylor TL, Leyson C, Pantin-Jackwood M, Suarez DL, Stanton JB and Afonso CL (2022) Comparable outcomes from long and short read random sequencing of total RNA for detection of pathogens in chicken respiratory samples. Front. Vet. Sci. 9:1073919. doi: 10.3389/fvets.2022.1073919
Received: 19 October 2022; Accepted: 14 November 2022;
Published: 01 December 2022.
Edited by:
Ihab Habib, United Arab Emirates University, United Arab EmiratesReviewed by:
Sunil Kumar Mor, University of Minnesota Twin Cities, United StatesIrit Davidson, Kimron Veterinary Institute, Israel
Copyright © 2022 Butt, Kariithi, Volkening, Taylor, Leyson, Pantin-Jackwood, Suarez, Stanton and Afonso. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Claudio L. Afonso, Y2xhdWRpby5hZm9uc29AYmFzZTJiaW8uY29t