- 1Center for Environmental Sensing and Modelling, Singapore-MIT Alliance for Research and Technology Centre, Singapore, Singapore
- 2Department of Civil and Environmental Engineering, National University of Singapore, Singapore, Singapore
- 3Singapore Centre on Environmental Life Sciences Engineering, Nanyang Technological University, Singapore, Singapore
- 4School of Civil and Environmental Engineering, Nanyang Technological University, Singapore, Singapore
- 5Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
Water quality is an emergent property of a complex system comprised of interacting microbial populations and introduced microbial and chemical contaminants. Studies leveraging next-generation sequencing (NGS) technologies are providing new insights into the ecology of microbially mediated processes that influence fresh water quality such as algal blooms, contaminant biodegradation, and pathogen dissemination. In addition, sequencing methods targeting small subunit (SSU) rRNA hypervariable regions have allowed identification of signature microbial species that serve as bioindicators for sewage contamination in these environments. Beyond amplicon sequencing, metagenomic and metatranscriptomic analyses of microbial communities in fresh water environments reveal the genetic capabilities and interplay of waterborne microorganisms, shedding light on the mechanisms for production and biodegradation of toxins and other contaminants. This review discusses the challenges and benefits of applying NGS-based methods to water quality research and assessment. We will consider the suitability and biases inherent in the application of NGS as a screening tool for assessment of biological risks and discuss the potential and limitations for direct quantitative interpretation of NGS data. Secondly, we will examine case studies from recent literature where NGS based methods have been applied to topics in water quality assessment, including development of bioindicators for sewage pollution and microbial source tracking, characterizing the distribution of toxin and antibiotic resistance genes in water samples, and investigating mechanisms of biodegradation of harmful pollutants that threaten water quality. Finally, we provide a short review of emerging NGS platforms and their potential applications to the next generation of water quality assessment tools.
The Role and Precedent of “Proxies” in Water Quality Analysis
Surface freshwaters including lakes, rivers and streams, are important aquatic ecosystems and are a source of drinking water in many countries. In recent decades, increase in human population size and urbanization have exerted immense pressure on the use of these water resources for human recreation and consumption. Contamination by sewage, toxic chemicals, nutrients, and resultant harmful algal blooms can render water unfit for human consumption or recreational activities. Bio-monitoring using sentinel or indicator species in the form of microorganisms and aquatic macroinvertebrates are frequently used by water management authorities to infer water quality, ecosystem health status, and to protect public health from waterborne risks (Carew et al., 2013). The occurrence and abundance of indicator organisms serve as proxies, i.e., easily measured quantities that are correlated to often unknown agents that directly mediate waterborne risk such as pathogens, biotoxins and chemicals. Advances in molecular methods and next-generation sequencing (NGS) have ushered in new opportunities for water quality assessment through analysis of waterborne microbial communities for the development of indicators and sentinels, new markers for microbial source tracking, and observation of microbially mediated processes. However, as these new tools are developed, biases and uncertainties associated with nucleic-acid based methods and NGS must be considered.
For decades, water quality for drinking and recreational purposes has been largely assessed based on culture-based enumeration and detection of fecal indicator bacteria (FIB), e.g., total coliforms, Escherichia coli, or Enterococci: a practice that has been regarded as the “gold standard” in the assessment of microbial safety of water (Figueras and Borrego, 2010). FIB are at high concentration in human feces, thus their presence in freshwater serves as a proxy for associated pathogens, where risks of exposure and has been shown to correlate to incidence of disease in exposed populations (e.g., Haile et al., 1999; Zmirou et al., 2003; Colford et al., 2007; Harwood et al., 2014). However, these proxies are imperfect as they may be derived from non-human sources and/or may be subjected to ecological or environmental interactions that compromise their predictive power as proxies for pathogens. In addition to predicting contamination from sewage, microbial proxies are used to predict the origin or source of sewage since fecal material of human-origin poses the most significant risk to water quality due to its potential to transmit human pathogens. Research into “source tracking” has established at least two species within the genus Bacteroides as indicators of human-origin, which have been shown to associate with and perhaps be symbionts of the human gut (Bernhard and Field, 2000; Shanks et al., 2007, 2009, 2010; Yampara-Iquise et al., 2008). Finally, direct detection and enumeration of specific pathogens such as waterborne Leptospira, Campylobacter, Legionella, viruses (e.g., Norovirus) and parasites (e.g., Cryptosporidium) enables quantitative microbial risk assessment (QMRA; Hass et al., 2014), where environmental concentrations of pathogens are compared to models of infectivity to characterize the risk to exposed human populations.
Development and validation of methods for enumeration of indicators and/or pathogens presents major challenges. Initial methods for identification of waterborne indicators and pathogens relied on selective culturing and enumeration of presumptive isolates, followed by species confirmation using biochemical, serological or molecular genetic methods. Such efforts lead to the standardization of simple and routine procedures for quantification of FIB, which are relatively easy to culture (Dufour, 1984; Simpson et al., 2002). However, cultivation methods for detection of specific pathogens are often time-consuming, and may fail to detect some organisms due to fastidious growth or requirement for a host (in the case of intracellular pathogens; Evangelista and Coburn, 2010). In addition, some pathogens are known to exist in a dormant, but still infective state that cannot be quantified by standard cultivation-based methods, i.e., the viable but not-culturable (VBNC) state (Ramamurthy et al., 2014).
To circumvent problems associated with quantification of VBNC and otherwise difficult to culture bacteria, cultivation-independent molecular methods were developed for detection and quantification of specific bacteria (recently reviewed by Ramamurthy et al., 2014). Such methods target DNA extracted from environmental and clinical samples which is subsequently subjected to analysis for the presence or abundance of genes from indicator species or pathogens of interest. The small subunit ribosomal RNA gene (SSU rRNA), consisting of the 16S rRNA gene for bacteria and the 18S rRNA gene for eukarya, has emerged as one of the most frequently used target genes for molecular analysis. This is due to its ubiquity in all organisms and sequence structure which includes both highly conserved and variable/hypervariable regions to promote alignment of the DNA sequence across diverse organisms, and allow for more fine-scale taxonomic identification (e.g., Guo et al., 2013). Use of the polymerase chain reaction (PCR) and quantitative PCR (qPCR) targeting regions of the SSU rRNA and functional genes for microbial source tracking and direct detection and quantification of target indicator strains has been recently reviewed by Harwood et al. (2014).
Cultivation-independent analyses based on direct detection of nucleic acids or amplification of targeted genes using PCR circumvent some problems associated with culture-based methods, while raising additional challenges. First, detection and quantification of environmental DNA from individual species is often assumed to be derived from living organisms, however, naked or free DNA may also be detected in such methods, making the correlation between DNA copies and cell abundance imperfect. Second, DNA-based methods used for quantification of microbial risk agents may be confounded by the highly dynamic and diversified genomes of pathogens and prevalence of strain-specific virulence factors. Thus, quantification of pathogenic taxa based on occurrence of biomarker DNA such as the SSU rRNA may not correlate to public health risk if the strain detected lacks virulence genes. Nevertheless, PCR-based detection and quantification of organisms in environmental DNA has proven useful for enumeration of sewage indicators and one DNA-based method is approved for quantification of the fecal indicator Enterococcus by the US Environmental Protection Agency (USEPA, 2009, 2013) while additional DNA-based methods are currently under evaluation (Green et al., 2014). A third challenge is raised by the ability of nucleic-acid based analyses to detect as little as one DNA target molecule; thus, it is possible to detect trace levels of nucleic acids from microbial risk agents (e.g., pathogens or virulence factors). Risk assessment frameworks to define thresholds of “acceptable risk” will be necessary before incorporating quantification of microbial risk agents by DNA-based approaches into decision-making for water quality management.
Brief Review of NGS Technologies and Analysis Methods
Advances in NGS enabling massively parallel analysis of DNA sequence information from PCR amplicons, or environmental nucleic acids, ushers in a new era of proxy development for water quality assessment. In clinical research, massively parallel sequencing (MPS) has been demonstrated as a screening tool used to complement or circumvent conventional diagnostic methods (e.g., culturing, microscopy and Gram-staining) for the detection and identification of etiological agents in disease (Kumar et al., 2013; Salipante et al., 2013). While quantitative tests for water quality assessment (e.g., qPCR, culture-based FIB quantification kits) are appropriate for estimating exposure to biological risk agents or sewage contamination, application of NGS surveys can be a first step to focus on more specific exposure assessment of appropriate targets.
Surveys of waterborne microbial communities using NGS thus far, have relied upon targeted sequencing of the hypervariable regions of SSU rRNA gene (e.g., V1, V3, V4, V6 regions) and Large Subunit (LSU) rRNA gene (e.g., Guo et al., 2013). Apart from SSU and LSU rRNA genes, other genes with taxonomic signals such as nirS (denitrification) and nifH (nitrogen fixation) indicative of biochemical cycles (e.g., Farnelid et al., 2011; Bowen et al., 2013), as well as plastid SSU rRNA (Steven et al., 2012) have also been adopted for NGS-based microbial profiling.
In early studies of the relationship between waterborne microbial communities and water quality, 454 pyrosequencing (Roche) emerged as the preferred platform of choice (e.g., McLellan et al., 2010; Vandewalle et al., 2012) due to relatively long sequence read lengths (i.e., initial read lengths of 110 bp, now currently ∼1000 bp van Dijk et al., 2014a) and generally better optimized sequencing conditions and bioinformatic workflows (e.g., Sergeant et al., 2012). The Illumina (Solexa) platform was introduced to the market with read lengths of 35 bp with a focus on genome sequencing (van Dijk et al., 2014a). However, as the Illumina technology improved and read lengths achieved by merging paired-end reads began to rival pyrosequencing, it has gained in stature for use as a platform for analysis of environmental samples. Other NGS platforms including Ion Torrent and single molecule real-time sequencing (SMRT) such as the Pacific Biosciences have also been used in amplicon sequencing for microbial community profiling (e.g., Marshall et al., 2012; Yergeau et al., 2012), although these technologies have not been widely adopted. An emerging number of studies in recent years have used the Illumina MiSeq platform in shotgun amplicon sequencing of SSU rRNA (e.g., Newton et al., 2015), with several studies demonstrating improved performances (e.g., sequencing depth, coverage, detection sensitivity, false positive detection) compared to 454 pyrosequencing and Ion Torrent (Loman et al., 2012; Li et al., 2014; Sinclair et al., 2015).
Several open-source bioinformatic platforms including MOTHUR (Kozich et al., 2013) and QIIME (Caporaso et al., 2010), which are among some of the most commonly used software packages in the analyses of amplicon sequences, have been updated to improve on sequence analyses using paired-end sequence data generated from the Illumina platform. As Illumina MiSeq technology has become one of the most widely used sequencing platforms worldwide, and with the announcement by Roche to withdraw the GS FLX 454 pyrosequencing platform, several studies have embarked on further refining data analyses for Illumina platforms by improving on methods in library preparation (Kozich et al., 2013; Esling et al., 2015; Shishkin et al., 2015) and quality control of sequence reads (Kozich et al., 2013; Nelson et al., 2014; Schirmer et al., 2015). The success of using MPS of the SSU rRNA gene for routine monitoring and general sample comparative purposes, however, will hinge on efforts to streamline processes in samples processing, which at the current stage, can vary from one laboratory to another.
Small subunit rRNA gene amplicon sequences generated through MPS are generally clustered into operational taxonomic units (i.e., OTUs) based on nucleotide identity thresholds (e.g., 95–99%). In some cases, however, OTU clustering may fail to segregate highly identical sequence variants into ecologically- or environmentally- relevant groups. A method termed “oligotyping” has been developed to classify and group sequences based on sequence minimum entropy decomposition (MED), at the resolution of single nucleotide polymorphisms (SNPs), and has been shown to be particularly useful in tracing different microbial populations in sewage treatment facilities to their environmental origins (Eren et al., 2013, 2014). Community composition cataloged using NGS can serve as a baseline or reference for monitoring environmental perturbations or biodegradation. Furthermore, NGS-enabled biostatiscal analyses (e.g., principle coordinate analyses, non-metric multidimensional scaling or correspondence analysis) have been adopted to correlate the occurrence and distribution of microbial taxa, OTUs or oligotypes to environmental metadata, thereby allowing identification of oligotypes or OTUs that can serve as bioindicators for environmental quality (e.g.,Yergeau et al., 2012).
In order for NGS to become a useful tool for water quality monitoring purposes, long term sequence data collection and management will be crucial in establishing databases that are important for inter-laboratory data comparison and comparative metagenomics studies. Using a combination of NGS approaches (e.g., metagenomics, metatranscriptomics, single-cell genomics, and comparative genomics) in systematic studies of freshwater microbiomes can be expected to yield a wealth of information crucial in water quality assessment and management.
Can NGS be Quantitative?
Massively Parallel Sequencing of SSU rRNA Gene Amplicons
Similar to proxies based on cultivation or cultivation-independent enumeration of indicators and pathogens, MPS of targeted genes such as the SSU rRNA has challenges which will prevent proxies developed with this technology from being 100% accurate. Moreover, the role of NGS in water quality assessment is in its infancy and has not yet been integrated into an epidemiological framework to link trends observed with NGS studies to adverse effects in human populations. One requirement of microbial risk assessment is the ability to accurately quantify biological risks/agents within the framework of QMRA (Hass et al., 2014). While a number of molecular tests including qPCR and digital PCR are known to be highly quantitative, providing results in the form of absolute gene copy number, profiles of species composition generated from amplicon sequencing are generally regarded as being qualitative. Biases in PCR amplification due to secondary structure or GC content of the resulting amplicons, generation of false diversity from sequencing error or chimera formation, choice of primers targeting different SSU rRNA hypervariable regions (Kozarewa et al., 2009; Quail et al., 2012; Kozich et al., 2013; Nelson et al., 2014; van Dijk et al., 2014b; Schirmer et al., 2015), as well as the presence of multiple copies of the SSU rRNA gene in some bacterial species (Angly et al., 2014) all influence the relative abundance of taxa observed by PCR-based methods with more pronounced biases associated with increased PCR cycle numbers (Murray et al., 2015; Schirmer et al., 2015; Sinclair et al., 2015). Sequencing errors associated with reads generated from the Illumina MiSeq platform may also result from differences in library preparation methods (Schirmer et al., 2015). Downstream bioinformatic data processing including methods in chimera removal (Kennedy et al., 2014) and OTU clustering (e.g., UCLAST, CD-HIT, UPARSE, Nelson et al., 2014; Schmidt et al., 2014; Sinclair et al., 2015) can similarly contribute to biases in determining the relative abundance and diversity of microbial taxa. Studies of mock bacterial communities suggest that these variations can sometimes result in inflated OTU numbers and therefore skew estimates of species richness and evenness (Kennedy et al., 2014; Nelson et al., 2014).
Despite the concerns discussed above, several studies have demonstrated that MPS of the SSU rRNA gene does offer a good approximation of the microbial species composition and relative abundance in samples (Ong et al., 2013; Wang et al., 2013), though the accuracy is likely sample-dependent and may be influenced by factors such as methods in library preparation and primer choice, as discussed above. Due to inherent differences in sample types, and associated microbiota, it is likely that no single universal approach can be best applied to all samples types to achieve quantitative measurement (Schmidt et al., 2014; Murray et al., 2015).
Quantification of Gene Copy Number in Multi-Omics Datasets
Due to concern with biases in PCR amplification, several studies have carried out amplification-independent analysis of microbial communities using SSU rRNA genes extracted from metagenomic datasets. For example, Logares et al. (2014) recently assessed the diversity of marine plankton communities using three different NGS platforms (i.e., metagenomic shotgun sequencing by Illumina HiSeq and Roche 454, and amplicon sequencing of SSU rRNA gene by Roche 454). The diversity and composition of SSU rRNA genes observed in metagenomes prepared by the Illumina HiSeq platform provided higher and more even estimates of community diversity relative to the metagenomes or amplicons sequenced using the Roche 454 platform. In the same study, the relative abundance of microbial taxa observed by Illumina HiSeq was comparable to relative abundances observed by catalyzed reporter deposition fluorescence in situ hybridization (CARD-FISH) and flow cytometry (i.e., positive Pearson correlation and p < 0.01).
The choice of NGS platform may yield results with varying resolution and quantitative power due to differential sequence throughput and coverage. Frey et al. (2014) recently compared three sequencing platforms (i.e., Roche 454, Illumina MiSeq, and Ion Torrent) to characterize the relative abundance of viral and bacterial pathogens spiked into blood at known, serially diluted, concentrations (i.e., Dengue virus Types 1 and 2; Swine Influenza A, and Bacillus anthracis). The blood samples were lysed using commercial kits, and one portion of DNA sample was sequenced using the three independent NGS platforms, while the second portion was subjected to qPCR for the quantification of specific virus and bacterial targets. Sequence reads from NGS were mapped to the reference genomes of the tested organisms, and all three NGS platforms were found to produce results comparable to the qPCR assay for relative quantification. Illumina and Ion Torrent provided the highest sensitivity to detect sequences at the lowest dilutions due to their higher throughput than Roche 454 (Frey et al., 2014).
Several protocols have been developed to quantify absolute gene copy numbers or transcripts within a metagenome or metatranscriptome, respectively (Gifford et al., 2011; Satinsky et al., 2014a,b) by benchmarking to standards that were added to sample prior to total RNA/DNA extraction, library construction, and high throughput sequencing (Gifford et al., 2011). In several metatranscriptomics studies (Gifford et al., 2011; Satinsky et al., 2014a), in vitro RNA internal standards were prepared using commercially acquired plasmid DNA, linearized and restriction-enzyme digested, after which plasmid fragments were transcribed in vitro. Following this, the absolute copy number of the internal standard was quantified by spectrophotometry before addition into a sample prior to cell lysis and RNA extraction. Benchmarking of sequenced transcripts to the copy number of internal standard was used to estimate sequencing depth and absolute quantity of differential expressed gene or transcript-type. Similarly, DNA standards in known concentration (Satinsky et al., 2014b) or a biological agent (e.g., virus or bacterial species) in known quantity (e.g., titer or cell count) could be added into a sample prior to sample processing (i.e., cell lysis) and shotgun sequencing, so that the absolute quantity of metagenomics reads recovered from a sample can be quantified by benchmarking to the spiked-in internal control. These methods have been used to quantify gene and transcript abundance in microbial communities in the Amazon river plume (Satinsky et al., 2014a,b), bathypelagic marine bacterioplankton communities after Deep Horizon Oil Spill (Rivers et al., 2013) and costal water in the USA (Gifford et al., 2011).
The studies discussed above demonstrate the possibility of using NGS platforms for both quantification of the relative abundance of microbial genes or taxonomic groups in a sample or absolute quantification through the use of internal standards. Nevertheless, potential biases and uncertainties introduced by use of nucleic-acid based methods and NGS (especially shotgun amplicon sequencing) require in-depth consideration during data analyses and interpretation.
Application of NGS to Analysis of Water Quality
To date, integrated multi-omics approaches encompassing genomics, metagenomics, metatranscriptomics and MPS of targeted genes (e.g., SSU rRNA genes) have been used to unravel the functions of microbial communities in a variety of freshwater (reviewed by Fondi and Liò, 2015; Franzosa et al., 2015) and marine environments (reviewed by Bourlat et al., 2013). For microbial water quality assessment, these multi-omics analyses have been used in combinations to investigate the microbial compositions and their ecological functions (e.g., dissemination of antibiotic genes and pollutant degradation) in the urban water cycle, such as lakes and rivers, engineered waste tailings ponds, sewage, water distribution systems and finished waters, amongst others (Table 1). These studies yield insights into the activities mediated by microbial communities in aquatic systems and potential biological risk factors in the form of specific microbial populations and their genes for virulence, toxin biosynthesis, or antibiotic resistance (AR), which can be important in microbial water quality management.
In this section we review recent studies that have leveraged NGS to improve upon the current understanding of factors mediating water quality in mainly freshwater environments. In particular, we will discuss ongoing work to (i) identify indicators of human sewage and human fecal contamination for water quality assessment (see Discovery of New Indicators for Human Sewage Contamination), (ii) examine the fate and transport of human pathogens in water and wastewater systems (see Relationship between Fecal Indicator Bacteria and Pathogen-Like Sequences and Microbial Safety of Drinking Water), (iii) understand the ecological drivers of harmful algal bloom persistence and toxin production/degradation (see Toxin Production and Degradation in Cyanobacterial Blooms), (iv) observe the spread and distribution of AR in freshwater environments (see Tracking Antibiotic Resistance through Metagenomics), and (v) investigate mechanisms of natural and stimulated biodegradation of harmful pollutants that threaten water quality (see Understanding Biodegradation of Pollutants that Threaten Water Quality). While NGS-based studies are providing unprecedented insights into the structure and function of waterborne microbial communities, an open challenge remains to convert trends observed in these studies into actionable data for water quality managers.
Discovery of New Indicators for Human Sewage Contamination
Contamination of freshwater bodies intended for human consumption or recreational use by fecal materials continues to be a major public health concern in many countries (Figueras and Borrego, 2010). New approaches are being developed to establish evidence of fecal contamination in surface freshwaters through alternative DNA-based indicators. MPS of the SSU rRNA gene has been used to characterize the microbial composition in raw sewage entering wastewater treatment plants in several different countries. Bacterial taxa associated with sewage infrastructure could be differentiated from those present in human fecal materials and other environmental sources (McLellan et al., 2010; Unno et al., 2010; Shanks et al., 2013; Cai et al., 2014). Comparison to a database of microbial sequences from feces in the Human Microbiome Project1 suggested that only ∼10–15% of sewage microbiomes in these surveyed sites were of human-fecal origin (McLellan et al., 2010, 2013; Shanks et al., 2013; Newton et al., 2015). Microbiomes of human-fecal origin detected in all sewage treatment plants appeared to be largely congruent and were represented by Firmicutes (e.g., Lachnospiracea, Ruminococcaceae; McLellan et al., 2013; Shanks et al., 2013), Bacteroidetes (e.g., Bacteroidaceae, Porphyromonadaceae, Prevotellaceae; Newton et al., 2015) and to a lesser extent by other mostly anaerobic microbes, e.g., Bifidobacteriaceae, Coriobacteriaceae (McLellan et al., 2010; Shanks et al., 2013; Newton et al., 2015). Amplicon sequences (V4–V5) obtained from municipal sewage communities and human stool samples identified 27 human fecal oligotypes (predominantly Bacteroidaceae, Prevotellaceae, or Lachnospiraceae/Ruminococcaceae) that are commonly and abundantly present in most surveyed sewage treatment facilities across the USA (Newton et al., 2015). The relative abundance of sequences within the dominant families of Bacteroidaceae, Prevotellaceae, or Lachnospiraceae/Ruminococcaceae could be statistically correlated to the obesity rate of the surveyed cities; with higher obesity rates corresponding to higher representation of Bacteroidaceae in the sewage microbiomes. This finding is particularly interesting as high representation of Bacteroidaceae in the human gut microbiome has been previously linked to consumption of a diet high in fat (David et al., 2014).
In contrast, a major proportion of the microbiome associated with sewage appeared to be adapted to the sewage infrastructure (∼80%) and varied in diversity and abundance according to geographical location (i.e., latitude) and air temperature (Vandewalle et al., 2012; Shanks et al., 2013; Newton et al., 2015). The microbiomes associated with sewers were also predominantly unique in taxonomy compared to those associated with animal hosts, surface freshwaters and other environmental sources (McLellan et al., 2010, 2013; Vandewalle et al., 2012; Shanks et al., 2013), and had in general higher diversity and species richness compared to stool samples (Newton et al., 2015). Such microbial species found in either human feces or sewage, but either absent, or present in very low abundance, in surface freshwaters, are potentially good indicators for sewage contamination (Koskey et al., 2014; McLellan and Eren, 2014). For example, development of qPCR probes based on Lachnospiraceae sequences enriched in human sewage were used to evaluate sewage contamination in an urban freshwater harbor where occurrence of this new indicator group was highly correlated with a more established marker of human fecal contamination (i.e., HF183; Newton et al., 2011). Analysis of profiles of microbial SSU rRNA genes (V4/V6 regions) with a source estimation program employing Bayesian statistics (SourceTracker) allowed identification of human fecal and sewage signatures which correlated to the distribution of human-source markers Lachno2 and HF183 along the coastline of Lake Michigan (Newton et al., 2013). Use of bacterial taxonomic groups identified through NGS-based surveys as alternative indicators may be site-specific as several studies have shown that the microbial composition in different sewer systems can differ according to several instances: origin of the waste materials (Ye and Zhang, 2013), climatic variations due to latitudinal locations of treatment facilities (Shanks et al., 2013), or infiltration of rainwater and storm water inputs (McLellan et al., 2010).
Collectively, studies using MPS of the SSU rRNA gene have shown that wastewater treatment is, in general, capable of removing most microbial populations associated with human-feces (Ye and Zhang, 2011; Cai et al., 2014). Even so, a small proportion of potential pathogens may still be present in sewer effluents (Ye and Zhang, 2011; Cai et al., 2014). In areas without widespread sewage treatment, direct contamination of waters with human waste is still a problem. The cited studies have shown that assessment of waterbodies using NGS can reveal impacts from sewage and fecal contamination. In addition, with advancement in high throughput sequencing technology and bioinformatics, surveys of the sewage microbiome may 1 day be used to assess and monitor the overall health status of human populations.
Relationship between Fecal Indicator Bacteria and Pathogen-Like Sequences
While NGS has shown potential for tracking water impairment through discovery and detection of indicators of human fecal or sewage pollution, a more direct approach to evaluate waterborne biological risks is through evaluation of pathogen diversity and abundance. However, tracking individual waterborne pathogens is costly, methodologically challenging, and requires knowledge of which pathogens to target (Varela and Manaia, 2013). The use of NGS to screen for sequences with high identity to waterborne pathogens allows identification of potential risk agents and, depending on biases in sample preparation, may also provide semi-quantitative inference of their relative abundance in wastewater and other environments (McLellan et al., 2010; Cai and Zhang, 2013; Ibekwe et al., 2013; Cai et al., 2014; Lu et al., 2015). Identification of pathogen species was not possible in early studies that were based on analysis of the V6 region of the SSU rRNA gene due to short sequence length (∼60 bp) which failed to provide taxonomic resolution beyond the family or genus level. However, as sequence read lengths through NGS continues to increase, confident classification of NGS-generated sequences to the level of bacterial species has become attainable.
A recent study of bacterial pathogens in a wastewater treatment plant applied a combination of NGS and qPCR of genetic markers to track the occurrence of bacterial pathogens through stages of wastewater treatment process (Lu et al., 2015). This analysis revealed that raw sewage was enriched in potential pathogens most closely related to Arcobacter butzleri, Aeromonas hydrophila, and Klebsiella pneumonia. The Arcobacter genus represented over 43.5–97.37% of all pathogen-like sequences in the treatment plant (sewage influent, primary effluent, activated sludge, secondary effluent and final sand filter effluent) while the effluent contained only Arcobacter butzleri with other non-detected species presumed to have been removed mainly by biological processes during treatment. Quantification of genetic markers with qPCR confirmed the trends in the distribution of pathogenic species identified by sequencing in both raw and treated sewage (Lu et al., 2015). In a previous study of sewage treatment plants across nations (Canada, USA, China, and Singapore), 454 pyrosequencing revealed that the most abundant sequences related to pathogenic bacteria in raw sewage corresponded to the genera Aeromonas and Clostridium with species Aeromonas veronii, Aeromonas hydrophila, Clostridium perfringens, and Corynebacterium diphtheria (Ye and Zhang, 2011).
Several studies have employed NGS to characterize the distribution of potential pathogens in natural and human-impacted areas. Nine sites sampled in urban and agricultural watersheds along the Santa Ana River, CA, USA were screened for potential bacterial pathogens by pyrosequencing of the V1 and V2 hypervariable regions of the SSU rRNA gene (Ibekwe et al., 2013). Sequences related to human pathogens comprised a greater percent of the sampled microbial community in urban runoff and agricultural waters than in waters collected from sites with little to no human activity. Similarly, a study of an urban tropical area in Singapore used Illumina MiSeq (V3 and V4 hypervariable regions of the SSU rRNA gene) to evaluate the variation of bacterial communities as a function of land use and water quality (Nshimyimana et al., 2015). The relative abundance of pathogen-like sequences identified to genera and species varied with land use and with the abundance of FIB, where moderate positive correlation to measured levels of E. coli suggested higher incidence of potential pathogens in sites with higher measured levels of FIB.
Despite the capability for simultaneous detection of multiple human bacterial pathogens in environmental waters, NGS has not yet been integrated into a framework of QMRA, in part because, as discussed in Section 1, (1) amplification-based biodiversity screens are qualitative, although studies of mock communities indicate that these may provide tentative indications of relative abundance, (2) identification of pathogens to species-level has only recently been possible with longer reads afforded by NGS, and (3) species-level identification may not be sufficient to assess risk from pathogens with strain-specific virulence.
Recent studies have attempted to monitor human sewage to track outbreaks of disease-causing viruses. While published studies have not yet integrated NGS, such methods could complement conventional approaches such as PCR, qPCR, and Sanger sequencing for the identification and quantification of pathogens. This method combination was recently employed for analysis of eight viral strains in sewage in Gothenburg, Sweden and led to an early warning of a potential outbreak of Hepatitis A and norovirus (Hellmér et al., 2014) which were also detected in an ongoing outbreak in Scandinavia (Hellmér et al., 2014). Clinical research and diagnostic studies in Africa also combined three methods (Sanger sequencing, PCR, and qPCR) to understand the spread and distribution of the polio virus and to discern wild-type from vaccine-derived strains (Gumede et al., 2013). Future studies have the potential to apply NGS to screen for and identify pathogens that could subsequently be monitored by targeted diagnostic methods such as PCR and qPCR.
Reviewed studies demonstrate that NGS and related sequence-based methods are being used to track human bacterial pathogens and/or waterborne viral strains. Monitoring the distribution of viral strains in the environment shows promise to improve vaccination campaigns and provide early warnings of disease outbreaks in a given community. NGS profiling of bacterial pathogens in the natural and built environment provides general insights into how the distribution and relative representation of potential human pathogens varies with environmental conditions, anthropogenic impacts, and the implementation of water treatment technologies. Such information may be leveraged to guide engineers and watershed managers in selection of best practices to reduce human exposure to potential pathogens.
Microbial Safety of Drinking Water
Next-generation sequencing has been used to survey the microbial composition in drinking water distribution systems including source waters (Chao et al., 2013), end-point taps (Chao et al., 2013; Shi et al., 2013; Huang et al., 2014), and various stages of the drinking water treatment and distribution process (Gomez-Alvarez et al., 2012; Chao et al., 2013; Shi et al., 2013; Huang et al., 2014), as well as biofilm matrices associated with distribution pipelines (Figure 1A; Hong et al., 2010). While most microorganisms in treated drinking water are harmless, outbreaks of diseases linked to pathogens in drinking water may be due to compromised water treatment and contaminated source waters (MacKenzie et al., 1994; Howe et al., 2002). Pathogens that have evaded treatment and disinfection processes may persist in water distribution pipelines as biofilms, leading to dissemination to end users through the process of sloughing (Figueras and Borrego, 2010). A recent study noted an increase in microbial diversity in end-point drinking water relative to source water which was linked to dispersal of biofilm-associated microbes during passage through the water distribution pipeline (Huang et al., 2014). Consumption of untreated well-water poses an additional risk, where waterborne pathogens such as Legionella and Campylobacter are among some of the main well-waterborne disease agents in the USA (CDC, 2013). Other causative agents of waterborne diseases in both developed and developing countries include parasites such as Cryptosporidium parvum, Toxoplasma gondii, Cyclospora cayetanesis, Giardia lamblia and viruses such as norovirus (Ashbolt, 2015). Metagenomic surveys of water within distribution systems and end-point drinking waters have detected DNA indicative of several of these potential opportunistic pathogens (e.g., Legionella, Mycobacterium, Pseudomonas, and Leptospira) and virulence factors (e.g., AR and pathogenicity islands, Gomez-Alvarez et al., 2012; Shi et al., 2013; Huang et al., 2014). Several recent studies have used NGS to shed light on the fate of microbial populations, including pathogens, during various stages of the water treatment process.
FIGURE 1. Environments where studies of water quality have been advanced by NGS. (A) Fluorescence microscopy showing biofilm grown on a stainless steel surface in a laboratory potable water biofilm reactor for 14 days (image from Donlan, 2002). Drinking water is thought to contain a mixture of both planktonic and dissociated biofilm bacteria. Using NGS methods, a repertoire of microbial species including potential pathogens have been detected in water distribution systems, raising the concern of the effectiveness of water treatment on microbial water safety. (B) A water reservoir in Singapore impacted by a cyanobacterial bloom characterized by Penn et al. (2014). With global warming, cHAB is predicted to occur more regularly with likely severe consequences (e.g., toxin loading) that can diminish water quality. (C) An oil sands tailings pond in Alberta, Canada characterized by Tan et al. (2015). Inset (D) shows microcosms established to study degradation of unrecovered hydrocarbons deposited in oil sands tailings pond. NGS has been used to characterize the microbial communities in oil sands tailings ponds (http://www.hydrocarbonmetagenomics.com/) in order to better understand mechanisms of pollutant degradation.
Microbial communities in drinking water systems tend to be present at low abundance (e.g., <105 cells per ml, personal observation) hence studies of microbial communities in drinking water require collection of microbial biomass from large amounts of water, e.g., 100–2000 L (Gomez-Alvarez et al., 2012; Chao et al., 2013; Shi et al., 2013; Huang et al., 2014) where DNA extraction yields may still be too low for direct sequencing, requiring amplification (of genomes or amplicons) prior to sequencing (Gomez-Alvarez et al., 2012). Studies of water distribution systems reveal microbial communities dominated by the phyla Proteobacteria (i.e., Alpha-, Beta-, and Gammaproteobacteria), Firmicutes, Nitrospirae, and Actinobacteria (Gomez-Alvarez et al., 2012; Chao et al., 2013; Huang et al., 2014). Application of disinfectants (e.g., free chlorine or monochloramine) appear to have selective effects on the microbial assemblages in water distribution pipelines and end-point drinking water (Gomez-Alvarez et al., 2012; Chao et al., 2013; Huang et al., 2014). In water treated with monochloramine collected from a water distribution simulator in the USA, the microbial composition based on taxonomic assignment of 454 reads, was dominated by Actinobacteria (28%; e.g., Mycobacterium), Betaproteobacteria (25%; e.g., Acidovorax), and Alphaproteobacteria (23%), whereas free-chlorine treated water had a lower proportion of Actinobacteria (6%), and was dominated largely by Alphaproteobacteria (35%; e.g., Caulobacter, Rhodopseudomonas, Bradyrhizobium) and Cyanobacteria (Synechococcus; Gomez-Alvarez et al., 2012). Chlorination appeared to selectively deplete Gamma- and Betaproteobacteria, resulting in higher relative abundance of Alphaproteobacteria in drinking water (Chao et al., 2013; Huang et al., 2014). Relative to untreated water, treatment with chlorine was found to reduce waterborne microbial diversity (Huang et al., 2014). Importantly, chlorination was associated with the removal of most potential pathogens from the detected diversity (Huang et al., 2014) although genomic signatures of some potential pathogens (e.g., Pseudomonas aeruginosa and Leptospira interrogans) persisted in the tap water (Huang et al., 2014). Protective functions for detoxification and modulation of oxidative stress (Chao et al., 2013), as well as mobile genetic elements (MGEs; Shi et al., 2013) were also found to be more highly represented in metagenomes obtained from water following disinfection (Shi et al., 2013; Huang et al., 2014).
Toxin Production and Degradation in Cyanobacterial Blooms
Cyanobacterial harmful algal blooms (cHABs) are destructive to aquatic ecosystems, deteriorating surface water quality and rendering it unsafe for use by humans and livestock. cHABs are associated with overgrowth of cyanobacterial species many of which are capable of producing cyanotoxins, e.g., microcystin, cylindrospermopsin, anatoxin (Ferrão-Filho and Kozlowsky-Suzuki, 2011) and off-flavor/odorous compounds, e.g., geosmin or 2-methylisoborneol (Li et al., 2012). Rising global temperature as a result of climate change is expected to promote the frequency and severity of cHAB events in the future (Paerl and Huisman, 2009). NGS technologies are being applied to the systematic understanding of the ecology and control of cHAB using methods in metagenomics, comparative genomics, and metatranscriptomics (Li et al., 2011; Steffen et al., 2012, 2015; Penn et al., 2014).
A cyanobacterial bloom consists of both primary producers and heterotrophs including consumers and grazers, resulting in nutrient cycling and recycling. It is likely that different populations of microbial heterotrophs may influence the dynamics of a cHAB and may benefit from nutrients and metabolites released during a cHAB event (Edwards and Lawton, 2009; Mou et al., 2013), or consumption of dead cyanobacterial biomass after a bloom die-off (Xing et al., 2011). The interplay of this complex ecosystem shapes the bloom community structure and may control the adverse effects of the bloom (e.g., bloom persistence and the balance between production and biodegradation of toxins and off-flavor/odor compounds). Though this interplay has long been hypothesized (Christoffersen et al., 1990), detailed insights from community gene expression can link specific populations, or groups of populations, to specific activities via metagenomic and metatranscriptomic approaches. In contrast to oligotrophic lakes where phototrophic Proteobacteria and Actinobacteria appear to be the most active light harvesters (Yurkov and Beatty, 1998; Sharma et al., 2008), in a cHAB, which generally manifests during eutrophic conditions, solar energy is captured primarily by algae and cyanobacteria. Metagenomic studies conducted during bloom events in Lake Taihu (China), Lake Erie (N. America), and Grand Lake St. Marys (GLSM; OH, USA) revealed a high proportion of cyanobacterial sequences (25–88%) including common bloom members Chroococcales (e.g., Cyanothece, Synechococcus, Crocosphera; ca. 40–45%), Nostocales (ca. 10%), and Oscillatoriales (e.g., Lyngbya, Trichodesmium-like, Arthrospira, etc; ca. 17–38%; Steffen et al., 2012). Metatranscriptomics studies of other freshwater systems suggest that the most active cyanobacterial taxa were also from the same cyanobacterial orders, dominated by Microcystis in an eutrophic Singaporean reservoir (Penn et al., 2014; Figure 1B) and in Lake Erie with co-dominant taxa including Synechococcales and Gloeobacterales (Steffen et al., 2015). Several recent -omics studies showed that the non-cyanobacterial members of cHAB communities were dominated by Proteobacteria (>90% of non-cyanobacterial sequences) with a smaller portion of Bacteroidetes (Li et al., 2011; Steffen et al., 2012; Penn et al., 2014). Eukaryotic algal species (e.g., Streptophyta, Euglenids, Chlorophyta, Bacillariophyta) were also observed within blooms, and may co-dominate, as revealed using MPS of the plastid 23S rRNA gene, specific for cyanobacterial and eukaryotic algal species (Steven et al., 2012). In addition, viruses/phage, protozoa, and fungi are present and are increasingly recognized as key players in bloom persistence and decay (Gerphagnon et al., 2013; Xia et al., 2013; Ger et al., 2014).
Studies based on NGS have provided unprecedented insights into the evolution of cyanobacterial genomes, pointing to the role of nutrient-enrichment as a factor that can accelerate genome evolutionary rates and potential niche expansion of cyanobacterial species. Comparative genomic analyses reveal signatures of genome rearrangement through both homologous and non-homologous recombination (Frangeul et al., 2008; Humbert et al., 2013; Yang et al., 2015) with such variation reflected in natural bloom populations. Comparison of the genome of Microcystis aeruginosa strain NIES 843 to metagenomes from Lake Erie, Lake Taihu, and GLSM identified metagenomic islands (Mi’s) within the NIES 843 genome (defined as regions of ≥10 kb with low coverage in the metagenomes) that were not observed in these three sites and which contained transposase and MGEs (Steffen et al., 2012). Variation in toxin biosynthesis gene content among closely related cyanobacterial strains is well-documented (Mikalsen et al., 2003; Tanabe et al., 2004; Christiansen et al., 2008) and is linked to genomic dynamism mediated by homologous and non-homologous recombination. Transposase expression in strain NIES 843 has been shown to be upregulated by nutrient-enrichment, particularly in the presence of organic nitrogen (urea; Steffen et al., 2014). Thus, conditions of eutrophication may promote genome rearrangement within this group, contributing to the mosaic patterns in gene order, MGE content and the distribution of toxin genes observed in these strains. The dynamic genomes of cHAB strains may influence their fitness, promoting local adaptation and contributing to the widespread distribution of closely related strains (Mikalsen et al., 2003; Yoshida et al., 2008; Acinas et al., 2009).
It is of interest to water quality managers to know which waterborne microbial populations are responsible for the production and biodegradation of toxins and off-flavor/odorous compounds. Transcripts for genes in the biosynthesis pathways for multiple cyanotoxins (i.e., microcystin, aeruginosin, and cyanopeptolin), and structurally related compounds including a newly identified secondary metabolite gene cluster were detected throughout a 24 h sampling campaign in Singapore, suggesting that toxin gene expression was continuous in this system, despite dissolved toxin levels measured near the limit of detection (Penn et al., 2014). Biosynthesis and release of toxins may be balanced by their biodegradation. Biodegradation of microcystin by natural microbial communities from Lake Erie was studied through metagenomic analysis of mesocosms (Mou et al., 2013). The authors found Methylophilales and Burkholderiales were significantly enriched in microcystin-amended microcosm, while the gene encoding for the only known mechanism of microcystin biodegradation (mlr) was not, prompting the authors to speculate that new, undiscovered pathways for microcystin biodegradation were being utilized. As additional omics-enabled studies with a focus on water quality emerge, it is likely that new pathways for the production and biodegradation of such compounds will be discovered, paving the way for better strategies to control their expression and improve source water quality.
Tracking Antibiotic Resistance through Metagenomics
Antibiotic resistance in pathogenic bacteria is a growing public health threat (Berendonk et al., 2015). Antibiotic resistance determinants (ARDs) including antibiotic resistance genes (ARGs), and MGEs that catalyze the transfer of such genes, occur widely in environmental bacteria. ARD appear to be enriched in microbial communities including sediment microbiota near wastewater treatment plant effluent outfalls (Czekalski et al., 2014; Port et al., 2014), sediments of human-impacted estuaries (Chen et al., 2013) and in a variety of sample types (e.g., soil, water, sediments, human and animal fecal samples, sludge, wastewater) derived from heavily anthropogenic impacted environments (Li et al., 2015). Although environmental surveillance of AR is not currently part of water quality monitoring frameworks, there is an urgent need to better control the spread and evolution of AR and a better understanding of the occurrence and characteristics of environmental reservoirs of ARD will advance that goal (Bush et al., 2011).
To gain a better understanding of the types of ARGs in environmental bacteria and their co-localization with MGEs (e.g., transposons, plasmids, and integrons) complementary approaches of sequence-based metagenomics and functional metagenomics has overcome the impediments of earlier culturing and molecular techniques. With sequence-based metagenomics, total DNA from an environment is directly extracted and randomly sequenced. The sequenced reads are then interrogated against a reference database containing known ARG sequences to predict the resistance potential originating from a metagenome. Functional metagenomics involves cloning randomly sheared DNA fragments into an expression vector and transforming them into a host (e.g., E. coli) and selecting transformants which exhibit resistance to the selected antibiotic (Allen et al., 2009; Sommer et al., 2009). The advantages of this dual approach include (i) the ability to identify highly divergent genes from known ARGs; (ii) direct evidence of resistance phenotypes associated with expressed genes and (iii) no reference gene sequences required for gene identification (Pehrsson et al., 2013). Combining sequence based metagenomics and functional metagenomics has led to the discovery of novel ARGs in various microbial communities including soil (Riesenfeld et al., 2004; Allen et al., 2009; Torres-Cortés et al., 2011), freshwater lakes (Bengtsson-Palme et al., 2014), human gut microbiomes (Sommer et al., 2009; Hu et al., 2013), oral microbiome (Diaz-Torres et al., 2006), animal gut microbiomes (Kazimierczak et al., 2009) and activated sludge (Mori et al., 2008; Parsley et al., 2010).
The growing sophistication, accuracy and speed in downstream processing pipelines of NGS datasets provides a significant step forward in facilitating large-scale environmental studies to assess the emerging threat imposed by AR. With the inundation of metagenomic sequence data over the past decade from variety of environmental microbiomes (e.g., IMG, MG-RAST, Sequence Read Archive), metagenomic analysis tools have been developed to mine larger and more expansive reference sequence databases for ARG signatures. These tools include the Comprehensive Antibiotic Resistance Database (CARD2, McArthur et al., 2013), the Antibiotic Resistance Database (Liu and Pop, 2009), the beta-lactamase database (BLAD, Danishuddin et al., 2013), and ResFinder3. To determine the relationship between environmental and human-associated resistomes (i.e., antibiotic resistance genes and bacteria), Gibson et al. (2015) developed Resfams, a curated protein family database and associated profile hidden Markov models (HMMs), organized by ontology specifically applied to AR functions, with a subset of these AR proteins functionally verified using protein assays. This method circumvents the common approach of assigning AR functions using pairwise sequence alignment to AR databases, instead, HMM and consensus models were used for AR functional assignment, which significantly increases prediction sensitivity and specificity.
Several recent studies have employed NGS and bioinformatic tools, such as those described above, to characterize the AR profiles of microbial communities in aquatic environments. A network analysis was conducted to investigate the broad-spectrum profiles of ARGs and their co-occurrence patterns in 50 samples spanning water, soil, sediments, wastewater, sludge, and human and fecal samples. This study concluded that the abundant ARGs were associated with antibiotics commonly administered in human or veterinary medicine (i.e., aminoglycoside, bacitracin, beta-lactam, chloramphenicol, macrolide-lincosamide-streptogramin, quinolone, sulphonamide, and tetracycline) and abundances of these ARGs were up to three magnitudes higher in the most heavily anthropogenic-impacted environments (Li et al., 2015). Similarly, metagenomic profiles of ARGs in sediments of a heavily human impacted estuary [Pearl River Estuary (PRE) in China] revealed a higher diversity of both genotypes and resistance genes for sulphonamides, fluoroquinolones, and aminoglycosides relative to a pristine deep ocean bed in the South China Sea (SCS; Chen et al., 2013). In addition, this study showed parallel trends between the distribution of ARGs and MGEs, where MGE’s may function as vectors for dissemination of ARGs in the aquatic environment.
Antibiotic resistance is a public health threat of heightened concern and the central theme of current global monitoring efforts is limited to tracking antibiotic consumption and antibiotic resistant bacteria (ARBs) isolated from clinical and public health laboratories (Grundmann et al., 2011). While clinical settings may be the source of highest antibiotic and ARB loads, the natural environment has recently drawn attention as a reservoir of transferable ARGs potentially implicating environments highly contaminated with ARGs as a human health risk (Ashbolt et al., 2013). ARGs in particular are increasingly viewed as emerging pollutants and emphasis of their occurrence and distribution in natural environments are being considered in development of antibiotic surveillance frameworks (Codex Alimentarius Commission, 2011; Berendonk et al., 2015). Incorporating metagenomics into frameworks for monitoring the threat of AR in aquatic environments provides a novel approach for environmental health monitoring and pushes boundaries on improving current risk assessment models. Recently, a metagenomic-based approach was used to develop a multivariate index to quantify the AR potential in published metagenomes (Port et al., 2014). This index was based on the abundance of ARGs, MGEs, pathogenic potential and metal resistance genes (implicated in co-selection of ARGs). Congruent with studies examining the distribution of ARG, the multivariate index was shown to differentiate aquatic environments based on human-impact. This study exemplifies the utility of NGS to advance characterization of water quality within the context of AR, which will ultimately assist in risk evaluation and mitigation by public health authorities.
Understanding Biodegradation of Pollutants that Threaten Water Quality
Chemical pollution is one of the major causes of diminished water quality (Schwarzenbach et al., 2010). Strategies for pollutant removal through stimulation of indigenous microbial community for bioremediation can be effective to remove, or immobilize, toxic compounds (Major et al., 2002; Gieg et al., 2014; Koenig et al., 2014). The presence of pollutants can exert selective pressures that enrich for microbial populations that are capable of coping with associated stresses and/or are able to utilize chemical contaminants as a source for carbon, nutrients, or as an electron acceptor for respiration (Hemme et al., 2010; Smith et al., 2012; An et al., 2013a,b; Rivers et al., 2013; Tan et al., 2015). Recent work in the area of marine metagenomics has demonstrated that the enrichment of microbial genes in distinct environments reflects the key biogeochemical processes in these systems (Coleman and Chisholm, 2010; Ulloa et al., 2012; Kelly et al., 2013). Similarly, microbial communities and gene expression profiles in polluted waters reveal pathways for transformation of contaminants, thus serving as an indicator of in situ biodegradation processes and highlighting metabolic pathways to target and possibly stimulate for accelerated pollutant clean up (Kimes et al., 2013; Engel and Gupta, 2014; Lamendella et al., 2014; Mason et al., 2014).
Analysis of microbial populations and genes enriched in polluted sites using “-omics” enabled approaches has revealed key microbial processes involved in contaminant tolerance and transformation. For example, in an underground water system heavily polluted by heavy metals, nitric acid and organic solvents, the microbial community surveyed through metagenomics showed limited diversity consisting mainly of Beta- and Gammaproteobacteria, with streamlined metabolic capabilities including the capacity to utilize heavy metal ions for lithotrophy (Hemme et al., 2010). A similar link between microbial gene content and contaminant transformation is observed in systems contaminated with hydrocarbons. In the highly anaerobic environments of oil sands tailings ponds constructed to store waste tailings from bitumen extraction, metagenomic data identified a repertoire of facultative and strict anaerobes that were capable of utilizing the pollutant hydrocarbons as a carbon source (Figure 1C; An et al., 2013a; Tan et al., 2015). Similarly, NGS surveys of open oceans and coastal shorelines in Gulf of Mexico after Deep Horizon Oil spill in 2010 showed that within the water column, a microbial community consisting of mainly marine Gammaproteobacteria (e.g., Colwelliaceae, Oceanospirillaceae, Piscirickettsiaceae, Methylococcaceae) dominated the oil plumes, and were the key players in the aerobic degradation of oil and gas (Mason et al., 2012; Rivers et al., 2013). MPS of SSU rRNA genes from microbial communities originating from beach areas before and post-oil impact indicated that the community shifted from a baseline of predominantly enteric-type consortia associated with human-impact to marine-associated taxa (e.g., Oceanospirillales, Rhodospirillales, and Rhodobacterales) linked to remediation efforts (e.g., sand washing), including populations with potential oil degradation capability (Engel and Gupta, 2014). Oil plumes have been shown to have an impact on microbial communities in marine sediments and other beach areas, with NGS detecting genes and transcripts indicative of pollutant (monoaromatics and alkanes) degradation in marine sediments (Kimes et al., 2013; Mason et al., 2014) and areas along the shorelines (Lamendella et al., 2014).
Several studies have used microcosms and enrichment cultures inoculated from contaminated sites (i.e., polluted groundwater or tailings ponds containing toxic waste) to establish evidence of pollutant degradability, followed by NGS studies incorporating gene expression (i.e., metatranscriptomics) or comparative metagenomics approaches (Hug et al., 2012; Waller et al., 2012) in order to identify (novel) key processes relevant to pollutant degradation (e.g., Abu Laban et al., 2010; Hug et al., 2012; Luo et al., 2014; Figure 1D). NGS in tandem with other functional analyses (e.g., proteomics and transcriptomics) have been employed to successfully identify novel genes and microbes involved in anaerobic activation and degradation of recalcitrant compounds, e.g., carboxylation of benzene (Abu Laban et al., 2010; Luo et al., 2014), dechlorination of chlorinated ethene (Hug et al., 2012; Waller et al., 2012) and fumarate addition activation of hydrocarbons (Tan et al., 2015). Comparative metagenomics and genome-centric metagenomics have been used to tease out the functional roles of different members in complex community in bioreactors (Albertsen et al., 2013; Haroon et al., 2013). Microbially mediated processes, such as dechlorination of chlorinated compounds often requires the presence of a mixed community to provide supporting roles such as production of essential nutrients, e.g., H2, corrinoid, and methionine that are needed by primary degraders (e.g., Dehalococcoidetes) in substrate mineralization. Hug et al. (2012) compared the metagenomes of three enrichment cultures involved in the dechlorination of chlorinated ethene, and identified genes required for the production of nutrients essential for Dehalococcoidetes in dechlorination. Assigning these gene to taxa using a homology-approach; revealed a mutualistic relationship between a repertoire of microorganisms (e.g., Firmicutes, methanogens, and Deltaproteobacteria), which is not easily achieved using conventional microbiology methods involving isolation. The functions of key members involved in the degradation of pollutants could also be inferred through targeted genome enrichment using strategies such as stable-isotope probing (SIP), in which labeled recalcitrant compounds can be used to enrich for key degraders, followed by (meta)genome sequencing and analysis allowing investigation of the genetic capabilities or metabolic modeling of uncultivated pollutant degraders (e.g., Saidi-Mehrabad et al., 2013).
Future Technologies on the Horizon and Impact for Water Quality Assessment
Next-generation sequencing platforms for single molecule real-time sequencing technology (SMRT; e.g., Pacific Biosciences and Oxford Nanopore Technologies) are capable of producing sequences with read lengths exceeding 1000 bp, surpassing those generated using Illumina and Roche 454. In addition, real-time data acquisition make these technologies attractive targets for integration into sensors for monitoring environmental sequence data. To date, these technologies have not been widely adopted for surveys of highly diverse bacterial communities due to high rates of randomly distributed sequencing errors, which would lead to artificially inflated community diversity (Schloss et al., 2015). However, recent studies employing SMRT sequencing with improved sequencer hardware and chemistry have yielded improved accuracy and read lengths (i.e., >1400 bp) making community surveys with amplicon sequencing more feasible (Marshall et al., 2012; Mosher et al., 2014). To date, Pacific Bioscience (PacBio) sequencing has been used extensively for genome sequencing, where the high coverage and long sequence read length allows for generation of highly contiguous consensus sequences with low error rates, where microbial genomes can often be closed within a single run (Koren et al., 2013). PacBio sequencing is often used in parallel with other NGS platforms (e.g., Illumina, Roche 454, and SoLiD) to allow for scaffolding and phishing to produce finished/close genomes with high sequence quality (Koren et al., 2013). MinION, a newer SMRT based on Nanopore technology (Venkatesan and Bashir, 2011), has garnered a lot of interest due to the cost-effectiveness and pocket-size mobility. Hundreds of units were shipped to members of the MinION Access Program to test the device in a wide range of sequencing applications. Sequencing results produced by some of the laboratories showed that MinION could generate single read lengths of up to 5500 bp in a single run, though riddled with a high sequence error rate (∼30%; Ashton et al., 2014; Mikheyev and Tin, 2014; Madoui et al., 2015). Nanopore sequencing holds particular promise for online detection of waterborne nucleic acids due to its high potential portability and real-time data output, allowing future work to develop real-time sensing platforms for water quality monitoring. Indeed, as different NGS technologies advance toward enhanced sequence chemistry for improved sequence read length with higher throughput and reduced error rate, online real-time detection using NGS will breach the gaps for its use in real-time monitoring of genetic parameters for water quality.
The synergy between multiple NGS platforms and the increased accuracy and data processing speed of downstream bioinformatics, biostatistical and machine-learning pipelines continue to push the boundaries of genomic and metagenomic data analyses. Advancements in the field of multi-omics have enabled optimization of bioinformatics workflows (Kozich et al., 2013; Nelson et al., 2014); allowing acquisition of uncultivated microbial genomes from a complex metagenome (Huson et al., 2007; Dick et al., 2009; Hess et al., 2011; Albertsen et al., 2013; Haroon et al., 2013); deduction of microbial community composition based on single copy genes (Sunagawa et al., 2013); inference of metabolic interactions among microbial community members (Hanson et al., 2014); identification of phylogenetic markers and genes encoding important processes (Stark et al., 2010; Darling et al., 2014), and from a broader public health aspect, have augmented our knowledge of the population genomics and dissemination of virulent strains of waterborne pathogens such as Vibrio cholerae (Dutilh et al., 2014).
The influx of newly sequenced microbiomes associated with various natural and engineered water environments progressively expand sequence information available in public databases such as MG-RAST (Meyer et al., 2008) and IMG (Markowitz et al., 2014), providing a bigger data pool for mining and comparative metagenomic analyses. This is evident in ecological studies of freshwater microbiomes where taxonomic composition and functions of waterborne microbial communities and pathogens are dependent upon a priori databases. In addition, integrating physiological methods through use of microcosms, enrichment cultures and single cell genomics, coupled with NGS has allowed the discovery of new enzymes and genes in complex microbial communities that are important in water quality preservation (e.g., toxin degradation, nutrient cycling), which otherwise would be overlooked since many of these microbes have remained uncultivated.
Microbial water quality assessment through NGS-based molecular detection of FIB, pathogens, or virulence factors, as well as genes encoding biogeochemical processes such as pollutant biodegradation, have the potential to be translated into actionable data for water quality managers. However, crucial relationships between the occurrence, detection and quantification of nucleic acids in the environment through NGS-based approaches and potential impacts to human or environmental health must be established before profiles based on the distribution(s) of microbial genes are relied upon to replace currently used bioindicators. Current frameworks for water quality assessment heavily rely upon proxy measurements (e.g., quantification of culturable FIB) with a deep literature substantiating such approaches with epidemiological data. As the field of NGS-based water quality assessment matures, such further studies will be needed to establish whether proposed NGS-based indicators for pollution can improve upon the existing state of the art in water quality assessment.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
This research was funded by the National Research Foundation Singapore through the Singapore MIT Alliance for Research and Technology’s (SMART) Center for Environmental Sensing and Modeling (CENSAM) research program. The authors would like to thank Professor Emerita Julia Foght (University of Alberta) for providing Figures 1C,D in the main text.
Footnotes
References
Abu Laban, N., Selesi, D., Rattei, T., Tischler, P., and Meckenstock, R. U. (2010). Identification of enzymes involved in anaerobic benzene degradation by a strictly anaerobic iron-reducing enrichment culture. Environ. Microbiol. 12, 2783–2796. doi: 10.1111/j.1462-2920.2010.02248.x
Acinas, S. G., Haverkamp, T. H. A., Huisman, J., and Stal, L. J. (2009). Phenotypic and genetic diversification of Pseudanabaena spp. (cyanobacteria). ISME J. 3, 31–46. doi: 10.1038/ismej.2008.78
Albertsen, M., Hugenholtz, P., Skarshewski, A., Nielsen, K. L., Tyson, G. W., and Nielsen, P. H. (2013). Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nat. Biotechnol. 31, 533–538. doi: 10.1038/nbt.2579
Allen, H. K., Moe, L. A., Rodbumrer, J., Gaarder, A., and Handelsman, J. (2009). Functional metagenomics reveals diverse beta-lactamases in a remote Alaskan soil. ISME J. 3, 243–251. doi: 10.1038/ismej.2008.86
An, D., Brown, D., Chatterjee, I., Dong, X., Ramos-Padron, E., Wilson, S., et al. (2013a). Microbial community and potential functional gene diversity involved in anaerobic hydrocarbon degradation and methanogenesis in an oil sands tailings pond. Genome 56, 612–618. doi: 10.1139/gen-2013-0083
An, D., Caffrey, S. M., Soh, J., Agrawal, A., Brown, D., Budwill, K., et al. (2013b). Metagenomics of hydrocarbon resource environments indicates aerobic taxa and genes to be unexpectedly common. Environ. Sci. Technol. 47, 10708–10717. doi: 10.1021/es4020184
Angly, F. E., Dennis, P. G., Skarshewski, A., Vanwonterghem, I., Hugenholtz, P., and Tyson, G. W. (2014). CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction. Microbiome 2:11. doi: 10.1186/2049-2618-2-11
Ashbolt, N. J. (2015). Microbial contamination of drinking water and human health from community water systems. Curr. Environ. Health Rep. 2, 95–106. doi: 10.1007/s40572-014-0037-5
Ashbolt, N. J., Amézquita, A., Backhaus, T., Borriello, P., Brandt, K. K., Collignon, P., et al. (2013). Human Health Risk Assessment (HHRA) for environmental development and transfer of antibiotic resistance. Environ. Health Perspect. 121, 993–1001. doi: 10.1289/ehp.1206316
Ashton, P. M., Nair, S., Dallman, T., Rubino, S., Rabsch, W., Mwaigwisya, S., et al. (2014). MinION nanopore sequencing identifies the position and structure of a bacterial antibiotic resistance island. Nat. Biotechnol. 33, 296–300. doi: 10.1038/nbt.3103
Bai, Y., Liu, R., Liang, J., and Qu, J. (2013). Integrated metagenomic and physiochemical analyses to evaluate the potential role of microbes in the sand filter of a drinking water treatment system. PLoS ONE 8:e61011. doi: 10.1371/journal.pone.0061011
Bengtsson-Palme, J., Boulund, F., Fick, J., Kristiansson, E., and Larsson, D. G. J. (2014). Shotgun metagenomics reveals a wide array of antibiotic resistance genes and mobile elements in a polluted lake in India. Front. Microbiol. 5:648. doi: 10.3389/fmicb.2014.00648
Berendonk, T. U., Manaia, C. M., Merlin, C., Fatta-Kassinos, D., Cytryn, E., Walsh, F., et al. (2015). Tackling antibiotic resistance: the environmental framework. Nat. Rev. Microbiol. 13, 310–317. doi: 10.1038/nrmicro3439
Bernhard, A. E., and Field, K. G. (2000). Identification of nonpoint sources of fecal pollution in coastal waters by using host-specific 16S ribosomal DNA genetic markers from fecal anaerobes. Appl. Environ. Microbiol. 66, 1587–1594. doi: 10.1128/AEM.66.4.1587-1594.2000
Bourlat, S. J., Borja, A., Gilbert, J., Taylor, M. I., Davies, N., Weisberg, S. B., et al. (2013). Genomics in marine monitoring: new opportunities or assessing marine health status. Mar. Pollut. Bull. 74, 19–31. doi: 10.1016/j.marpolbul.2013.05.042
Bowen, J. L., Byrnes, J. E. K., Weisman, D., and Colaneri, C. (2013). Functional gene pyrosequencing and network analysis: an approach to examine the response of denitrifying bacteria to increased nitrogen supply in salt marsh sediments. Front. Microbiol. 4:342. doi: 10.3389/fmicb.2013.00342
Bush, K., Courvalin, P., Dantas, G., Davies, J., Eisenstein, B., Huovinen, P., et al. (2011). Tackling antibiotic resistance. Nat. Rev. Microbiol. 9, 894–896. doi: 10.1038/nrmicro2693
Cai, L., Ju, F., and Zhang, T. (2014). Tracking human sewage microbiome in a municipal wastewater treatment plant. Appl. Microbiol. Biotechnol. 98, 3317–3326. doi: 10.1007/s00253-013-5402-z
Cai, L., and Zhang, T. (2013). Detecting human bacterial pathogens in wastewater treatment plants by a high-throughput shotgun sequencing technique. Environ. Sci. Technol. 47, 5433–5441. doi: 10.1021/es400275r
Caporaso, J. G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F. D., Costello, E. K., et al. (2010). QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7, 335–336. doi: 10.1038/nmeth.f.303
Carew, M. E., Pettigrove, V. J., Metzeling, L., and Hoffmann, A. A. (2013). Environmental monitoring using next generation sequencing: rapid identification of macroinvertebrate bioindicator species. Front. Zool. 10:45. doi: 10.1186/1742-9994-10-45
CDC (2013). Surveillance for Waterborne Disease Outbreaks Associated with Drinking Water and Other Nonrecreational Water- United States, 2009-2010. Available at: http://www.cdc.gov/mmwr/preview/mmwrhtml/mm6235a3.htm
Chao, Y., Ma, L., Yang, Y., Ju, F., Zhang, X.-X., Wu, W.-M., et al. (2013). Metagenomic analysis reveals significant changes of microbial compositions and protective functions during drinking water treatment. Sci. Rep. 3:3550. doi: 10.1038/srep03550
Chen, B., Yang, Y., Liang, X., Yu, K., Zhang, T., and Li, X. (2013). Metagenomic profiles of antibiotic resistance genes (ARGs) between human impacted estuary and deep ocean sediments. Environ. Sci. Technol. 47, 12753–12760. doi: 10.1021/es403818e
Christiansen, G., Molitor, C., Philmus, B., and Kurmayer, R. (2008). Nontoxic strains of cyanobacteria are the result of major gene deletion events induced by a transposable element. Mol. Biol. Evol. 25, 1695–1704. doi: 10.1093/molbev/msn120
Christoffersen, K., Riemann, B., Hansen, L. R., Klysner, A., and Sørensen, H. B. (1990). Qualitative importance of the microbial loop and plankton community structure in a eutrophic lake during a bloom of cyanobacteria. Microb. Ecol. 20, 253–272. doi: 10.1007/BF02543881
Codex Alimentarius Commission (2011). Guidelines for Risk Analysis of Foodborne Antimicrobial Resistance. Available at: http://www.fao.org/food/food-safety-quality/a-z-index/antimicrobial/en/
Coleman, M. L., and Chisholm, S. W. (2010). Ecosystem-specific selection pressures revealed through comparative population genomics. Proc. Natl. Acad. Sci. U.S.A. 107, 18634–18639. doi: 10.1073/pnas.1009480107
Colford, J. M., Wade, T. J., Schiff, K. C., Wright, C. C., Griffith, J. F., Sandhu, S. K., et al. (2007). Water quality indicators and the risk of illness at beaches with nonpoint sources of fecal contamination. Epidemiology 18, 27–35. doi: 10.1097/01.ede.0000249425.32990.b9
Czekalski, N., Gascón Díez, E., and Bürgmann, H. (2014). Wastewater as a point source of antibiotic-resistance genes in the sediment of a freshwater lake. ISME J. 8, 1381–1390. doi: 10.1038/ismej.2014.8
Danishuddin, M., Hassan Baig, M., Kaushal, L., and Khan, A. U. (2013). BLAD: a comprehensive database of widely circulated beta-lactamases. Bioinformatics 29, 2515–2516. doi: 10.1093/bioinformatics/btt417
Darling, A. E., Jospin, G., Lowe, E., Matsen, F. A., Bik, H. M., and Eisen, J. A. (2014). PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ. 2:e243. doi: 10.7717/peerj.243
David, L. A., Maurice, C. F., Carmody, R. N., Gootenberg, D. B., Button, J. E., Wolfe, B. E., et al. (2014). Diet rapidly and reproducibly alters the human gut microbiome. Nature 505, 559–563. doi: 10.1038/nature12820
Diaz-Torres, M. L., Villedieu, A., Hunt, N., McNab, R., Spratt, D. A., Allan, E., et al. (2006). Determining the antibiotic resistance potential of the indigenous oral microbiota of humans using a metagenomic approach. FEMS Microbiol. Lett. 258, 257–262. doi: 10.1111/j.1574-6968.2006.00221.x
Dick, G. J., Andersson, A. F., Baker, B. J., Simmons, S. L., Thomas, B. C., Yelton, A. P., et al. (2009). Community-wide analysis of microbial genome sequence signatures. Genome Biol. 10:R85. doi: 10.1186/gb-2009-10-8-r85
Donlan, R. M. (2002). Biofilms: microbial life on surfaces. Emerg. Infect. Dis. 8, 881–890. doi: 10.3201/eid0809.020063
Dufour, A. P. (1984). Bacterial indicators of recreational water quality. Can. J. Public Health 75, 49–56.
Dutilh, B. E., Thompson, C. C., Vicente, A. C. P., Marin, M. A., Lee, C., Silva, G. G. Z., et al. (2014). Comparative genomics of 274 Vibrio cholerae genomes reveals mobile functions structuring three niche dimensions. BMC Genomics 15:654. doi: 10.1186/1471-2164-15-654
Edwards, C., and Lawton, L. A. (2009). Bioremediation of cyanotoxins. Adv. Appl. Microbiol. 67, 109–129. doi: 10.1016/S0065-2164(08)01004-6
Engel, A. S., and Gupta, A. A. (2014). Regime shift in sandy beach microbial communities following Deepwater Horizon oil spill remediation efforts. PLoS ONE 9:e102934. doi: 10.1371/journal.pone.0102934
Eren, A. M., Maignien, L., Sul, W. J., Murphy, L. G., Grim, S. L., Morrison, H. G., et al. (2013). Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data. Methods Ecol. Evol. 4, 111–119. doi: 10.1111/2041-210X.12114
Eren, A. M., Morrison, H. G., Lescault, P. J., Reveillaud, J., Vineis, J. H., and Sogin, M. L. (2014). Minimum entropy decomposition: unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences. ISME J. 9, 968–979. doi: 10.1038/ismej.2014.195
Esling, P., Lejzerowicz, F., and Pawlowski, J. (2015). Accurate multiplexing and filtering for high-throughput amplicon-sequencing. Nucleic Acids Res. 43, 2513–2524. doi: 10.1093/nar/gkv107
Evangelista, K. V., and Coburn, J. (2010). Leptospira as an emerging pathogen: a review of its biology, pathogenesis and host immune responses. Future Microbiol. 5, 1413–1425. doi: 10.2217/fmb.10.102
Farnelid, H., Andersson, A. F., Bertilsson, S., Al-Soud, W. A., Hansen, L. H., Sørensen, S., et al. (2011). Nitrogenase gene amplicons from global marine surface waters are dominated by genes of non-cyanobacteria. PLoS ONE 6:e19223. doi: 10.1371/journal.pone.0019223
Ferrão-Filho, A., da, S., and Kozlowsky-Suzuki, B. (2011). Cyanotoxins: bioaccumulation and effects on aquatic animals. Mar. Drugs 9, 2729–2772. doi: 10.3390/md9122729
Figueras, M. J., and Borrego, J. J. (2010). New perspectives in monitoring drinking water microbial quality. Int. J. Environ. Res. Public Health 7, 4179–4202. doi: 10.3390/ijerph7124179
Fondi, M., and Liò, P. (2015). Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology. Microbiol. Res. 171, 52–64. doi: 10.1016/j.micres.2015.01.003
Frangeul, L., Quillardet, P., Castets, A.-M., Humbert, J.-F., Matthijs, H. C. P., Cortez, D., et al. (2008). Highly plastic genome of Microcystis aeruginosa PCC 7806, a ubiquitous toxic freshwater cyanobacterium. BMC Genomics 9:274. doi: 10.1186/1471-2164-9-274
Franzosa, E. A., Hsu, T., Sirota-Madi, A., Shafquat, A., Abu-Ali, G., Morgan, X. C., et al. (2015). Sequencing and beyond: integrating molecular “omics” for microbial community profiling. Nat. Rev. Microbiol. 13, 360–372. doi: 10.1038/nrmicro3451
Frey, K. G., Herrera-Galeano, J. E., Redden, C. L., Luu, T. V., Servetas, S. L., Mateczun, A. J., et al. (2014). Comparison of three next-generation sequencing platforms for metagenomic sequencing and identification of pathogens in blood. BMC Genomics 15:96. doi: 10.1186/1471-2164-15-96
Ger, K. A., Hansson, L.-A., and Lürling, M. (2014). Understanding cyanobacteria-zooplankton interactions in a more eutrophic world. Freshw. Biol. 59, 1783–1798. doi: 10.1111/fwb.12393
Gerphagnon, M., Latour, D., Colombet, J., and Sime-Ngando, T. (2013). Fungal parasitism: life cycle, dynamics and impact on cyanobacterial blooms. PLoS ONE 8:e60894. doi: 10.1371/journal.pone.0060894
Gibson, M. K., Forsberg, K. J., and Dantas, G. (2015). Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology. ISME J. 9, 207–216. doi: 10.1038/ismej.2014.106
Gieg, L. M., Fowler, S. J., and Berdugo-Clavijo, C. (2014). Syntrophic biodegradation of hydrocarbon contaminants. Curr. Opin. Biotechnol. 27, 21–29. doi: 10.1016/j.copbio.2013.09.002
Gifford, S. M., Sharma, S., Rinta-Kanto, J. M., and Moran, M. A. (2011). Quantitative analysis of a deeply sequenced marine microbial metatranscriptome. ISME J. 5, 461–472. doi: 10.1038/ismej.2010.141
Gomez-Alvarez, V., Revetta, R. P., and Santo Domingo, J. W. (2012). Metagenomic analyses of drinking water receiving different disinfection treatments. Appl. Environ. Microbiol. 78, 6095–6102. doi: 10.1128/AEM.01018-12
Green, H. C., Haugland, R. A., Varma, M., Millen, H. T., Borchardt, M. A., Field, K. G., et al. (2014). Improved HF183 quantitative real-time PCR assay for characterization of human fecal pollution in ambient surface water samples. Appl. Environ. Microbiol. 80, 3086–3094. doi: 10.1128/AEM.04137-13
Grundmann, H., Klugman, K. P., Walsh, T., Ramon-Pardo, P., Sigauque, B., Khan, W., et al. (2011). A framework for global surveillance of antibiotic resistance. Drug Resist. Updat. 14, 79–87. doi: 10.1016/j.drup.2011.02.007
Gumede, N., Lentsoane, O., Burns, C. C., Pallansch, M., de Gourville, E., Yogolelo, R., et al. (2013). Emergence of vaccine-derived polioviruses, democratic republic of congo, 2004–2011. Emerg. Infect. Dis. 19, 1583–1589. doi: 10.3201/eid1910.130028
Guo, F., Ju, F., Cai, L., and Zhang, T. (2013). Taxonomic precision of different hypervariable regions of 16S rRNA gene and annotation methods for functional bacterial groups in biological wastewater treatment. PLoS ONE 8:e76185. doi: 10.1371/journal.pone.0076185
Haile, R. W., Witte, J. S., Gold, M., Cressey, R., McGee, C., Millikan, R. C., et al. (1999). The health effects of swimming in ocean water contaminated by storm drain runoff. Epidemiology 10, 355–363. doi: 10.1097/00001648-199907000-00004
Hanson, N. W., Konwar, K. M., Hawley, A. K., Altman, T., Karp, P. D., and Hallam, S. J. (2014). Metabolic pathways for the whole community. BMC Genomics 15:619. doi: 10.1186/1471-2164-15-619
Haroon, M. F., Hu, S., Shi, Y., Imelfort, M., Keller, J., Hugenholtz, P., et al. (2013). Anaerobic oxidation of methane coupled to nitrate reduction in a novel archaeal lineage. Nature 500, 567–570. doi: 10.1038/nature12375
Harwood, V. J., Staley, C., Badgley, B. D., Borges, K., and Korajkic, A. (2014). Microbial source tracking markers for detection of fecal contamination in environmental waters: relationships between pathogens and human health outcomes. FEMS Microbiol. Rev. 38, 1–40. doi: 10.1111/1574-6976.12031
Hass, C., Rose, J., and Gerba, C. (2014). Quantitative Microbial Risk Assessment. Available at: http://as.wiley.com/WileyCDA/WileyTitle/productCd-1118145291.html [Accessed July 10, 2015].
Hellmér, M., Paxéus, N., Magnius, L., Enache, L., Arnholm, B., Johansson, A., et al. (2014). Detection of pathogenic viruses in sewage provided early warnings of hepatitis A virus and norovirus outbreaks. Appl. Environ. Microbiol. 80, 6771–6781. doi: 10.1128/AEM.01981-14
Hemme, C. L., Deng, Y., Gentry, T. J., Fields, M. W., Wu, L., Barua, S., et al. (2010). Metagenomic insights into evolution of a heavy metal-contaminated groundwater microbial community. ISME J. 4, 660–672. doi: 10.1038/ismej.2009.154
Hess, M., Sczyrba, A., Egan, R., Kim, T.-W., Chokhawala, H., Schroth, G., et al. (2011). Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467. doi: 10.1126/science.1200387
Hong, P.-Y., Hwang, C., Ling, F., Andersen, G. L., LeChevallier, M. W., and Liu, W.-T. (2010). Pyrosequencing analysis of bacterial biofilm communities in water meters of a drinking water distribution system. Appl. Environ. Microbiol. 76, 5631–5635. doi: 10.1128/AEM.00281-10
Howe, A. D., Forster, S., Morton, S., Marshall, R., Osborn, K. S., Wright, P., et al. (2002). Cryptosporidium oocysts in a water supply associated with a cryptosporidiosis outbreak. Emerg. Infect. Dis. 8, 619–624. doi: 10.3201/eid0806.010271
Hu, Y., Yang, X., Qin, J., Lu, N., Cheng, G., Wu, N., et al. (2013). Metagenome-wide analysis of antibiotic resistance genes in a large cohort of human gut microbiota. Nat. Commun. 4:2151. doi: 10.1038/ncomms3151
Huang, K., Zhang, X.-X., Shi, P., Wu, B., and Ren, H. (2014). A comprehensive insight into bacterial virulence in drinking water using 454 pyrosequencing and Illumina high-throughput sequencing. Ecotoxicol. Environ. Saf. 109, 15–21. doi: 10.1016/j.ecoenv.2014.07.029
Hug, L. A., Beiko, R. G., Rowe, A. R., Richardson, R. E., and Edwards, E. A. (2012). Comparative metagenomics of three Dehalococcoides-containing enrichment cultures: the role of the non-dechlorinating community. BMC Genomics 13:327. doi: 10.1186/1471-2164-13-327
Humbert, J.-F., Barbe, V., Latifi, A., Gugger, M., Calteau, A., Coursin, T., et al. (2013). A tribute to disorder in the genome of the bloom-forming freshwater cyanobacterium Microcystis aeruginosa. PLoS ONE 8:e70747. doi: 10.1371/journal.pone.0070747
Huson, D. H., Auch, A. F., Qi, J., and Schuster, S. C. (2007). MEGAN analysis of metagenomic data. Genome Res. 17, 377–386. doi: 10.1101/gr.5969107
Ibekwe, A. M., Leddy, M., and Murinda, S. E. (2013). Potential human pathogenic bacteria in a mixed urban watershed as revealed by pyrosequencing. PLoS ONE 8:e79490. doi: 10.1371/journal.pone.0079490
Kazimierczak, K. A., Scott, K. P., Kelly, D., and Aminov, R. I. (2009). Tetracycline resistome of the organic pig gut. Appl. Environ. Microbiol. 75, 1717–1722. doi: 10.1128/AEM.02206-08
Kelly, L., Ding, H., Huang, K. H., Osburne, M. S., and Chisholm, S. W. (2013). Genetic diversity in cultured and wild marine cyanomyoviruses reveals phosphorus stress as a strong selective agent. ISME J. 7, 1827–1841. doi: 10.1038/ismej.2013.58
Kennedy, K., Hall, M. W., Lynch, M. D. J., Moreno-Hagelsieb, G., and Neufeld, J. D. (2014). Evaluating bias of illumina-based bacterial 16S rRNA gene profiles. Appl. Environ. Microbiol. 80, 5717–5722. doi: 10.1128/AEM.01451-14
Kimes, N. E., Callaghan, A. V., Aktas, D. F., Smith, W. L., Sunner, J., Golding, B., et al. (2013). Metagenomic analysis and metabolite profiling of deep–sea sediments from the Gulf of Mexico following the Deepwater Horizon oil spill. Front. Microbiol. 4:50. doi: 10.3389/fmicb.2013.00050
Koenig, J., Lee, M., and Manefield, M. (2014). Aliphatic organochlorine degradation in subsurface environments. Rev. Environ. Sci. Biotechnol. 14, 49–71. doi: 10.1007/s11157-014-9345-3
Koren, S., Harhay, G. P., Smith, T. P. L., Bono, J. L., Harhay, D. M., Mcvey, S. D., et al. (2013). Reducing assembly complexity of microbial genomes with single-molecule sequencing. Genome Biol. 14:R101. doi: 10.1186/gb-2013-14-9-r101
Koskey, A. M., Fisher, J. C., Eren, A. M., Ponce-Terashima, R., Reis, M. G., Blanton, R. E., et al. (2014). Blautia and Prevotella sequences distinguish human and animal fecal pollution in Brazil surface waters. Environ. Microbiol. Rep. 6, 696–704. doi: 10.1111/1758-2229.12189
Kozarewa, I., Ning, Z., Quail, M. A., Sanders, M. J., Berriman, M., and Turner, D. J. (2009). Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes. Nat. Methods 6, 291–295. doi: 10.1038/nmeth.1311
Kozich, J. J., Westcott, S. L., Baxter, N. T., Highlander, S. K., and Schloss, P. D. (2013). Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl. Environ. Microbiol. 79, 5112–5120. doi: 10.1128/AEM.01043-3
Kumar, N. V., Menon, T., Pathipati, P., and Cherian, K. M. (2013). 16S rRNA sequencing as a diagnostic tool in the identification of culture-negative endocarditis in surgically treated patients. J. Heart Valve Dis. 22, 846–849.
Laban, N. A., Dao, A., and Foght, J. (2015). DNA stable-isotope probing of oil sands tailings pond enrichment cultures reveals different key players for toluene degradation under methanogenic and sulfidogenic conditions. FEMS Microbiol. Ecol. 91, pii: fiv039. doi: 10.1093/femsec/fiv039
Lamendella, R., Strutt, S., Borglin, S., Chakraborty, R., Tas, N., Mason, O. U., et al. (2014). Assessment of the Deepwater Horizon oil spill impact on Gulf coast microbial communities. Front. Microbiol. 5:130. doi: 10.3389/fmicb.2014.00130
Li, B., Yang, Y., Ma, L., Ju, F., Guo, F., Tiedje, J. M., et al. (2015). Metagenomic and network analysis reveal wide distribution and co-occurrence of environmental antibiotic resistance genes. ISME J. doi: 10.1038/ismej.2015.59 [Epub ahead of print].
Li, J. Z., Chapman, B., Charlebois, P., Hofmann, O., Weiner, B., Porter, A. J., et al. (2014). Comparison of illumina and 454 deep sequencing in participants failing raltegravir-based antiretroviral therapy. PLoS ONE 9:e90485. doi: 10.1371/journal.pone.0090485
Li, N., Zhang, L., Li, F., Wang, Y., Zhu, Y., Kang, H., et al. (2011). Metagenome of microorganisms associated with the toxic Cyanobacteria Microcystis aeruginosa analyzed using the 454 sequencing platform. Chinese J. Oceanol. Limnol. 29, 505–513. doi: 10.1007/s00343-011-0056-0
Li, Z., Hobson, P., An, W., Burch, M. D., House, J., and Yang, M. (2012). Earthy odor compounds production and loss in three cyanobacterial cultures. Water Res. 46, 5165–5173. doi: 10.1016/j.watres.2012.06.008
Liu, B., and Pop, M. (2009). ARDB–antibiotic resistance genes database. Nucleic Acids Res. 37, D443–D447. doi: 10.1093/nar/gkn656
Logares, R., Sunagawa, S., Salazar, G., Cornejo-Castillo, F. M., Ferrera, I., Sarmento, H., et al. (2014). Metagenomic 16S rDNA Illumina tags are a powerful alternative to amplicon sequencing to explore diversity and structure of microbial communities. Environ. Microbiol. 16, 2659–2671. doi: 10.1111/1462-2920.12250
Loman, N. J., Misra, R. V., Dallman, T. J., Constantinidou, C., Gharbia, S. E., Wain, J., et al. (2012). Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 30, 434–439. doi: 10.1038/nbt.2198
Lu, X., Zhang, X.-X., Wang, Z., Huang, K., Wang, Y., Liang, W., et al. (2015). Bacterial pathogens and community composition in advanced sewage treatment systems revealed by metagenomics analysis based on high-throughput sequencing. PLoS ONE 10:e0125549. doi: 10.1371/journal.pone.0125549
Luo, F., Gitiafroz, R., Devine, C. E., Gong, Y., Hug, L. A., Raskin, L., et al. (2014). Metatranscriptome of an anaerobic benzene-degrading, nitrate-reducing enrichment culture reveals involvement of carboxylation in benzene ring activation. Appl. Environ. Microbiol. 80, 4095–4107. doi: 10.1128/AEM.00717-14
MacKenzie, W., Hoxie, N., Proctor, M., Ms, G., Blair, K., Peterson, D., et al. (1994). A massive outbreak in milwaukee of Cryptosporidium infection transmitted through the public water supply. N. Engl. J. Med. 331, 161–167. doi: 10.1056/NEJM199407213310304
Madoui, M.-A., Engelen, S., Cruaud, C., Belser, C., Bertrand, L., Alberti, A., et al. (2015). Genome assembly using Nanopore-guided long and error-free DNA reads. BMC Genomics 16:327. doi: 10.1186/s12864-015-1519-z
Major, D. W., McMaster, M. L., Cox, E. E., Edwards, E. A., Dworatzek, S. M., Hendrickson, E. R., et al. (2002). Field demonstration of successful bioaugmentation to achieve dechlorination of tetrachloroethene to ethene. Environ. Sci. Technol. 36, 5106–5116. doi: 10.1021/es0255711
Markowitz, V. M., Chen, I.-M. A., Palaniappan, K., Chu, K., Szeto, E., Pillay, M., et al. (2014). IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. 42, D560–D567. doi: 10.1093/nar/gkt963
Marshall, C. W., Ross, D. E., Fichot, E. B., Norman, R. S., and May, H. D. (2012). Electrosynthesis of commodity chemicals by an autotrophic microbial community. Appl. Environ. Microbiol. 78, 8412–8420. doi: 10.1128/AEM.02401-2
Mason, O. U., Hazen, T. C., Borglin, S., Chain, P. S. G., Dubinsky, E. A., Fortney, J. L., et al. (2012). Metagenome, metatranscriptome and single-cell sequencing reveal microbial response to Deepwater Horizon oil spill. ISME J. 6, 1715–1727. doi: 10.1038/ismej.2012.59
Mason, O. U., Scott, N. M., Gonzalez, A., Robbins-Pianka, A., Bælum, J., Kimbrel, J., et al. (2014). Metagenomics reveals sediment microbial community response to Deepwater Horizon oil spill. ISME J. 8, 1464–1475. doi: 10.1038/ismej.2013.254
McArthur, A. G., Waglechner, N., Nizam, F., Yan, A., Azad, M. A., Baylay, A. J., et al. (2013). The comprehensive antibiotic resistance database. Antimicrob. Agents Chemother. 57, 3348–3357. doi: 10.1128/AAC.00419-13
McLellan, S. L., and Eren, A. M. (2014). Discovering new indicators of fecal pollution. Trends Microbiol. 22, 697–706. doi: 10.1016/j.tim.2014.08.002
McLellan, S. L., Huse, S. M., Mueller-Spitz, S. R., Andreishcheva, E. N., and Sogin, M. L. (2010). Diversity and population structure of sewage-derived microorganisms in wastewater treatment plant influent. Environ. Microbiol. 12, 378–392. doi: 10.1111/j.1462-2920.2009.02075.x
McLellan, S. L., Newton, R. J., Vandewalle, J. L., Shanks, O. C., Huse, S. M., Eren, A. M., et al. (2013). Sewage reflects the distribution of human faecal Lachnospiraceae. Environ. Microbiol. 15, 2213–2227. doi: 10.1111/1462-2920.12092
Meyer, F., Paarmann, D., D’Souza, M., Olson, R., Glass, E. M., Kubal, M., et al. (2008). The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics 9:386. doi: 10.1186/1471-2105-9-386
Mikalsen, B., Boison, G., Skulberg, O. M., Fastner, J., Davies, W., Gabrielsen, T. M., et al. (2003). Natural variation in the microcystin synthetase operon mcyABC and impact on microcystin production in Microcystis strains. J. Bacteriol. 185, 2774–2785. doi: 10.1128/JB.185.9.2774-2785.2003
Mikheyev, A. S., and Tin, M. M. Y. (2014). A first look at the Oxford Nanopore MinION sequencer. Mol. Ecol. Resour. 14, 1097–1102. doi: 10.1111/1755-0998.12324
Mori, T., Mizuta, S., Suenaga, H., and Miyazaki, K. (2008). Metagenomic screening for bleomycin resistance genes. Appl. Environ. Microbiol. 74, 6803–6805. doi: 10.1128/AEM.00873-08
Mosher, J. J., Bowman, B., Bernberg, E. L., Shevchenko, O., Kan, J., Korlach, J., et al. (2014). Improved performance of the PacBio SMRT technology for 16S rDNA sequencing. J. Microbiol. Methods 104, 59–60. doi: 10.1016/j.mimet.2014.06.012
Mou, X., Lu, X., Jacob, J., Sun, S., and Heath, R. (2013). Metagenomic identification of bacterioplankton taxa and pathways involved in microcystin degradation in lake erie. PLoS ONE 8:e61890. doi: 10.1371/journal.pone.0061890
Murray, D. C., Coghlan, M. L., and Bunce, M. (2015). From benchtop to desktop: important considerations when designing amplicon sequencing workflows. PLoS ONE 10:e0124671. doi: 10.1371/journal.pone.0124671
Nelson, M. C., Morrison, H. G., Benjamino, J., Grim, S. L., and Graf, J. (2014). Analysis, optimization and verification of Illumina-generated 16S rRNA gene amplicon surveys. PLoS ONE 9:e94249. doi: 10.1371/journal.pone.0094249
Newton, R. J., Bootsma, M. J., Morrison, H. G., Sogin, M. L., and McLellan, S. L. (2013). A microbial signature approach to identify fecal pollution in the waters off an urbanized coast of Lake Michigan. Microb. Ecol. 65, 1011–1023. doi: 10.1007/s00248-013-0200-9
Newton, R. J., McLellan, S. L., Dila, D. K., Vineis, J. H., Morrison, H. G., Eren, A. M., et al. (2015). Sewage reflects the microbiomes of human populations. MBio 6:e02574. doi: 10.1128/mBio.02574-4
Newton, R. J., Vandewalle, J. L., Borchardt, M. A., Gorelick, M. H., and McLellan, S. L. (2011). Lachnospiraceae and Bacteroidales alternative fecal indicators reveal chronic human sewage contamination in an urban harbor. Appl. Environ. Microbiol. 77, 6972–6981. doi: 10.1128/AEM.05480-11
Nshimyimana, J. P., Freedman, A. J. E., Shanahan, P., Chua, L. C. H., and Thompson, J. R. (2015). “Variation of Bacterial Communities and Pathogen Taxa as a Function of Land Use and Water Quality in an Urban Tropical Catchment of Singapore” in Proceedings of the 115th General Meeting of American Society for Microbiology, New Orleans.
Ong, S. H., Kukkillaya, V. U., Wilm, A., Lay, C., Ho, E. X. P., Low, L., et al. (2013). Species identification and profiling of complex microbial communities using shotgun Illumina sequencing of 16S rRNA amplicon sequences. PLoS ONE 8:e60811. doi: 10.1371/journal.pone.0060811
Paerl, H. W., and Huisman, J. (2009). Climate change: a catalyst for global expansion of harmful cyanobacterial blooms. Environ. Microbiol. Rep. 1, 27–37. doi: 10.1111/j.1758-2229.2008.00004.x
Parsley, L. C., Consuegra, E. J., Kakirde, K. S., Land, A. M., Harper, W. F., and Liles, M. R. (2010). Identification of diverse antimicrobial resistance determinants carried on bacterial, plasmid, or viral metagenomes from an activated sludge microbial assemblage. Appl. Environ. Microbiol. 76, 3753–3757. doi: 10.1128/AEM.03080-09
Pehrsson, E. C., Forsberg, K. J., Gibson, M. K., Ahmadi, S., and Dantas, G. (2013). Novel resistance functions uncovered using functional metagenomic investigations of resistance reservoirs. Front. Microbiol. 4:145. doi: 10.3389/fmicb.2013.00145
Penn, K., Wang, J., Fernando, S. C., and Thompson, J. R. (2014). Secondary metabolite gene expression and interplay of bacterial functions in a tropical freshwater cyanobacterial bloom. ISME J. 8, 1866–1878. doi: 10.1038/ismej.2014.27
Port, J. A., Cullen, A. C., Wallace, J. C., Smith, M. N., and Faustman, E. M. (2014). Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments. Environ. Health Perspect. 122, 222–228. doi: 10.1289/ehp.1307009
Quail, M. A., Otto, T. D., Gu, Y., Harris, S. R., Skelly, T. F., McQuillan, J. A., et al. (2012). Optimal enzymes for amplifying sequencing libraries. Nat. Methods 9, 10–11. doi: 10.1038/nmeth.1814
Ramamurthy, T., Ghosh, A., Pazhani, G. P., and Shinoda, S. (2014). Current perspectives on viable but non-culturable (VBNC) pathogenic bacteria. Front. Public Health 2:103. doi: 10.3389/fpubh.2014.00103
Riesenfeld, C. S., Goodman, R. M., and Handelsman, J. (2004). Uncultured soil bacteria are a reservoir of new antibiotic resistance genes. Environ. Microbiol. 6, 981–989. doi: 10.1111/j.1462-2920.2004.00664.x
Rivers, A. R., Sharma, S., Tringe, S. G., Martin, J., Joye, S. B., and Moran, M. A. (2013). Transcriptional response of bathypelagic marine bacterioplankton to the Deepwater Horizon oil spill. ISME J. 7, 2315–2329. doi: 10.1038/ismej.2013.129
Saidi-Mehrabad, A., He, Z., Tamas, I., Sharp, C. E., Brady, A. L., Rochman, F. F., et al. (2013). Methanotrophic bacteria in oilsands tailings ponds of northern Alberta. ISME J. 7, 908–921. doi: 10.1038/ismej.2012.163
Salipante, S. J., Sengupta, D. J., Rosenthal, C., Costa, G., Spangler, J., Sims, E. H., et al. (2013). Rapid 16S rRNA next-generation sequencing of polymicrobial clinical samples for diagnosis of complex bacterial infections. PLoS ONE 8:e65226. doi: 10.1371/journal.pone.0065226
Satinsky, B. M., Crump, B. C., Smith, C. B., Sharma, S., Zielinski, B. L., Doherty, M., et al. (2014a). Microspatial gene expression patterns in the Amazon River Plume. Proc. Natl. Acad. Sci. U.S.A. 111, 11085–11090. doi: 10.1073/pnas.1402782111
Satinsky, B. M., Zielinski, B. L., Doherty, M., Smith, C. B., Sharma, S., Paul, J. H., et al. (2014b). The Amazon continuum dataset: quantitative metagenomic and metatranscriptomic inventories of the Amazon River plume, June 2010. Microbiome 2:17. doi: 10.1186/2049-2618-2-17
Schirmer, M., Ijaz, U. Z., D’Amore, R., Hall, N., Sloan, W. T., and Quince, C. (2015). Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform. Nucleic Acids Res. 43:e37. doi: 10.1093/nar/gku1341
Schloss, P. D., Westcott, S .L., Jenior, M. L., and Highlander, S. K. (2015). Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ PrePrints 3:e778v1. doi: 10.7287/peerj.preprints.778v1
Schmidt, T. S. B., Matias Rodrigues, J. F., and von Mering, C. (2014). Limits to robustness and reproducibility in the demarcation of operational taxonomic units. Environ. Microbiol. 17, 1689–1706. doi: 10.1111/1462-2920.12610
Schwarzenbach, R. P., Egli, T., Hofstetter, T. B., von Gunten, U., and Wehrli, B. (2010). Global water pollution and human health. Annu. Rev. Environ. Resour. 35, 109–136. doi: 10.1146/annurev-environ-100809-125342
Sergeant, M. J., Constantinidou, C., Cogan, T., Penn, C. W., and Pallen, M. J. (2012). High-throughput sequencing of 16S rRNA gene amplicons: effects of extraction procedure, primer length and annealing temperature. PLoS ONE 7:e38094. doi: 10.1371/journal.pone.0038094
Shanks, O. C., Domingo, J. W. S., Lu, J., Kelty, C. A., and Graham, J. E. (2007). Identification of bacterial DNA markers for the detection of human fecal pollution in water. Appl. Environ. Microbiol. 73, 2416–2422. doi: 10.1128/AEM.02474-06
Shanks, O. C., Kelty, C. A., Sivaganesan, M., Varma, M., and Haugland, R. A. (2009). Quantitative PCR for genetic markers of human fecal pollution. Appl. Environ. Microbiol. 75, 5507–5513. doi: 10.1128/AEM.00305-09
Shanks, O. C., Newton, R. J., Kelty, C. A., Huse, S. M., Sogin, M. L., and McLellan, S. L. (2013). Comparison of the microbial community structures of untreated wastewaters from different geographic locales. Appl. Environ. Microbiol. 79, 2906–2913. doi: 10.1128/AEM.03448-12
Shanks, O. C., White, K., Kelty, C. A., Hayes, S., Sivaganesan, M., Jenkins, M., et al. (2010). Performance assessment PCR-based assays targeting bacteroidales genetic markers of bovine fecal pollution. Appl. Environ. Microbiol. 76, 1359–1366. doi: 10.1128/AEM.02033-09
Sharma, A. K., Zhaxybayeva, O., Papke, R. T., and Doolittle, W. F. (2008). Actinorhodopsins: proteorhodopsin-like gene sequences found predominantly in non-marine environments. Environ. Microbiol. 10, 1039–1056. doi: 10.1111/j.1462-2920.2007.01525.x
Shi, P., Jia, S., Zhang, X.-X., Zhang, T., Cheng, S., and Li, A. (2013). Metagenomic insights into chlorination effects on microbial antibiotic resistance in drinking water. Water Res. 47, 111–120. doi: 10.1016/j.watres.2012.09.046
Shishkin, A. A., Giannoukos, G., Kucukural, A., Ciulla, D., Busby, M., Surka, C., et al. (2015). Simultaneous generation of many RNA-seq libraries in a single reaction. Nat. Methods Advance Online 12, 323–325. doi: 10.1038/nmeth.3313
Simpson, J. M., Santo Domingo, J. W., and Reasoner, D. J. (2002). Microbial source tracking: state of the science. Environ. Sci. Technol. 36, 5279–5288. doi: 10.1021/es026000b
Sinclair, L., Osman, O. A., Bertilsson, S., and Eiler, A. (2015). Microbial community composition and diversity via 16S rRNA gene amplicons: evaluating the Illumina platform. PLoS ONE 10:e0116955. doi: 10.1371/journal.pone.0116955
Smith, R. J., Jeffries, T. C., Roudnew, B., Fitch, A. J., Seymour, J. R., Delpin, M. W., et al. (2012). Metagenomic comparison of microbial communities inhabiting confined and unconfined aquifer ecosystems. Environ. Microbiol. 14, 240–253. doi: 10.1111/j.1462-2920.2011.02614.x
Sommer, M. O. A., Dantas, G., and Church, G. M. (2009). Functional characterization of the antibiotic resistance reservoir in the human microflora. Science 325, 1128–1131. doi: 10.1126/science.1176950
Stark, M., Berger, S. A., Stamatakis, A., and von Mering, C. (2010). MLTreeMap–accurate maximum likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics 11:461. doi: 10.1186/1471-2164-11-461
Steffen, M. M., Belisle, B. S., Watson, S. B., Boyer, G. L., Bourbonniere, R. A., and Wilhelm, S. W. (2015). Metatranscriptomic evidence for co-occurring top-down and bottom-up controls on toxic cyanobacterial communities. Appl. Environ. Microbiol. 81, 3268–3276. doi: 10.1128/AEM.04101-14
Steffen, M. M., Dearth, S. P., Dill, B. D., Li, Z., Larsen, K. M., Campagna, S. R., et al. (2014). Nutrients drive transcriptional changes that maintain metabolic homeostasis but alter genome architecture in Microcystis. ISME J. 8, 2080–2092. doi: 10.1038/ismej.2014.78
Steffen, M. M., Li, Z., Effler, T. C., Hauser, L. J., Boyer, G. L., and Wilhelm, S. W. (2012). Comparative metagenomics of toxic freshwater cyanobacteria bloom communities on two continents. PLoS ONE 7:e44002. doi: 10.1371/journal.pone.0044002
Steven, B., McCann, S., and Ward, N. L. (2012). Pyrosequencing of plastid 23S rRNA genes reveals diverse and dynamic cyanobacterial and algal populations in two eutrophic lakes. FEMS Microbiol. Ecol. 82, 607–615. doi: 10.1111/j.1574-6941.2012.01429.x
Sunagawa, S., Mende, D. R., Zeller, G., Izquierdo-Carrasco, F., Berger, S. A., Kultima, J. R., et al. (2013). Metagenomic species profiling using universal phylogenetic marker genes. Nat. Methods 10, 1196–1199. doi: 10.1038/nmeth.2693
Tan, B., Jane Fowler, S., Laban, N. A., Dong, X., Sensen, C. W., Foght, J., et al. (2015). Comparative analysis of metagenomes from three methanogenic hydrocarbon degrading enrichment cultures with 41 environmental samples. ISME J. 9, 2028–2045. doi: 10.1038/ismej.2015.22
Tanabe, Y., Kaya, K., and Watanabe, M. M. (2004). Evidence for recombination in the microcystin synthetase (mcy) genes of toxic cyanobacteria Microcystis spp. J. Mol. Evol. 58, 633–641. doi: 10.1007/s00239-004-2583-1
Torres-Cortés, G., Millán, V., Ramírez-Saad, H. C., Nisa-Martínez, R., Toro, N., and Martínez-Abarca, F. (2011). Characterization of novel antibiotic resistance genes identified by functional metagenomics on soil samples. Environ. Microbiol. 13, 1101–1114. doi: 10.1111/j.1462-2920.2010.02422.x
Ulloa, O., Canfield, D. E., DeLong, E. F., Letelier, R. M., and Stewart, F. J. (2012). Microbial oceanography of anoxic oxygen minimum zones. Proc. Natl. Acad. Sci. U.S.A. 109, 15996–16003. doi: 10.1073/pnas.1205009109
Unno, T., Jang, J., Han, D., Kim, J. H., Sadowsky, M. J., Kim, O.-S., et al. (2010). Use of barcoded pyrosequencing and shared OTUs to determine sources of fecal bacteria in watersheds Environ. Sci. Technol. 44, 7777–7782. doi: 10.1021/es101500z
USEPA (2013). Method 1609: Enterococci in Water by TaqMan®Quantitative Polymerase Chain Reaction (qPCR) with Internal Amplification Control (IAC) Assay. Washington, DC: US Environmental Protection Agency (EPA-820-R-13-005).
USEPA (2009). Review of Published Studies to Characterize Relative Risks from Different Sources of Fecal Contamination in Recreational Waters. Available at: http://www.epa.gov/waterscience/criteria/recreation/pdf/fecalcontamrecreationalwaters.pdf
Vandewalle, J. L., Goetz, G. W., Huse, S. M., Morrison, H. G., Sogin, M. L., Hoffmann, R. G., et al. (2012). Acinetobacter, Aeromonas and Trichococcus populations dominate the microbial community within urban sewer infrastructure. Environ. Microbiol. 14, 2538–2552. doi: 10.1111/j.1462-2920.2012.02757.x
van Dijk, E. L., Auger, H., Jaszczyszyn, Y., and Thermes, C. (2014a). Ten years of next-generation sequencing technology. Trends Genet. 30, 418–426. doi: 10.1016/j.tig.2014.07.001
van Dijk, E. L., Jaszczyszyn, Y., and Thermes, C. (2014b). Library preparation methods for next-generation sequencing: tone down the bias. Exp. Cell Res. 322, 12–20. doi: 10.1016/j.yexcr.2014.01.008
Varela, A. R., and Manaia, C. M. (2013). Human health implications of clinically relevant bacteria in wastewater habitats. Environ. Sci. Pollut. Res. Int. 20, 3550–3569. doi: 10.1007/s11356-013-1594-0
Venkatesan, B. M., and Bashir, R. (2011). Nanopore sensors for nucleic acid analysis. Nat. Nanotechnol. 6, 615–624. doi: 10.1038/nnano.2011.129
Waller, A. S., Hug, L. A., Mo, K., Radford, D. R., Maxwell, K. L., and Edwards, E. A. (2012). Transcriptional analysis of a Dehalococcoides-containing microbial consortium reveals prophage activation. Appl. Environ. Microbiol. 78, 1178–1186. doi: 10.1128/AEM.06416-11
Wang, H., Du, P., Li, J., Zhang, Y., Zhang, W., Han, N., et al. (2013). Comparative analysis of microbiome between accurately identified 16S rDNA and quantified bacteria in simulated samples. J. Med. Microbiol. 63, 433–440. doi: 10.1099/jmm.0.060616-0
Xia, H., Li, T., Deng, F., and Hu, Z. (2013). Freshwater cyanophages. Virol. Sin. 28, 253–259. doi: 10.1007/s12250-013-3370-1
Xing, P., Guo, L., Tian, W., and Wu, Q. L. (2011). Novel Clostridium populations involved in the anaerobic degradation of Microcystis blooms. ISME J. 5, 792–800. doi: 10.1038/ismej.2010.176
Yampara-Iquise, H., Zheng, G., Jones, J. E., and Carson, C. A. (2008). Use of a Bacteroides thetaiotaomicron-specific alpha-1-6, mannanase quantitative PCR to detect human faecal pollution in water. J. Appl. Microbiol. 105, 1686–1693. doi: 10.1111/j.1365-2672.2008.03895.x
Yang, C., Lin, F., Li, Q., Li, T., and Zhao, J. (2015). Comparative genomics reveals diversified CRISPR-Cas systems of globally distributed Microcystis aeruginosa, a freshwater bloom-forming cyanobacterium. Front. Microbiol. 6:394. doi: 10.3389/fmicb.2015.00394
Ye, L., and Zhang, T. (2011). Pathogenic bacteria in sewage treatment plants as revealed by 454 pyrosequencing. Environ. Sci. Technol. 45, 7173–7179. doi: 10.1021/es201045e
Ye, L., and Zhang, T. (2013). Bacterial communities in different sections of a municipal wastewater treatment plant revealed by 16S rDNA 454 pyrosequencing. Appl. Microbiol. Biotechnol. 97, 2681–2690. doi: 10.1007/s00253-012-4082-4
Yergeau, E., Lawrence, J. R., Sanschagrin, S., Waiser, M. J., Korber, D. R., and Greer, C. W. (2012). Next-generation sequencing of microbial communities in the Athabasca River and its tributaries in relation to oil sands mining activities. Appl. Environ. Microbiol. 78, 7626–7637. doi: 10.1128/AEM.02036-12
Yoshida, M., Yoshida, T., Kashima, A., Takashima, Y., Hosoda, N., Nagasaki, K., et al. (2008). Ecological dynamics of the toxic bloom-forming cyanobacterium Microcystis aeruginosa and its cyanophages in freshwater. Appl. Environ. Microbiol. 74, 3269–3273. doi: 10.1128/AEM.02240-7
Yurkov, V. V., and Beatty, J. T. (1998). Aerobic anoxygenic phototrophic bacteria. Microbiol. Mol. Biol. Rev. 62, 695–724.
Zmirou, D., Pena, L., Ledrans, M., and Letertre, A. (2003). Risks associated with the microbiological quality of bodies of fresh and marine water used for recreational purposes: summary estimates based on published epidemiological studies. Arch. Environ. Health 58, 703–711. doi: 10.3200/AEOH.58.11.703-711
Keywords: next-generation sequencing, water quality, fecal indicator, antibiotic resistance, harmful algal bloom, sewage, biodegradation
Citation: Tan B, Ng C, Nshimyimana JP, Loh LL, Gin KY-H and Thompson JR (2015) Next-generation sequencing (NGS) for assessment of microbial water quality: current progress, challenges, and future opportunities. Front. Microbiol. 6:1027. doi: 10.3389/fmicb.2015.01027
Received: 13 July 2015; Accepted: 10 September 2015;
Published: 25 September 2015.
Edited by:
Mark Vincent Brown, University of New South Wales, AustraliaReviewed by:
Lisa Moore, University of Southern Maine, USARebecca Gast, Woods Hole Oceanographic Institution, USA
Copyright © 2015 Tan, Ng, Nshimyimana, Loh, Gin and Thompson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Janelle R. Thompson, Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Building 48, Cambridge, MA 02139, USA, jthompson@mit.edu