- 1Department of Biology, University of Texas at Arlington, Arlington, TX, United States
- 2Department of Biological and Environmental Sciences, Georgia College and State University, Milledgeville, GA, United States
- 3Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, United States
- 4Department of Environmental and Occupational Health, Colorado School of Public Health, University of Colorado, Anschutz, Aurora, CO, United States
Schistosomiasis is a neglected tropical disease caused by multiple parasitic Schistosoma species, and which impacts over 200 million people globally, mainly in low- and middle-income countries. Genomic surveillance to detect evidence for natural selection in schistosome populations represents an emerging and promising approach to identify and interpret schistosome responses to ongoing control efforts or other environmental factors. Here we review how genomic variation is used to detect selection, how these approaches have been applied to schistosomes, and how future studies to detect selection may be improved. We discuss the theory of genomic analyses to detect selection, identify experimental designs for such analyses, and review studies that have applied these approaches to schistosomes. We then consider the biological characteristics of schistosomes that are expected to respond to selection, particularly those that may be impacted by control programs. Examples include drug resistance, host specificity, and life history traits, and we review our current understanding of specific genes that underlie them in schistosomes. We also discuss how inherent features of schistosome reproduction and demography pose substantial challenges for effective identification of these traits and their genomic bases. We conclude by discussing how genomic surveillance for selection should be designed to improve understanding of schistosome biology, and how the parasite changes in response to selection.
Introduction
Schistosomiasis is a neglected tropical disease caused by schistosome parasites that affects over 200 million people annually, with most infections impacting the health and socio-economic development of populations in Africa, South America, and Asia (1, 2). Schistosomes have two distinct reproductive phases, the first of which includes the pairing of adult male and female worms in definitive mammalian hosts to produce eggs through sexual reproduction, and a second asexual phase which occurs once eggs hatch into miracidia and infect intermediate snail hosts. In the snail host, miracidia become reproducing sporocysts that are eventually emitted as free-swimming cercaria that infect the mammalian host (3). Three Schistosoma species are primarily responsible for human infection, and depending on the species, will infect either the urogenital (S. haematobium) or intestinal (S. mansoni, S. japonicum) tract. Symptoms range from fibrosis of the liver and bladder to anemia and cancer, all with an underlying pathology stemming from immunological responses to egg deposition from sexually mature worms (1, 4). The World Health Organization (WHO) has adopted resolutions calling for regional elimination of schistosomiasis through a multi-pronged approach that includes mass drug administration (MDA) using the anthelminthic drug praziquantel (PZQ), reduction of snail intermediate host populations, and improvements to hydrological and sanitation infrastructure (5).
In this context, there are growing concerns that control and elimination practices are creating selective pressures on schistosome populations that may alter schistosome biology, sometimes in ways that could negatively impact current and future control success (6). Moreover, there is substantial evidence that schistosomes respond opportunistically to large environmental change such as dam construction (7, 8). Given these concerns, monitoring how schistosomes respond to diverse environmental contexts represents a valuable and increasingly feasible priority as a component of control strategies (9–11). For example, to surveil, and ideally prevent, the development of drug resistant populations, it is necessary to identify and understand selection pressures and parasite responses when drugs are applied. Other diverse selection-driven responses may include shifts in definitive host preferences, shifts in reproductive behavior, or any other shifts in life history that lead to greater fitness (Figure 1).
Figure 1. Overview of the implementation of genomic scans of selection. Population sampling is carried out under different scenarios of drug administration (or other control regimes), changes in host affinity, environmental conditions, or simply across a temporal series of population sampling. Genome scans identify targets of selection and inform further study of the genomic basis of relevant traits. Targeted surveillance will then help to assess the presence of alleles and their impact on these traits, and be used to monitor for selection in parasite populations. In genome scan figures (lower graphs) we highlight a hypothetical genomic scan, with red lines indicating loci with extreme values (using FST here as an example); dashed lines from particular regions of the genome represent zoomed-in areas of potential interest that could contain genes with known links to epidemiologically-relevant traits (with PZQ resistance and host preference shown as examples). We note that our hypothetical examples of genomic scans use FST, but such scans could be conducted with a diversity of statistics, which we discuss in the text.
In this review we discuss strategies to integrate broad (genome-wide) and targeted (locus-specific) approaches to genomic surveillance for selection across natural populations of Schistosoma species. We also update earlier reviews (9), which have discussed the roles of genetic studies for understanding the impacts of control measures on schistosomes, to incorporate modern approaches that leverage whole genome sequencing (WGS) coupled with targeted schistosome sample collection to understand and interpret genetic variation of pathogen populations to guide control programs (12–15). In a recent review, we discussed the ways in which genomic data can be integrated in epidemiological and ecological studies to clarify schistosomiasis transmission pathways (15). In this review, we discuss the theory underlying population genomic scans, review how recent studies have leveraged population-level sampling of genomic variation to conduct genome-wide scans to identify signatures of selection and genes underlying key traits in schistosomes and discuss how future studies can be designed to improve their power and effectiveness. We also review current progress linking epidemiologically relevant traits in schistosomes (e.g., drug resistance, host preference, reproductive output) to their underlying genetic basis, and discuss how future studies and infrastructure may enhance the potential impacts of genomic surveillance for evidence of selection acting on such traits.
Importance of genomic surveillance for tracking selection in schistosome populations
The world's recent experience with SARS-CoV-2 reinforces the fundamental reality that all natural systems evolve. In the context of infectious diseases and parasites, evolution must be accounted for, monitored, and ideally integrated into control strategies. Understanding how neutral processes of genomic divergence (aka. genetic drift) and natural selection together shape the genomic diversity of pathogen populations is necessary to evaluate how biological systems respond and adapt to their environment, which can ultimately lead to valuable and well-informed predictions to guide strategies for control. For example, control measures may lead to an increase in the frequency of genomic variants that impact drug tolerance, parasite reproductive output, or host specificity. Genomic surveillance to monitor evidence of selection on loci related to these traits may therefore provide early warnings that would enable strategic re-design of control efforts. Furthermore, comparisons of genomic variation between populations, lineages, or temporal population samples can be used to identify regions of the genome that underlie selected traits. Here we argue that longitudinal sampling of natural schistosome populations is a high priority to establish critical references for continued surveillance, with the added benefits that they also may be used to detect evidence of natural selection, population connectivity, and to monitor variation at specific genomic loci associated with traits of epidemiological relevance (e.g., host preference or drug resistance).
Genomic resources and new approaches enable Schistosoma surveillance
All species of Schistosoma are diploid and contain 7 autosomes as well as ZW sex chromosomes. Their genomes range from ~390–410 Mbp and tend to be comprised of ~50% readily identifiable repetitive elements (16, 17). Reference genomes for Schistosoma species that are responsible for most human schistosomiasis cases [(17–20); S. mansoni, S. haematobium, and S. japonicum] now have high-quality chromosome-level reference assemblies (17, 21, 22). These genomes share relatively high degrees of synteny and conserved chromosome numbers (21), and are assembled in relatively large contiguous scaffolds, with scaffold N50 values (defined as the size of the shortest scaffold included in the set of long scaffolds that comprise 50% of the genome assembly) ranging from 4.8 to 48 Mbp. Despite the overall high quality of these genome assemblies, the quality and contiguity of the sex chromosomes in these assemblies are lower, and most references have only included the Z and not the W chromosome. In ZW sex chromosome systems, such as schistosomes, males are the homogametic sex (ZZ) and females are the heterogametic sex (ZW). Most reference genomes are generated from the homogametic sex because this provides equal representations of autosomes and sex chromosomes, and because of the inherent complications of assembling sex chromosomes (e.g., high repeat element content, high heterochromatic content, and relative copy number relative to autosomes) (23–25).
The availability of these reference genomes enables the accurate mapping and interpretation of whole genome sequencing (WGS) data, and thus provides the means to understand how processes such as natural selection, genetic drift, hybridization, and recombination shape genetic diversity in specific schistosome populations. WGS from individual miracidia and cercaria, the readily available schistosome life stages, is challenging due to the low quantity of DNA available from these physically small samples [(15, 26); see also discussion in Rey et al. (6)]. Recent advances that use whole genome amplification (WGA) ameliorate this challenge, and thus substantially increase the feasibility of genomic surveillance by enabling WGS of individual parasites (12, 27, 28). Low-input DNA genomic library preparation approaches that accomplish amplification through PCR have also shown promising results using schistosome eggs or larval stage samples (29).
A number of studies that have used WGS to sequence DNA from miracidia seem to have little host contamination (12, 27, 28), suggesting minimal off-target sequencing of host DNA using filtration and washing protocols to separate miracidia from host stool, as is used for the miracidial hatching test in China (30). However, other studies (13) have reported higher rates of contamination (based on higher fractions of genome sequencing reads that do not map to schistosome reference genomes), suggesting contamination may be more of an issue in some cases. These advances in techniques coupled with decreasing costs of sample preparation and sequencing. For example, at time of publication WGS library prep and Illumina sequencing for a single sample at 20× coverage is ~$300, based on our experience in the context of high parallelization. These recently reduced costs now make it feasible to generate WGS data for many individual samples and to survey patterns within and across populations. Thus, reference genomes and WGS approaches for resequencing genomes from individuals have laid the foundation for a new generation of genomic surveillance programs capable of resolving genetic variation at extremely high precision, and as we discuss below are sufficient to evaluate evidence for selection.
The theory of genome scans for selection and the delineation of underlying genes
One of the most powerful approaches for genomic surveillance of infectious organisms is to evaluate population genetic variation and “scan” how it changes along the genome. Such genomic scans can dissect evidence that different evolutionary processes have shaped variation differently among loci (31–34), which can track the spread of specific genetic variants (35), monitor genomic changes within and between populations (36), and potentially predict future outbreaks and control efficacy. For a region of the genome, variation can be quantified using population genetic statistics such as the Fixation Index [FST; (29)], nucleotide diversity [π; (30)], and Tajima's D (37). Haplotype-based population statistics were used in scans of schistosome genomic variation (22), including the integrated haplotype score (iHS), which measures extended haplotype homozygosity (EHH) surrounding a given SNP, and compares inferred ancestral and derived alleles (38, 39). Additionally, other haplotype methods such as Identity-By-Descent (IBD) approaches, have been used to infer shared genomic segments between individuals in pathogen populations inherited from a recent common-ancestor, and can be used to show multiple levels of relationships depending on shared segment lengths (40). Genomic scans of schistosome populations (22) have also used cross-population composite likelihood ratio tests [XP-CLR; (31)], which quantify multilocus allele frequency differentiation between populations and account for genetic drift, to test for evidence of selective sweeps. Typically, a battery of such measures are applied [e.g., FST, EHH, XP-CLR; (22, 41, 42)], and the background distribution computed across all genomic regions provides an expectation that can be used to highlight outlier loci with extreme characteristics. For example, abnormally high FST differentiaton between regions that differ significantly from the rest of the genome might be of note, as they may represent regions that have experienced strong selection leading to divergence between populations at a particular locus. Genomic regions that harbor outliers may be strong causal candidates for affecting a trait of interest; for example, gene regions affecting drug resistance may be discovered when comparing two or more populations subject to differential drug treatment.
The logic of genome scans is relatively straightforward: population genetics theory predicts that different evolutionary processes will leave particular signatures of variation within and between genomes, and so distinct processes can be inferred if the appropriate signature is observed. Most of the genome is assumed to evolve under random genetic drift, such that the genomic background (e.g., distribution of FST among genomic regions) is primarily shaped by demographic processes. A central controlling demographic parameter that impacts patterns of genomic diversity and differentiation is the effective population size (Ne), which summarizes the long-term population size effect even when the number of reproducing individuals within a population varies over time. For example, population size bottlenecks (brief periods of greatly reduced population size; Figure 2) tend to reduce genomic diversity, leading to lower average π across all regions compared to a population that maintained large population sizes. Populations that evolve separately will also diverge over time, so that recently separated populations have low differentiation (i.e., lower average FST when compared to more well-defined lineages that diverged further in the past (which will have comparatively higher average FST). In the case of schistosomes, large-scale control efforts that successfully eradicate large numbers of reproductive adults may have such brief bottlenecks, or ideally even dramatic long-term reduction in local Ne. Bottlenecks may also be caused by founder effects that occur when a few individuals expand and colonize new environments or hosts. Indeed, previous empirical estimates of Ne across multiple S. mansoni and S. japonicum populations have shown evidence of multiple population bottlenecks within the last 100,000 years. Some inferred bottlenecks appear to be correlated with the timing of human migration into those regions following glacial cycles, and other more recent bottlenecks are potentially linked to modern control efforts (14, 22). These studies have also highlighted differences in Ne among populations within species of schistosomes. For example, S. mansoni populations in the Lake Victoria and Lake Albert regions of Africa appear to have experienced recent increases in Ne, whereas populations in both West Africa and in regions of the New World show evidence of continuous declines in Ne (14).
Figure 2. The power and pitfalls of genome scans. Population size affects genetic drift, creating noise in the form of randomly divergent regions of high FST that disguise true signatures of selection. Simulated distributions of FST are shown (with FST at individual loci represented as dots) under two neutral population scenarios indicated by the ellipses on top: (A) a moderately large population that split into two similarly sized populations; and (B) a similarly sized ancestral population that split into two much smaller populations. In both cases, FST values for fixed length genomic regions are colored as indicated by the legend to aid in visual interpretation, and frequency histograms of neutral FST values are summarized to the right. The red circles are examples of FST values that might be expected for selected loci and are placed with the simulation results to indicate that they would be easy to distinguish in scenario (A) but difficult to distinguish in scenario (B) due to the large number of high FST variants generated under genetic drift alone—indicating high false positive rates regardless of what high FST threshold might be chosen. In these simulations, Ne is scaled by the mutation rate μ, with θ = 4Neμ values of 0.1 for the larger populations and θ = 4Neμ values of 0.005 for the smaller populations. These hypothetical examples demonstrate the importance of integrating multiple statistics and approaches to discern evidence of selection to confirm or refute evidence for selection at particular loci, while also illustrating the pitfalls of using single statistical criteria (like high FST alone) to make such inferences.
Strong natural selection is also predicted to leave striking signatures along the genome, often highly localized perturbations that defy the expectations of drift [(43); Figure 2]. Positive directional selection associated with adaptive traits, also known as a selective sweep, reduces genetic variation in regions of the genome near selected loci (43). This occurs when new mutations or low-frequency alleles are favored, driving them to high frequency and eliminating variation at other nearby regions of the genome that are genetically linked. These allelic sweeps therefore result in regions of the genome with low diversity that are also more differentiated from related populations than other regions that did not experience directional selection. This is detected by observing exceptionally low π and high FST compared to the genome-wide background.
Many aspects of schistosome treatment and control may drive selective pressure on the parasites to adapt or die. For the drug treatment example, genetic variants that enable the organisms to resist the drug are more likely to be passed on to the next generation, and over time this will produce the classic signature of a selective sweep. Although the mechanisms may be more complex, life history traits such as timing of cercaria production, or life span of adult worms, for example, may also favor various alleles to sweep through a population. However, our power to detect a selective sweep will depend on the strength of selection that is driving the sweep. Cross-over events during meiosis (recombination events) tend to unlink nearby genomic regions from the selected allele, and there will be more opportunity for recombination events if the sweep takes longer when selection is weaker (or if selection happened longer in the past). Thus, the ability to recognize the signatures of selective sweeps is reduced as time accrues since they occurred, and is also decreased if selection is weaker (44–46).
Factors that limit power of genome scans
For genomic surveillance of selection on infectious organisms, most genomic scan studies are designed to compare two populations that differ in some key treatment (e.g., presence or absence of MDA) or trait of interest (e.g., host preference). The approach is powerful, but it should be appreciated that some differences among loci may be due to drift alone. Drift is a noisy (stochastic) process, and the genetic linkage that helps identify a selected region also leads to regional levels of linked differentiation due to drift, which makes detection of selection challenging (Figure 2). Such random variation in genetic differentiation among loci will have little to do with the condition or trait of interest. In this context, it is worth appreciating that genome scans can easily be performed on populations without an explicit trait of interest in mind. In that case, they will identify regions of the genome that most strongly differentiate these populations, in many cases regardless of whether selection, drift, or some other process is responsible.
Another important caveat when performing genomic scans is that some divergence between the populations examined may be caused by variation in traits that are not of direct interest. For example, if two populations with different drug treatments are also from distinct environmental conditions, such as different altitudes or aridity, loci may have been selected in response to those environmental differences and not the drug treatment of interest. A basic comparative genomic scan is most powerful for two populations in highly similar environments that are as closely related as possible so that they have diverged as little as possible, thus having few biological differences other than the trait or treatment of interest. This reduces background noise and allows any selective effect due to the trait of interest to definitively stand out.
Genome scans also assume that the trait of interest has a genetic basis, that alleles underlying variation in this trait were under relatively recent and relatively strong selection, and that a reasonable amount of genetic variation had built up in the population prior to selection. If there is little standing variation at alleles that affect the trait, there is little allelic diversity to be reduced by selection and a low ability to differentiate the effects of selection between the two populations. The effect of drift interacts with low levels of standing variation because drift effects are larger in small populations, and population bottlenecks due to control efforts will lower Ne, increase drift, and fix unselected alleles between populations.
It is also worth appreciating that selection can take multiple forms that affect diversity patterns (e.g., balancing selection, diversifying selection, negative selection on nearly neutral variants, multiallelic selection), and the interaction of these different forms of selection with drift can complicate or confound interpretation of results. This also means that some forms of selection (e.g., balancing selection) will not be identified by approaches designed to detect other forms of selection, such as scans for selective sweeps (e.g., XP-CLR). Balancing selection may result from highly variable (temporally or spatially) control efforts, and would lead to elevated π, potentially elevated heterozygosity, yet low FST between populations; this result is nearly opposite that expected for a selective sweep. Diversifying selection, usually characterized by repeated selection for new alleles at the same locus, might be expected for parasite proteins that are antigenic and selected to repeatedly evade host immunity. Diversifying selection is expected to result in variable (but often low) FST and thus may not be detected in scans designed to detect selective sweeps. Polyallelic selection, in which multiple equivalent but divergent alleles are selected at the same time, can also complicate interpretation (47). Due to these challenges, scans of selection should layer multiple methods, inferences, and summary statistics to test whether specific patterns and processes of selection have occurred.
One way to obtain expectations of the effects of selection and drift in various scenarios is to use simulation-based inferences (48–54). Simulations can also provide calibrated expectations about false positive rates and appropriate statistical thresholds for identifying outlier loci in genomic scans. The efficacy of selection is positively correlated with population size (i.e., adaptation is more efficient in larger populations), and simulations can identify genome-wide background expectations under drift. For example, in a small population we expect many loci to have maximal FST value (Figure 2B), even in the absence of selection. Simulations can also be beneficial to understand how rapidly adaptation may have occurred in response to control measures (55) and to predict the conditions under which drug resistance may be expected to lead to parasite elimination or to lead to the evolution of drug resistance (56).
Example workflow for population genomic analyses of selection
In Figure 3 we provide an example workflow for inferring evidence of selection by analyzing population genomic data with existing programs and strategies that can be used in various steps. Standard data analysis of high throughput sequencing data typically starts with processing of raw genome resequencing reads to remove low-quality reads and trim low-quality portions of reads using programs such as Trimmomatic (57), FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/), fastp (58), and Scythe (59). Processed reads are then mapped to a reference genome using the mapping algorithm bwa, BowTie2 or SOAP (60–62). Sequence variants relative to the reference genome are then called using various approaches such as BCFtools, FreeBayes, or GATK (63–65). Among these, GATK is considered an industry standard because it includes well-established “best-practice” guidelines for multiple variant types (e.g., SNPs and structural variants), which helps maintain transparency and consistency across analyses and studies. It is notable, however, that GATK “best practices” for inferring genomic variants have been designed to interpret human genomic variation, and that it remains an important priority for future work to consider if other modified best-practice approaches may be more appropriate for schistosomes and other parasites. For example, it is possible that expectations related to high levels of inbreeding likely in many schistosome populations may warrant revision of best-practice approaches for minimums of sequencing depth or other parameters associated with inferring genomic variants.
Figure 3. Example work-flow for analysis of whole genome resequencing data for analysis of selection. The example work flow begins with raw genome shotgun sequencing reads that are processed for quality (A), processed reads are then mapped to the reference genome of the respective species (B), and variants within individuals are called (C). Estimates of population structure can then be estimated (D). Within population genomic statistics (E) and between population genomic statistics (F) can then be estimated in the form of genome-wide scans of these parameters to develop integrated inferences for evidence of patterns of selection in schistosome genomes. All data shown is hypothetical and not necessarily representative of schistosome populations, and are shown only as a graphical representation for illustration purposes. Relevant programs for each step are listed to the right of each panel, and these are cited and discussed in the main text.
Once genomic variants are inferred, basic inferences of population structure can be estimated using clustering methods to infer population ancestry [e.g., STRUCTURE/ADMIXTURE; (66, 67)], and through discriminant analysis of principal components [e.g., DAPC; (68)], PCA, along with tree-based methods implemented in programs like RAxML (68) and IQTree (69). These initial assessments are important because they provide a first-pass approximation of population structure within the data, which is essential for downstream experimental design considerations, such as analyzing subpopulations separately or filtering highly related individuals to avoid potential stratification issues. Further, population clustering analyses (e.g., PCA, DAPC, STRUCTURE) can also provide insight into levels of connectivity and structure among parasites between hosts, which can provide insight into parasite transmission patterns relevant for informing control measures [see (15) for a detailed review of inferring transmission pathways using genomic data].
There is no single straightforward way to infer evidence of selection. Instead, genomic inferences of selection often require the application of multiple types of genomic scans and integration of these to effectively identify evidence for selection, and to differentiate various modes of selection. The choice of relevant cutoffs for parameters used in genomic scans is often difficult to be certain about; some researchers use confidence intervals, although other more appropriate approaches [e.g., GPPFst; (50)] can be used to predict the null distribution of these parameters under drift alone (without selection), which hold promise for more powerful inferences by calibrating expectations by explicitly considering the demographic history of populations. Because many methods for detecting selection are designed to identify directional selection, such approaches are poorly suited to identify evidence of other forms of selection, such as balancing selection. It is therefore useful to incorporate genomic scans for different modes of selection implemented in programs such as BaILerMix and Betascan (70, 71). Secondary analyses using haplotype-based measures provide additional opportunities to identify evidence for directional selection by detection of selective sweeps using programs like rehh (72), which uses methods like extended haplotype homozygosity (EHH). While these methods are important for understanding patterns within empirical data, comparing these measures against simulation-based inferences using programs like SLiM (52) or msprime (73) can also help more appropriately interpret statistics and their expectations under drift vs. selection, thereby improving the accuracy of inferences of selection.
Prior Schistosome genomic scans for signatures of selection
Multiple studies have examined genomic variation across Schistosoma species or populations to test for signatures of selection and thereby predict which loci underlie evidence for selection on divergent traits, or specific traits under study. Crellen et al. (14) used transcontinental sampling within S. mansoni to identify evidence of selection between populations by applying codon-based measures of selection in protein-coding regions (i.e., dN/dS) and other approaches that examine ratios of polymorphisms and fixed differences (e.g., Hudson-Kreitman-Aguade and McDonald-Kreitman tests). They identified suites of genes inferred to be under selection, and a subset of these loci were associated with functional categories of genes involved in host-parasite interactions and genes required for mammalian-host infection. Li et al. (74) PCR-amplified and sequenced multiple candidate genes across populations of S. japonicum from China, comparing S. japonicum from lake vs. mountainous regions, and used population genetic summary statistics (e.g., Tajima's D, π, Fu's, FST) to identify evidence of divergent selection on genes that encode tegument-associated antigens (74). These proteins are thought to be involved in intermediate snail host immunity, and evidence for selection acting on these genes was presumed to be linked to divergence between lineages of Oncomelania snail hosts in mountainous and lowland regions.
Other studies inferring patterns of selection used WGS data to compare genomic variation across either temporal or experimental treatments. Berger et al. (13) sampled populations of S. mansoni from school-aged children in Uganda to identify loci putatively under selection following drug administration by using a combination of haplotype measures (i.e., iHS and XP-EHH), relative and absolute population differentiation (DXY and FST), and π and accounted for the impacts of linkage disequilibrium and close-order relatedness among samples. They identified significant variation between populations across multiple genomic regions, with the highest occurring on chromosome two that contained loci coding for sodium/potassium/calcium ion exchanger proteins.
Some studies followed up genomic scans with secondary experiments involving gene expression analyses or knockdown/interference studies. Le Clec'h et al. (75), for example, identified a candidate locus that may underlie variation in drug response by performing genome-wide association (GWAS) analyses between laboratory-bred strains of S. mansoni that showed significant differences in survival between high and low doses of PZQ. They identified several variants (single nucleotide polymorphisms and structural variants) that were associated with PZQ response and functionally validated these by comparing gene expression differences between lab and natural populations. Additionally, they identified natural populations possessing these alleles that impact drug-response by analyzing previously published exome sequence data from S. mansoni populations in Africa, South America, and the Middle East. Luo et al. (22) generated a new chromosome-level reference genome for S. japonicum and used this to interpret additional WGS data to infer selection patterns in natural populations. They used multiple genome wide scans to identify regions with signatures of selective sweeps and focused on candidate loci that are involved in both intermediate and definitive host-affinity pathways. They functionally validated one candidate gene for definitive host preference (GATAD2A) using RNAi methods and demonstrated how decreased expression of this gene leads to abnormal development of reproductive organs in sexually mature worms and shifts in differential mammalian host infection outcomes. In a separate genome scan between S. japonicum populations they identified another locus (Lmln) with strong signatures of selection potentially related to intermediate host preference and showed that this gene is expressed in schistosome life stages important for intermediate host infection. Further supporting these inferences for the role of Lmln in S. japonicum, a separate study also linked Lmln to infection stability for intermediate snail hosts in S. mansoni (76).
These studies collectively establish precedence for leveraging genomic scans of selection to identify candidate loci for important traits in schistosomes. However, many of these early studies also suffered from reduced statistical power due to relatively limited sampling of populations, potential issues stemming from population stratification and the highly inbred nature of sampled populations, and to some extent from the existence of population structure between compared populations (44). Indeed, many of these analyses identified hundreds of potential candidate genes, which presumably include large fractions of spurious candidates, making it logistically difficult to investigate them all experimentally (38, 44).
Experimental designs for more powerful genome scans to identify selection
Because of the limitations of previous pioneering studies, it is relevant to consider how population genomic analyses can be designed to most powerfully leverage their ability to identify important patterns of selection linked to control efforts and use these inferences to further identify genes underlying relevant traits. To effectively account for drift when conducting genomic surveillance of schistosomes, we need accurate and reliable estimates of complex demographic histories that likely include changes in effective population size (i.e., bottlenecks, founder effects), substructure, migration, and other demographic factors, which is not always an easy task (77). Demographic estimates that are fit directly to the study populations (i.e., specific Ne estimates for each population) can be used to calibrate the null expectations of drift in a population-specific manner [e.g., (32, 78–81)]. Bayesian predictive methods (e.g., approximate Bayesian computation and posterior predictive methods) explicitly incorporate demographic estimates while accounting for uncertainty in these estimates to simulate genomic data under drift, which can be compared to the observed data (49, 81–83). Some methods, like the XP-CLR approach, are designed to inherently account for drift while identifying selective sweeps, and are also more powerful for identifying more ancient selective sweeps (48). Further advances in incorporating the impacts of drift through demographic modeling and simulation also hold great promise and may improve predictive approaches with more realistic models designed to incorporate the true demographic history of populations. Recently, a number of machine learning applications of artificial neural networks [i.e., artificial intelligence (AI)] have been developed to predict targets of selection with high accuracy and robustness, which may prove fruitful for genomic surveillance of schistosomes (84), including demonstrations for predicting selection in malaria parasites with genome scans (85). Indeed, AI approaches are rapidly gaining traction in a variety of population genetic applications (86), including prediction of selection (87), migration (86), recombination (88), demography (81), population origins (89), and other factors relevant to schistosome biology and control.
The biology of schistosomes requires particular attention when considering experimental design for genomic scan approaches to detect and interpret patterns of selection. Schistosomes are overall genetically diverse, but they can be far more closely related in small geographic areas such as in areas of re-emergence or hotspots (6, 12, 27, 90). This makes sense biologically because of the large number of eggs a single worm pair can produce, combined with the clonal nature and large numbers of cercaria produced from a single snail intermediate host. Even in an area in which schistosomes have been nearly eliminated, a single introduction event into snail habitat from an infected definitive host, for example, can lead to rapid expansion of highly-related schistosome infections in a geographically proximate population of definitive hosts. From a genetic standpoint, this can lead to inbreeding and population bottlenecks in such schistosome populations that make it more difficult to identify genomic regions that are the principal targets of selection from those regions not under selection (e.g., Figure 2). Elevated levels of genetic relatedness within populations (6, 12, 27, 90) defy default assumptions of many population genetic inference approaches and substantially reduce the power of scans for selection. To overcome this and increase the power and accuracy of future genomic scan studies, experimental designs for collection and sequencing of samples for studying selection should strategically incorporate larger sample sizes that minimize inclusion of closely related individuals, yet also minimizes the divergence and population structure between populations compared. Alternatively, with larger sample sizes, genetic structure can be identified and accounted for by conducting independent analyses within sub-populations. Because relationships among miracidia are not well-known ahead of time, initial sampling should be large enough that highly related samples can be removed for some analyses, and still be sufficient to accurately estimate population allele frequencies. One example of a strategy that would minimize these pitfalls is to obtain a time series of population genomic sampling (still with a large sample of individual parasites and excluding close relatives). Because inferences of selection fundamentally depend upon accurate estimation of population allele frequencies, greater sampling of unrelated individuals, compared to prior studies, is a central goal for future more powerful and accurate inferences of selection.
Review of candidate genes for targeted genomic surveillance in schistosomes
Prospects for targeted surveillance using genome scans for selection
Through genomic surveillance of genetic variation at candidate loci known to underlie key epidemiologically-relevant traits (e.g., drug susceptibility, host specificity, parasite reproductive output), researchers can track the geographic distribution of specific genomic variants at loci with putatively known functional ramifications, providing them with insight to guide control strategies. This approach, for example, has been effective to understand the population dynamics and spread of drug-resistance alleles in Plasmodium spp. and insecticide-resistant genotypes in the Anopheles spp. vectors across Central Africa (91–93). These approaches largely depend upon knowledge of the functional and phenotypic ramifications of genes underlying key life history traits, where variation among loci may contribute to potential variation in drug response or other factors that impact parasite fitness (91, 94–96). Recently, however, researchers have begun to identify candidate loci in schistosomes associated with traits of interest that would enable more direct interpretations of genetic variation from schistosome populations via genomic surveillance (14, 97). Currently, there are three major categories of candidate genes that underlie critical traits relevant to the control of human schistosomiasis: (i) drug-resistant loci, (ii) host-shift related loci, and (iii) loci related to parasite life history traits. Here we discuss these candidate loci and the potential to develop additional links between functionally relevant genes and schistosome biology that would identify more candidate loci. While these candidate loci provide some ability to interpret patterns of selection to parasite traits they may be associated with, we argue that future studies of selection may substantially expand and refine this set, and even the identification of selection without a priori understanding of the function of loci it effects represents important information for understanding impacts of control efforts.
Genes linked to drug resistance in schistosomes
Most cases of schistosomiasis are treated with praziquantel (PZQ), with oxamaniquine (OXA) being used historically in limited populations (98). While evidence for the reduced efficacy of PZQ has been documented in natural populations (99–105); see also Cioli et al. (106) for a review on this, the potential for PZQ-resistance to emerge has been demonstrated multiple times in laboratory settings (75, 106). These laboratory-based studies found evidence that schistosomes can evolve variation in response to drug treatment after only a few generations of strong selection, but tend to reduce the fitness of the parasites in other ways (75, 106). This indicates that such traits may evolve rapidly in nature if selection is strong and consistent enough, or the organisms find compensatory mechanisms around the detrimental effects of initial drug resistance strategies. A previous study also provided new evidence for a specific genetic locus that may underlie variation in drug response in S. mansoni (75). Also, the diversity of distinct mechanisms (and distinct loci) that may contribute to drug response or resistance remains an open question (107, 108).
Indeed, drug resistance or tolerance alleles may evolve through novel allelic variants (i.e., de novo mutations), or may already be present as standing variation in low frequencies in natural populations. Because existing variants could expand rapidly under strong selection from control efforts, monitoring populations for variation in loci linked to drug resistance is particularly important. Previous research identified the underlying locus and variant linked to OXA resistance in S. mansoni, a loss of function mutation within a sulfotransferase (107). Additional studies showed that that OXA-resistant alleles are widespread in natural S. mansoni populations, and appear to have existed prior to the use of this drug (97); these findings also refuted prior inferences that resistance had evolved in the 1970's, which subsequently led to the discontinuation of OXA as a first line treatment (107). Other laboratory studies showed substantial differences in the effectiveness of OXA across Schistosoma species, being most effective for S. mansoni but less effective for species such as S. haematobium and S. japonicum (108).
In contrast to progress understanding mechanisms of OXA resistance, the precise molecular targets and mechanisms of action of PZQ are not entirely clear (107), although a suite of candidate loci and molecular mechanisms associated with PZQ activity and resistance have been proposed (75, 105, 109, 110). Candidate genes related to PZQ susceptibility or resistance that have been proposed fall into several categories: (i) genes associated broadly with drug resistance [e.g., cytochrome p450s, ABC multidrug transporters (109, 111–114)], (ii) genes plausibly linked to PZQ-specific effects on schistosomes, including genes affecting calcium transport and muscle function [e.g., CamKII, calcium channel subunits, p2X receptors, regulatory myosin light chain; (111, 112, 115–117)], and adenosine uptake [e.g., adenosine receptor ador1, SLC29 nucleoside transporters (116, 118)], (iii) genes related to reproduction (e.g., p14 eggshell protein gene, CPEB1), and (iv) genes related to growth and development [e.g., TGFβ, EGFR pathways; (119, 120)]. A set of previous studies showed compelling evidence that PZQ resistance in S. mansoni could be mediated by the gene TPRMPZQ, which encodes an ion channel that binds PZQ (75, 120). Variation in this gene and an adjacent genomic region correlates with PZQ resistance in lab strains, and PZQ resistance may be related to TPRMPZQ gene expression (75).
Expanded monitoring of genomic variation at these candidate loci, and further identification of genes selected following intense control efforts, are key priorities to improve our understanding of the mode of action and specific targets of PZQ. Genotypes that persist or increase in frequency following drug treatments may provide early warnings for drug resistance and improve our understanding of the relevant loci and molecular mechanisms.
Genes associated with host specificity and host shifts
Schistosomes likely adapt to different host environments, and selection for genomic variation related to host-specificity is a critical target for genomic surveillance for selection. S. japonicum can infect multiple mammalian hosts (121) and the relative importance of different hosts varies by regional context (122, 123). Recent evidence also suggests that hybridization and introgression of different schistosome species [recently reviewed in Rey et al. (6)] may act as a source for the introduction of allelic variants that impact host specificity (124). For example, previous studies have documented the occurrence of hybridization between ruminant- and human-infecting species of schistosomes (S. haematobium/curassoni and S. haematobium/S. bovis) infecting children in Senegal, which may have the potential to facilitate shifts in host affinities (125). These studies collectively highlight the importance of genomic surveillance that may, for example, be capable of detecting evidence of selection on genomic variants, derived from hybridization events that introduce new alleles, which may enable rapid shifts in host preference.
A recent study used population and functional genomic approaches to identify candidate loci involved in both intermediate and definitive host-affinity, and found strong signals of selection across endemic populations of S. japonicum (22). One candidate gene, GATAD2A, was shown to be integral in the development of reproductive organs in sexually mature schistosomes, indicating that variation in this locus may dictate definitive host pathogenicity. Additionally, an outlier locus Lmln was shown to be significantly upregulated in the miracidia, the life stage that infects the intermediate snail host (22). This study also identified many other candidate genes potentially involved in host preference, suggesting that these traits may be highly polygenic, and that the potential is great for future studies to further refine candidate genes associated with host preference. Beyond studies of host preference, understanding the relationships between human-host outcomes and parasite genotypes may help identify parasite genotypes that correlate with host pathologies or symptom severity. Such studies, however, would be complicated by multiple infections of schistosomes per host, which in some cases includes a diversity of genotypes per host (13).
Loci associated with schistosome life history and control efforts: Largely the unknown
Many other life history features of schistosomes presumably have a genetic basis and would be important to monitor through surveillance for evidence of selection. The mechanisms producing such traits may be largely unknown, and the challenge for the field is to identify loci that underlie these traits. Schistosomes may alter life history traits such as reproductive output, longevity, and cercarial emergence timing to better suit constraints imposed by control measures or environmental conditions. For example, one potential interest for genomic surveillance includes loci associated with reproductive output, but our understanding of candidate loci for these traits is limited (119, 126). However, a previous study identified multiple genes that show sex-specific expression differences, with many loci underlying overall reproductive output from female schistosome parasites (127). Another study identified the gene encoding the enzyme β-alanyl-tryptamine that is produced by mature male schistosome parasites and is responsible for inducing female reproductive development (128). Cercarial emergence time is another relevant life history trait that is likely related to host shifts but has not been linked to candidate loci. Cercariae tend to emerge from snails during periods that optimize contact with specific mammalian hosts (129, 130), and there is evidence that this trait varies across populations and species (131–133). Continued progress understanding the genomic loci that underlie life history traits would be valuable for targeted surveillance and may be discovered through well-designed population genomic comparisons to detect evidence of selection associated with variation in these traits.
Beyond identifying loci underlying a priori traits of interest, monitoring for and identifying any genomic signatures of selection in schistosome populations could provide new perspectives on parasite biology, some of which may be otherwise difficult to predict. Even if we do not currently understand the phenotypic or functional relevance of genes under selection, selection reflects adaptive changes to parasite biology that improve their survival and fitness. Identification of selection in naïve a posteriori analyses would still provide the community with loci to prioritize for experimental studies and may lead to unexpected insight into the factors that promote schistosome survival and persistence, which is ultimately central to the design of effective control and elimination programs.
Conclusions and future directions
We argue that population genomic studies to identify evidence of selection on schistosome populations can play a transformational role in the development of strategies to reduce the global burden of human schistosomiasis. The promise of this approach is supported by the substantial insights provided by recent studies that use these approaches. Genomic data sampled at regular intervals globally can be used in a comparative context to understand shifts in parasite population dynamics, patterns of selection, and responses to control efforts, and greater efforts to collect and share these data would expedite progress. Also, further improvements in experimental design for genomic studies of selection, including more strategic sampling of Schistosoma populations over time, under different types of control measures, among definitive host species, and targeting independently replicated selection events, would provide greater power to estimate the specific strength and direction of selection at specific genes or loci (134). Lastly, continued efforts to link selected patterns of schistosome genomic variation to key life history traits of epidemiological relevance (e.g., drug resistance, cercarial emergence times, parasite reproductive output) are a priority for increasing the interpretive potential and ultimate value of genomic surveillance for selection.
Reductions in the cost of genomic sequencing, and increasing amounts of information that links epidemiologically-relevant traits in schistosomes to their genomic bases, has led to the increasing value and feasibility of genomic surveillance for informing control efforts [see Lund et al. (15) for recent review on this topic, including tradeoffs associated with genome-scale data collection]. We argue that genomic surveillance, both for any evidence of natural selection genome-wide, or for evidence of new variants at well-characterized loci linked to relevant traits, provide complementary information for guiding public health officials on impacts of control measures and their efficacy. Central to the success of such surveillance programs, it is critical that monitoring programs begin conducting regular longitudinal sampling of populations (at least sample collection and storage, if not sequencing) that would provide the necessary foundation for comparative analyses. Progress across all these areas would further advance the capacity for genomic surveillance for selection to impact strategic control effort development and facilitate elimination goals globally.
Author contributions
DP and TC contributed to the conceptualization, planning, writing, and editing of this review. ZN contributed fundamental writing and figures and edited the review. RA contributed a figure and contributed to writing. KW, AL, and EC contributed some writing and editing. All authors contributed to the article and approved the submitted version.
Funding
Support was provided by a National Institutes of Health (NIH) grant 1RO1AI134673 to EC, TC, and DP.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Colley DG, Bustinduy AL, Secor WE, King CH. Human schistosomiasis. Lancet. (2014) 383:2253–64. doi: 10.1016/S0140-6736(13)61949-2
2. Hotez PJ, Fenwick A, Savioli L, Molyneux DH. Rescuing the bottom billion through control of neglected tropical diseases. Lancet. (2009) 373:1570–5. doi: 10.1016/S0140-6736(09)60233-6
3. Nelwan ML. Schistosomiasis: life cycle, diagnosis, and control. Curr Ther Res. (2019) 91:5–9. doi: 10.1016/j.curtheres.2019.06.001
4. Burke M, Jones M, Gobert G, Li Y, Ellis M, McManus D. Immunopathogenesis of human schistosomiasis. Parasite Immunol. (2009) 31:163–76. doi: 10.1111/j.1365-3024.2009.01098.x
5. World Health Organization. WHO Guideline on Control and Elimination of Human Schistosomiasis. Geneva: World Health Organization (2022).
6. Rey O, Webster BL, Huyse T, Rollinson D, Van den Broeck F, Kincaid-Smith J, et al. Population genetics of African schistosoma species. Infect Genet Evol. (2021) 89:104727. doi: 10.1016/j.meegid.2021.104727
7. Adekiya TA, Aruleba RT, Oyinloye BE, Okosun KO, Kappo AP. The effect of climate change and the snail-schistosome cycle in transmission and bio-control of schistosomiasis in Sub-Saharan Africa. Int J Environ Res Public Health. (2020) 17:181. doi: 10.3390/ijerph17010181
8. Steinmann P, Keiser J, Bos R, Tanner M, Utzinger J. Schistosomiasis and water resources development: systematic review, meta-analysis, and estimates of people at risk. Lancet Infect Dis. (2006) 6:411–25. doi: 10.1016/S1473-3099(06)70521-7
9. Lamberton PH, Crellen T, Cotton JA, Webster JP. Modelling the effects of mass drug administration on the molecular epidemiology of schistosomes. Adv Parasitol. (2015) 87:293–327. doi: 10.1016/bs.apar.2014.12.006
10. Turner HC, Truscott JE, Bettis AA, Farrell SH, Deol AK, Whitton JM, et al. Evaluating the variation in the projected benefit of community-wide mass treatment for schistosomiasis: implications for future economic evaluations. Parasit Vectors. (2017) 10:1–15. doi: 10.1186/s13071-017-2141-5
11. Gurarie D, King C. Heterogeneous model of schistosomiasis transmission and long-term control: the combined influence of spatial variation and age-dependent factors on optimal allocation of drug therapy. Parasitology. (2005) 130:49–65. doi: 10.1017/S0031182004006341
12. Nikolakis ZL, Hales NR, Perry BW, Schield DR, Timm LE, Liu Y, et al. Patterns of relatedness and genetic diversity inferred from whole genome sequencing of archival blood fluke miracidia (Schistosoma japonicum). PLoS Negl Trop Dis. (2021) 15:e0009020. doi: 10.1371/journal.pntd.0009020
13. Berger DJ, Crellen T, Lamberton PHL, Allan F, Tracey A, Noonan JD, et al. Whole-genome sequencing of Schistosoma mansoni reveals extensive diversity with limited selection despite mass drug administration. Nat Commun. (2021) 12:4776. doi: 10.1038/s41467-021-24958-0
14. Crellen T, Allan F, David S, Durrant C, Huckvale T, Holroyd N, et al. Whole genome resequencing of the human parasite Schistosoma mansoni reveals population history and effects of selection. Sci Rep. (2016) 6:20954. doi: 10.1038/srep20954
15. Lund AJ, Wade KJ, Nikolakis ZL, Ivey KN, Perry BW, Pike HN, et al. Integrating genomic and epidemiologic data to accelerate progress toward schistosomiasis elimination. eLife. (2022) 11:e79320. doi: 10.31219/osf.io/rt8wn
16. Han ZG, Brindley PJ, Wang SY, Chen Z. Schistosoma genomics: new perspectives on schistosome biology and host-parasite interaction. Ann Rev Genomics Hum Genet. (2009) 10:211–40. doi: 10.1146/annurev-genom-082908-150036
17. Berriman M, Haas BJ, LoVerde PT, Wilson RA, Dillon GP, Cerqueira GC, et al. The genome of the blood fluke Schistosoma mansoni. Nature. (2009) 460:352–8. doi: 10.1038/nature08160
18. Luo F, Yin M, Mo X, Sun C, Wu Q, Zhu B, et al. An improved genome assembly of the fluke Schistosoma japonicum. PLoS Negl Trop Dis. (2019) 13:e0007612. doi: 10.1371/journal.pntd.0007612
19. Zhou Y, Zheng H, Chen X, Zhang L, Wang K, Guo J, et al. The Schistosoma japonicum genome reveals features of host-parasite interplay. Nature. (2009) 460:345–51. doi: 10.1038/nature08140
20. Young ND, Jex AR, Li B, Liu S, Yang L, Xiong Z, et al. Whole-genome sequence of Schistosoma haematobium. Nat Genet. (2012) 44:221–5. doi: 10.1038/ng.1065
21. Stroehlein AJ, Korhonen PK, Lee VV, Ralph SA, Mentink-Kane M, You H, et al. Chromosome-level genome of Schistosoma haematobium underpins genome-wide explorations of molecular variation. PLoS Pathog. (2022) 18:e1010288. doi: 10.1371/journal.ppat.1010288
22. Luo F, Yang W, Yin M, Mo X, Pang Y, Sun C, et al. A chromosome-level genome of the human blood fluke Schistosoma japonicum identifies the genomic basis of host-switching. Cell Rep. (2022) 39:110638. doi: 10.1016/j.celrep.2022.110638
23. Picard MA, Cosseau C, Ferre S, Quack T, Grevelding CG, Coute Y, et al. Evolution of gene dosage on the Z-chromosome of schistosome parasites. eLife. (2018) 7:e35684. doi: 10.7554/eLife.35684
24. Buddenborg SK, Tracey A, Berger DJ, Lu Z, Doyle S, Fu B, et al. Assembled chromosomes of the blood fluke Schistosoma mansoni provide insight into the evolution of its ZW sex-determination system. bioRxiv. (2021) [preprint]. doi: 10.1101/2021.08.13.456314
25. Elkrewi M, Moldovan MA, Picard MA, Vicoso B. Schistosome W-linked genes inform temporal dynamics of sex chromosome evolution and suggest candidate for sex determination. Mol Biol Evol. (2021) 38:5345–58. doi: 10.1093/molbev/msab178
26. Doyle SR, Cotton JA. Genome-wide approaches to investigate anthelmintic resistance. Trends Parasitol. (2019) 35:289–301. doi: 10.1016/j.pt.2019.01.004
27. Shortt JA, Timm LE, Hales NR, Nikolakis ZL, Schield DR, Perry BW, et al. Population genomic analyses of schistosome parasites highlight critical challenges facing endgame elimination efforts. Sci Rep. (2021) 11:6884. doi: 10.1038/s41598-021-86287-y
28. Shortt JA, Card DC, Schield DR, Liu Y, Zhong B, Castoe TA, et al. Whole genome amplification and reduced-representation genome sequencing of Schistosoma japonicum Miracidia. PLoS Negl Trop Dis. (2017) 11:e0005292. doi: 10.1371/journal.pntd.0005292
29. Doyle SR, Sankaranarayanan G, Allan F, Berger D, Jimenez Castro PD, Collins JB, et al. Evaluation of DNA extraction methods on individual helminth egg and larval stages for whole-genome sequencing. Front Genet. (2019) 10:826. doi: 10.3389/fgene.2019.00826
30. Xiao N, Remais JV, Brindley PJ, Qiu D-C, Carlton EJ, Li R-Z, et al. Approaches to genotyping individual miracidia of Schistosoma japonicum. Parasitol Res. (2013) 112:3991–9. doi: 10.1007/s00436-013-3587-9
31. Oleksyk TK, Smith MW, O'Brien SJ. Genome-wide scans for footprints of natural selection. Philos Trans R Soc B Biol Sci. (2010) 365:185–205. doi: 10.1098/rstb.2009.0219
32. Haasl RJ, Payseur BA. Fifteen years of genomewide scans for selection: trends, lessons and unaddressed genetic sources of complication. Mol Ecol. (2016) 25:5–23. doi: 10.1111/mec.13339
33. Bustamante CD, Fledel-Alon A, Williamson S, Nielsen R, Todd Hubisz M, Glanowski S, et al. Natural selection on protein-coding genes in the human genome. Nature. (2005) 437:1153–7. doi: 10.1038/nature04240
34. International HapMap Consortium. A second generation human haplotype map of over 3.1 million SNPs. Nature. (2007) 449:851–61. doi: 10.1038/nature06258
35. Hamilton WL, Amato R, van der Pluijm RW, Jacob CG, Quang HH, Thuy-Nhien NT, et al. Evolution and expansion of multidrug-resistant malaria in southeast Asia: a genomic epidemiology study. Lancet Infect Dis. (2019) 19:943–51. doi: 10.1016/S1473-3099(19)30392-5
36. Clarkson CS, Miles A, Harding NJ, O'Reilly AO, Weetman D, Kwiatkowski D, et al. The genetic architecture of target-site resistance to pyrethroid insecticides in the African malaria vectors Anopheles gambiae and Anopheles coluzzii. Mol Ecol. (2021) 30:5303–17. doi: 10.1111/mec.15845
37. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. (1989) 123:585–95. doi: 10.1093/genetics/123.3.585
38. Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ, Schaffner SF, et al. Detecting recent positive selection in the human genome from haplotype structure. Nature. (2002) 419:832–7. doi: 10.1038/nature01140
39. The International HapMap Consortium, Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, et al. Genome-wide detection and characterization of positive selection in human populations. Nature. (2007) 449:913–8. doi: 10.1038/nature06250
40. Taylor AR, Echeverry DF, Anderson TJ, Neafsey DE, Buckee CO. Identity-by-descent with uncertainty characterises connectivity of Plasmodium falciparum populations on the Colombian-Pacific coast. PLoS Genet. (2020) 16:e1009101. doi: 10.1371/journal.pgen.1009101
41. Poelstra JW, Vijay N, Bossu CM, Lantz H, Ryll B, Müller I, et al. The genomic landscape underlying phenotypic integrity in the face of gene flow in crows. Science. (2014) 344:1410–4. doi: 10.1126/science.1253226
42. Nielsen R, Hellmann I, Hubisz M, Bustamante C, Clark AG. Recent and ongoing selection in the human genome. Nat Rev Genet. (2007) 8:857–68. doi: 10.1038/nrg2187
43. Kaplan NL, Hudson RR, Langley CH. The“ hitchhiking effect” revisited. Genetics. (1989) 123:887–99. doi: 10.1093/genetics/123.4.887
44. Vitti JJ, Grossman SR, Sabeti PC. Detecting natural selection in genomic data. Ann Rev Genet. (2013) 47:97–120. doi: 10.1146/annurev-genet-111212-133526
45. Przeworski M. The signature of positive selection at randomly chosen loci. Genetics. (2002) 160:1179–89. doi: 10.1093/genetics/160.3.1179
46. Kim Y, Stephan W. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics. (2002) 160:765–77. doi: 10.1093/genetics/160.2.765
47. Nosil P, Funk DJ, Ortiz-Barrientos D. Divergent selection and heterogeneous genomic divergence. Mol Ecol. (2009) 18:375–402. doi: 10.1111/j.1365-294X.2008.03946.x
48. Chen H, Patterson N, Reich D. Population differentiation as a test for selective sweeps. Genome Res. (2010) 20:393–402. doi: 10.1101/gr.100545.109
49. Adams RH, Schield DR, Card DC, Blackmon H, Castoe TA. GppFst: genomic posterior predictive simulations of FST and dXY for identifying outlier loci from population genomic data. Bioinformatics. (2017) 33:1414–5. doi: 10.1093/bioinformatics/btw795
50. Messer PW. SLiM: simulating evolution with selection and linkage. Genetics. (2013) 194:1037–9. doi: 10.1534/genetics.113.152181
51. Stephan W. Signatures of positive selection: from selective sweeps at individual loci to subtle allele frequency changes in polygenic adaptation. Mol Ecol. (2016) 25:79–88. doi: 10.1111/mec.13288
52. Haller BC, Messer PW. SLiM 3: forward genetic simulations beyond the Wright–Fisher model. Mol Biol Evol. (2019) 36:632–7. doi: 10.1093/molbev/msy228
53. Hendry JA, Kwiatkowski D, McVean G. Elucidating relationships between P. falciparum prevalence and measures of genetic diversity with a combined genetic-epidemiological model of malaria. PLoS Comput Biol. (2021) 17:e1009287. doi: 10.1371/journal.pcbi.1009287
54. Adrion JR, Cole CB, Dukler N, Galloway JG, Gladstein AL, Gower G, et al. A community-maintained standard library of population genetic models. eLife. (2020) 9:e54967. doi: 10.7554/eLife.54967
55. Daniels RF, Schaffner SF, Wenger EA, Proctor JL, Chang H-H, Wong W, et al. Modeling malaria genomics reveals transmission decline and rebound in Senegal. Proc Natl Acad Sci USA. (2015) 112:7067–72. doi: 10.1073/pnas.1505691112
56. Van den Broeck F, Vanoverbeeke J, Polman K, Huyse T. A Darwinian outlook on schistosomiasis elimination. BioRxiv. (2020) [preprint]. doi: 10.1101/2020.10.28.358523
57. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics. (2014) 30:2114–20. doi: 10.1093/bioinformatics/btu170
58. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. (2018) 34:i884–90. doi: 10.1093/bioinformatics/bty560
59. Buffalo V, Scythe-a Bayesian Adapter Trimmer. (2011). Available online at: http://github.com/vsbuffalo/scythe (accessed September 19, 2022).
60. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. (2009) 25:1754–60. doi: 10.1093/bioinformatics/btp324
61. Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. (2012) 9:357–9. doi: 10.1038/nmeth.1923
62. Li R, Li Y, Kristiansen K, Wang J. SOAP: short oligonucleotide alignment program. Bioinformatics. (2008) 24:713–4. doi: 10.1093/bioinformatics/btn025
63. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. (2011) 27:2987–93. doi: 10.1093/bioinformatics/btr509
64. Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. arXiv. (2012) arXiv:12073907.
65. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. (2011) 43:491–8. doi: 10.1038/ng.806
66. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. (2000) 155:945–59. doi: 10.1093/genetics/155.2.945
67. Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genetics. (2010) 11:94. doi: 10.1186/1471-2156-11-94
68. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. (2014) 30:1312–3. doi: 10.1093/bioinformatics/btu033
69. Nguyen L-T, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. (2015) 32:268–74. doi: 10.1093/molbev/msu300
70. Cheng X, DeGiorgio M. Flexible mixture model approaches that accommodate footprint size variability for robust detection of balancing selection. Mol Biol Evol. (2020) 37:3267–91. doi: 10.1093/molbev/msaa134
71. Siewert KM, Voight BF. Detecting long-term balancing selection using allele frequency correlation. Mol Biol Evol. (2017) 34:2996–3005. doi: 10.1093/molbev/msx209
72. Gautier M, Vitalis R. rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure. Bioinformatics. (2012) 28:1176–7. doi: 10.1093/bioinformatics/bts115
73. Baumdicker F, Bisschop G, Goldstein D, Gower G, Ragsdale AP, Tsambos G, et al. Efficient ancestry and mutation simulation with msprime 1.0. Genetics. (2022) 220:iyab229. doi: 10.1093/genetics/iyab229
74. Li Y, Yin M, Wu Q, McManus DP, Blair D, Li H, et al. Genetic diversity and selection of three nuclear genes in Schistosoma japonicum populations. Parasit Vectors. (2017) 10:87. doi: 10.1186/s13071-017-2033-8
75. Le Clec'h W, Chevalier FD, Mattos ACA, Strickland A, Diaz R, Mcdew-White M, et al. Genetic analysis of praziquantel response in schistosome parasites implicates a transient receptor potential channel. Sci Transl Med. (2021) 13:eabj9114. doi: 10.1126/scitranslmed.abj9114
76. Hambrook JR, Kabore AL, Pila EA, Hanington PC. A metalloprotease produced by larval Schistosoma mansoni facilitates infection establishment and maintenance in the snail host by interfering with immune cell function. PLoS Pathog. (2018) 14:e1007393. doi: 10.1371/journal.ppat.1007393
77. Excoffier L, Hofer T, Foll M. Detecting loci under selection in a hierarchically structured population. Heredity. (2009) 103:285–98. doi: 10.1038/hdy.2009.74
78. Peng B, Amos CI. Forward-time simulation of realistic samples for genome-wide association studies. BMC Bioinformatics. (2010) 11:442. doi: 10.1186/1471-2105-11-442
79. Tang K, Thornton KR, Stoneking M. A new approach for using genome scans to detect recent positive selection in the human genome. PLoS Biol. (2007) 5:e171. doi: 10.1371/journal.pbio.0050171
80. Li J, Li H, Jakobsson M, Li S, Sjödin P, Lascoux M. Joint analysis of demography and selection in population genetics: where do we stand and where could we go? Mol Ecol. (2012) 21:28–44. doi: 10.1111/j.1365-294X.2011.05308.x
81. Sanchez T, Cury J, Charpiat G, Jay F. Deep learning for population size history inference: design, comparison and combination with approximate Bayesian computation. Mol Ecol Resourc. (2021) 21:2645–60. doi: 10.1111/1755-0998.13224
82. Laval G, Patin E, Boutillier P, Quintana-Murci L. Sporadic occurrence of recent selective sweeps from standing variation in humans as revealed by an approximate Bayesian computation approach. Genetics. (2021) 219:iyab161. doi: 10.1093/genetics/iyab161
83. Nakagome S, Hudson RR, Di Rienzo A. Inferring the model and onset of natural selection under varying population size from the site frequency spectrum and haplotype structure. Philos Trans R Soc B Biol Sci. (2019) 286:20182541. doi: 10.1098/rspb.2018.2541
84. Torada L, Lorenzon L, Beddis A, Isildak U, Pattini L, Mathieson S, et al. ImaGene: a convolutional neural network to quantify natural selection from genomic data. BMC Bioinformatics. (2019) 20:337. doi: 10.1186/s12859-019-2927-x
85. Deelder W, Benavente ED, Phelan J, Manko E, Campino S, Palla L, et al. Using deep learning to identify recent positive selection in malaria parasite sequence data. Malaria J. (2021) 20:1–9. doi: 10.1186/s12936-021-03788-x
86. Flagel L, Brandvain Y, Schrider DR. The unreasonable effectiveness of convolutional neural networks in population genetic inference. Mol Biol Evol. (2019) 36:220–38. doi: 10.1093/molbev/msy224
87. Dehasque M, Ávila-Arcos MC, Díez-del-Molino D, Fumagalli M, Guschanski K, et al. Inference of natural selection from ancient DNA. Evol Lett. (2020) 4:94–108. doi: 10.1002/evl3.165
88. Adrion JR, Galloway JG, Kern AD. Predicting the landscape of recombination using deep learning. Mol Biol Evol. (2020) 37:1790–808. doi: 10.1093/molbev/msaa038
89. Battey CJ, Ralph PL, Kern AD. Predicting geographic location from genetic variation with deep neural networks. eLife. (2020) 9:e54507. doi: 10.7554/eLife.54507
90. Vianney TJ, Berger DJ, Doyle SR, Sankaranarayanan G, Serubanja J, Nakawungu PK, et al. Genome-wide analysis of Schistosoma mansoni reveals limited population structure and possible praziquantel drug selection pressure within Ugandan hot-spot communities. PLoS Negl Trop Dis. (2022) 16:e0010188. doi: 10.1371/journal.pntd.0010188
91. Neafsey DE, Taylor AR, MacInnis BL. Advances and opportunities in malaria population genomics. Nat Rev Genet. (2021) 22:502–17. doi: 10.1038/s41576-021-00349-5
92. The Anopheles Gambiae 1000 Genomes Consortium. Genome variation and population structure among 1142 mosquitoes of the African malaria vector species Anopheles gambiae and Anopheles coluzzii. Genome Res. (2020) 30:1533–46. doi: 10.1101/gr.262790.120
93. Hupalo DN, Luo Z, Melnikov A, Sutton PL, Rogov P, Escalante A, et al. Population genomics studies identify signatures of global dispersal and drug resistance in Plasmodium vivax. Nat Genet. (2016) 48:953–8. doi: 10.1038/ng.3588
94. Volkman SK, Neafsey DE, Schaffner SF, Park DJ, Wirth DF. Harnessing genomics and genome biology to understand malaria biology. Nat Rev Genet. (2012) 13:315–28. doi: 10.1038/nrg3187
95. Inzaule SC, Tessema SK, Kebede Y, Ogwell Ouma AE, Nkengasong JN. Genomic-informed pathogen surveillance in Africa: opportunities and challenges. Lancet Infect Dis. (2021) 21:e281–9. doi: 10.1016/S1473-3099(20)30939-7
96. Fenwick A, Webster JP. Schistosomiasis: challenges for control, treatment and drug resistance.Curr Opin Infect Dis. (2006) 19:577–82. doi: 10.1097/01.qco.0000247591.13671.6a
97. Chevalier FD, Le Clec'h W, McDew-White M, Menon V, Guzman MA, Holloway SP, et al. Oxamniquine resistance alleles are widespread in old world Schistosoma mansoni and predate drug deployment. PLoS Pathog. (2019) 15:e1007881. doi: 10.1371/journal.ppat.1007881
98. Fallon PG, Doenhoff MJ. Drug-resistant schistosomiasis: resistance to praziquantel and oxamniquine induced in Schistosoma mansoni in mice is drug specific. Am J Trop Med Hyg. (1994) 51:83–8. doi: 10.4269/ajtmh.1994.51.83
99. Greenberg RM. New approaches for understanding mechanisms of drug resistance in schistosomes. Parasitology. (2013) 140:1534–46. doi: 10.1017/S0031182013000231
100. Zhong H, Ren Y, Qin F, Li X, Hou L, Gu S, et al. Schistosoma japonicum translationally controlled tumor protein, which is associated with the development of female worms, as a target for control of schistosomiasis. Int J Parasitol. (2022) 52:569–79. doi: 10.1016/j.ijpara.2022.01.005
101. Webster BL, Diaw OT, Seye MM, Faye DS, Stothard JR, Sousa-Figueiredo JC, et al. Praziquantel treatment of school children from single and mixed infection foci of intestinal and urogenital schistosomiasis along the Senegal River Basin: monitoring treatment success and re-infection patterns. Acta Trop. (2013) 128:292–302. doi: 10.1016/j.actatropica.2012.09.010
102. Ismail M, Botros S, Metwally A, William S, Farghally A, Tao L-F, et al. Resistance to praziquantel: direct evidence from Schistosoma mansoni isolated from Egyptian villagers. Am J Trop Med Hyg. (1999) 60:932–5. doi: 10.4269/ajtmh.1999.60.932
103. Melman SD, Steinauer ML, Cunningham C, Kubatko LS, Mwangi IN, Wynn NB, et al. Reduced susceptibility to praziquantel among naturally occurring Kenyan isolates of Schistosoma mansoni. PLoS Negl Trop Dis. (2009) 3:e504. doi: 10.1371/journal.pntd.0000504
104. Crellen T, Walker M, Lamberton PH, Kabatereine NB, Tukahebwa EM, Cotton JA, et al. Reduced efficacy of praziquantel against Schistosoma mansoni is associated with multiple rounds of mass drug administration. Clin Infect Dis. (2016) 63:1151–9. doi: 10.1093/cid/ciw506
105. Wang W, Wang L, Liang Y-S. Susceptibility or resistance of praziquantel in human schistosomiasis: a review. Parasitol Res. (2012) 111:1871–7. doi: 10.1007/s00436-012-3151-z
106. Cioli D, Botros SS, Wheatcroft-Francklow K, Mbaye A, Southgate V, Tchuenté L-AT, et al. Determination of ED50 values for praziquantel in praziquantel-resistant and -susceptible Schistosoma mansoni isolates. Int J Parasitol. (2004) 34:979–87. doi: 10.1016/j.ijpara.2004.05.001
107. Valentim CLL, Cioli D, Chevalier FD, Cao X, Taylor AB, Holloway SP, et al. Genetic and molecular basis of drug resistance and species-specific drug action in schistosome parasites. Science. (2013) 342:1385–9. doi: 10.1126/science.1243106
108. Rugel AR, Guzman MA, Taylor AB, Chevalier FD, Tarpley RS, McHardy SF, et al. Why does oxamniquine kill Schistosoma mansoni and not S. haematobium and S. japonicum? Int J Parasitol Drugs Drug Resist. (2020) 13:8–15. doi: 10.1016/j.ijpddr.2020.04.001
109. Greenberg RM. ABC multidrug transporters in schistosomes and other parasitic flatworms. Parasitol Int. (2013) 62:647–53. doi: 10.1016/j.parint.2013.02.006
110. You H, McManus DP, Hu W, Smout MJ, Brindley PJ, Gobert GN. Transcriptional responses of in vivo praziquantel exposure in schistosomes identifies a functional role for calcium signalling pathway member CamKII. PLoS Pathog. (2013) 9:e1003254. doi: 10.1371/journal.ppat.1003254
111. Greenberg RM. Schistosome ABC multidrug transporters: from pharmacology to physiology. Int J Parasitol Drugs Drug Resist. (2014) 4:301–9. doi: 10.1016/j.ijpddr.2014.09.007
112. Kasinathan RS, Goronga T, Messerli SM, Webb TR, Greenberg RM. Modulation of a Schistosoma mansoni multidrug transporter by the antischistosomal drug praziquantel. FASEB J. (2010) 24:128–35. doi: 10.1096/fj.09-137091
113. Kasinathan RS, Sharma LK, Cunningham C, Webb TR, Greenberg RM. Inhibition or knockdown of ABC transporters enhances susceptibility of adult and juvenile schistosomes to Praziquantel. PLoS Negl Trop Dis. (2014) 8:e3265. doi: 10.1371/journal.pntd.0003265
114. Pinto-Almeida A, Mendes T, Armada A, Belo S, Carrilho E, Viveiros M, et al. The role of efflux pumps in Schistosoma mansoni praziquantel resistant phenotype. PLoS ONE. (2015) 10:e0140147. doi: 10.1371/journal.pone.0140147
115. Chan JD, Zarowiecki M, Marchant JS. Ca2+ channels and praziquantel: a view from the free world. Parasitol Int. (2013) 62:619–28. doi: 10.1016/j.parint.2012.12.001
116. Angelucci F, Basso A, Bellelli A, Brunori M, Mattoccia LP, Valle C. The anti-schistosomal drug praziquantel is an adenosine antagonist. Parasitology. (2007) 134:1215–21. doi: 10.1017/S0031182007002600
117. Gnanasekar M, Salunkhe AM, Mallia AK, He YX, Kalyanasundaram R. Praziquantel affects the regulatory myosin light chain of Schistosoma mansoni. Antimicrobiol Agents Chemother. (2009) 53:1054–60. doi: 10.1128/AAC.01222-08
118. Hobert O. The neuronal genome of Caenorhabditis elegans. WormBook. (2013) 1–106. doi: 10.1895/wormbook.1.161.1
119. LoVerde PT, Andrade LF, Oliveira G. Signal transduction regulates schistosome reproductive biology. Curr Opin Microbiol. (2009) 12:422–8. doi: 10.1016/j.mib.2009.06.005
120. Park S-K, Friedrich L, Yahya NA, Rohr CM, Chulkov EG, Maillard D, et al. Mechanism of praziquantel action at a parasitic flatworm ion channel. Sci Transl Med. (2021) 13:eabj5832. doi: 10.1126/scitranslmed.abj5832
121. He Y-X, Salafsky B, Ramaswamy K. Host–parasite relationships of Schistosoma japonicum in mammalian hosts. Trends Parasitol. (2001) 17:320–4. doi: 10.1016/S1471-4922(01)01904-3
122. Riley S, Carabin H, Bélisle P, Joseph L, Tallo V, Balolong E, et al. Multi-host transmission dynamics of Schistosoma japonicum in Samar province, the Philippines. PLoS Med. (2008) 5:e18. doi: 10.1371/journal.pmed.0050018
123. Carlton EJ, Bates MN, Zhong B, Seto EY, Spear RC. Evaluation of mammalian and intermediate host surveillance methods for detecting schistosomiasis reemergence in southwest China. PLoS Negl Trop Dis. (2011) 5:e987. doi: 10.1371/journal.pntd.0000987
124. Borlase A, Rudge JW, Leger E, Diouf ND, Fall CB, Diop SD, et al. From spillover to persistence: hybridization and schistosomiasis transmission dynamics at the human-animal interface. Proc Natl Acad Sci USA. (2021) 118:e2110711118. doi: 10.1073/pnas.2110711118
125. Webster BL, Diaw OT, Seye MM, Webster JP, Rollinson D. Introgressive hybridization of Schistosoma haematobium group species in Senegal: species barrier break down between ruminant and human schistosomes. PLoS Negl Trop Dis. (2013) 7:e2110. doi: 10.1371/journal.pntd.0002110
126. Steinauer ML, Blouin MS, Criscione CD. Applying evolutionary genetics to schistosome epidemiology. Infect Genet Evol. (2010) 10:433–3. doi: 10.1016/j.meegid.2010.02.007
127. Cogswell AA, Kommer VP, Williams DL. Transcriptional analysis of a unique set of genes involved in Schistosoma mansoni female reproductive biology. PLoS Negl Trop Dis. (2012) 6:e1907. doi: 10.1371/journal.pntd.0001907
128. Chen R, Wang J, Gradinaru I, Vu HS, Geboers S, Naidoo J, et al. A male-derived nonribosomal peptide pheromone controls female schistosome development. Cell. (2022) 185:1506–20.e17. doi: 10.1016/j.cell.2022.03.017
129. Asch HL. Rhythmic emergence of Schistosoma mansoni cercariae from Biomphalaria glabrata: control by illumination. Exp Parasitol. (1972) 31:350–5. doi: 10.1016/0014-4894(72)90096-3
130. Nojima H, Sato A. Schistosoma mansoni and Schistosoma haematobium: emergence of schistosome cercariae from snails with darkness and illumination. Exp Parasitol. (1982) 53:189–98. doi: 10.1016/0014-4894(82)90060-1
131. Pages JR, Theron A. Schistosoma intercalatum from cameroon and zaire: chronobiological differentiation of cercarial emergence. J Parasitol. (1990) 76:743–5. doi: 10.2307/3282996
132. Combes C, Fournier A, Moné H, Théron A. Behaviours in trematode cercariae that enhance parasite transmission: patterns and processes. Parasitology. (1994) 109:S3–13. doi: 10.1017/S0031182000085048
133. Martinez-Bakker M, Helm B. The influence of biological rhythms on host–parasite interactions. Trends Ecol Evol. (2015) 30:314–26. doi: 10.1016/j.tree.2015.03.012
134. Ferrer-Admetlla A, Leuenberger C, Jensen JD, Wegmann D. An approximate markov model for the Wright-Fisher diffusion and its application to time series data. Genetics. (2016) 203:831–46. doi: 10.1534/genetics.115.184598
Keywords: natural selection, selection, schistosomes, population genetics, genomic scans, control efforts
Citation: Nikolakis ZL, Adams RH, Wade KJ, Lund AJ, Carlton EJ, Castoe TA and Pollock DD (2022) Prospects for genomic surveillance for selection in schistosome parasites. Front. Epidemiol. 2:932021. doi: 10.3389/fepid.2022.932021
Received: 29 April 2022; Accepted: 12 September 2022;
Published: 29 September 2022.
Edited by:
Jonathan J. Juliano, University of North Carolina at Chapel Hill, United StatesReviewed by:
Thomas Crellen, University of Glasgow, United KingdomAlyssa E. Barry, Deakin University, Australia
Copyright © 2022 Nikolakis, Adams, Wade, Lund, Carlton, Castoe and Pollock. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: David D. Pollock, david.pollock@cuanschutz.edu
†These authors share senior authorship