- 1Department of Pediatrics, University of Pittsburgh Medical Center Children's Hospital of Pittsburgh, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- 2Department of Pediatrics, University of California, San Francisco, San Francisco, CA, United States
- 3Department of Human Genetics, University of Pittsburgh Graduate School of Public Health, Pittsburgh, PA, United States
- 4Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, United States
- 5Artificial Intelligence Lab, Illumina Inc, Foster City, CA, United States
Newborn screening (NBS) is a population-based program with a goal of reducing the burden of disease for conditions with significant clinical impact on neonates. Screening tests were originally developed and implemented one at a time, but newer methods have allowed the use of multiplex technologies to expand additions more rapidly to standard panels. Recent improvements in next-generation sequencing are also evolving rapidly from first focusing on individual genes, then panels, and finally all genes as encompassed by whole exome and genome sequencing. The intersection of these two technologies brings the revolutionary possibility of identifying all genetic disorders in newborns, allowing implementation of therapies at the optimum time regardless of symptoms. This article reviews the history of newborn screening and early studies examining the use of whole genome and exome sequencing as a screening tool. Lessons learned from these studies are discussed, along with technical, ethical, and societal challenges to broad implementation.
Introduction
Medical screening is an important part of health care and public health initiatives. The purpose of a medical screening test is to identify a medical condition early, ideally in the pre-symptomatic phase, so that appropriate treatment can be initiated to decrease morbidity and mortality. Screening is particularly indicated for medical conditions in which early treatment is more effective than treatment in later stages of the condition. Population screening adds a requirement of broader societal benefit in addition to benefit for the individual. Newborn screening (NBS) was cited by the United States Centers for Disease Control and Prevention as one of the most impactful public health initiatives of the twentieth century and in the twenty first century has undergone significant expansion through improved techniques of high-throughput biochemical analysis of targeted analytes, enzymatic activities, and specific molecular defects. It now stands on the verge of incorporating increasingly large-scale molecular sequencing to detect a growing number of disorders. This article will provide a history of NBS, specifically focusing on the elements pertinent to implementation of whole exome and whole genome sequencing (WES and WGS, respectively), review efforts to date on the use of sequencing for NBS, and discuss the challenges for the future for larger incorporation in NBS programs.
History of Newborn Screening
Inborn Errors of Metabolism
In 1935, phenylpyruvic acid was reported in the urine of a subset of individuals with intellectual disability and later termed phenylketonuria (PKU) (1). Subsequently, the possibility of treatment for PKU with restriction of dietary phenylalanine was postulated, and in 1954, the first report was published of a child with PKU treated with a diet restricted in phenylalanine who showed significant clinical improvement and elimination of phenylpyruvate in the urine (2). While the detection of urine phenylpyruvate could be used to diagnose PKU, it was not until Robert Guthrie developed a bacterial inhibition assay in 1958 that early, rapid, and accurate testing of phenylalanine levels for diagnosis and management of PKU became a reality (3). The final piece fell into place in 1961 when a filter paper method for collecting samples as a dried blood spot (DBS) was developed that is used in NBS to this day (4). Screening for PKU began in New York in 1961, and in 1963, Massachusetts became the first state to mandate NBS for PKU (3, 5). The introduction of NBS for PKU was clearly a turning point for the disease, as it allowed normal development in identified babies who otherwise would have suffered devastating neurodevelopmental symptoms. Over time, it was determined that NBS for PKU not only identifies classic PKU due to phenylalanine hydroxylase deficiency but also non-PKU hyperphenylalaninemia and tetrahydrobiopterin deficiency, all of which are characterized by elevation of a single metabolite, phenylalanine.
The success of NBS for PKU allowing early identification and intervention that prevented intellectual disability led to the hope that expansion of NBS would translate to similar improvement for other appropriate conditions (6, 7).
The Wilson and Jungner Principles
Given a growing interest in expanding NBS to more disorders, a meeting sponsored by the World Health Organization (WHO) in 1968 led to the publication of the Wilson and Jungner guidelines outlining 10 principles to guide development and implementation of a screening test and provided the framework for the further development of NBS programs (Table 1) (8). Necessary characteristics of a screening test include the availability of an economical method of identification of important health problems for which treatment is available, as well as the infrastructure to provide such treatment. These guidelines clearly went beyond the principle that a disease should be added to NBS based on technical ability and formed the basis of NBS expansion for several decades.
Expansion of NBS Beyond Inborn Errors of Metabolism
While inborn errors of metabolism (IEM) were the initial focus of NBS, the scope of NBS quickly began to expand. In 1973, a method for identifying hemoglobinopathies from DBS was developed, and the first NBS for hemoglobinopathies began in New York in 1975 (10). Yet another milestone in NBS was achieved in 1987 when a process for extracting DNA from DBS was developed, and its use as a second-tier test to confirm the diagnosis of sickle cell disease after an initial positive NBS with isoelectric focusing (IEF) or high-performance liquid chromatography (HPLC) was reported (11, 12). Second-tier DNA testing is necessary to identify sickle cell disease, as multiple hemoglobinopathies are identified with IEF and HPLC (10).
Newborn screening for congenital hypothyroidism was first introduced in 1974 and represents another triumph in NBS akin to that of PKU, allowing early identification, treatment, and prevention of intellectual disability using a new technology, immunodetection (13). Shortly thereafter, NBS for a second endocrinopathy, congenital adrenal hyperplasia (CAH), was introduced using an immunoassay to identify the most common type of CAH, 21-hydroxylase deficiency (14). Newborn screening for CAH prevented neonatal death in the salt-wasting forms through early diagnosis and appropriate treatment, though CAH screening via immunoassay alone had a high false positive rate, particularly in premature infants (14).
Newborn screening for cystic fibrosis (CF) was introduced in 1988 and, similar to the two-tiered approach for hemoglobinopathy screening, utilized DNA extraction from DBS and assessment for common pathogenic variants in the CFTR gene after a positive first-tier screen, in this case with immunoreactive trypsinogen (IRT) as the first step in screening (15).
New Technologies and Expanded Newborn Screening Targets
Tandem Mass Spectrometry
The early successes of NBS were all based on identifying one disease at a time, though as noted for PKU and as is true for other disorders, such as galactosemia, sometimes more than one disease can be diagnosed. This paradigm required the development and implementation of a new test for every proposed disorder added to screening. The next major advance in NBS came in the 1990's with the introduction of tandem mass spectrometry (MS/MS), which allows rapid simultaneous screening for multiple IEM using a single test (16–18). The concept of multiple markers for multiple disorders not only broadened the scope of initial screening tests but also significantly increased the differential diagnosis of a positive newborn screen (Figure 1). However, with this advancement in technology also came additional challenges. The technology was beyond that in use by most public NBS programs. Moreover, while MS/MS identified IEM that conformed to Wilson and Jungner principles for NBS disorders, it also identified others that do not meet these criteria (8, 16). For example, medium-chain acyl-CoA dehydrogenase (MCAD) deficiency is a disorder of fatty acid oxidation first identified in the late 1970s in a subset of children presenting with Reye syndrome-like features and abnormal urinary metabolites (19, 20). Prior to newborn screening for MCAD deficiency, affected individuals presented with acute crises characterized by hypoglycemia, lethargy, vomiting, seizures, hepatomegaly, and liver dysfunction, progressing to coma and death with a reported mortality rate of 20–30% in individuals diagnosed on a clinical basis (21, 22). In those who survived, an additional 40% showed evidence of significant neurological damage as a sequela of hypoglycemic crises (21, 23). The specific enzymatic defect was identified in 1982, and NBS for MCAD deficiency became a reality in 1990 when MS/MS on DBS became available (19, 20). The treatment for MCAD deficiency is avoidance of prolonged fasting and administration of IV dextrose if oral intake cannot be maintained due to intercurrent illness. After NBS for MCAD deficiency was introduced, morbidity and mortality decreased significantly, with a recent study reporting a mortality rate of 3.5% (20, 24). Clearly, the advent of NBS by MS/MS has saved lives and has a substantial beneficial effect on the natural history of MCAD deficiency. However, MS/MS also identifies short-chain acyl-CoA dehydrogenase deficiency (SCAD), another defect of fatty acid oxidation, which is now considered an asymptomatic biochemical condition and does not meet the Wilson and Jungner criteria (25). Other examples of disorders with significant clinical impact such as isovaleric acidemia were also balanced by disorders ultimately shown to have a low risk of clinical symptoms (3-methylcrotonyl-CoA carboxylase deficiency). Furthermore, other identified diseases were less amendable to early intervention (for example, mitochondrial trifunctional protein deficiency), though screening led to earlier diagnosis and avoidance of an extended diagnostic odyssey, providing an important benefit to families and allowing for genetic counseling for the couple and other family members.
Figure 1. Newborn screening paradigms. Newborn screening progressed from one marker for each disease (left) to multiplex testing with multiple markers identified from a single test (right) broadening the scope of initial screening tests and significantly increasing the differential diagnosis of a positive newborn screen.
Tiered Testing Strategies
Tiered testing strategies, including utilization of MS/MS, have led to improved positive predictive value of abnormal screens (9, 26). For example, steroid profiling via MS/MS as a second-tier test after a positive NBS for CAH has been shown to decrease the false positive rate of the initial test (15). Screening for congenital hypothyroidism with thyroxine (T4) is highly sensitive but not particularly specific; the addition of thyrotropin (TSH) as a second-tier test in those neonates with low T4 improves the specificity and decreases the false positive rate (9, 27). Other examples of tiered testing strategies include DNA testing on DBS after positive screens for CF, hemoglobinopathies, and MCAD deficiency (9, 10, 15). It is important to distinguish tiered screening testing strategies from diagnostic testing. A positive NBS, with or without second-tier testing, requires follow-up diagnostic testing. In some cases, the technology may be the same used in the initial screening test (e.g., MS/MS for fatty acid oxidation disorders); however, confirmatory diagnostic testing is still indicated as the cut-off values for screening tests are different than those of diagnostic testing (9). In addition, confirmatory diagnostic testing often utilizes additional methodologies not included in the initial NBS test (e.g., urine organic acid analysis, enzyme analysis, and/or DNA sequencing).
The Recommended Uniform Screening Panel
As more states began utilizing MS/MS for NBS, it became possible to screen for a larger number of disorders, and the variability in NBS among states increased (9). Recognizing the increasing complexity of the NBS landscape, in the late 1990's, the US Health Resource and Services Administration (HRSA) requested that the American Academy of Pediatrics (AAP) review the current state of affairs of NBS in the United States and provide recommendations for improvement. The AAP Newborn Screening Task Force concluded that there was a need for consistency and equity among NBS programs in the United States (28). In response, HRSA charged the American College of Medical Genetics and Genomics (ACMG) with developing a national framework for NBS and, specifically, with the development of a uniform panel of conditions as well as model policies, procedures, minimum standards, methods for expansion of NBS programs, as well as quality and oversight on a national level (9).
The ACMG developed a scoring system with which to evaluate each condition to be included on a recommended uniform screening panel, based on the following categories (9):
1. The clinical characteristics of the condition;
2. The analytical characteristics of the test; and
3. Diagnosis, follow-up, treatment, and management of the condition.
Each category included specific criteria upon which the tests were scored, outlined in Table 1. Based on the sum of scores from each category, a condition was then assigned to one of three groups, namely, the core panel, the secondary targets (differential diagnosis of core panel disorders), and conditions not appropriate for NBS (9). The ACMG also recommended standardization of reporting language, reporting standards, as well as improved oversight, long-term data collection, quality improvement, follow-up, and funding (9). With the inclusion of secondary targets, the ACMG broadened the original Wilson and Jungner criteria definition of benefit to include not only direct benefit for the individual being screened in the form of a specific treatment but also benefit to the individual screened, the family, and society derived from identification and management of disorders for which a specific treatment may not exist (9, 29, 30).
In 2008, the Advisory Committee on Heritable Disorders in Newborns and Children (ACHDNC), established by the Department of Health and Human Services (DHHS) in 2003, was given the responsibility to complete regular evidence-based systematic review of and recommendations for conditions to be included on the Recommended Universal Newborn Screening Panel (RUSP), which is based on the 2006 ACMG expert report outlining principles and processes for uniform NBS as well as the original Wilson and Jungner criteria (8, 9, 29, 31). Newborn screening in all states is mandatory, with limited ability for parents to opt out of testing (32). The current core and secondary targets can be found on the ACHDNC website at Recommended Uniform Screening Panel | Official web site of the U.S. Health Resources & Services Administration (hrsa.gov) (33).
Point-of-Care NBS
Additional methods for NBS fall outside the DBS paradigm for testing. The first point-of-care NBS began with neonatal early hearing detection and intervention screening in Hawaii in 1990 and is now part of the RUSP (34). Identification of critical congenital heart disease by pulse oximetry screening, another point-of-care NBS test, was added in 2011 (35).
Primary DNA NBS
Newborn screening for severe combined immunodeficiency (SCID) began in the United States in 2008, was added to the RUSP in 2010, and ushered in a new era of primary DNA screening (36). DNA extracted from DBS had been used for second-tier testing for sickle cell disease (SCD) utilizing PCR and allele-specific oligonucleotide hybridization, as SCD is due to a single specific beta-globin gene point mutation (12). NBS for SCID utilizes DNA extracted from DBS as the primary screen; quantitative PCR is used to assess DNA copy number (10, 15, 36). Severe combined immunodeficiency is characterized by decreased amounts of T- and/or B-cells and leads to the inability of the body to effectively fight infections (36). Without appropriate treatment, such as hematopoietic stem cell transplant or enzyme replacement, death due to overwhelming infections inevitably occurs. Severe combined immunodeficiency can be effectively screened for with the identification of decreased T-cell receptor excision circles (TRECs), which are small pieces of leftover DNA formed through DNA recombination in the process of T-cell maturation (36). Low numbers of TRECs identified on NBS indicate inadequate T-cells and trigger secondary testing with flow cytometry to identify the specific deficiency, as NBS for SCID also identifies other non-SCID T-cell deficiencies (36).
A more recent example of NBS utilizing DNA extracted from DBS as a primary target is SMN1-related spinal muscular atrophy (SMA), which was added to the RUSP in 2018 (31, 37). SMN1-related SMA is a progressive disorder characterized by muscle weakness due to loss of anterior horn cells and has a broad phenotypic spectrum ranging from a severe neonatal presentation with weakness, hypotonia, respiratory failure, and death in early infancy to an adult-onset presentation of muscle weakness without respiratory insufficiency and with a normal lifespan (31). Treatment for SMN1-related SMA has recently become available and slows or prevents the progression of the disorder once therapy is initiated, thus making it an ideal candidate for NBS (31). Screening identifies a common gene deletion, and as for SCID, the primary screen is a quantitative PCR technique; combining SCID and SMA in one newborn screen assay has been proposed (37). For both of these conditions, DNA is used for the primary screen but neither employs DNA sequencing. However, as a result of the introduction of SCID and SMA newborn screening, NBS programs now routinely extract DNA from DBS, allowing the introduction of other DNA-based assays, such as DNA sequencing. This and improvements that allow high-throughput DNA sequencing with a rapid turnaround time have led to an increased interest in the use of DNA sequencing as a primary NBS test.
Whole Exome and Whole Genome Sequencing in Disease Diagnosis
The production of the map of the human genome promised a new era in medicine, one in which genetic and genomic information would be used routinely in health care (38). With an available map, advanced DNA sequencing techniques that are high throughput and low cost offered the promise of the personalized genome, that is, sequencing of an individual's genome, in order to provide a personal health benefit (39). In the two decades since the completion of the human genome, the utility of DNA sequencing for health benefit has been demonstrated multiple times and is now routine in some areas of medicine. The first use of individual genome information for disease diagnosis occurred in 2009 at the University of Wisconsin for a child with severe inflammatory bowel disease. Sequencing of his entire genome and DNA variant analysis yielded an answer, immunodeficiency due to a pathogenic variant in the XIAP gene (40). This test costs $75,000 and required 4 months of analysis (41). It resulted in life-saving treatment; based on this genetic diagnosis, the child was successfully treated for his debilitating disease with a cord blood transplant. Additional confirmation of the power of next-generation sequencing (NGS) technology came from a proof-of-principle paper in which exome sequencing identified the correct genetic diagnosis for four individuals with the rare disorder Freeman–Sheldon syndrome (42). Shortly thereafter, next-generation sequencing and DNA variant analysis of four individuals with Miller syndrome, which lacked a causative disease gene at that time, resulted in the identification of that gene, DHODH (43). Exome sequencing remains a method for disease diagnosis and for disease gene discovery (44, 45).
Whole Exome and Whole Genome Sequencing in Disease Screening
Newborn Sequencing in Genomic Medicine and Public Health
The power of exome sequencing and computational analysis to identify new disease genes and diagnose individuals with rare disorders suggested other uses for this technology including pre-conception carrier testing and NBS (46). The ability to extract DNA from dried blood spots (11) and the development of high-throughput technologies for DNA sequencing have led to the use of next-generation sequencing in NBS in follow-up testing after an abnormal enzyme or analyte primary screen and have held out the promise of the use of this as a primary screen, especially for those early-onset treatable disorders that lack a current NBS test (11, 47–52). Recognizing the need for research in the area of DNA sequencing and newborn health, the NIH funded the Newborn Sequencing in Genomic Medicine and Public Health (NSIGHT) network in the early 2010's (53). The three key research questions to be addressed were as follows: (1) For disorders currently screened in newborns, how can genomic sequencing replicate or augment known newborn screening results? Can sequencing replace current modalities? (2) What knowledge could genomic sequencing provide about conditions not currently screened for in newborns? (3) What additional clinical information could be learned from genomic sequencing relevant to the clinical care of newborns (53)? The four funded projects addressed these questions through separate study designs and patient populations (53). Two projects included cases with abnormal newborn screen results. In the North Carolina Newborn Exome Sequencing for Universal Screening (NC NEXUS) project, healthy newborns (61 subjects) and infants and children <5 years of age with known abnormal newborn screening results (17 subjects) or with hearing loss (28 subjects) were enrolled for exome sequencing. Analysis was blinded to phenotype, and analyzed genes included those with childhood-onset medically actionable disorders (NBS-NGS, 466 genes, all subjects) and those with diagnostic findings (affected subjects, additional indication-based genes analyzed). This project included randomization to an arm in which parents could decide whether to learn additional information from the genomic analysis including low or no actionability childhood-onset conditions, high actionability adult-onset conditions, and carrier status for recessive disorders (54). The NBSeq project performed exome sequencing retrospectively on DNA obtained from dried blood spots of cases with known IEM diagnosed through conventional newborn screening in California. These were de-identified samples and included cases with false positive newborn screens. The role of exome sequencing as a primary or secondary test for IEM was assessed (55). The two additional NSIGHT projects addressed the utility of WGS in sick newborns and the role of WES in sick and healthy newborns (BabySeq) and did not have a primary newborn screening aim (53).
The NC NEXUS project found that NBS-NGS was 88% sensitive for cases with an abnormal newborn screen for an IEM and 18% sensitive for the hearing loss cohort (54). Four individuals had abnormal NBS-NGS results not identified by other methods, including female heterozygote status for OTC deficiency in a child with PKU and heterozygous status for a known pathogenic variant in LDLR causing autosomal dominant familial hypercholesterolemia in a child with a known family history of this. Two children in the hearing loss cohort had additional clinically relevant findings, one with a variant in DSC2 causing autosomal dominant arrhythmogenic right ventricular dysplasia and one with two variants in the gene associated with factor XI deficiency (54). These findings addressed the second aim of the NSIGHT consortium.
The NBSeq project found that DNA could not substitute as a primary newborn screen for disorders currently screened for by analyte, due to insufficient sensitivity (88%) and specificity (94%), but that DNA testing could be beneficial in follow-up testing in reducing follow-up of false positive results and in identifying disease diagnosis, which address the first aim of the NSIGHT consortium and demonstrate that DNA sequencing cannot substitute for current conventional screening but can augment it (55).
In the BabySeq project, which did not have a primary NBS component, three cases were found through WES to have newborn screening-related disorders missed on conventional screening. They were partial biotinidase deficiency (not associated with symptoms in most cases), non-classical congenital adrenal hyperplasia, and post-lingual KCNQ4 hearing loss (56). These findings addressed the third aim of the NSIGHT consortium, by demonstrating that non-classical forms of some disorders can be identified, though there is no current information regarding sensitivity for these.
The NSIGHT projects have demonstrated the current capabilities and the gaps to be addressed to improve the performance of NGS in NBS. Importantly, despite technological advances, the Ethics and Policy Board of NSIGHT recommended against genomic sequencing of all babies at birth and called for a nuanced use of genomic technologies, taking into account the contexts of screening vs. diagnosis and the contexts of clinical care, public health, and direct to consumer testing (57).
Practical Aspects of Genomic Testing for NBS
A crucial issue in genomic medicine is the large number of DNA variants in each individual. Depending on the context, narrowing the number of genes to be evaluated can be critical for improving test interpretation. Therefore, targeted testing has been proposed in carrier screening and NBS. This can also affect coverage of the individual genes assessed. In 2015, Naylor and colleagues reported the performance of two targeted next-generation sequencing (TNGS) platforms in two clinical contexts. They used targeted DNA sequencing either through WES with an in silico gene filter for 126 genes or through a next-generation sequencing panel, NBDx, which included a DNA capture step for exons of the 126-NBDx gene panel. These TNGS platforms were used both after an abnormal newborn screen and for ill infants in the neonatal intensive care unit (NICU). The authors compared these two platforms in the retrospective evaluation of 36 individuals with known IEM from the Amish and Mennonite communities. While the NBDx gene panel had advantages over WES, an important result from this study in a homogeneous population is that only 27 of 36 disorders were correctly identified in the experimental cohort in the absence of clinical information (50). This paper not only demonstrated the technical feasibility of obtaining DNA for NGS from dried blood spots and of developing a rapid and cost-effective test applicable to newborns with abnormal NBS and ill infants in the NICU, but also highlighted the limitation of DNA sequence information alone for disease diagnosis and the importance of clinical information to aid DNA variant interpretation. The authors suggest the possible use of NGS as a primary NBS in the future.
In 2017, the utility of NBS by WGS, rather than WES, was shown taking advantage of two DNA sequencing studies in newborns in the Inova Health System in Virginia (58). This study used two different WGS platforms on which 163 genes were analyzed. In contrast to the WES retrospective proof-of-principle study in a small number of affected individuals from the Amish and Mennonite communities, this study was conducted in an ancestrally diverse population of almost 1,700 newborns. Importantly, and unrealistically for a NBS public health program, parents were also sequenced, allowing phasing of variants for autosomal recessive disorders. Variants were classified using the ACMG criteria of pathogenic, likely pathogenic, benign, likely benign, and variant of uncertain significance (59). Only pathogenic and likely pathogenic variants were called. Importantly, exon coverage differed by gene and sequencing technology. While WGS did identify two cases missed on Virginia state NBS (hemoglobin SC disease and Duarte variant galactosemia, the latter a non-disease), the conclusion of this paper was that conventional NBS could not be replaced by WGS as NBS identified 4/5 affected neonates in the cohort and WGS identified only 2/5. This team suggested periodically reassessing NBS and WGS and using a larger cohort in later studies. The benefits of WGS in NBS were lower false positives than conventional NBS, resolution of inconclusive NBS results, distinguishing the correct disorder in cases of ambiguity with NBS results, and decreased numbers of follow-up samples required for preterm infants.
Next-generation DNA sequencing has been assessed as a primary newborn screen in Korea, where the use of a 307-gene panel for 159 disorders, including 60 neonatal IEM, was assessed. The study also addressed turnaround time (TAT), cost, and variant interpretation. They sequenced 103 subjects, 81 affected individuals and 22 controls. Remarkably, for the affected individuals, only 12% of causal variants were annotated in databases. Crucially, for the 307 genes, each subject had 8.6 variants, for 3.4 diseases; thus, manual curation and clinical information were required for each subject. Eighty-eight percent of the variants were non-disease causing (60). Variant interpretation was identified as a critical limiting factor in the use of TNGS as a primary screen in this work.
The UK is investing significantly in genomic sequencing, and the role of NGS in NBS in their National Health Service has been reported (52). They demonstrated that screening for five genes (ACADM, PAH, TSHR, CFTR, and HBB) in the National Health Service in the UK is feasible. The genes chosen for the study corresponded to disorders already on the NBS panel, specifically MCAD deficiency, PKU, congenital hypothyroidism, and sickle cell disease, respectively. They achieved a TAT of 4 days and could process 1,000 samples per week (a typical NBS lab processes 50,000 samples per year) but suggested that cost might be prohibitive as the cost of the current NBS test is 25 pounds, and the NGS test would cost 62 pounds or more. A consideration of adding diseases based on merit and not technologies was also recommended.
These papers and those of the NSIGHT network, discussed here and outlined in Table 2, demonstrate that while technically feasible, perhaps even with sufficient TAT to be appropriate for NBS, the high cost and limitations of variant detection (intronic variants, regulatory regions, copy number variants, structural variants, and trinucleotide repeats), variant interpretation and of annotated disease databases remain a significant barrier to the use of DNA as a primary screen. The contrast between the success of DNA in diagnosis and the difficulties in the use of DNA in screening highlights that the lack of a phenotype to guide variant interpretation remains the chief limitation of WES and WGS in NBS. In the next sections, we will examine technical aspects of NGS interpretation, especially as it relates to NBS.
Current Technologies to Detect Genetic Variants
To effectively identify disease from DNA, the relevant disease-causing genetic variants should be both technologically detectable and clinically interpretable (Figure 2, Table 3). Historically, genetic tests have been limited in scope to a single or few genes associated with specific genetic conditions and performed using conventional technologies like Sanger sequencing of PCR-amplified coding regions of the gene(s) of interest. In recent years, high-throughput (next-generation) sequencing has greatly expanded the scope of genetic loci that can be technologically assayed in a single individual's genome. In particular, as noted in the previous section, WES is now a commonly used tool in diagnosis of various rare genetic disorders (61–66). However, challenges remain in applying WES or WGS to NBS, including the fact that capture and read coverage may be non-uniform and some disease-causing variants may be missed due to poor coverage. Recent simulations indicate that read mapping and variant calling in some NBS genes could be affected by homologous genomic regions (CYP21A2, SMN1, CBS, and CORO1A) (67). Whole genome sequencing expands the scope of detectable variants beyond coding regions. Besides revealing the ~98% of the non-coding genomic regions not visible to WES, WGS can provide a better view of the coding regions as well as more uniform coverage. Indeed, in some IEM, WGS revealed large deletions in IEM genes not observed by WES alone (55).
Figure 2. A typical genomic analysis pipeline in the context of newborn screening (NBS). Among all the variants observed in the newborn DNA sequence, only those occurring in previously identified NBS genes are considered further. Within NBS genes, rare variants are prioritized over common variants. A combination of curated pathogenic variant databases and computational prediction tools is utilized to assign variant pathogenicity and screen for individuals who carry such variants. For NBS, the pipeline will need to demonstrate both a high sensitivity (screen positive almost all newborns with disease) and a high specificity (screen negative almost all newborns without disease).
Still, there are regions of the genome that are difficult to sequence using both conventional WES and WGS that rely on short-read sequencing (e.g., large insertions and deletions, tandem-repeat expansion, and complex chromosomal structural aberrations). Recent advances in long-read sequencing technologies could reveal such classes of variations to expand the repertoire for rare disease diagnosis. Even though such technologies are costly and data analysis methods are in early stages, there have been promising recent studies that have leveraged long-read sequencing to pathogenic variants in rare disease not detectable by conventional WES/WGS (68).
Variant Interpretation and Current Challenges
Assuming the causative genetic variants are confidently detected, another major challenge for integrating genomics in NBS will be the clinical interpretation of genetic variants. To implicate pathogenic variants, the field leverages both curation of expert knowledge as well as prediction from computational tools.
Over the last few years, extensive clinical guidelines for variant interpretation have been developed, converging to ordinal five-tier “pathogenic” to “benign” labels for genetic variants (59). However, these guidelines currently do not accommodate the possibility that the same variant may need to be considered differently for specific diseases under different clinical contexts. In the context of NBS, similar to global collaborative efforts for harmonization of cutoff values in MS/MS data, collaborative efforts are needed to standardize and share variant classification and evidence across laboratories (69). In a recent study evaluating concordance of variant classification across nine laboratories (eight CLIA-accredited and one research laboratory), 54% of variants reached complete concordance, whereas 11% had a discordance that could affect clinical recommendation (70). After subsequent review of the discordant variants, the concordance increased to 84%. Even after review and data sharing, a significant proportion of existing variants remains of uncertain significance, labeled VUS, where additional evidence in the form of functional or computational studies are needed to resolve pathogenicity.
Several computational tools have been developed to interpret human genetic variants and predict their consequence on clinical phenotypes. Most of these tools aim to distinguish disease-causing pathogenic variants from those that are benign based on various existing information. Most tools predominantly leverage evolutionary conservation information, while others additionally incorporate physiochemical properties and protein structural data (71–74). Besides directly predicting clinical pathogenicity, specialized tools also exist for predicting various intermediate biochemical phenotypes including protein stability, RNA splicing, subcellular protein localization, protein interactions, and mechanism of action (75–79). Recently, meta-predictors have also emerged, which combine scores from several prediction tools using machine learning methods and report an integrated pathogenicity score (80, 81).
While tools can leverage the information about the knowledge of gene and protein function for coding variants, computational predictions of pathogenicity for noncoding variants can be even more challenging. Several tools have been developed to annotate non-coding regulatory regions distal (enhancers) or local (in promoters, untranslated regions or introns) to genes based on methods that integrate information from histone marks, transcription binding, gene expression, phylogenetic analyses, and chromatin accessibility (82–85). Other tools have emerged in this area that leverage machine learning techniques to integrate such annotations into pathogenicity scores (86–89). Besides single-nucleotide changes, several computational tools have also emerged to detect pathogenic structural variants from genomic data (90–92).
Despite over a hundred computational tools currently available for variant interpretation [recent catalog in ref (93)], challenges remain. Most computational tools train their algorithms in a supervised fashion across large databases of previously characterized human disease-associated and neutral variants. These tools typically use similar underlying training datasets, model features, and design principles, which leads to confounding issues of circularity and overfitting (94). Most tools assume that the properties that determine a variant's deleteriousness are generalizable across all genes. Yet, to make more accurate predictions of variant impact in a particular gene of interest, computational tools of the future may need to incorporate properties that capture the biological context specific to that gene. A recent work focused on computational tools specialized in predictions for a gene or gene family of interest has found that incorporating the context of an individual gene, biological pathway, and disease can improve quality of predictions (95, 96). The main challenge for gene-specific computational approaches, however, is the lack of sizable variant datasets for training on individual genes, particularly for rare diseases. A potential solution is emerging in the form of high-throughput functional assays that measure the variety of molecular and cellular consequences of all possible variants, in particular disease-relevant loci (97–99).
Another limitation of current variant interpretation tools is that they ignore the diplotypic context of an individual's genetic variant during both training and inference. The impact of a single pathogenic variant on the eventual clinical phenotypes is often modulated by other variants in the individual's genome, sometimes even within the same gene (100, 101). For example, in recessive disorders, the combination of pathogenic variants in both the paternal and maternal copies of a gene determines the clinical phenotypes and disease severity. To improve clinical utility of DNA analysis pipelines, the next generation of variant interpretation tools will need to incorporate the full diplotypic context in an individual's genome to predict both the likelihood as well as severity of diseases. For now, pathogenicity assertions about genetic variants using computational tools alone are not sufficient, and human review is still required for proper variant interpretation and return of genetic results.
Sensitivity/Specificity Tradeoffs in Diagnosis Vs. Screening
Even though most genomic analysis pipelines for rare disease use similar sets of parameters, the optimal design and thresholds depend on the clinical context and application (102). Unlike diagnostic settings where DNA analysis is guided by phenotypic data to identify genetic variants that could explain an individual's clinical features, NBS is performed on asymptomatic newborns with no a priori phenotypes. The trade-offs of sensitivity and specificity can be different in these two contexts. Typically, diagnostic DNA pipelines report true positive rates (sensitivity) of around 25–60%, but the true negative rates (specificity) are often not reported. For population-scale NBS, both the sensitivity and specificity requirements are much stricter. Because most of the population will be unaffected in rare diseases typically screened for in the newborn period, even a 1% false positive rate for a screening test can translate to a large burden of false positive cases that require follow-up. Therefore, systematic exploration of parameters in analysis pipelines is necessary to achieve a balance between sensitivity and specificity that is best suited for screening (103).
As noted above, the NBSeq project, a retrospective study evaluating WES as a primary NBS test in an 8.5-year population-scale cohort for 48 IEM in California, achieved 88% overall sensitivity with a specificity of 98.4% (55). In comparison, current MS/MS analyte-based screening has a sensitivity and specificity of 99.0 and 99.8%, respectively, in the same cohort (69, 104–106). WES was therefore concluded to be insufficiently sensitive or specific as a general primary NBS test for IEM. The NC NEXUS study was performed in a smaller cohort and found similar results (54).
The appeal of genomic sequencing as a promising single test for all genetic diseases in the future should be reconciled with the possibility that the analytical performance of sequencing as a screening test may vary widely across different individual disorders. Such differences could arise from a range of factors, including limited prior genetic and clinical data, particularly in very rare conditions, as well as incomplete biological characterization of some genetic diseases. When grouped by prevalence, indeed the most common IEM (>2.5 per 10,000) had higher WES sensitivity of 91% and 78%, respectively, compared to the rarest (<0.04 per 10,000) (55).
An additional complexity arises from the fact that current databases of disease-associated genetic variants are largely Eurocentric. The larger proportion of previously uncharacterized variants in underrepresented populations could result in poorer performance of a sequencing-based screening test in such populations. Indeed, previous studies have demonstrated that variant misclassification in understudied populations can lead to genetic misdiagnoses with potential for exacerbating health disparities (107).
Besides analytical performance, bioinformatics analysis screening poses implementation challenges that may differ from a diagnostic scenario. Whereas, genomic analysis in diagnostic settings typically involves ad hoc rule-based pipelines with expert review of individual cases along with additional genomic data from other family members, NBS has to be performed at a population-level scale, requiring the analysis pipelines to be largely automated and streamlined.
Ethical and Societal Considerations
Finally, even if all of the technical issues surrounding the application of next-generation sequencing to NBS are solved, social and ethical barriers to universal implementation will remain. Based largely on the ACMG criteria, many (perhaps most) genetic disorders would not qualify as candidates for NBS since early detection will not lead to specific therapy that will change the course of the disease and thus provide a broad societal benefit. This limitation, of course, does not address the potential non-specific benefits of early identification such as implementation of early intervention programs, appropriate prospective disease monitoring, and even family planning, as well as circumventing the prolonged diagnostic odyssey that individuals with rare diseases often face. The cost, not only of screening but also of follow-up, for affected individuals by government-funded programs must also be considered. In essence, WES/WGS screening would mandate a transfer of considerable health care resources from later-onset diagnostic expenses to neonatal screening and follow-up efforts. Additional ethical concerns also abound, most prominently the loss of the right to self-determination for the neonate identified with a later onset of disease.
Conclusions
The evolution of NBS from testing one disease at a time to potentially identifying all possible diseases brings both great promise and challenges to modern health care systems. It is possible, perhaps even likely, that the overall cost to society will be reduced as NGS becomes more economical. However, the decision to adopt such a program must address a much more expansive set of issues that are likely to limit implementation in the near future, including identification of late-onset diseases for which no immediate therapy is available, improvement of turnaround time, and development of larger and multi-ethnic data sets of curated pathogenic variants that allow movement beyond variants of unknown significance. Societal concerns of genetic discrimination, especially as it relates to health insurance, must also be resolved. Rather, the use of NGS technology is more likely to continue to find its optimum use in diagnostic and follow-up settings, though moving to a primary role rather than as a tool of last resort as is now common. Instead, NGS panels and/or analysis focused on disorders with a significant neonatal, or even broader pediatric, footprint may play a role as a bridge to true WES/WGS NBS. Regardless, one of the key goals of traditional newborn screening, better care at lower cost, will ultimately be realized, translating next-generation sequencing to next-generation care.
Author Contributions
AW, RG, JV, and AA contributed to conception and design of the article and wrote sections of the manuscript. AW compiled Table 1. RG compiled Table 2. JV designed Figure 1. AA designed Figure 2. All authors contributed to manuscript revision, read, and approved the submitted version.
Conflict of Interest
AA is an employee of Illumina, Inc.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
1. Jervis GA. Phenylpyruvic oligophrenia: introductory study of fifty cases of mental deficiency associated with excretion of phenylpyruvic acid. Arch Neurol Psychiatry. (1937) 38:944–63. doi: 10.1001/archneurpsyc.1937.02260230042003
2. Bickel H, Gerrard J, Hickmans EM. The influence of phenylalanine intake on the chemistry and behaviour of a phenyl-ketonuric child. Acta Paediatr. (1954) 43:64–77. doi: 10.1111/j.1651-2227.1954.tb04000.x
3. Guthrie R. The introduction of newborn screening for phenylketonuria. A personal history. Eur J Pediatr. (1996) 155(Suppl. 1):S4–5. doi: 10.1007/PL00014247
4. Guthrie R, Susi A. A simple phenylalanine method for detecting phenylketonuria in large populations of newborn infants. Pediatrics. (1963) 32:338–43.
6. Alexander D. The national institute of child health and human development and phenylketonuria. Pediatrics. (2003) 112(6 Pt 2):1514–5.
7. Brosco JP, Mattingly M, Sanders LM. Impact of specific medical interventions on reducing the prevalence of mental retardation. Arch Pediatr Adolesc Med. (2006) 160:302–9. doi: 10.1001/archpedi.160.3.302
8. Wilson JMG, Jungner, Gunnar & World Health Organization. Principles and Practice of Screening for Disease. Public Health Papers. (1968) 34.
9. Watson M, Mann M, Lloyd-Puryear M, Rinaldo P, Rodney Howell R. Newborn screening: toward a uniform screening panel and system. Genet Med. (2006) 8(Suppl. 1):1S−252S. doi: 10.1097/01.gim.0000223467.60151.02
10. Benson JM, Therrell BL Jr. History and current status of newborn screening for hemoglobinopathies. Semin Perinatol. (2010) 34:134–44. doi: 10.1053/j.semperi.2009.12.006
11. McCabe ER, Huang SZ, Seltzer WK, Law ML. DNA microextraction from dried blood spots on filter paper blotters: potential applications to newborn screening. Hum Genet. (1987) 75:213–6. doi: 10.1007/BF00281061
12. Jinks DC, Minter M, Tarver DA, Vanderford M, Hejtmancik JF, McCabe ER. Molecular genetic diagnosis of sickle cell disease using dried blood specimens on blotters used for newborn screening. Hum Genet. (1989) 81:363–6. doi: 10.1007/BF00283692
13. LaFranchi SH, Murphey WH, Foley TP Jr, Larsen PR, Buist NR. Neonatal hypothyroidism detected by the northwest regional screening program. Pediatrics. (1979) 63:180–91.
14. Tajima T, Fukushi M. Neonatal mass screening for 21-hydroxylase deficiency. Clin Pediatr Endocrinol. (2016) 25:1–8. doi: 10.1297/cpe.25.1
15. Mak DY, Sykes J, Stephenson AL, Lands LC. The benefits of newborn screening for cystic fibrosis: the Canadian experience. J Cyst Fibros. (2016) 15:302–8. doi: 10.1016/j.jcf.2016.04.001
16. CDC. Using tandem mass spectrometry for metabolic disease screening among newborns. A report of a work group. MMWR Recomm Rep. (2001) 50:1–34. doi: 10.1037/e548322006-001
17. Chace DH, Adam BW, Smith SJ, Alexander JR, Hillman SL, Hannon WH. Validation of accuracy-based amino acid reference materials in dried-blood spots by tandem mass spectrometry for newborn screening assays. Clin Chem. (1999) 45:1269–77. doi: 10.1093/clinchem/45.8.1269
18. Millington DS, Kodo N, Norwood DL, Roe CR. Tandem mass spectrometry: a new method for acylcarnitine profiling with potential for neonatal screening for inborn errors of metabolism. J Inherit Metab Dis. (1990) 13:321–4. doi: 10.1007/BF01799385
19. Kolvraa S, Gregersen N, Christensen E, Hobolth N. In vitro fibroblast studies in a patient with C6-C10-dicarboxylic aciduria: evidence for a defect in general acyl-CoA dehydrogenase. Clin Chim Acta. (1982) 126:53–67. doi: 10.1016/0009-8981(82)90361-8
20. Rhead WJ. Newborn screening for medium-chain acyl-CoA dehydrogenase deficiency: a global perspective. J Inherit Metab Dis. (2006) 29:370–7. doi: 10.1007/s10545-006-0292-1
21. Iafolla AK, Thompson RJ, Jr., Roe CR. Medium-chain acyl-coenzyme a dehydrogenase deficiency: clinical course in 120 affected children. J Pediatr. (1994) 124:409–15. doi: 10.1016/S0022-3476(94)70363-9
22. Liebl B, Nennstiel-Ratzel U, Roscher A, von Kries R. Data required for the evaluation of newborn screening programmes. Eur J Pediatr. (2003) 162(Suppl. 1):S57–61. doi: 10.1007/s00431-003-1354-0
23. Andresen BS, Dobrowolski SF, O'Reilly L, Muenzer J, McCandless SE, Frazier DM, et al. Medium-chain acyl-CoA dehydrogenase (MCAD) mutations identified by MS/MS-based prospective screening of newborns differ from those observed in patients with clinical symptoms: identification and characterization of a new, prevalent mutation that results in mild MCAD deficiency. Am J Hum Genet. (2001) 68:1408–18. doi: 10.1086/320602
24. Feuchtbaum L, Yang J, Currier R. Follow-up status during the first 5 years of life for metabolic disorders on the federal recommended uniform screening panel. Genet Med. (2018) 20:831–9. doi: 10.1038/gim.2017.199
25. Wolfe L, Jethva R, Oglesbee D, Vockley J. Short-chain Acyl-CoA dehydrogenase deficiency. In: Adam MP, Ardinger HH, Pagon RA, Wallace SE, Bean LJH, Stephens K, et al. editors. GeneReviews (R). Seattle, WA: University of Washington (1993).
26. Matern D, Tortorelli S, Oglesbee D, Gavrilov D, Rinaldo P. Reduction of the false-positive rate in newborn screening by implementation of MS/MS-based second-tier tests: the mayo clinic experience (2004–2007). J Inherit Metab Dis. (2007) 30:585–92. doi: 10.1007/s10545-007-0691-y
27. Rose SR, Brown RS, Foley T, Kaplowitz PB, Kaye CI, Sundararajan S, et al. Update of newborn screening and therapy for congenital hypothyroidism. Pediatrics. (2006) 117:2290–303. doi: 10.1542/peds.2006-0915
28. Serving the family from birth to the medical home. Newborn screening: a blueprint for the future - a call for a national agenda on state newborn screening programs. Pediatrics. (2000) 106(2 Pt 2):389–422.
29. Trotter TL, Fleischman AR, Howell RR, Lloyd-Puryear M, Secretary's Advisory Committee on Heritable Disorders in Newborns and Children. Secretary's Advisory Committee on Heritable Disorders in Newborns and Children response to the President's Council on Bioethics report: the changing moral focus of newborn screening. Genet Med. (2011) 13:301–4. doi: 10.1097/GIM.0b013e318210655d
30. Bioethics PsCo. The Changing Moral Focus of Newborn Screening [White Paper]. The President's Council on Bioethics: Washington DC (2008).
31. Children ACoHDiNa. Report to Congress 2018. A Report to Congress Published by the Advisory Committee on Heritable Disorders in Newborns and Children, a Federal Advisory Committee of the Health Resources & Services Administration. Available online at: https://www.hrsa.gov/advisory-committees/heritable-disorders/recommendations-reports/reports/index.html
32. McCandless SE, Wright EJ. Mandatory newborn screening in the United States: history, current status, and existential challenges. Birth Defects Res. (2020) 112:350–66. doi: 10.1002/bdr2.1653
33. Children ACoHDiNa. Recommended Uniform Screening Panel. Available online at: https://www.hrsa.gov/advisory-committees/heritable-disorders/rusp/index.html (accessed April 21, 2021).
34. White KR, Forsman I, Eichwald J, Munoz K. The evolution of early hearing detection and intervention programs in the United States. Semin Perinatol. (2010) 34:170–9. doi: 10.1053/j.semperi.2009.12.009
35. Oster ME, Aucott SW, Glidewell J, Hackell J, Kochilas L, Martin GR, et al. Lessons learned from newborn screening for critical congenital heart defects. Pediatrics. (2016) 137:e20154573. doi: 10.1542/peds.2015-4573
36. Kwan A, Abraham RS, Currier R, Brower A, Andruszewski K, Abbott JK, et al. Newborn screening for severe combined immunodeficiency in 11 screening programs in the United States. JAMA. (2014) 312:729–38. doi: 10.1001/jama.2014.9132
37. Kucera KS, Taylor JL, Robles VR, Clinard K, Migliore B, Boyea BL, et al. A voluntary statewide newborn screening pilot for spinal muscular atrophy: results from early check. Int J Neonatal Screen. (2021) 7:20. doi: 10.3390/ijns7010020
38. Guttmacher AE, Collins FS. Genomic medicine–a primer. N Engl J Med. (2002) 347:1512–20. doi: 10.1056/NEJMra012240
39. Shendure J, Mitra RD, Varma C, Church GM. Advanced sequencing technologies: methods and goals. Nat Rev Genet. (2004) 5:335–44. doi: 10.1038/nrg1325
40. Worthey EA, Mayer AN, Syverson GD, Helbling D, Bonacci BB, Decker B, et al. Making a definitive diagnosis: successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet Med. (2011) 13:255–62. doi: 10.1097/GIM.0b013e3182088158
42. Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C, et al. Targeted capture and massively parallel sequencing of 12 human exomes. Nature. (2009) 461:272–6. doi: 10.1038/nature08250
43. Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, et al. Exome sequencing identifies the cause of a mendelian disorder. Nat Genet. (2010) 42:30–5. doi: 10.1038/ng.499
44. Collins FS, Doudna JA, Lander ES, Rotimi CN. Human molecular genetics and genomics - important advances and exciting possibilities. N Engl J Med. (2021) 384:1–4. doi: 10.1056/NEJMp2030694
45. Cooper GM, Shendure J. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat Rev Genet. (2011) 12:628–40. doi: 10.1038/nrg3046
46. Bell CJ, Dinwiddie DL, Miller NA, Hateley SL, Ganusova EE, Mudge J, et al. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med. (2011) 3:65ra4. doi: 10.1126/scitranslmed.3001756
47. Ko JM, Park KS, Kang Y, Nam SH, Kim Y, Park I, et al. A new integrated newborn screening workflow can provide a shortcut to differential diagnosis and confirmation of inherited metabolic diseases. Yonsei Med J. (2018) 59:652–61. doi: 10.3349/ymj.2018.59.5.652
48. Park KJ, Park S, Lee E, Park JH, Park JH, Park HD, et al. A population-based genomic study of inherited metabolic diseases detected through newborn screening. Ann Lab Med. (2016) 36:561–72. doi: 10.3343/alm.2016.36.6.561
49. Smon A, Repic Lampret B, Groselj U, Zerjav Tansek M, Kovac J, Perko D, et al. Next generation sequencing as a follow-up test in an expanded newborn screening programme. Clin Biochem. (2018) 52:48–55. doi: 10.1016/j.clinbiochem.2017.10.016
50. Bhattacharjee A, Sokolsky T, Wyman SK, Reese MG, Puffenberger E, Strauss K, et al. Development of DNA confirmatory and high-risk diagnostic testing for newborns using targeted next-generation DNA sequencing. Genet Med. (2015) 17:337–47. doi: 10.1038/gim.2014.117
51. Tarini BA, Goldenberg AJ. Ethical issues with newborn screening in the genomics era. Annu Rev Genomics Hum Genet. (2012) 13:381–93. doi: 10.1146/annurev-genom-090711-163741
52. van Campen JC, Sollars ESA, Thomas RC, Bartlett CM, Milano A, Parker MD, et al. Next generation sequencing in newborn screening in the United Kingdom national health service. Int J Neonatal Screen. (2019) 5:40. doi: 10.3390/ijns5040040
53. Berg JS, Agrawal PB, Bailey DB, Jr., Beggs AH, Brenner SE, et al. Newborn sequencing in genomic medicine and public health. Pediatrics. (2017) 139:e20162252. doi: 10.1542/peds.2016-2252
54. Roman TS, Crowley SB, Roche MI, Foreman AKM, O'Daniel JM, Seifert BA, et al. Genomic sequencing for newborn screening: results of the NC NEXUS project. Am J Hum Genet. (2020) 107:596–611. doi: 10.1016/j.ajhg.2020.08.001
55. Adhikari AN, Gallagher RC, Wang Y, Currier RJ, Amatuni G, Bassaganyas L, et al. The role of exome sequencing in newborn screening for inborn errors of metabolism. Nat Med. (2020) 26:1392–7. doi: 10.1038/s41591-020-0966-5
56. Ceyhan-Birsoy O, Murry JB, Machini K, Lebo MS, Yu TW, Fayer S, et al. Interpretation of genomic sequencing results in healthy and ill newborns: results from the babySeq project. Am J Hum Genet. (2019) 104:76–93. doi: 10.1016/j.ajhg.2018.11.016
57. Johnston J, Lantos JD, Goldenberg A, Chen F, Parens E, Koenig BA, et al. Sequencing newborns: a call for nuanced use of genomic technologies. Hastings Cent Rep. (2018) 48(Suppl. 2):S2–6. doi: 10.1002/hast.874
58. Bodian DL, Klein E, Iyer RK, Wong WS, Kothiyal P, Stauffer D, et al. Utility of whole-genome sequencing for detection of newborn screening disorders in a population cohort of 1,696 neonates. Genet Med. (2016) 18:221–30. doi: 10.1038/gim.2015.111
59. Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American college of medical genetics and genomics and the association for molecular pathology. Genet Med. (2015) 17:405–24. doi: 10.1038/gim.2015.30
60. Cho Y, Lee CH, Jeong EG, Kim MH, Hong JH, Ko Y, et al. Prevalence of rare genetic variations and their implications in NGS-data interpretation. Sci Rep. (2017) 7:9810. doi: 10.1038/s41598-017-09247-5
61. Bamshad MJ, Ng SB, Bigham AW, Tabor HK, Emond MJ, Nickerson DA, et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet. (2011) 12:745–55. doi: 10.1038/nrg3031
62. Farnaes L, Hildreth A, Sweeney NM, Clark MM, Chowdhury S, Nahas S, et al. Rapid whole-genome sequencing decreases infant morbidity and cost of hospitalization. NPJ Genom Med. (2018) 3:10. doi: 10.1038/s41525-018-0049-4
63. Biesecker LG, Green RC. Diagnostic clinical genome and exome sequencing. N Engl J Med. (2014) 371:1170. doi: 10.1056/NEJMc1408914
64. Adams DR, Eng CM. Next-Generation sequencing to diagnose suspected genetic disorders. N Engl J Med. (2018) 379:1353–62. doi: 10.1056/NEJMra1711801
65. French CE, Delon I, Dolling H, Sanchis-Juan A, Shamardina O, Mégy K, et al. Whole genome sequencing reveals that genetic conditions are frequent in intensively ill children. Intensive Care Med. (2019) 45:627–36. doi: 10.1007/s00134-019-05552-x
66. Friedman JM, Bombard Y, Cornel MC, Fernandez CV, Junker AK, Plon SE, et al. Genome-wide sequencing in acutely ill infants: genomic medicine's critical application? Genet Med. (2018) 21:498–504. doi: 10.1038/s41436-018-0055-z
67. Trier C, Fournous G, Strand JM, Stray-Pedersen A, Pettersen RD, Rowe AD. Next-generation sequencing of newborn screening genes: the accuracy of short-read mapping. NPJ Genom Med. (2020) 5:36. doi: 10.1038/s41525-020-00142-z
68. Mitsuhashi S, Matsumoto N. Long-read sequencing for rare human genetic diseases. J Hum Genet. (2020) 65:11–9. doi: 10.1038/s10038-019-0671-8
69. McHugh D, Cameron CA, Abdenur JE, Abdulrahman M, Adair O, Al Nuaimi SA, et al. Clinical validation of cutoff target ranges in newborn screening of metabolic disorders by tandem mass spectrometry: a worldwide collaborative project. Genet Med. (2011) 13:230–54. doi: 10.1097/GIM.0b013e31820d5e67
70. Amendola LM, Muenzen K, Biesecker LG, Bowling KM, Cooper GM, Dorschner MO, et al. Variant classification concordance using the aCMG-AMP variant interpretation guidelines across nine genomic implementation research studies. Am J Hum Genet. (2020) 107:932–41. doi: 10.1016/j.ajhg.2020.09.011
71. Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic acids research. (2011) 39:e118. doi: 10.1093/nar/gkr407
72. Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. (2012) 40:W452–7. doi: 10.1093/nar/gks539
73. Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using polyPhen-2. Curr Protoc Hum Genet. (2013) Chapter 7:Unit7.20. doi: 10.1002/0471142905.hg0720s76
74. Carter H, Douville C, Stenson PD, Cooper DN, Karchin R. Identifying Mendelian disease genes with the variant effect scoring tool. BMC Genomics. (2013) 14(Suppl. 3):S3. doi: 10.1186/1471-2164-14-S3-S3
75. Quan L, Lv Q, Zhang Y. STRUM: structure-based prediction of protein stability changes upon single-point mutation. Bioinformatics. (2016) 32:2936–46. doi: 10.1093/bioinformatics/btw361
76. Yeo G, Burge CB. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol. (2004) 11:377–94. doi: 10.1089/1066527041410418
77. Laurila K, Vihinen M. PROlocalizer: integrated web service for protein subcellular localization prediction. Amino Acids. (2011) 40:975–80. doi: 10.1007/s00726-010-0724-y
78. Betts MJ, Lu Q, Jiang Y, Drusko A, Wichmann O, Utz M, et al. Mechismo: predicting the mechanistic impact of mutations and modifications on molecular interactions. Nucleic Acids Res. (2015) 43:e10. doi: 10.1093/nar/gku1094
79. Li B, Krishnan VG, Mort ME, Xin F, Kamati KK, Cooper DN, et al. Automated inference of molecular mechanisms of disease from amino acid substitutions. Bioinformatics. (2009) 25:2744–50. doi: 10.1093/bioinformatics/btp528
80. Dong C, Wei P, Jian X, Gibbs R, Boerwinkle E, Wang K, et al. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum Mol Genet. (2015) 24:2125–37. doi: 10.1093/hmg/ddu733
81. Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. (2016) 99:877–85. doi: 10.1016/j.ajhg.2016.08.016
82. Corradin O, Saiakhova A, Akhtar-Zaidi B, Myeroff L, Willis J, Cowper-Sal lari R, et al. Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits. Genome Res. (2014) 24:1–13. doi: 10.1101/gr.164079.113
83. Chen J, Tian W. Explaining the disease phenotype of intergenic sNP through predicted long range regulation. Nucleic Acids Res. (2016) 44:8641–54. doi: 10.1093/nar/gkw519
84. Whalen S, Truty RM, Pollard KS. Enhancer-promoter interactions are encoded by complex genomic signatures on looping chromatin. Nat Genet. (2016) 48:488–96. doi: 10.1038/ng.3539
85. Javierre BM, Burren OS, Wilder SP, Kreuzhuber R, Hill SM, Sewitz S, et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell. (2016) 167:1369–84.e19. doi: 10.1016/j.cell.2016.09.037
86. Zhou J, Troyanskaya OG. Predicting effects of noncoding variants with deep learning-based sequence model. Nat Methods. (2015) 12:931–4. doi: 10.1038/nmeth.3547
87. Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet. (2017) 49:618–24. doi: 10.1038/ng.3810
88. Ritchie GR, Dunham I, Zeggini E, Flicek P. Functional annotation of noncoding sequence variants. Nat Methods. (2014) 11:294–6. doi: 10.1038/nmeth.2832
89. Ionita-Laza I, McCallum K, Xu B, Buxbaum JD. A spectral approach integrating functional genomic annotations for coding and noncoding variants. Nat Genet. (2016) 48:214–20. doi: 10.1038/ng.3477
90. Layer RM, Chiang C, Quinlan AR, Hall IM. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. (2014) 15:R84. doi: 10.1186/gb-2014-15-6-r84
91. Handsaker RE, Van Doren V, Berman JR, Genovese G, Kashin S, Boettger LM, et al. Large multiallelic copy number variations in humans. Nat Genet. (2015) 47:296–303. doi: 10.1038/ng.3200
92. Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. (2011) 21:974–84. doi: 10.1101/gr.114876.110
93. Hu Z, Yu C, Furutsuki M, Andreoletti G, Ly M, Hoskins R, et al. VIPdb, a genetic variant impact predictor database. Hum Mutat. (2019) 40:1202–14. doi: 10.1002/humu.23858
94. Grimm DG, Azencott C-A, Aicheler F, Gieraths U, MacArthur DG, Samocha KE, et al. The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity. Hum Mutat. (2015) 36:513–23. doi: 10.1002/humu.22768
95. Masica DL, Sosnay PR, Cutting GR, Karchin R. Phenotype-optimized sequence ensembles substantially improve prediction of disease-causing mutation in cystic fibrosis. Hum Mutat. (2012) 33:1267–74. doi: 10.1002/humu.22110
96. Torkamani A, Schork NJ. Accurate prediction of deleterious protein kinase polymorphisms. Bioinformatics. (2007) 23:2918–25. doi: 10.1093/bioinformatics/btm437
97. Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nat Methods. (2014) 11:801–7. doi: 10.1038/nmeth.3027
98. Starita LM, Young DL, Islam M, Kitzman JO, Gullingsrud J, Hause RJ, et al. Massively parallel functional analysis of BRCA1 rING domain variants. Genetics. (2015) 200:413–22. doi: 10.1534/genetics.115.175802
99. Starita LM, Ahituv N, Dunham MJ, Kitzman JO, Roth FP, Seelig G, et al. Variant interpretation: functional assays to the rescue. Am J Hum Genet. (2017) 101:315–25. doi: 10.1016/j.ajhg.2017.07.014
100. Cooper DN, Krawczak M, Polychronakos C, Tyler-Smith C, Kehrer-Sawatzki H. Where genotype is not predictive of phenotype: towards an understanding of the molecular basis of reduced penetrance in human inherited disease. Hum Genet. (2013) 132:1077–130. doi: 10.1007/s00439-013-1331-2
101. Katsanis N. The continuum of causality in human genetic disorders. Genome Biol. (2016) 17:233. doi: 10.1186/s13059-016-1107-9
102. Wright CF, FitzPatrick DR, Firth HV. Paediatric genomics: diagnosing rare disease in children. Nat Rev Genet. (2018) 19:325. doi: 10.1038/nrg.2018.12
103. Wang Y, Adhikari AN, Sunderam U, Kvale MN, Currier RJ, Gallagher RC, et al. Perturbation robustness analyses reveal important parameters in variant interpretation pipelines. bioRxiv [Preprint]. (2020). doi: 10.1101/2020.06.29.173815
104. Wilcken B, Wiley V, Hammond J, Carpenter K. Screening newborns for inborn errors of metabolism by tandem mass spectrometry. N Engl J Med. (2003) 348:2304–12. doi: 10.1056/NEJMoa025225
105. Mak CM, Lee HC, Chan AY, Lam CW. Inborn errors of metabolism and expanded newborn screening: review and update. Crit Rev Clin Lab Sci. (2013) 50:142–62. doi: 10.3109/10408363.2013.847896
106. Hall PL, Marquardt G, McHugh DM, Currier RJ, Tang H, Stoway SD, et al. Postanalytical tools improve performance of newborn screening by tandem mass spectrometry. Genet Med. (2014) 16:889–95. doi: 10.1038/gim.2014.62
Keywords: newborn screening, whole genome sequencing, whole exome sequencing, next-generation sequencing, recommended uniform screening panel
Citation: Woerner AC, Gallagher RC, Vockley J and Adhikari AN (2021) The Use of Whole Genome and Exome Sequencing for Newborn Screening: Challenges and Opportunities for Population Health. Front. Pediatr. 9:663752. doi: 10.3389/fped.2021.663752
Received: 03 February 2021; Accepted: 07 June 2021;
Published: 19 July 2021.
Edited by:
Mohamed A. Elmonem, Cairo University, EgyptReviewed by:
Silvia Salvatore, University of Insubria, ItalyLaura Vogel Milko, University of North Carolina at Chapel Hill, United States
Copyright © 2021 Woerner, Gallagher, Vockley and Adhikari. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Audrey C. Woerner, YXVkcmV5LndvZXJuZXImI3gwMDA0MDtjaHAuZWR1