From haystack to high precision: advanced sequencing methods to unraveling circulating tumor DNA mutations

Silva, Tamires Ferreira da; Azevedo, Juscelino Carvalho de; Teixeira, Eliel Barbosa; Casseb, Samir Mansour Moraes; Moreira, Fabiano Cordeiro; Assumpção, Paulo Pimentel de; Santos, Sidney Emanuel Batista dos; Calcagno, Danielle Queiroz

doi:10.3389/fmolb.2024.1423470

REVIEW article

Front. Mol. Biosci., 06 August 2024

Sec. Molecular Diagnostics and Therapeutics

Volume 11 - 2024 | https://doi.org/10.3389/fmolb.2024.1423470

From haystack to high precision: advanced sequencing methods to unraveling circulating tumor DNA mutations

1. Programa de Residência Multiprofissional em Saúde (Oncologia), Hospital Universitário João de Barros Barreto, Universidade Federal do Pará, Belém, Brazil
2. Núcleo de Pesquisas em Oncologia, Universidade Federal do Pará, Belém, Brazil

Article metrics

View details

Citations

3,8k

Views

854

Downloads

Abstract

Identifying mutations in cancer-associated genes to guide patient treatments is essential for precision medicine. Circulating tumor DNA (ctDNA) offers valuable insights for early cancer detection, treatment assessment, and surveillance. However, a key issue in ctDNA analysis from the bloodstream is the choice of a technique with adequate sensitivity to identify low frequent molecular changes. Next-generation sequencing (NGS) technology, evolving from parallel to long-read capabilities, enhances ctDNA mutation analysis. In the present review, we describe different NGS approaches for identifying ctDNA mutation, discussing challenges to standardized methodologies, cost, specificity, clinical context, and bioinformatics expertise for optimal NGS application.

Background

Cancer is a multifaceted and constantly evolving disease, which has a progression of genetically distinct clones that guide its course (Lomakin et al., 2022). In the era of precision medicine, the identification of mutations within cancer-associated genes assumes paramount significance, as it serves as a compass guiding the therapeutic journey for patients (Malone et al., 2020).

As a groundbreaking stride, liquid biopsies have risen as a complementary approach to traditional tissue biopsies, offering molecular insights into tumors that can revolutionize early cancer detection, patient stratification, treatment efficacy assessment, and post-treatment vigilance. Unlike tissue biopsies, this minimally invasive approach stands out for its increased uniformity, mitigating sampling bias across diverse tumor regions (Martins et al., 2021). Central to this methodology are mainly circulating tumor DNA (ctDNA) and circulating tumor cells (CTCs) (Jiang et al., 2021).

In particular, ctDNA corresponds to DNA fragments at about 160–200 base pairs (bp) that contain tumor-specific mutations which potentially represent the real-time status of the tumor genome (Chen and Zhao, 2019; Noguchi et al., 2020; Yu et al., 2022). Consequently, the assessment of ctDNA at specific time points—such as the clinical management and the detection of minimal residual disease (MRD)—has emerged as a pivotal factor in prognostication for a multitude of cancer types, encompassing breast cancer, colorectal cancer and leukemia (Parikh et al., 2021; Fürstenau et al., 2022; Turner et al., 2023).

The ctDNA concentrations represent about 0.01% of cell-free DNA (cfDNA); these low percentages lead to challenges in acquiring enough quality material for detection, especially at the early stages of tumor development (Huerta et al., 2021). According to individual tumor features, a specific analysis methodology is required, and the technique’s sensitivity for identifying ctDNA mutations is inversely proportional to the tumor stage (Elazezy and Joosse, 2018; Oliveira et al., 2020; Sanz-Garcia et al., 2022) (Figure 1).

FIGURE 1

In 2016, the U.S. Food and Drug Administration (FDA) and the European Medicines Agency approved the first ctDNA-based test to prescribe EGFR inhibitors in patients with non-small cell lung cancer (NSCLC) - Cobas EGFR mutation test v2 (Kwapisz, 2017; U.S Food and Drug Administration, 2022; U.S Food and Drug Administration, 2023). This ctDNA EGFR mutation testing leads to cost reductions and enables more effective treatment, resulting in a positive economic impact. Table 1 shows other current ctDNA tests approved for application in the clinical management of different cancer types.

TABLE 1

Year	Name test	Technology	Company	Biomarker	Molecular alteration	Cancer
2016	cobas® EGFR Mutation Test v2	real-time PCR	Roche Molecular Systems, Inc	EGFR	42 EGFR mutations in exons 18, 19, 20, and 21	NSCLC
2019	Therascreen PIK3CA RGQ PCR Kit	real-time PCR	QIAGEN GmbH	PIK3CA	11 mutations in exons 7, 9, and 20	Breast Cancer
2020	FoundationOne® Liquid CDx	NGS	Foundation Medicine, Inc	PIK3CA	PIK3CA mutations C420R, E542K, E545A, E545D [1635G>T only], E545G, E545K, Q546E, Q546R; and H1047L, H1047R, and H1047Y	Breast Cancer
				BRCA1, BRCA2, ATM.	BRCA1, BRCA2, ATM alterations	Prostate Cancer
				BRCA1, BRCA2	BRCA1, BRCA2 alterations	Ovarian Cancer
				MET, EGFR, ALK.	ALK, EGFR, MET	NSCLC
2022	Agilent Resolution ctDx FIRST assay	NGS	Resolution Bioscience, Inc	KRAS	KRAS G12C	NSCLC
2022	Agilent Resolution ctDx FIRST assay	NGS	Resolution Bioscience, Inc	EGFR	Single nucleotide variants (SNVs) and deletions	NSCLC
2023	Guardant 360 CDx	NGS	Guardant Health	ERS1	ESR1 missense mutations between codons 310–547	Breast Cancer
2023	FoundationOne® Liquid CDx	NGS	Foundation Medicine, Inc	BRAF	BRAF V600E alteration	Colorectal Cancer

FDA approved tests for identifying mutations used in liquid biopsy.

Adapted table of U.S Food and Drug Administrations https://www.fda.gov/medical-devices/in-vitro-diagnostics/list-cleared-or-approved-companion-diagnostic-devices-in-vitro-and-imaging-tools and https://www.accessdata.fda.gov/cdrh_docs/pdf19/P190032S010A.pdf

PCR, polymerase chain reaction; NGS, Next-Generation Sequencing; Non-Small Cell Lung Cancer.

Advances in next-generation sequencing (NGS) technology and a large demand for ctDNA mutation analysis to support clinical studies have facilitated the emergence of sequencing assays covering cancer-related genes (Yu et al., 2022). Because it is rare, detection of mutations in ctDNA can be challenging, even with the increased feasibility of its analysis through NGS, which can present error rates of 0.1%–1% depending on the platform used (Glenn, 2011).

Currently, sequencing technologies have two distinct approaches with different methods and applications. The non-targeted sequencing often provides an overview of the entire genome and captures coding and non-coding regions. Also, it enables new genetic discovery without previous knowledge (Bagger et al., 2024). Conversely, targeted sequencing focuses on specific genes or regions of interest previously known, which participate in biological processes and diseases (Figure 2) (Singh, 2022).

FIGURE 2

Recently, long-read sequencers, known as third-generation sequencing (TGS), have emerged to surpass NGS technologies. This approach allows the reading of single DNA molecules in real time without the need for prior PCR amplification steps, offering high precision and speed. Furthermore, TGS is capable of detecting epigenetic modifications, and its rapid results make it attractive for disease diagnosis, particularly in precision oncology (Ling et al., 2023; Scarano et al., 2024).

In the present study, we described NGS and TGS approaches and discussed standardized methodologies and challenges for the identification of ctDNA mutation. Additionally, we explore cost-effectiveness, specificity, clinical utility, and bioinformatic implications for optimal NGS application in ctDNA analysis from cancer patients.

Next-generation sequencing

The NGS technology has revolutionized the field of genomics by enabling rapid and affordable large-scale DNA and RNA sequencing. This methodology is based on analyzing several millions of short DNA fragments in parallel, followed by either sequence alignment to a reference genome or de novo sequence assembly (Lin et al., 2021). Therefore, this technology can be useful for real-time monitoring of tumor progression through detection with high accuracy of genetic status from primary and metastatic tumors (Hess et al., 2020).

Usually, library preparation is a critical step that precedes sequencing and varies according to study type and available financial resources. This process consists of ensuring genetic material is appropriate to be sequenced by high-throughput sequencing platforms and may include separation of large fragments, recovery of small fragments through probes, repair of DNA ends, connector connection, and addition of a special connector from the sequencing kit (Liang et al., 2020; Bohers et al., 2021). A technological advance within library preparation is the use of molecular barcoding by inserting random sequences prior to PCR amplification to obtain counts of original DNA molecules without unbiased results and with increased sensitivity (Bohers et al., 2021; Szadkowska et al., 2022).

In ctDNA, the identification of mutations is challenging due to its representation of a small fraction of cfDNA and the need for high levels of plasma DNA for analysis (Dang and Park, 2022). However, the various NGS tools offer potential applicability, specificity, sensitivity and low input, making them invaluable in ctDNA research (Elazezy and Joosse, 2018). This includes non-targeted (Diefenbach et al., 2019; Ganesamoorthy et al., 2022) and targeted approaches (Phallen et al., 2017; Elazezy and Joosse, 2018; Gale et al., 2018; Peng et al., 2019; Zhao et al., 2020; Kato et al., 2021; Hallermayr et al., 2022) (Table 2).

TABLE 2

Technology		Methods	Sensitivity (%)	Specificity (%)	Input (ng)	Applications	Alteration	Reference
NGS	Non targeted	WGS	5–10	99.85	1–30	Cancer localization and origin, early detection (early and late stage), for research us	Structural and non-coding variations: genome-wide copy number aberrations, methylation profiles and fragmentation patterns	(Ganesamoorthy et al., 2022)
	Non targeted	WES	5	96	5	Cancer detection, monitoring of resistant clones in metastasis, for research use	Exploring unknown mutations	Diefenbach et al. (2019)
	Targeted	Safe-SeqS/UMI-based	0.01–0.05	98.9	3	Cancer detection and monitoring, classification, targetable alterations, for research use	Known point mutation and number copy variation	(Elazezy and Joosse, 2018)
		Tam-Seq	2	99.9997	0.9–20	Cancer detection and monitoring, classification, targetable alterations, for research use	Known point mutation	(Gale et al., 2018)
		CancerSEEK	69–98	99	0.11–119	Early cancer detection	Mutations nonsense, insertions or deletions, synonymous mutations and intronic mutations	Cohen et al. (2018)
		eTam-Seq	0.2	99.9997	6.6–53	Cancer detection and monitoring, classification, targetable alterations, for research use	Low frequency mutations, short (indels)	(Gale et al., 2018)
		CAPP-SEQ	0.02	99.99	32	Molecular Profiling, Treatment Monitoring, ctDNA MRD	Known point mutation, number copy variation and rearrangements	(Kato et al., 2021)
		Ig-HTS	10–6	98.3	500	Minimal residual disease in hematologic malignancy and cancer monitoring	Not mentioned	Rezazadeh et al. (2024)
NGS	Targeted	TEC-Seq	0.05–0.01	99.99	2.9–49.5	Molecular Profiling, Treatment Monitoring, ctDNA MRD	Point mutations, small insertions, and deletions	Phallen et al. (2017)
		Single primer extension (SPE)	0.05–1	94	1–50	Cancer detection and monitoring, classification, targetable alterations, for research use	Point mutations	(Zhao et al., 2020)
		SPE-duplex UMI	0.1–0.2	95	40	Cancer detection and monitoring, classification, targetable alterations, for research use	Single-nucleotide variant and Indel mutations	(Peng et al., 2019)
		Duplex Sequencing	0.001–0.1	96.91	64	Cancer detection and monitoring, classification, targetable alterations, for research use	Known and unknown mutations, indels, CNV, chromosomal rearrangements (capture)	(Hallermayr et al., 2022)
TGS	Single Molecular Real-time		Not mentioned	Not mentioned	Not mentioned	Reading of repetitive elements and allele phasing in long fragments	Not mentioned	Choy et al. (2022)
TGS	Nanopore	CyclomicsSeq	Not mentioned	Not mentioned	1500	Real-time monitoring of tumors	Nonsense mutation, missense and deletion	(Marcozzi et al., 2021)

Sequencing NGS- and TGS-bated methods used for ctDNA analysis.

WGS, Whole-genome sequencing; WES., Whole-exome sequencing; Safe-SeqS, Safe-Sequencing System; UMI, unique molecular identifier; Tam-Seq, Tagged-amplicon deep sequencing; eTam-seq, enhanced Tam-Seq; CAPP-Seq, Cancer Personalized Profiling by Deep Sequencing; TEC-Seq, Targeted error correction sequencing; Ig-HTS, Immunoglobulin high-throughput sequencing; SPE, single primer extension.

Non-targeted NGS technologies

In the realm of non-targeted sequencing, the focus broadens to include the entire genome or exome using methods such as whole-genome sequencing (WGS) and whole-exome sequencing (WES), allowing for the simultaneous identification of multiple mutations (Elazezy and Joosse, 2018; Chen and Zhao, 2019; Esteva-Socias et al., 2020). In ctDNA analysis, these methodologies can be applied to discover new molecular alterations, recognize new drug targets, and screen for drug resistance clones (Bohers et al., 2021).

In particular, WGS technologies are better suited to identifying structural and non-coding variations in ctDNA, composing a potential promise for the diagnosis of rare diseases (Bos et al., 2020; Marshall et al., 2020; Sun et al., 2021; Ibañez et al., 2022). The goal of the technique is to detect mutations, chromosomal alterations, genetic rearrangements, and somatic copy number alterations (Daya and Mahfouz, 2018).

According to Zviran et al. (2020) the WGS approach allowed dynamic tracking of tumor burden and detection of single nucleotide variations in postoperative residual disease in colorectal cancer with sensitivity ±SE = 90% ± 0.069%, specificity ±SE = 98% ± 0.006% (AUC ±SE = 0.97 ± 0.025). In addition, showed an association with shorter recurrence-free survival for 36.8% (7/19) of post-operative ctDNA-positive patients P = 0.03.

Recently, a study used ultra-low-pass whole-genome sequencing (ULP-WGS), an emergent tool for ctDNA analysis in hepatocellular carcinoma (HC) patients. This technique is cheaper compared to WGS and has a total ctDNA input of 2.5 ng but a very low coverage (<0.05), which can leave gaps in the sequencing. The results showed that 30.1% (22/73) of HC patients had detectable ctDNA levels. Furthermore, a pattern of chromosomal changes was found, such as the loss of 5q (36.3%) and 16q (40.9%) with an association with positive ctDNA as a predictor of worse prognosis and a biomarker of tumor aggressiveness (Sogbe et al., 2024).

In contrast, WES is a limited method only for coding regions (Sabatier et al., 2022). It is generally used to detect genetic variants that are associated with diseases and detect mutations (Glotov et al., 2023). In a comparative study, WES was applied to paired ctDNA and tumor biopsy in 15 patients for breast cancer, sarcoma, gastrointestinal cancer and melanoma. It was observed that the ctDNA fraction <16.4% is insufficient for detecting tumor-specific variants with a median number of 3 variants, in contrast, a value >30% of ctDNA fraction detected 95 non-synonymous variants. Furthermore, the results showed that ctDNA captures tumor heterogeneity by sharing 22 variants between melanoma (primary tumor) and liver (metastatic) and 12 additional variants that are unique to a tumor site, as well as being able to identify more frequently mutated genes concordant between WES ctDNA and tissue for breast cancer such as ESR1, KRAS, PIK3CA, PIK3R1, FAT1 and MED12, for gastrointestinal cancer APC, CASP8, GRIN2A, MYH9, TP53, ASXL1, CDH11 and KRAS; and melanoma PSIP1, RSPO2 and SF3B1 (Leenanitikul et al., 2023).

Nevertheless, it is adequate to detect mutation in patients with advanced tumors and increased ctDNA fractions (Bohers et al., 2021). A study by Diefenbach et al., 2019 showed that ctDNA WES can be used to profile mutations and capture clinically relevant alterations in metastatic melanoma, such as BRAF and NRAS melanoma driver gene mutations in 6/10 patients when applying a mutant allele frequency (MAF) cutoff of at least 10%.

Notably, WES presents a cost-effective approach compared to WGS by exclusively scrutinizing exons. However, both WGS and WES demand substantial DNA input to ensure the acquisition of high-quality data for the sequencing process and high-throughput. Therefore, these techniques are expensive, which makes their clinical application challenging. Additionally, these methods exhibit limited sensitivity, rendering them less suitable for early-stage cancer detection (Ganesamoorthy et al., 2022).

Targeted NGS-based methods

The targeted strategies allow the detection of single or few tumor-specific mutations in ctDNA through pre-selected panels previously described, such as BRAF, KRAS, TP53, PIK3CA, APC and EGFR (Elazezy and Joosse, 2018; Mallampati et al., 2019; Liu et al., 2020; Kato et al., 2021; Jiménez-Rodríguez et al., 2022). These techniques could be useful in clinical management for monitoring MRD, early detection of relapse or screening for resistant mutations (Bohers et al., 2021; Lin et al., 2021; Sanz-Garcia et al., 2022).

Generally, customized panels are constructed based on mutations captured during tissue sequencing and applied to detect tumor-specific mutations in plasma (Sanz-Garcia et al., 2022). In addition, laboratories have no standardization in the clinical implementation of NGS panel design. It is widespread to use pre-designed panels from suppliers or to create your panels. However, developing a targeted panel from scratch is challenging, as investments in operational infrastructure and bioinformatics are required (Shi et al., 2022).

Amplicon

Target NGS technologies require enrichment by amplicon or hybrid-capture (Figure 3) (Lin et al., 2021; Sanz-Garcia et al., 2022). Amplicon sequencing, a targeted NGS method able to analyze genetic variation in specific genomic regions, consists of a multiplex PCR-based method that uses oligonucleotides to target and capture regions of interest. PCR is used to create DNA sequences known as amplicons, which can be multiplexed by adding a barcode or index to the samples for identification. Before, the samples must be transferred into libraries by adding adapters and enriching targets using PCR amplification. The adapters allow the formation of indexed amplicons and their adherence to the flow cell for sequencing (Hung et al., 2018). Currently, some amplicon-based methods are described in the literature.

FIGURE 3

FIGURE 4

Safe-sequencing system (Safe-SeqS)

Safe-SeqS is an amplicon method that uses DNA molecular barcodes to increase sequencing sensitivity before PCR and uses the unique identifier (UID), which allows fragments with the same UID to be considered mutants if more than 95% have the same mutation. Barcode error correction increases sensitivity to 0.05% and identifies rare mutations (Tuaeva et al., 2019; Bohers et al., 2021). Tie et al. (2021) designed Safe-SeqS to evaluate a previously detected mutation with a higher allele frequency in 54 patients with resectable colorectal liver metastases (CRLM) and evaluated the prognostic impact of postoperative ctDNA in patients with CRLM. As a result, ctDNA was most detectable in patients at baseline (T0) 85% (46/54) with a median MAF for positive ctDNA of 1.86% (IQR, 0.44%–8.2%) and in patients after surgery (TP) 24% (12/49) 0.09% (IQR, 0.02%–1.3%).

Nowadays, Safe-seqS is recognized as Unique Molecular Identifier (UMI)-based sequencing and highlights in new nomenclature the use of unique molecular identifiers (UMIs) to track and correct errors during the process, with greater accuracy in the detection of rare mutations and in the quantification of nucleic acids (Salk et al., 2018). UMI-based sequencing technology was used to investigate somatic mutations in ctDNA of patients with lung squamous cell carcinoma (LUSC), which were detected in 80.8% (20/26) of patients and mutations with maximum allele fraction (maxAF) > 5% compared to maxAF ≤5% (P = 0.020) reflected shorter overall survival. The most frequently mutated gene was TP53 with 73.0% (19/26), and the classic lung cancer driver mutations, PIK3CA (n = 3), EGFR amplification (n = 2), EGFR exon 19 deletion (n = 1), KRAS Q61R (n = 1), and MET amplification (n = 1) were detected (Liu et al., 2020).

Tagged-amplicon deep sequencing (Tam-seq)

Tam-seq uses an enrichment matrix with primers and barcodes in the construction of an amplicon library, which goes through steps of targeted pre-amplification and selective amplification with single-plex reactions, as well as PCR is performed for the addition of adapters and barcodes for sample identification (Zhao et al.,2020). This technique showed high sensitivity 0.01%–2.0% and specificity >97% to detect mutations in circulating DNA, as a ctDNA analysis method that allows for an ultra-low detection limit and broad patient coverage, as well as showing digital PCR-like sensitivity for hotspot alleles and can simultaneously interrogate thousands of additional genomic positions without your sensitivity or specificity are affected (Noguchi et al., 2020). The technique requires knowledge of recurrent cancer mutations available in databases and uses a selector (biotinylated oligonucleotide probes) to target large segments of the studied regions (Bohers et al., 2021).

In 2018, Gale et al. described enhanced Tam-Seq (eTam-Seq), which consists of an expanded assay to target hotspots and entire coding regions of 35 genes for common cancer types, based on a primer design that allows amplification of highly fragmented DNA and in library preparation does not use microfluidics. This technique aims to identify single nucleotide variants (SNVs) and short insertions/deletions (indels) and identify copy number variants (CNVs). The validation test results of this tool indicated high specificity 99.9997% (95% (CI): 99.9989%–99.9999% by base specificity) and sensitivity 100% (90% (CI): 99.01%–100%) in low input samples at 2%–2.5% AF, 99.17% (90% CI: 97.40%–99.85%) in medium input samples at 1%–1.3% AF and 95.45% (90% CI: 93.09%–97.18%) in high input samples at 0.25%–0.33% AF (Gale et al., 2018).

On the other hand, the hybrid-capture, also known as hybridization-based sequencing, is based on using long, biotinylated probes or baits complementary to the region of interest. This method involved the fragmentation of physical or enzymatic DNA followed by enzymatic repair of the ends of the molecules and ligation of platform-specific adapters. These adapters usually contain index bases that comprise a sequence that is unique to the sample or the barcode of the sample (Bohers et al., 2021). Unlike amplicon sequencing, this method does not require PCR primer design. Thus, it is less likely to miss mutations and is said to be better at performing in terms of sequence complexity. The capacity of this method for mutation detection makes it best suited to cancer research. Moreover, its sequence complexity and scalability make it good for WES (Wu et al., 2022).

Hybrid capture

When choosing panels in the hybridization method, cfDNA fragmentation must be taken into account, as it may result in heterogeneous coverage between target exons (Lin et al., 2021; Shen et al., 2021). This enrichment step prevents loss of the variant of interest if they are on the edges of the fragments because the probe binding to the target region is sufficient to capture the variant. However, the fragments may not amplify because they do not have a binding sequence with the primers during NGS library preparation (Mallampati et al., 2019). Several hybrid capture-based technologies have been described.

Cancer personalized profiling by deep sequencing (CAPP-Seq)

CAPP-Seq developed the ability to simultaneously detect several types of changes: SNVs, rearrangements, insertions/deletions, and copy number changes (Elazezy and Joosse, 2018). Additionally, CAPP-Seq has been enhanced with Integrated Digital Error Suppression (iDES), combining CAPP-Seq with duplex barcode sequencing technology and a computational algorithm that removes stereotyped errors associated with the CAPP-Seq hybridization step (Peng et al., 2019). According to Kato et al. (2021), CAPP-SEq applied to ctDNA mutation analysis allowed the identification of mechanisms of resistance to osimertinib in EGFR T790M-positive NSCLC patients. In addition, the assay also detected EGFR-activating mutation in 70% (14/20) of patients, and these results were associated with a larger tumor volume through the sum analysis of the largest diameters of the target lesions (P = 0.04). In addition, for patients with EGFR activating mutation, mutations were observed in the genes PIK3CA (3/14) 21%, KRAS (2/14) (14%) and or BRAF (3/14) 21% and copy number gain alterations for EGFR (9/14) 64%, ERBB2 (4/14) 29% or MET (4/14) 29%. Additionally, the identified alterations were more common in patients with innate resistance 8 (57%) compared to patients with acquired resistance 6 (43%) (Kato et al., 2021).

Others technologies

Some approaches described use different combinations of technologies to optimize results. Some methods do not apply to the amplicon enrichment or hybrid capture standards.

Immunoglobulin high-throughput sequencing (Ig-HTS)

Ig-HTS is an ultra-deep genomic DNA sequencing method developed for minimal residual disease in hematologic malignancy that uses multiplex PCR arrays to identify a tumor-specific clonotype from rearranged gene regions of IgH, IgK, and IgL receptors. This technology enables cancer monitoring through quantifying ctDNA with a sensitivity of 10%–6% (Bohers et al., 2021). In 2022, Rezazedeh et al. demonstrated that Ig-HTS as a Food and Drug Administration-proven tool clonoSEQ (Adaptive Biotechnologies) allows the minimization of surveillance imaging in patients with B-cell lymphomas from ctDNA analysis, in which the result of the MRD assay was predictive of relapse before imaging in 92% of patients (11/12) (Rezazadeh et al., 2024).

Targeted error correction sequencing (TEC-Seq)

TEC-Seq is a method that combines targeted sequencing and error correction approaches, which has a sensitivity of 94.7% and is capable of detecting mutations in early-stage solid cancers, as well as being a method capable of identifying true mutations and false-positive variants (Phallen et al., 2017; Bohers et al., 2021). Serrano et al. employed TEC-Seq for serial monitoring of ctDNA from patients with gastrointestinal stromal tumors to evaluate the combination of sunitinib and regorafenib as a new add-on drug treatment regimen. In this study, somatic mutations, point mutations, small insertions, and deletions were analyzed. This approach resulted in primary mutations in 89% (8/9) and secondary mutations in 78% (7/9) of patients (Serrano et al., 2019).

Single primer extension (SPE)

SPE is a method developed by QIAGEN that redefines amplicon enrichment and sequencing (QIAseq SPE technology for Illumina: Redefining amplicon sequencing - QIAGEN, 2018). The method is based on the extension of a single gene-specific primer by DNA polymerase to amplify each genomic region with uniform coverage, allowing the detection of single nucleotide polymorphisms (SNPs) and specific mutations with high accuracy. Initially, the primer is hybridized to the DNA template strand in the target region, where there are subsequent adapter ligation repair steps. Then, the primer is extended from the 3′ end, and each genomic region is targeted by only one region-specific primer plus a universal adapter primer that binds to sequences introduced through adapters. These adapters are linked to primers and a molecular barcoding technology used to uniquely tag each molecule in the sample library, Unique Molecular Index (UMI), with a sensitivity of 0.5%–1% (Bentley et al., 2008; Peng et al., 2019; Zhao et al., 2020). In SPE, the use of UMI reduces amplification errors and increases the sensitivity of variant detection, which provides error correction and higher accuracy during sequencing. Additionally, SPE can be enhanced through duplex UMI adapters (duplex SP-UMI), multiplex PCR-based enrichment and sequencing, which increases sensitivity to 0.1%–0.2% (Peng et al., 2019).

Recently, this technology was used by Jiménez-Rodríguez et al. (2022) for the analysis of ctDNA from BC patients and a sequencing panel composed of exonic regions of 33 genes in 75 plasma samples was developed. As a result of the study, 21.31% (13/61) of tumor mutations were found in both plasma and corresponding tumors, and the most frequently mutated genes were TP53 (53.84%) and PIK3CA (23.07%). In addition, it presented a sensitivity of 0.03% and a specificity of 86.36%.

Duplex sequencing

Duplex sequencing is a method that aims to achieve accuracy and reduce sequencing errors based on double-strand consensus analysis. This technique begins with the fragmentation of DNA into smaller pieces and the addition of specific adapters. The fragmented DNA is encapsulated in emulsion drops where PCR amplification occurs, generating single-strand readings. The single strands are paired to form duplex readings. The analysis of the two strands is compared to eliminate random errors that can be identified by the lack of correspondence between the single-strand readings (Mallampati et al., 2019; Bohers et al., 2021; Shields et al., 2022). This approach was demonstrated by Mallampati et al. (2019) to monitor disease progression in patients with stage IV colorectal cancer. In this research, a CRC23 panel with 78.81 kb was created involving 85% of mutated targets and exon regions for the TP53, APC, KRAS, NRAS, BRAF, PIK3CA and ERBB2 genes and hotspot coding exons of 16 other genes. Furthermore, a detection limit of 0.3% of variant frequency was observed, as well as diagnostic accuracy of 96.15% (95% CI, 94.28%–97.55%), sensitivity of 87.23% (95% CI, 74.26%–95.17%) and specificity of 96.91% (95% CI, 95.11%–98.19%).

Although the targeted strategy makes cancer monitoring extremely sensitive, these approaches require prior genetic knowledge of the tumor. This may not be useful in characterizing new molecular alterations that occur during tumor treatment (Elazezy and Joosse, 2018; Sanz-Garcia et al., 2022).

Third generation of sequencing

Additionally to NGS, the advent of the third generation of sequencing (TGS) has provided new features and capabilities for real-time reading, long-fragment reading, portability, and ease of use which are fundamental to understanding cancer genetics, and currently PacBio Sequencing (Menlo Park, CA, United States) and Oxford Nanopore Technologies (ONT, Oxford, United Kingdom) are the two TGS technology platforms (Amarasinghe et al., 2020; Scarano et al., 2024).

Single Molecular Real-Time (SMRT) (Pacific Biosciences, California) is a method based on reading made on SMRT chips which is composed of metal film containing zero-mode waveguides (ZMW) which are special nanophotonic visualization chambers. Inside chambers in the flow cell are ZMW that capture signals from phospholinked dNTP labeled with fluorophores which are incorporated by DNA polymerase and released fluorescence pulse that is identified by laser at a specific wavelength in real time (Treffer and Deckert, 2010). This SMRT technology enables the reading of repetitive elements and allele phasing in long fragments (Ardui et al., 2018). In the analysis of ctDNA, SMRT sequencing was used to evaluate long DNA properties and methylation patterns, since analyses usually focus on short fragments. The assay results showed the detection of fragments up to 13.6 kb in length in samples from 13 patients with hepatocellular carcinoma. Additionally, it was observed that non-tumor cfDNA was generally longer than tumor cfDNA, in which plasma DNA molecules longer than 600 bp were 55.1% carrying mutant alleles and 64.8% wild-type, and molecules longer than 1 kb were 43.4% carrying mutant alleles and 56.4% wild-type. Furthermore, complete reads were performed in 85.79% (IQR: 83.11%–88.69%) of the fragments. Another important point to be analyzed was the detection of long cfDNA fragments containing a mutant allele, which can generate changes in cfDNA analyses for the inclusion of long molecules (Choy et al., 2022).

Furthermore, nanopore sequencing (Oxford Nanopore Technologies) is a technology that consists of real-time readings of changes in electrical current during the passage of the DNA molecule through a biosensor, which is composed of an electrically resistant membrane. The nanopores are arranged in the flow cell in micro-scaffolds and can be categorized as solid and biological. Each nanopore is an electrode connected to the channel inside the sensor chip where the electrical current is measured. When the electrical current is interrupted by the passage of a molecule, the so-called “squiggle” occurs and this information becomes corresponding to a specific nucleotide. This method has capacity for long-read sequencing, empowering the direct analysis of DNA or RNA fragments sans the prerequisite of prior amplification (Wang et al., 2021; Scarano et al., 2024). This TGS technology was employed to analyze genomic and fragmentomic data from liquid biopsies in 8 urine samples from bladder cancer patients and 22 plasma samples from lung cancer patients. ONT sequencing performed on the MinION showed structural properties of cfDNA and the ability to recover somatic copy number aberrations (SCNAs) in 24 h with a median of 800,183 reads and ∼0.1X coverage. Although cfDNA is described in the literature as short and fragmented molecules (167 bp), the results obtained from this research showed increased recovery of long cfDNA (>300 bp) in plasma from lung cancer patients, and compared to short-read sequencing (5.3%), ONT sequencing had 54.1% of fragments larger than 300 bp (van der Pol et al., 2023).

CyclomicsSeq is a technology based on the circularization and concatemerization of DNA molecules and an optimized DNA sequence in combination with Oxford Nanopore sequencing created for real-time monitoring of tumors based on the analysis of ctDNA levels. The protocol of this technology uses amplicons and is divided into four steps, which involve the circularization of the insert and backbone (DNA adapter), rolling circle amplification (RCA), long-read sequencing and data processing. The detection of ctDNA through this technology allows the identification of mutations based on somatic variants. Real-time monitoring can be done by identifying mutations in the TP53 gene, in which a TP53 mutation was observed in a trial with patients with head and neck squamous cell cancer negative for the human papillomavirus (HPV) at a frequency of 0.02%. During the trial, the single nucleotide error false positive rate (snFP rate) was also analyzed, which had a median <6, 10⁻⁴ in all TP53 exons to evaluate the use of CyclomicsSeq for mutation detection in liquid biopsy (Marcozzi et al., 2021).

Although TGS can generate long reads and detect complex structural variants, its use in ctDNA analysis still has challenges. ctDNA fragments are rare in cfDNA, and reads of long fragments can induce the appearance of false base substitution mutations and indels (Ardui et al., 2018; Marcozzi et al., 2021; Scarano et al., 2024). These errors can make it difficult to accurately detect relevant mutations that could interfere with the clinical management of cancer patients.

Sequencing data analysis

Data sequencing analysis is a critical process for ctDNA evaluation and consists of three main steps: quality analysis, alignment, and variant calling (Figure 4) (Wadapurkar and Vyas, 2018). Firstly, quality control of the reads is crucial for the bioinformatics analysis since high throughput NGS generates a massive volume of data and improves confidence in the data. In general, programs like FastQC provide a comprehensive per-base analysis, ensuring that the sequence is accurate and not compromised by issues generated during the sequencing run (Andrews, 2010; Trivedi et al., 2014; Mahamdallie et al., 2018). Moreover, reads can be contaminated by other sequences, such as primers or adapters in library preparation. Thus, several tools may be used to remove low-quality bases and sequences from adapters, such as Cutadapt, FastP, and Trimmomatic (Bolger et al., 2014; Chen et al., 2018; Martins et al., 2021).

Based on the provenance of the data and the size of the fragments, several aligners can be useful for ctDNA, including BWA and Bowtie2 (Li and Durbin, 2009; Langmead and Salzberg, 2012). In target sequencing, the alignment process consists of comparing the generated sequences to verify the degree of similarity using a reference genome or a customized file containing only the regions of interest of the study as a parameter. Moreover, it is worth noting that the version of the genome used during the analysis should be the same in order to avoid later disagreements (Reinert et al., 2015; Dilliott et al., 2018; Kang et al., 2020).

The last step seeks the identification of variants that differ from the reference used, typically FreeBayes, VarScan, BCFtools, VarDict and VariantDx are among the tools used to find SNPS, indels during the calling process in ctDNA analysis (Liu et al., 2013; Kang et al., 2020). Finally, the variants found go through the annotation process, which is querying existing databases. The VarDict is an ultra-sensitive variant caller pipeline that has already been used for the identification of ctDNA variants in cancer samples (Lai et al., 2016; Leal et al., 2020).

A sufficient number of reads is extremely important for correct mapping, identifying genetic alterations, and ruling out putative execution errors, especially data from devices that show errors in base changes. Targeted sequencing provides just that, contributing to the identification of variants at low abundance, which is characteristic of ctDNA. Therefore, high coverages (>30,000×) are expected in this type of experiment.

In addition, variant detection in ctDNA samples can be challenging due to the low frequency of total cfDNA and PCR artifacts in library preparation. Thus, Kockan et al. (2017) introduced SiNVICT, which consists of a tool for the detection of SNVs and short indels in ctDNA at very low variant allele percentages with high accuracy and sensitivity. This approach includes pre-processing, SNV/indel calling, and post-processing steps. SiNVICT also allows for analyzing samples collected at different time points and evaluating the temporal clonal evolution of tumors, which could be useful for the detection of resistance mutations and therapy selection (Kockan et al., 2017).

Conclusion and future perspectives

Currently, ctDNA analysis represents a crucial approach to guide cancer diagnosis, management and monitoring, but the clinical implementation of ctDNA is still limited (Oliveira et al., 2020). NGS has shown great potential for advancing clinical practices through the development of a diverse panel for identifying ctDNA mutations in different cancer types, but finding the optimal approach remains a challenge (Table 3). Studies based on non-targeted NGS have the highest cost but are necessary for the construction of mutational panels, especially in cases of tumors lacking biomarkers (Hess et al., 2020; Christodoulou et al., 2023). With these studies, it is expected that new techniques will be developed to detect ctDNA mutations even at low frequencies in the bloodstream.

TABLE 3

Sequencing technology	Classification	Method	Principle	Advantages	Disadvantages
NGS	Non-targeted	WGS	Determining the complete DNA sequence from a genome captures exons (coding) and introns (non-coding) regions, providing a comprehensive view of the genetic information	Provides a genome-wide view, capturing all genetic variations without requiring prior knowledge of regions of interest	Presents high cost and generates large amounts of data, requiring substantial computational resources for analysis
	Non-targeted	WES	Performs only sequencing of the coding regions of the genome	It is cost-effective and efficient in identifying clinically relevant mutations	Does not provide information on non-coding regions and it also requires comprehensive bioinformatics tools for analysis
	Targeted	Amplicon	Analyze genetic sequences by amplifying specific regions of the genome before sequencing	Exhibits high sensitivity, is customizable according to the needs of the study, has high performance, and has a shorter response time	Only provides information about the selected regions; the design of primers for regions with high genetic variability can be complex, and errors arising from the amplification steps can lead to false-positive results
		Hybrid-capture	Uses biotinylated oligonucleotide probes to hybridize and enrich the regions of interest before sequencing	It has high coverage and specificity, can be targeted to various genomic regions, and has no amplification bias	The workflow is more complex, expensive, and time-consuming due to the steps in the protocol. Errors in hybridization can lead to inadequate capture and false results
TGS		SMRT	Based on SMRT (Single Molecule, Real-Time) chips, fluorophore-labeled nucleotides are added to DNA polymerase, and when incorporated into the DNA strand, fluorescent light is recorded at a specific wavelength	Long DNA sequence reads allow identification of structural rearrangements and mutations that may be difficult to detect with short-read methods	Limitation on coverage and processing time
TGS	Nanopore	CyclomicsSeq	Performs amplification and repeated cyclic reading of circular DNA molecules to achieve accurate detection of low-frequency variants	Presents high precision and sensitivity for detecting low-frequency mutations, and random errors are reduced due to the cyclic reading of the fragments	It has a high cost and technical complexity for its execution, in addition to having a lower yield compared to NGS and requiring sophisticated bioinformatics tools to analyze the results

Sequencing technologies are available for ctDNA analysis, as well as its principles, advantages, and disadvantages.

NGS, Next-Generation Sequencing; TGS, third generation sequencing; WGS, Whole-Genome Sequencing; WES, Whole-Exome Sequencing; SMRT, Single Molecular Real-time.

One of the tests approved by the FDA based on NGS panels most used in clinical oncology practice is still Foundation One® Liquid Cdx, used with both tissue biopsies and ctDNA in NSCLC, breast, prostate, ovarian, and colorectal cancer (Newman et al., 2016; Shahnoor et al., 2023). This test allows comprehensive genomic profiling that guides more effective therapy and predicts patient prognosis (Woodhouse et al., 2020).

Another technology that is quite promising for application in clinical practice is CancerSEEK is an amplicon-based method that uses multiplex PCR in the enrichment step and was developed in 2018 as a blood test for early cancer detection through quantifying the levels of circulating proteins and cfDNA (Cohen et al., 2018; Duffy et al., 2021; Dao et al., 2023).

CancerSEEK is capable of detecting 8 types of non-metastatic cancer (ovarian, liver, stomach, pancreas, esophagus, colorectal, lung or breast) through the construction of a panel for 16 genes (NRAS, CTNNB1, PIK3CA, FBXW7, APC, EGFR, BRAF, CDKN2A, PTEN, FGFR2, HRAS, KRAS, AKT1, TP53, PPP2R1A, GNAS) composed of 61 amplifiers containing on average 33 base pairs each amplicon. This assay has shown results, after application in 1,005 patients, of sensitivities of 69%–98% for 5 types of cancer (ovarian, liver, stomach, pancreas and esophagus) and specificity >99% in 0.86% (7/812) of healthy controls. In addition, it was observed that the maximum ctDNA detection capacity of the assay could vary according to the type of tumor (60% for liver cancer and 100% for ovarian cancer) and DNA concentrations in plasma ranged from 0.11 to 119 ng/mL. The test identified rare mutations: nonsense, insertions or deletions, canonical splice site mutations, synonymous mutations, except at exon ends and intronic mutations, except at splice sites. Regarding the reading model, CancerSEEK uses reference sequences and custom scripts in Python, SQL and C# (In Silico Solutions, Falls Church, VA) (Cohen et al., 2018).

Although the CancerSEEK test has been recognized as a Breakthrough Device by the U.S. Food and Drug Administration for the detection of genetic mutations and proteins associated with pancreatic and ovarian cancers, it still needs to be validated in large-scale screening studies for commercialization (Duffy et al., 2021).

Therefore, it is expected that more target NGS-based technologies will be developed to increase the sensitivity of ctDNA detection. Additionally, as NGS-based experimental designs become more affordable and popular, there is an escalating demand for software capable of collating, manipulating, and visually presenting quality control (QC) logs and reports, especially when dealing with a substantial number of samples. Also, multiple factors, including cost, yield, specificity, cancer type, disease stage, clinical application, and bioinformatics analysis need to be considered.

Statements

Author contributions

TS: Writing–review and editing, Writing–original draft. JA: Writing–original draft, Supervision, Writing–review and editing. ET: Writing–review and editing. SC: Writing–review and editing. FM: Writing–review and editing. PP: Writing–review and editing. SS: Writing–review and editing. DC: Writing–review and editing, Writing–original draft.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was supported by Universidade Federal do Pará and Brazilian funding agencies: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES; to ET, TS, and JA), Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq; to DC, 315643/2023-4) for financial support.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1
AmarasingheS. L.SuS.DongX.ZappiaL.RitchieM. E.GouilQ. (2020). Opportunities and challenges in long-read sequencing data analysis. Genome Biol.21, 30. 10.1186/s13059-020-1935-5
- CrossRef
- Google Scholar
2
AndrewsS. (2010). Babraham bioinformatics - FastQC A quality control tool for high throughput sequence data. Available at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (Accessed June 14, 2024).
- Google Scholar
3
ArduiS.AmeurA.VermeeschJ. R.HestandM. S. (2018). Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res.46, 2159–2168. 10.1093/nar/gky066
- CrossRef
- Google Scholar
4
BaggerF. O.BorgwardtL.JespersenA. S.HansenA. R.BertelsenB.KodamaM.et al (2024). Whole genome sequencing in clinical practice. BMC Med. Genomics17, 39. 10.1186/s12920-024-01795-w
- CrossRef
- Google Scholar
5
BentleyD. R.BalasubramanianS.SwerdlowH. P.SmithG. P.MiltonJ.BrownC. G.et al (2008). Accurate whole human genome sequencing using reversible terminator chemistry. Nature456, 53–59. 10.1038/nature07517
- CrossRef
- Google Scholar
6
BohersE.ViaillyP.-J.JardinF. (2021). cfDNA sequencing: technological approaches and bioinformatic issues. Pharmaceuticals14, 596. 10.3390/ph14060596
- CrossRef
- Google Scholar
7
BolgerA. M.LohseM.UsadelB. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120. 10.1093/bioinformatics/btu170
- CrossRef
- Google Scholar
8
BosM. K.AngusL.NasserinejadK.JagerA.JansenM. P. H. M.MartensJ. W. M.et al (2020). Whole exome sequencing of cell-free DNA – a systematic review and Bayesian individual patient data meta-analysis. Cancer Treat. Rev.83, 101951. 10.1016/j.ctrv.2019.101951
- CrossRef
- Google Scholar
9
ChenM.ZhaoH. (2019). Next-generation sequencing in liquid biopsy: cancer screening and early detection. Hum. Genomics13, 34. 10.1186/s40246-019-0220-8
- CrossRef
- Google Scholar
10
ChenS.ZhouY.ChenY.GuJ. (2018). fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics34, i884–i890. 10.1093/bioinformatics/bty560
- CrossRef
- Google Scholar
11
ChoyL. Y. L.PengW.JiangP.ChengS. H.YuS. C. Y.ShangH.et al (2022). Single-molecule sequencing enables long cell-free DNA detection and direct methylation analysis for cancer patients. Clin. Chem.68, 1151–1163. 10.1093/clinchem/hvac086
- CrossRef
- Google Scholar
12
ChristodoulouE.YellapantulaV.O’HalloranK.XuL.BerryJ. L.CotterJ. A.et al (2023). Combined low-pass whole genome and targeted sequencing in liquid biopsies for pediatric solid tumors. Npj Precis. Oncol.7, 21–11. 10.1038/s41698-023-00357-0
- CrossRef
- Google Scholar
13
CohenJ. D.LiL.WangY.ThoburnC.AfsariB.DanilovaL.et al (2018). Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science359, 926–930. 10.1126/science.aar3247
- CrossRef
- Google Scholar
14
DangD. K.ParkB. H. (2022). Circulating tumor DNA: current challenges for clinical utility. J. Clin. Investig.132, e154941. 10.1172/JCI154941
- CrossRef
- Google Scholar
15
DaoJ.ConwayP. J.SubramaniB.MeyyappanD.RussellS.MahadevanD. (2023). Using cfDNA and ctDNA as oncologic markers: a path to clinical validation. Int. J. Mol. Sci.24, 13219. 10.3390/ijms241713219
- CrossRef
- Google Scholar
16
DayaS. A.MahfouzR. (2018). Circulating tumor DNA, liquid biopsy, and next generation sequencing: a comprehensive technical and clinical applications review - ScienceDirect. Available at: https://www.sciencedirect.com/science/article/pii/S2214540018301439?via%3Dihub (Accessed June 2, 2024).
- Google Scholar
17
DiefenbachR. J.LeeJ. H.StrbenacD.YangJ. Y. H.MenziesA. M.CarlinoM. S.et al (2019). Analysis of the whole-exome sequencing of tumor and circulating tumor DNA in metastatic melanoma. Cancers11, 1905. 10.3390/cancers11121905
- CrossRef
- Google Scholar
18
DilliottA. A.FarhanS. M. K.GhaniM.SatoC.LiangE.ZhangM.et al (2018). Targeted next-generation sequencing and bioinformatics pipeline to evaluate genetic determinants of constitutional disease. J. Vis. Exp., 57266. 10.3791/57266
- CrossRef
- Google Scholar
19
DuffyM. J.DiamandisE. P.CrownJ. (2021). Circulating tumor DNA (ctDNA) as a pan-cancer screening test: is it finally on the horizon?Clin. Chem. Lab. Med. CCLM59, 1353–1361. 10.1515/cclm-2021-0171
- CrossRef
- Google Scholar
20
ElazezyM.JoosseS. A. (2018). Techniques of using circulating tumor DNA as a liquid biopsy component in cancer management. Comput. Struct. Biotechnol. J.16, 370–378. 10.1016/j.csbj.2018.10.002
- CrossRef
- Google Scholar
21
Esteva-SociasM.Enver-SumayaM.Gómez-BellvertC.GuillotM.AzkárateA.MarséR.et al (2020). Detection of the EGFR G719S mutation in non-small cell lung cancer using droplet digital PCR. Front. Med.7, 594900. 10.3389/fmed.2020.594900
- CrossRef
- Google Scholar
22
FürstenauM.WeissJ.GizaA.FranzenF.RobrechtS.FinkA.-M.et al (2022). Circulating tumor DNA–based MRD assessment in patients with CLL treated with obinutuzumab, acalabrutinib, and venetoclax. Clin. Cancer Res.28, 4203–4211. 10.1158/1078-0432.CCR-22-0433
- CrossRef
- Google Scholar
23
GaleD.LawsonA. R. J.HowarthK.MadiM.DurhamB.SmalleyS.et al (2018). Development of a highly sensitive liquid biopsy platform to detect clinically-relevant cancer mutations at low allele fractions in cell-free DNA. PLOS ONE13, e0194630. 10.1371/journal.pone.0194630
- CrossRef
- Google Scholar
24
GanesamoorthyD.RobertsonA. J.ChenW.HallM. B.CaoM. D.FergusonK.et al (2022). Whole genome deep sequencing analysis of cell-free DNA in samples with low tumour content. BMC Cancer22, 85. 10.1186/s12885-021-09160-1
- CrossRef
- Google Scholar
25
GlennT. C. (2011). Field guide to next-generation DNA sequencers. Mol. Ecol. Resour.11, 759–769. 10.1111/j.1755-0998.2011.03024.x
- CrossRef
- Google Scholar
26
GlotovO. S.ChernovA. N.GlotovA. S. (2023). Human exome sequencing and prospects for predictive medicine: analysis of international data and own experience. J. Pers. Med.13, 1236. 10.3390/jpm13081236
- CrossRef
- Google Scholar
27
HallermayrA.NeuhannT. M.Steinke-LangeV.ScharfF.LanerA.EwaldR.et al (2022). Highly sensitive liquid biopsy Duplex sequencing complements tissue biopsy to enhance detection of clinically relevant genetic variants. Front. Oncol.12, 1014592. 10.3389/fonc.2022.1014592
- CrossRef
- Google Scholar
28
HessJ. F.KohlT. A.KotrováM.RönschK.PaprotkaT.MohrV.et al (2020). Library preparation for next generation sequencing: a review of automation strategies. Biotechnol. Adv.41, 107537. 10.1016/j.biotechadv.2020.107537
- CrossRef
- Google Scholar
29
HuertaM.RosellóS.SabaterL.FerrerA.TarazonaN.RodaD.et al (2021). Circulating tumor DNA detection by digital-droplet PCR in pancreatic ductal adenocarcinoma: a systematic review. Cancers13, 994. 10.3390/cancers13050994
- CrossRef
- Google Scholar
30
HungS. S.MeissnerB.ChavezE. A.Ben-NeriahS.EnnishiD.JonesM. R.et al (2018). Assessment of capture and amplicon-based approaches for the development of a targeted next-generation sequencing pipeline to personalize lymphoma management. J. Mol. Diagn20, 203–214. 10.1016/j.jmoldx.2017.11.010
- CrossRef
- Google Scholar
31
IbañezK.PolkeJ.HagelstromR. T.DolzhenkoE.PaskoD.ThomasE. R. A.et al (2022). Whole genome sequencing for the diagnosis of neurological repeat expansion disorders in the UK: a retrospective diagnostic accuracy and prospective clinical validation study. Lancet Neurol.21, 234–245. 10.1016/S1474-4422(21)00462-2
- CrossRef
- Google Scholar
32
JiangM.JinS.HanJ.LiT.ShiJ.ZhongQ.et al (2021). Detection and clinical significance of circulating tumor cells in colorectal cancer. Biomark. Res.9, 85. 10.1186/s40364-021-00326-4
- CrossRef
- Google Scholar
33
Jiménez-RodríguezB.Alba-BernalA.López-LópezE.Quirós-OrtegaM. E.CarbajosaG.Garrido-ArandaA.et al (2022). Development of a novel NGS methodology for ultrasensitive circulating tumor DNA detection as a tool for early-stage breast cancer diagnosis. Int. J. Mol. Sci.24, 146. 10.3390/ijms24010146
- CrossRef
- Google Scholar
34
KangJ.-K.HeoS.KimH.-P.SongS.-H.YunH.HanS.-W.et al (2020). Liquid biopsy-based tumor profiling for metastatic colorectal cancer patients with ultra-deep targeted sequencing. PLOS ONE15, e0232754. e0232754–e0232754. 10.1371/journal.pone.0232754
- CrossRef
- Google Scholar
35
KatoR.HayashiH.SakaiK.SuzukiS.HarataniK.TakahamaT.et al (2021). CAPP-seq analysis of circulating tumor DNA from patients with EGFR T790M–positive lung cancer after osimertinib. Int. J. Clin. Oncol.26, 1628–1639. 10.1007/s10147-021-01947-3
- CrossRef
- Google Scholar
36
KockanC.HachF.SarrafiI.BellR. H.McConeghyB.BejaK.et al (2017). SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA. Bioinformatics33, 26–34. 10.1093/bioinformatics/btw536
- CrossRef
- Google Scholar
37
KwapiszD. (2017). The first liquid biopsy test approved. Is it a new era of mutation testing for non-small cell lung cancer?Ann. Transl. Med.5, 46. 10.21037/atm.2017.01.32
- CrossRef
- Google Scholar
38
LaiZ.MarkovetsA.AhdesmakiM.ChapmanB.HofmannO.McEwenR.et al (2016). VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research. Nucleic Acids Res.44, e108. 10.1093/nar/gkw227
- CrossRef
- Google Scholar
39
LangmeadB.SalzbergS. L. (2012). Fast gapped-read alignment with Bowtie 2. Nat. Methods9, 357–359. 10.1038/nmeth.1923
- CrossRef
- Google Scholar
40
LealA.van GriekenN. C. T.PalsgroveD. N.PhallenJ.MedinaJ. E.HrubanC.et al (2020). White blood cell and cell-free DNA analyses for detection of residual disease in gastric cancer. Nat. Commun.11, 525. 10.1038/s41467-020-14310-3
- CrossRef
- Google Scholar
41
LeenanitikulJ.ChanchaemP.MankhongS.DenariyakoonS.FongchaiyaV.ArayataweegoolA.et al (2023). Concordance between whole exome sequencing of circulating tumor DNA and tumor tissue. PLOS ONE18, e0292879. 10.1371/journal.pone.0292879
- CrossRef
- Google Scholar
42
LiH.DurbinR. (2009). Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics25, 1754–1760. 10.1093/bioinformatics/btp324
- CrossRef
- Google Scholar
43
LiangJ.ZhaoW.LuC.LiuD.LiP.YeX.et al (2020). Next-generation sequencing analysis of ctDNA for the detection of glioma and metastatic brain tumors in adults. Front. Neurol.11, 544. 10.3389/fneur.2020.00544
- CrossRef
- Google Scholar
44
LinC.LiuX.ZhengB.KeR.TzengC.-M. (2021). Liquid biopsy, ctDNA diagnosis through NGS. Life11, 890. 10.3390/life11090890
- CrossRef
- Google Scholar
45
LingX.WangC.LiL.PanL.HuangC.ZhangC.et al (2023). Third-generation sequencing for genetic disease. Clin. Chim. Acta Int. J. Clin. Chem.551, 117624. 10.1016/j.cca.2023.117624
- CrossRef
- Google Scholar
46
LiuX.HanS.WangZ.GelernterJ.YangB.-Z. (2013). Variant callers for next-generation sequencing data: a comparison study. PLoS ONE8, e75619. 10.1371/journal.pone.0075619
- CrossRef
- Google Scholar
47
LiuY.FengY.HouT.LizasoA.XuF.XingP.et al (2020). Investigation on the potential of circulating tumor DNA methylation patterns as prognostic biomarkers for lung squamous cell carcinoma. Lung Cancer Res.9, 2356–2366. 10.21037/tlcr-20-1070
- CrossRef
- Google Scholar
48
LomakinA.SvedlundJ.StrellC.GataricM.ShmatkoA.RukhovichG.et al (2022). Spatial genomics maps the structure, nature and evolution of cancer clones. Nature611, 594–602. 10.1038/s41586-022-05425-2
- CrossRef
- Google Scholar
49
MahamdallieS.RuarkE.YostS.MünzM.RenwickA.Poyastro-PearsonE.et al (2018). The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing data quality assurance. Wellcome Open Res.3, 37. 10.12688/wellcomeopenres.14307.1
- CrossRef
- Google Scholar
50
MallampatiS.ZallesS.DuoseD. Y.HuP. C.MedeirosL. J.WistubaI. I.et al (2019). Development and application of duplex sequencing strategy for cell-free DNA–based longitudinal monitoring of stage IV colorectal cancer. J. Mol. Diagn21, 994–1009. 10.1016/j.jmoldx.2019.06.008
- CrossRef
- Google Scholar
51
MaloneE. R.OlivaM.SabatiniP. J. B.StockleyT. L.SiuL. L. (2020). Molecular profiling for precision cancer therapies. Genome Med.12, 8. 10.1186/s13073-019-0703-1
- CrossRef
- Google Scholar
52
MarcozziA.JagerM.ElferinkM.StraverR.van GinkelJ. H.PeltenburgB.et al (2021). Accurate detection of circulating tumor DNA using nanopore consensus sequencing. NPJ Genomic Med.6, 106. 10.1038/s41525-021-00272-y
- CrossRef
- Google Scholar
53
MarshallC. R.ChowdhuryS.TaftR. J.LeboM. S.BuchanJ. G.HarrisonS. M.et al (2020). Best practices for the analytical validation of clinical whole-genome sequencing intended for the diagnosis of germline disease. Npj Genomic Med.5, 47. 10.1038/s41525-020-00154-9
- CrossRef
- Google Scholar
54
MartinsI.RibeiroI. P.JorgeJ.GonçalvesA. C.Sarmento-RibeiroA. B.MeloJ. B.et al (2021). Liquid biopsies: applications for cancer diagnosis and monitoring. Genes.12, 349. 10.3390/genes12030349
- CrossRef
- Google Scholar
55
NoguchiT.SakaiK.IwahashiN.MatsudaK.MatsukawaH.YahataT.et al (2020). Changes in the gene mutation profiles of circulating tumor DNA detected using CAPP-Seq in neoadjuvant chemotherapy-treated advanced ovarian cancer. Oncol. Lett.19, 2713–2720. 10.3892/ol.2020.11356
- CrossRef
- Google Scholar
56
OliveiraK. C. S.RamosI. B.SilvaJ. M. C.BarraW. F.RigginsG. J.PalandeV.et al (2020). Current perspectives on circulating tumor DNA, precision medicine, and personalized clinical management of cancer. Mol. Cancer Res.18, 517–528. 10.1158/1541-7786.MCR-19-0768
- CrossRef
- Google Scholar
57
ParikhA. R.Van SeventerE. E.SiravegnaG.HartwigA. V.JaimovichA.HeY.et al (2021). Minimal residual disease detection using a plasma-only circulating tumor DNA assay in patients with colorectal cancer. Clin. Cancer Res.27, 5586–5594. 10.1158/1078-0432.CCR-21-0410
- CrossRef
- Google Scholar
58
PengQ.XuC.KimD.LewisM.DiCarloJ.WangY. (2019). Targeted single primer enrichment sequencing with single end duplex-UMI. Sci. Rep.9, 4810. 10.1038/s41598-019-41215-z
- CrossRef
- Google Scholar
59
PhallenJ.SausenM.AdleffV.LealA.HrubanC.WhiteJ.et al (2017). Direct detection of early-stage cancers using circulating tumor DNA. Sci. Transl. Med.9, eaan2415. 10.1126/scitranslmed.aan2415
- CrossRef
- Google Scholar
60
QIAseq SPE technology for Illumina (2018). Redefining amplicon sequencing - QIAGEN. Available at: https://www.qiagen.com/us/resources/resourcedetail?id=b3363886-aaed-4e0d-8d4b-3291b28593c5&lang=en (Accessed July 5, 2024).
- Google Scholar
61
ReinertK.LangmeadB.WeeseD.EversD. J. (2015). Alignment of next-generation sequencing reads. Annu. Rev. Genomics Hum. Genet.16, 133–151. 10.1146/annurev-genom-090413-025358
- CrossRef
- Google Scholar
62
RezazadehA.PruettJ.DetznerA.EdwinN.HamadaniM.ShahN. N.et al (2024). Immunoglobulin high throughput sequencing (Ig-HTS) minimal residual disease (MRD) analysis is an effective surveillance tool in patients with mantle cell lymphoma. Clin. Lymphoma Myeloma Leuk.24, 254–259. 10.1016/j.clml.2023.12.006
- CrossRef
- Google Scholar
63
SabatierR.GarnierS.GuilleA.CarbucciaN.PakradouniJ.AdelaideJ.et al (2022). Whole-genome/exome analysis of circulating tumor DNA and comparison to tumor genomics from patients with heavily pre-treated ovarian cancer: subset analysis of the PERMED-01 trial. Front. Oncol.12, 946257. 10.3389/fonc.2022.946257
- CrossRef
- Google Scholar
64
SalkJ. J.SchmittM. W.LoebL. A. (2018). Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations. Nat. Rev. Genet.19, 269–285. 10.1038/nrg.2017.117
- CrossRef
- Google Scholar
65
Sanz-GarciaE.ZhaoE.BratmanS. V.SiuL. L. (2022). Monitoring and adapting cancer treatment using circulating tumor DNA kinetics: current research, opportunities, and challenges. Sci. Adv.8, eabi8618. 10.1126/sciadv.abi8618
- CrossRef
- Google Scholar
66
ScaranoC.VenerusoI.De SimoneR. R.Di BonitoG.SecondinoA.D’ArgenioV. (2024). The third-generation sequencing challenge: novel insights for the omic sciences. Biomolecules14, 568. 10.3390/biom14050568
- CrossRef
- Google Scholar
67
SerranoC.LealA.KuangY.MorganJ. A.BarysauskasC. M.PhallenJ.et al (2019). Phase I study of rapid alternation of sunitinib and regorafenib for the treatment of tyrosine kinase inhibitor refractory gastrointestinal stromal tumors. Clin. Cancer Res.25, 7287–7293. 10.1158/1078-0432.CCR-19-2150
- CrossRef
- Google Scholar
68
ShenW.ShanB.LiangS.ZhangJ.YuY.ZhangY.et al (2021). Hybrid capture-based genomic profiling of circulating tumor DNA from patients with advanced ovarian cancer. Pathol. Oncol. Res.27, 581534. 10.3389/pore.2021.581534
- CrossRef
- Google Scholar
69
ShiZ.LopezJ.KallineyW.SuttonB.SimpsonJ.MaggertK.et al (2022). Development and evaluation of ActSeq: a targeted next-generation sequencing panel for clinical oncology use. PLoS ONE17, e0266914. 10.1371/journal.pone.0266914
- CrossRef
- Google Scholar
70
ShieldsM. D.ChenK.DutcherG.PatelI.PelliniB. (2022). Making the rounds: exploring the role of circulating tumor DNA (ctDNA) in non-small cell lung cancer. Int. J. Mol. Sci.23, 9006. 10.3390/ijms23169006
- CrossRef
- Google Scholar
71
SinghR. R. (2022). Target enrichment approaches for next-generation sequencing applications in oncology. Diagnostics12, 1539. 10.3390/diagnostics12071539
- CrossRef
- Google Scholar
72
SogbeM.BilbaoI.MarcheseF. P.ZazpeJ.De VitoA.PozueloM.et al (2024). Prognostic value of ultra-low-pass whole-genome sequencing of circulating tumor DNA in hepatocellular carcinoma under systemic treatment. Clin. Mol. Hepatol.30, 177–190. 10.3350/cmh.2023.0426
- CrossRef
- Google Scholar
73
SunY.LiuF.FanC.WangY.SongL.FangZ.et al (2021). Characterizing sensitivity and coverage of clinical WGS as a diagnostic test for genetic disorders. BMC Med. Genomics14, 102. 10.1186/s12920-021-00948-5
- CrossRef
- Google Scholar
74
SzadkowskaP.RouraA.-J.WojtasB.WojnickiK.LicholaiS.WallerT.et al (2022). Improvements in quality control and library preparation for targeted sequencing allowed detection of potentially pathogenic alterations in circulating cell-free DNA derived from plasma of brain tumor patients. Cancers14, 3902. 10.3390/cancers14163902
- CrossRef
- Google Scholar
75
TieJ.WangY.CohenJ.LiL.HongW.ChristieM.et al (2021). Circulating tumor DNA dynamics and recurrence risk in patients undergoing curative intent resection of colorectal cancer liver metastases: a prospective cohort study. PLOS Med.18, e1003620. e1003620–e1003620. 10.1371/journal.pmed.1003620
- CrossRef
- Google Scholar
76
TrefferR.DeckertV. (2010). Recent advances in single-molecule sequencing. Curr. Opin. Biotechnol.21, 4–11. 10.1016/j.copbio.2010.02.009
- CrossRef
- Google Scholar
77
TrivediU. H.Cã©zardT.BridgettS.MontazamA.NicholsJ.BlaxterM.et al (2014). Quality control of next-generation sequencing data without a reference. Front. Genet.5, 111. 10.3389/fgene.2014.00111
- CrossRef
- Google Scholar
78
TuaevaF.PorozovN.TrukhanK.NosyrevA. E.KovatsiL.SpandidosD. A.et al (2019). Translational application of circulating DNA in oncology: review of the last decades achievements. Cells8, 1251. 10.3390/cells8101251
- CrossRef
- Google Scholar
79
TurnerN. C.SwiftC.JenkinsB.KilburnL.CoakleyM.BeaneyM.et al (2023). Results of the c-TRAK TN trial: a clinical trial utilising ctDNA mutation tracking to detect molecular residual disease and trigger intervention in patients with moderate- and high-risk early-stage triple-negative breast cancer. Ann. Oncol.34, 200–211. 10.1016/j.annonc.2022.11.005
- CrossRef
- Google Scholar
80
U.S Food And Drug Administration (2022). List of cleared or approved companion diagnostic devices (in vitro and imaging tools). Available at: https://www.fda.gov/medical-devices/in-vitro-diagnostics/list-cleared-or-approved-companion-diagnostic-devices-in-vitro-and-imaging-tools.
- Google Scholar
81
U.S Food And Drug Administration (2023). List of cleared or approved companion diagnostic devices (in vitro and imaging tools). Available at: https://www.fda.gov/medical-devices/recently-approved-devices/foundationone-liquid-cdx-f1-liquid-cdx-p190032s010 (Accessed January 10, 2024).
- Google Scholar
82
van der PolY.TantyoN. A.EvanderN.HentschelA. E.WeverB. M.RamakerJ.et al (2023). Real‐time analysis of the cancer genome and fragmentome from plasma and urine cell‐free DNA using nanopore sequencing. EMBO Mol. Med.15, e17282. 10.15252/emmm.202217282
- CrossRef
- Google Scholar
83
WadapurkarR. M.VyasR. (2018). Computational analysis of next generation sequencing data and its applications in clinical oncology. Inf. Med. Unlocked11, 75–82. 10.1016/j.imu.2018.05.003
- CrossRef
- Google Scholar
84
WangY.ZhaoY.BollasA.WangY.AuK. F. (2021). Nanopore sequencing technology, bioinformatics and applications. Nat. Biotechnol.39, 1348–1365. 10.1038/s41587-021-01108-x
- CrossRef
- Google Scholar
85
WuX.-B.HouS.-L.ZhangQ.-H.JiaN.HouM.ShuiW. (2022). Circulating tumor DNA characteristics based on next generation sequencing and its correlation with clinical parameters in patients with lymphoma. Front. Oncol.12, 901547. 10.3389/fonc.2022.901547
- CrossRef
- Google Scholar
86
YuL.LopezG.RassaJ.WangY.BasavanhallyT.BrowneA.et al (2022). Direct comparison of circulating tumor DNA sequencing assays with targeted large gene panels. PLOS ONE17, e0266889. 10.1371/journal.pone.0266889
- CrossRef
- Google Scholar
87
ZhaoC.PanY.WangY.LiY.HanW.LuL.et al (2020). A novel cell-free single-molecule unique primer extension resequencing (cf-SUPER) technology for bladder cancer non-invasive detection in urine. Transl. Androl. Urol.9, 1222–1231. 10.21037/tau-19-774
- CrossRef
- Google Scholar
88
ZviranA.SchulmanR. C.ShahM.HillS. T. K.DeochandS.KhamneiC. C.et al (2020). Genome-wide cell-free DNA mutational integration enables ultra-sensitive cancer monitoring. Nat. Med.26, 1114–1124. 10.1038/s41591-020-0915-3
- CrossRef
- Google Scholar

Summary

Keywords

precision medicine, ctDNA mutation, non-targeted next-generation sequencing, targeted next-generation sequencing, bioinformatics

Citation

Silva TF, Azevedo Jr JC, Teixeira EB, Casseb SMM, Moreira FC, Assumpção PP, Santos SEB and Calcagno DQ (2024) From haystack to high precision: advanced sequencing methods to unraveling circulating tumor DNA mutations. Front. Mol. Biosci. 11:1423470. doi: 10.3389/fmolb.2024.1423470

Received

25 April 2024

Accepted

11 July 2024

Published

06 August 2024

Volume

11 - 2024

Edited by

Carmela De Marco, Magna Græcia University of Catanzaro, Italy

Reviewed by

Claudia Veneziano, Magna Græcia University, Italy

Yunfan Fan, Bristol Myers Squibb, United States

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tamires Ferreira da Silva, ferreiradasilvatamires81@gmail.com; Danielle Queiroz Calcagno, danicalcagno@gmail.com

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Molecular Diagnostics and Therapeutics

REVIEW article

From haystack to high precision: advanced sequencing methods to unraveling circulating tumor DNA mutations

Abstract

Background

Next-generation sequencing

Non-targeted NGS technologies