- 1AO Vector-Best, Novosibirsk, Russia
- 2Department of the Structure and Function of Chromosomes, Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
- 3Federal State Budget Scientific Institution "Federal Research Center of Fundamental and Translational Medicine", Novosibirsk, Russia
- 4Department of Obstetrics and Gynecology, Novosibirsk State University, Novosibirsk, Russia
- 5Institute of Oncology and Neurosurgery, E. Meshalkin National Medical Research Center, Novosibirsk, Russia
- 6Department of Cervical Pathology, Saint-Petersburg City Clinic №17, Saint-Petersburg, Russia
- 7Subcellular Technology Lab, N.N. Petrov National Medical Research Center of Oncology, Saint Petersburg, Russia
Background: Cervical screening, aimed at detecting precancerous lesions and preventing cancer, is based on cytology and HPV testing. Both methods have limitations, the main ones being the variable diagnostic sensitivity of cytology and the moderate specificity of HPV testing. Various molecular biomarkers are proposed in recent years to improve cervical cancer management, including a number of mRNAs encoded by human genes involved in carcinogenesis. Many scientific papers have shown that the expression patterns of cellular mRNAs reflect the severity of the lesion, and their analysis in cervical smears may outperform HPV testing in terms of diagnostic specificity. However, such analysis has not yet been implemented in broad clinical practice. Our aim was to devise an assay detecting severe cervical lesions (≥HSIL) via analysis of cellular mRNA expression in cytological smears.
Methods: Through logistic regression analysis of a reverse-transcription quantitative PCR (RT-qPCR) dataset generated from analysis of six mRNAs in 167 cervical smears with various cytological diagnoses, we generated a family of linear classifiers based on paired mRNA concentration ratios. Each classifier outputs a dimensionless decision function (DF) value that increases with lesion severity. Additionally, in the same specimens, the HPV genotyping, viral load assessment, diagnosis of cervicovaginal microbiome imbalance and profiling of some relevant mRNAs and miRNAs were performed by qPCR-based methods.
Results: The best classifiers were obtained with pairs of mRNAs whose expression changes in opposite directions during lesion progression. With this approach based on a five-mRNA combination (CDKN2A, MAL, TMPRSS4, CRNN, and ECM1), we generated a classifier having ROC AUC 0.935, diagnostic sensitivity 89.7%, and specificity 87.6% for ≥HSIL detection. Based on this classifier, a two-tube RT-qPCR based assay was developed and it confirmed the preliminary characteristics on 120 cervical smears from the test sample. DF values weakly correlated with HPV loads and cervicovaginal microbiome imbalance, thus being independent markers of ≥HSIL risk.
Conclusion: Thus, we propose a high-throughput method for detecting ≥HSIL cervical lesions by RT-qPCR analysis of several cellular mRNAs. The method is suitable for the analysis of cervical cytological smears prepared by a routine method. Further clinical validation is necessary to clarify its clinical potential.
1 Introduction
Cervical cancer is preventable if detected at early stages of progression and treated timely. Nonetheless, it remains one of leading causes of morbidity and mortality among women. This problem can largely be attributed to shortcomings of disease prevention methods. Vaccination against high-risk human papillomavirus (HR-HPV) types (which are the main etiological factor for most of cervical lesions) covers only the most common types of HPV and is not generally available in most countries. Cervical screening, intended to detect the disease at early stages and to predict risks, is based on methods that have considerable drawbacks. In particular, cytological examination of cervical scrapings (cytology), which is the main technique for primary screening, has moderate and variable diagnostic sensitivity (~30% to 90% for high-grade squamous intraepithelial lesion or later stage, ≥HSIL) and low reproducibility of repeated tests (1–5), and consequently approximately half of severe lesions go undetected. The above characteristics are influenced by many factors: qualifications and experience of the cytologist, the organization of the laboratory, the technique and quality of sampling, and fixation and staining of cytological material (6). Besides, diagnoses based on cellular morphology are subjective and sometimes even experienced specialists disagree with each other (7, 8). HPV testing—combined with cytology or used instead of it in primary screening—has widely ranging diagnostic specificity (~40% to 98% for ≥HSIL) (9–13) due to the high percentage of transient infections (~90% are eliminated within 2 years) (14), varied prevalence of papillomavirus infections in different populations and age groups (5), and different types of employed assays. These issues lead to overdiagnosis and numerous unnecessary referrals of women for colposcopy. Moreover, such methods are unable to detect HPV-negative cases of severe cervical lesions progressing along an HPV-independent pathway as well as cases associated with atypical HPV genotypes (together they can account for up to 15% of cases, and if we take adenocarcinomas separately, then up to 38%) (15, 16). Furthermore, as HR-HPV vaccination programs are implemented, the contribution of such cases to the proportion of false negative screening results will increase.
In this regard, a search is underway for improving the diagnosis of cervical lesions by using additional molecular biomarkers. Promising solutions include the use of biomarkers of various types, such as proteins; methylation of viral or cellular DNA; cellular mRNAs, microRNAs (miRNAs, miRs), circular RNAs, or long noncoding RNAs; and cervical microbiome composition (17–23). Some of them may be useful for early detection of the disease and/or for triaging HPV-positive patients by weighing the risk of cervical pathology, and others can serve as supporting tools for better prognosis or for molecular typing of a lesion or choosing the type of treatment and monitoring its effectiveness. However, it is important to note that using combinations of different types of molecular biomarkers in a single pipeline can be problematic for routine analysis due to individual workflow requirements, starting from the preanalytical stage. Compared with a variety of cytomorphological features, which show various degrees of poor interobserver concordance (3), molecular biomarker analysis may be more objective and reproducible (24). In addition, molecular testing can be performed using high-throughput methods such as PCR, which can be useful in the face of a shortage of experienced cytologists.
This and our previous articles (25, 26) are devoted to the development of a molecular test for cervical screening that is based on the analysis of cellular biomarker RNAs in cytological smears. The approach is designed to subdivide patients into two groups: i) low-grade squamous intraepithelial lesions or earlier stages, ≤LSIL (i.e., low risk of ≥HSIL), and ii) ≥HSIL (increased risk of ≥HSIL). The division is based on fundamentally different clinical guidelines for LSILs and HSILs: while some patients require only follow-up, others require immediate in-depth examination and treatment. This is because cervical precancerous lesions have different prognoses: probabilities of regression or progression. Therefore, our aim was to design a tool that would be useful for clinicians both for identifying severe cervical lesions and for risk stratification in cases of borderline controversial conditions. Important criteria for the creation of the test (making it suitable for routine screening) were high diagnostic performance and throughput, ease of operation, and an acceptable cost. Therefore, in this work, we selected a small set of up- and down-regulated cervical-cancer–related mRNAs and attempted to use the approach (involving pairs of oppositely deregulated biomarkers) that we have previously successfully used for miRNA normalization (25). The use of paired combinations of biomarkers with different directions of expression change (instead of their ‘classical’ normalization to housekeeping genes) can minimize the number of analyzed targets in the test and maximize its diagnostic characteristics because only diagnostically useful RNAs are chosen.
2 Materials and methods
2.1 Clinical material
Air-dried cytological smears of cervical epithelium were prepared by routine Papanicolaou staining. The material was collected from patients undergoing cytological and histological examination and treatment in oncogynecology departments of three medical centers in Russia (N.N. Petrov National Medical Research Center of Oncology, St. Petersburg; Medsanchast-168, Novosibirsk; and Novosibirsk Regional Oncology Dispensary). The cytological smears were classified according to the Bethesda system: negative for intraepithelial lesion or malignancy (NILM), LSIL, HSIL, or cervical cancer (CC). All HSIL and CC diagnoses were verified histologically. Samples with ASCUS diagnoses were intentionally excluded from the study in order to minimize their influence on the selection of biomarkers and training the algorithm for their interpretation. This is due to the fact that, in our opinion, this group of diagnoses is most susceptible to the influence of the human factor and subjectivity. The cytological smears were obtained from patients aged 24 to 81 years: NILM (n = 61, mean age 38), LSIL (n = 28, mean age 35), HSIL (n = 42, mean age 40), and CC (n = 36, mean age 44). Information about the samples is presented as a flow diagram in Figure 1. This study was approved by the Ethics Committee of federal government-funded institution N.N. Petrov National Medical Research Center of Oncology (extract 11/31 from protocol №3 from 16.02.2023; Saint Petersburg, Russia). Written informed consent was provided by all patients involved in the study, and the clinical data were depersonalized.
Figure 1. The flow diagram of the study population. HMBS and PGK1 analyses were performed to assess the amount of DNA and mRNA in samples (see subsection 2.3).
The HeLa cell line was also used, to assay the expression of mRNAs included in the prototype test and to compare it with expression levels in cytological smears that were ≥HSIL. The cell line was obtained from the cell culture collection at the State Research Center of Virology and Biotechnology VECTOR (Koltsovo, Russia).
2.2 Criteria for inclusion and exclusion of patients
Criteria for inclusion of patients:
1. Patients aged 21-75 years;
2. The interval for taking a sample is 5-23 days of the menstrual cycle;
3. Availability of written, personally signed, and dated voluntary informed consent to participate in the study;
4. The presence of a cytological conclusion; histological confirmation for ≥HSIL cases.
Criteria for exclusion of patients:
1. Refusal of the patient to participate in the study;
2. Pregnancy and breastfeeding period;
3. Taking a smear from the cervix within less than 1 month before the start of the study;
4. Lack of histological confirmation of ≥HSIL cytological findings;
5. Cytological conclusion other than NILM, LSIL, HSIL, CC;
6. Prior therapy for precancerous lesions and cervical cancer, endometrial cancer;
7. Any other cancer history.
2.3 Nucleic acid extraction
Total-nucleic-acid extraction from cytological smears and from the HeLa cell line was performed as described elsewhere (26). To assess the integrity of human DNA and mRNA in the samples, the copy number of a conserved region of the HMBS gene and the mRNA of housekeeping gene PGK1 were analyzed by quantitative PCR (qPCR) and reverse-transcription quantitative PCR (RT-qPCR), respectively (primer sequences are given in Supplementary Table S1). Samples with an insufficient amount of human DNA (Cq for HMBS ≥ 35) or of human RNA (Cq for PGK1 mRNA ≥ 35) were excluded from further analyses.
2.4 Selection of biomarker mRNAs
In this paper, mRNAs known to be downregulated in cervical lesions were selected, to be combined into pairs with upregulated mRNAs chosen based on our previous work (26). To do this, a systematic literature search was conducted in PubMed. We selected mRNA-coding genes for whose downregulation a statistically significant association has been reported (in relation to cervical disease severity) in at least three publications. Preference was given to studies with larger sample sizes in which clinical relevance of biomarkers was confirmed by RT-qPCR-based methods. The initial list consisted of 20 mRNA-coding genes: ALOX12, CDH1, CFD, CRCT1, CRISP3, CRNN, CWH43, DSG1, EDN3, ESR1, HOPX, KRT1, KRT10, KRT13, KRT4, KRTDAP, MAL, PER1, PPP1R3C, and SPINK5. Of these, the six most common among studies (≥ 5 publications) were selected that i) have a sufficiently high level of initial expression in cells of the normal epithelium (according to proteinatlas.org) and ii) are not downregulated in endometriosis and endometrial cancer (to exclude false positive results if such cells accidentally get into a cytological smear): CDH1, PER1, MAL, CRNN, CRISP3, and SPINK5. Their expression was assessed by RT-qPCR in a small set of cervical specimens cytologically corresponding to NILM or CC. For four of the six mRNAs (MAL, CRNN, CRISP3, and SPINK5), considerable CC-associated downregulation was confirmed, and they were chosen for further analysis along with several biomarker mRNAs from our previous work (CDKN2A, TMPRSS4 [= TSP4], ECM1, and TOP2A).
2.5 Molecular analyses and data normalization
All amplification procedures were performed using a CFX96 thermal cycler (Bio-Rad Laboratories, Hercules, CA, USA). Identification, genotyping, and quantification of HR-HPV viral DNA were performed with the RealBest DNA HPV HR genotype quantitative (Cat. № D-8478, AO Vector-Best), according to the manufacturer’s instructions. The kit is designed to detect and quantify 12 HR-HPV types (HPV 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, and 59). Four cytological smears with signs of neoplastic lesions, identified as HPV-negative, were additionally tested for HPV DNA genotypes 26, 53, 66, 68, 73, and 82 using the RealBest DNA HPV 26/53/66 Kit and RealBest DNA HPV 68/73/82 Kit (Cat. № D-8449 and D-8451, AO Vector-Best). The viral DNA loads were estimated by the 2-ΔCq method (27), using Cq values from amplification of the HMBS gene (as a marker reflecting human DNA amount) as a normalizing factor. To determine the relative enrichment of lactobacilli (LBs) and the proportion of typical opportunistic species (aerobic and anaerobic) in the total bacterial population, the RealBest BioFlor Kit (Cat. № D-4225, AO Vector-Best) was used.
For the selected six mRNAs, oligonucleotides for RT-qPCR were designed; their sequences are given in Supplementary Table S1. The RT-qPCR conditions were identical to those described elsewhere (26); each clinical sample was analyzed as a single replicate. Samples with Cq > 40 were considered expression-negative. The concentration of each biomarker mRNA was assessed semiquantitatively via normalization to housekeeping PGK1 mRNA by the 2-ΔCq method.
During the training of the linear classifiers, for each mRNA pair (mRNAX; mRNAY), a normalized ΔCq value was utilized, calculated according to the formula:
where C(mRNAX) is the concentration of gene X’s mRNA, C(mRNAY) is the concentration of gene Y’s mRNA, and Cq(mRNAX) and Cq(mRNAY) are quantification PCR cycles for mRNAs of genes X and Y, respectively.
2.6 Statistical analysis, construction of classifiers, and assessment of diagnostic performance
Data were analyzed using the SciPy library of the Python programming language (28, 29). All possible paired mRNA combinations were generated to train classifiers based on them. The significance of detected differences between groups of patients was assessed by the Mann–Whitney U test in the STATISTICA v10.0 software (StatSoft, Tulsa, OK, USA). At a p-value of < 0.05, differences were considered statistically significant. To avoid the type I error, the Bonferroni correction was applied. To evaluate diagnostic characteristics (ability to detect ≥HSIL) of individual mRNAs, of biomarker pairs, and of linear classifiers, the area under the receiver-operating characteristic (ROC) curve (AUC) and diagnostic sensitivity and specificity were calculated in SPSS Statistics v23 (IBM, Armonk, New York, USA), with clinical reports of cytological findings as a reference (in cases where a histological diagnosis determined for cytological smears differed from their cytological diagnosis, the former was employed as a reference). For the best classifiers, sensitivity and specificity were calculated in five combinations for five diagnostic strategies that we tentatively selected: 1) the basic version (sensitivity ≈ specificity), 2) increased sensitivity (> 90%) with acceptable specificity (≥ 80%), 3) highly sensitive version (sensitivity 95–99%), 4) increased specificity (> 90%) with acceptable sensitivity (≥ 80%), and 5) highly specific version (specificity 95–99%). Classifiers were trained based on ΔCq values for the best biomarker pairs (the lowest p-values and the best combination of diagnostic characteristics). Classification procedures were performed by means of the Python Scikit-learn library as described before (26). K-fold method was used for cross-validation. The result of applying a classifier to each cytological smear is a value of a decision function (DF) calculated using the formula:
where ΔCqi are the above-mentioned variables, wi are coefficients of the variables (feature weights), n is the number of biomarker pairs, and w0 is an arbitrary term (the selected cutoff). DF values > 0 indicate an increased risk of ≥HSIL, which means a high probability of HSIL or CC. The distance from 0 denotes relative risks. Spearman’s correlation coefficient was calculated to assess the statistical dependence between features/data sets.
2.7 Extended analysis of samples for detailed examination of discordant results
The results were considered discrepant [“false positive” (FP) or “false negative” (FN) in terms of molecular classification] in cases when the “risk of ≥HSIL” based on the DF value for a cytological smear contradicted its clinical report of cytological findings. For a more detailed examination of such cases, an extended analysis was performed on 20 discrepant results of classifier 1 and on all other 147 samples with the help of additional biomarkers of different types from our previous papers (25, 26, 30): mRNAs (ASF1B, KRT7, CD82, CDH3, TOP2A, and SPRR3), miRNAs (miR-1246, miR-145, miR-196b, miR-34a, miR-20a, miR-21, miR-96, and miR-375), HPV viral load and genotypes (12 HR-HPV types: 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, and 59 and six additional types: 26, 53, 66, 68, 73, and 82), and a relative percentage of LBs. For RNAs, normalized values were employed: for mRNAs, normalization to housekeeping gene PGK1 by the 2-ΔCq method; and for miRNAs, normalization to miR-375 by the ΔCq method. The rationale for the method of normalization of miRNAs can be found in ref (25). Based on the results of the analysis, a single heat map was constructed for all biomarkers in Microsoft Excel 2019 (Microsoft, USA).
3 Results
3.1 Diagnostic utility of selected mRNAs for the detection of ≥HSIL in cytological smears
Discriminatory power of selected mRNAs for the detection of ≥HSIL was evaluated as follows. ROC analysis—for four downregulated mRNAs (MAL, CRNN, CRISP3, and SPINK5) chosen in this study and for two upregulated (TOP2A and CDKN2A) and two downregulated (ECM1 and TMPRSS4) mRNAs selected earlier (26) normalized to the housekeeping mRNA (PGK1)—was performed on the data obtained from the entire study population. Two classification schemes were tested for the groups formed based on cytological diagnoses: NILM versus CC and ≤LSIL versus ≥HSIL. The results are presented in Table 1; Figure 2 also illustrates the distribution of relative mRNA amounts in groups of cytological smears corresponding to different diagnoses.
Table 1. ROC AUC and p-values for each PGK1-normalized mRNA, regarding the discrimination of different diagnoses.
Figure 2. Box-whisker plots for normalized concentrations of selected mRNAs in subcategories of cervical specimens: (A) in different cytological diagnoses (Bethesda: NILM, LSIL, HSIL, or CC); (B) in combined subgroups: ≤LSIL and ≥HSIL. Blue color: downregulated mRNAs, and pink: upregulated mRNAs. 2-ΔCq values are linearly transformed to bring all charts to a single scale. The charts show average values (crosses), upper and lower quartiles (boxes), median values (horizontal lines inside boxes), ranges without outliers (whiskers), and outliers (tiny circles). P-values are given above the plots.
Upregulated mRNA CDKN2A and downregulated mRNAs MAL and CRNN performed best at discriminating NILM from CC (ROC AUC > 0.85) and group ≤LSIL from ≥HSIL (ROC AUC > 0.75).
3.2 Diagnostic utility of mRNA pairs for the detection of ≥HSIL in cytological smears
To evaluate discriminatory power of different mRNA pairs for the detection of ≥HSIL, ROC analysis for ΔCq values calculated for all possible mRNA pairs (n = 36) was performed on the data obtained from the entire study population. The results are summarized in Figure 3, including box plots illustrating the distribution of ΔCq values in groups of cytological smears corresponding to various diagnoses (all ROC AUC and p-values are given in the Supplementary Table S2).
Figure 3. (A) ROC AUC values for 36 mRNA pairs in terms of discrimination of NILM from CC (x-axis) and ≤LSIL from ≥HSIL (y-axis). Colors indicate different types of mRNA pairs, combinations of up-, down-regulated, and housekeeping (hk) mRNAs: red, up-up (UU); dark blue, down-down (DD); orange, up-hk (UH); light blue, down-hk (DH); and pink, down-up (DU). (B) Box-whisker plots for ΔCq values of the best mRNA pairs in different cytological diagnoses (Bethesda: NILM, LSIL, HSIL, or CC). ΔCq values are linearly transformed to bring all charts to a single scale. The charts show average values (crosses), upper and lower quartiles (boxes), median values (horizontal lines inside boxes), ranges without outliers (whiskers), and outliers (tiny circles). P-values are given above the plots.
The pairs combining an upregulated mRNA and a downregulated mRNA performed best at discriminating NILM from CC, and five of them (MAL-CDKN2A, CRNN-CDKN2A, TMPRSS4-CDKN2A, CRISP3-CDKN2A, and ECM1-CDKN2A) also performed best at discriminating ≤LSIL from ≥HSIL as compared with the best upregulated biomarker CDKN2A normalized to hk-mRNA PGK1.
3.3 Diagnostic utility of different types of linear classifiers based on selected mRNA pairs for the detection of ≥HSIL in cytological smears
The diagnostic characteristics (ROC AUC, diagnostic sensitivity and specificity) of linear classifiers based on 2–6 mRNA pairs selected at the previous stage were calculated for the entire study population and compared. In Table 2, diagnostic characteristics of the six selected classifiers are given, as compared to the classifier from our previous work (P) (26), for which these characteristics were calculated too. The inclusion of more than four mRNA pairs into a classifier or the addition of miRNA pairs validated in ref (26). in no case considerably improved the diagnostic characteristics.
Table 2. Diagnostic characteristics (ROC AUC, sensitivity and specificity) of the best six linear classifiers and of the classifier from our previous work (P) (26) calculated for the entire study population.
Because it is possible to obtain different combinations of sensitivity and specificity for the same test by changing the cutoffs, we additionally determined which of them could be obtained for classifiers 1–6 from Table 2 by adjusting the cutoff for specific diagnostic strategies (described in subsection 2.6), see Table 3.
Table 3. Combinations of sensitivity (Sens) and specificity (Spec) for detection of ≥HSIL by classifiers 1–6 from Table 2 at different cutoffs selected for four additional diagnostic strategies (described in 2.5).
Thus, classifier 1 was the best option for four of the five diagnostic strategies, considering that it allows the use of its shortened versions: classifier 2 (ECM1 excluded), which had comparable diagnostic performance, and classifier 3 (ECM1 and CRNN excluded), which was the best in the “Sens > 90%, Spec ≥ 80” strategy.
3.4 Extended analysis of samples for detailed examination of discordant results
The results are presented as a heat map (Figure 4), including data on all examined biomarkers in 167 cytological smears, as well as data on classifier 1 from the current work and classifier P from the previous paper (26).
Figure 4. The heat map illustrating data for different types of biomarkers (HR-HPV genotypes and loads, percentage of LBs, mRNAs and miRNAs levels) for 167 cytological smears and calculated DF values of classifiers 1 and P. The samples on the map are subdivided into six groups: “false negative” (FN) and “false positive” (FP) discrepant results; NILM and LSIL identified by classifiers in group ≤LSIL (low risk of ≥HSIL); HSIL and CC identified by classifiers in group ≥HSIL (increased risk of ≥HSIL). HPV genotypes are listed in order from highest to lowest viral load. Quantitative data on each biomarker were linearly transformed to a scale from 0 (baseline expression: blue) to 100 (highly aberrant expression: red). P50 (white) denotes the 50th percentile (P50 values vary for different biomarkers). Data on downregulated mRNAs and LBs were inverted (multiplied by −1 and then added 100). mRNAs included in classifier 1 are highlighted in pink.
Based on the results of the analysis of additional molecular biomarkers, several types of samples were identified:
1. baseline expression (blue) for most RNAs;
2. baseline expression for most of upregulated mRNAs and miRNAs and highly aberrant expression (red) for downregulated mRNAs;
3. moderately aberrant (white) and highly aberrant expression for most RNAs;
4. highly aberrant expression for most RNAs.
In the NILM group, type 1 samples were predominant, and samples of types 2 and 3 were also present. In the LSIL group, samples of the same three types were present, but the percentage of types 2 and 3 was higher. In groups HSIL and CC, samples of type 4 predominated, and there were a few samples of types 2 and 3. In groups FN and FP, type 3 samples were predominant, with type 1 samples also present in FN, type 2 in both FN and FP, and type 4 in group FP.
In total, 111 specimens from 167 (66.5%) were HR-HPV positive [NILM, 16 of 60 (26.6%); LSIL, 16 of 22 (72.7%); HSIL, 31 of 32 (96.9%); CC, 30 of 33 (90.9%); FN 11 of 13 (84.6%); FP, 7 of 7 (100%)]. In 43 of 111 (38.7%) cases multigenotype (≥2 genotypes) HPV infections were found [NILM, 7 of 16 (43.7%); LSIL, 5 of 16 (31.2%); HSIL, 10 of 31 (32.2%); CC, 11 of 30 (36.6%); FN, 4 of 11 (36.4%); FP, 6 of 7 (85.7%)]. The representation of different HR-HPV genotypes in these groups is shown in Figure 4 and Supplementary Table S3. The percentage of HR-HPV-positive samples containing genotype 16 increased with the severity of cervical lesion [NILM, 3 of 16 (18.7%); LSIL, 9 of 16 (56.2%); HSIL, 21 of 31 (67.7%); CC, 24 of 30 (80%)]; and also was high in the FP group, 6 of 7 (85.7%). In addition, the NILM group showed a higher percentage of genotype 51 (68.7% of HR-HPV-positive samples) compared to other groups, where it ranged from 0 to 14%. Otherwise, no obvious associations or trends related to HPV genotypes or multigenotypic infections were found in the entire sample. Elevated viral loads (> P50) were observed in most samples from groups LSIL, HSIL, CC, and FP. Decreased LBs percentages were most common in groups HSIL and CC. The proportion of anaerobic bacterial species was inversely correlated with LBs, while no patterns were found for aerobic species (data not shown). Although in groups HSIL, CC, and FP, the values of all biomarkers mostly were moderately or highly aberrant, in groups NILM, LSIL, and FN, we did not find any obvious associations between increased HR-HPV DNA load, a decreased LBs percentage, and aberrant RNA expression. To compare the distribution of DF values, HPV loads and the percentage of LBs in the sample, box-whisker plots were constructed, see Figure 5. The calculation of the Spearman’s coefficient (rs) showed a weak correlation between the DF-values versus HPV loads and DF-values versus LBs percentage: 0.546 and ‑0.527, respectively.
Figure 5. Distribution of DF values, HPV viral loads and percentage of LBs in the analyzed sample of 167 cervical cytological preparations. p-values are given for comparisons for which differences were significant (p < 0.05/(7*3) = 2.4*10-3, with Bonferroni correction).
3.5 A prototype molecular test suitable for routine use
To devise a prototype test, the multiplex analysis of five mRNAs (providing an opportunity to use classifiers 1–3) in two tubes (1: CDKN2A+TMPRSS4+CRNN, 2: MAL+ECM1) was optimized in the format of lyophilized ready-for-use PCR mixes (Figure 6A). In the remaining material of 120 out of the 167 cytological smears (52 NILMs, 9 LSILs, 38 HSILs, and 21 CCs), diagnostic characteristics equivalent (sensitivity 88.14%, specificity 86.9%) to those of the nonmultiplex assay (sensitivity 88.14%, specificity 88.52%) were obtained for the same set of cytological smears. Spearman’s coefficient of correlation between DF values of the two methods was high (rs = 0.945). Figure 6B shows examples of RT-qPCR results for eight random cervical cytological smears with different diagnoses and cultured HeLa cells for comparison.
Figure 6. A prototype molecular test for cervical pathology detection. (A) To perform the test, it is necessary to reconstitute lyophilized ready-for-use PCR mixes in 50 µL of an eluate (including addition of a nucleic-acid sample: two tubes per sample), to mix and place them in a thermocycler and select an appropriate protocol of RT-qPCR. (B) Examples of RT-qPCR results for cervical cytological smears with different diagnoses (2 NILMs, 2 LSILs, 2 HSILs, and 2 CCs) and for cultured HeLa cells in five replicates. The colors indicate signals in different fluorescence channels: red, ROX (crosses: CDKN2A, squares: ECM1); green, FAM (circles: TMPRSS4, diamonds: MAL); orange, HEX (triangles: CRNN).
Given that the accuracy of the obtained DF values partly depends on systematic error of RT-qPCR, we analyzed the samples in five replicates to estimate the contribution of this error. From the combinations of the obtained Cq values, all possible DF values (classifier 1) were computed (five biomarkers to the power of five replicates = 3125 DF values for each cytological smear). Their distribution for eight cytological smears is illustrated in Figure 7 and coefficients of variation (CV) are given in Table 4 below.
Figure 7. The distribution of DF values calculated from all possible combinations of Cq values from five RT-qPCR replicates for five mRNAs (3125 DF values for each of eight cytological smears).
Table 4. Examples of the prototype test’s results for eight cervical cytological samples with different diagnoses (2 NILMs, 2 LSILs, 2 HSILs, and 2 CCs) and cultured HeLa cells: each mRNA’s Cq obtained from the amplification plots in Figure 6B and DF values calculated from them using classifier 1.
Table 4 also shows the average Cq values, the DF values calculated from them, and the corresponding test results reflecting the risk of ≥HSIL for each of the nine samples.
The DF values’ variability associated with systematic error of RT-qPCR was low for all samples (CV ≤ 12.6%), and in none of the nine cases did it lead to overlaps with another diagnostic group.
4 Discussion
Every year, more and more studies come out on cellular molecular biomarkers of severe cervical lesions and cancer. These markers may improve the effectiveness of cervical screening in several ways. Firstly, they can compensate for the mediocre diagnostic sensitivity of cytology both by detecting missed severe lesions (owing to technological features of cytology and the influence of the human factor) and by identifying early molecular abnormalities that precede morphological ones and indicate a high risk of disease progression (31, 32). Secondly, the use of cellular biomarkers in combination with HPV testing provides an opportunity for risk stratification and triage to reduce the frequency of overdiagnosis and unnecessary referrals for colposcopy in case of transient infections. In addition, the analysis of cellular targets can help to detect both HPV-negative cases and those associated with infection with rare HPV genotypes not covered by the HPV tests (the prevalence of such cases will increase with the dissemination of vaccination against the main HR-HPV genotypes). A wide variety of biomarkers have already been proposed for these purposes. At present, there are a number of established tests based on different types of biomarkers: proteins: CINtec® PLUS (p16 and Ki67; sensitivity 86.7%, specificity 95.2% for ≥HSIL) (33) and its analogs; methylation of cellular DNA: GynTect® (ASTN1, DLX1, ITGA4, RXFP3, SOX17, and ZNF671; sensitivity 59.7%, specificity 98.0% for ≥HSIL) (34), WID®-can (DPP6, RALYL, and GSX1; sensitivity 77.0%, specificity 76.9% for ≥HSIL) (35), QIAsure Methylation Test (FAM19A4 and MIR124-2; sensitivity 63.3%, specificity 67.4% for ≥HSIL) (36), and other gene panels; and miRNAs: NOVAprep® miR-Cervix (miR-21, -29b, -145, -451a, -1246, and -1290; sensitivity 79.2%, specificity 79.3% for NILM vs HSIL) (37). There are no established tests for triage based on the analysis of cellular mRNAs yet; however, diagnostic utility of biomarkers of this type is stated in some studies [e.g (38–42)].
In the present work, we developed a prototype test for detection of cervical lesions at a ≥HSIL stage (increased risk of ≥HSIL) by analyzing a small set of mRNAs in cytological smears. The test (based on CDKN2A, MAL, TMPRSS4, CRNN, and ECM1) has characteristics comparable to those of established tests (sensitivity 89.7%, specificity 87.6% for ≥HSIL). By using pairs of oppositely deregulated mRNAs, we managed to improve diagnostic accuracy of the method compared to our previous results (26), while reducing the number of analyzed biomarkers. The sensitivity and specificity of the assay can be adjusted by changing the cutoffs. Nonetheless, the suitability of these changes—in relation to the choice of different diagnostic strategies for solving specific clinical problems—should be confirmed in further studies. The contribution of each mRNA to a classification result needs to be clarified in a larger set of cytological smears that includes a variety of diagnostic categories (including ambiguous ones such as atypical squamous cells of undetermined significance, ASCUS) to find a balance between diagnostic parameter values and the number of analyzed targets. This is also important from an economic point of view because using a smaller number of biomarkers can reduce the cost of the assay.
The proportion of discordant results in this study was 12% (20 out of 167 for classifier 1). All of them can be tentatively categorized into two groups depending on the proximity of their DF values to the classifier cutoff: Group I [4 out of 20 (20%): DF values are far from the classifier cutoff and the results on most mRNAs in the extended analysis of samples correspond to their characteristic expression in groups NILM (2 FNs) and ≥HSIL (2 FPs)] and Group II [16 out of 20 (80%): DF values are close to the classifier cutoff (within 10% of values above and below it) and the expression of most mRNAs is intermediate between (or similar to) their characteristic expression levels in groups ≤LSIL (for FNs) and ≥HSIL (for FPs)]. There are various possible reasons for such results, including
1. dilution of atypical cells in cytological smears with normal cells (FNs from groups I and II);
2. errors of the reference method: failure to detect severe lesions or overestimation of lesion severity by the cytologist owing to the human factor (FNs and FPs from groups I and II);
3. intermediate states that are difficult to interpret by any methods and probably require assignment to a gray zone (FNs and FPs from group II);
4. errors in molecular classification owing to systematic error at any stage of the analysis (FNs and FPs from group II);
5. detection of early molecular abnormalities with a high risk of progression, without pronounced morphological atypia (FPs from group II);
6. insufficient specificity of biomarkers, leading to false positive results in cases of transient infections or any other diseases or in age-related aberrations (FPs from group II);
7. special molecular subtypes that are poorly identified by selected mRNAs (FNs from group II).
Using additional molecular biomarkers, we were unable to identify specific causes of each discrepant result. Nevertheless, in cases where the results on most biomarkers were consistent, it can be assumed that the reasons for such results are not systematic error or other errors in the molecular analysis. In our view, the following may be useful in addressing this issue and facilitating decision-making:
● collecting additional information about the age of patients, their genetic characteristics, and comorbidities;
● counting of different types of cells in cytological smears (calculating the proportions of atypical and healthy cells to assess heterogeneity);
● repeating various stages of the molecular analysis;
● getting a second opinion from another cytologist on the diagnosis for the same cytological smear or a redone one.
The results of the extended analysis of the remaining 147 samples, which were identified by the molecular test consistently with their morphological diagnoses, did not allow us to confidently distinguish any clusters or molecular subclasses, partly owing to the small sample size. Nonetheless, in groups NILM and LSIL, a subgroup of samples was identified that differed from the rest: normal expression of upregulated mRNAs characteristic of ≤LSIL and aberrant expression of all downregulated mRNAs characteristic of ≥HSIL. Most of our downregulated biomarkers are associated with some specific epithelial molecular processes (e.g., epithelial differentiation, keratinization, and apical transport in polarized epithelial cells), and upregulated ones are related to the cell cycle (regulation of replication, proliferation, apoptosis, and cell adhesion). Therefore, we can suggest two possible reasons for such an expression profile. The first one is early molecular abnormalities associated with neoplasia that have an uncertain risk of progression (there is a pronounced decrease in epithelial function, but no loss of cell cycle control). The second reason is an unrelated neoplasia condition or disease that is also characterized by underexpression of the selected mRNAs (i.e., insufficient specificity of the biomarkers). Four samples with similar expression profiles were also present in ≥HSIL groups, thus possibly also indicating that this is some special “molecular subclass” or dilution of atypical cells with normal cells in the cytological smears. Another notable finding is that in ≥HSIL groups, there were cytological smears with highly aberrant expression of most biomarkers, but baseline expression of CDKN2A. These cytological smears may represent a subgroup of CDKN2A-negative lesions mentioned in ref (43). Alternatively, this result may be due to the influence of some factors on gene expression that we did not account for. The above findings confirm the necessity to use combinations of biomarkers (instead of stand-alone ones) because this approach increases the likelihood of their mutual compensation for each other’s insufficient sensitivity or specificity, including those caused by the existence of different molecular subclasses.
When regarding the results of mRNA testing (DF-values) in the context of HPV infection and cervicovaginal microbiome (CVM) condition, we observed the weak correlation between these indicators. Microbial dysbiosis is now recognized as an additional risk factor for HPV persistence and cervical neoplasia. Normally, in women of reproductive age, the CVM is dominated by LBs, which prevent infections by maintaining a weakly acidic pH, producing lysozyme, hydrogen peroxide, bacteriocins, preventing biofilm formation and having a stimulatory influence on local immunity (44, 45). The CMV dysbiosis, the most pronounced feature of which being the decrease in the concentration of LBs, is usually a consequence of weakening of either systemic or local immunity, which, in its turn, is modulated by the CVM (46, 47). Dysbiosis is accompanied by growing biodiversity and an increased risk of infections caused by various pathogens. The dysbiotic CVM raises the risk of HR-HPV infection and persistence, which in turn trigger SIL development. As the result, cervical lesions are generally accompanied by the CVM dysbiosis (23, 48, 49). In the present work, we chose as an indicator of dysbiosis the relative proportion of LBs DNA in total bacterial DNA as the most universal parameter that could be easily estimated using a simple PCR-based method. In our sample, we observed significant differences in the proportion of LBs between patients with different diagnoses. However, they did not differ significantly between NILM and LSIL, as well as between LSIL and HSIL (see Figure 5) and were weakly correlated with DF values. Thus, despite the fact that CVM analysis can act as a triage tool, this may require more complex methods than a simple assessment of the relative proportion of LBs. Of note is that our sample was enriched in women with no LBs dominance, which is most likely due to age-related effects.
While the possibility of using CVM analysis as a triage tool has only been discussed relatively recently, attempts to use viral load and HPV genotyping for the same purposes have a long history. As HR-HPV testing cannot differentiate productive from transforming HPV infections, does not identify HR-HPV-negative lesions, and also has a low positive predictive value (PPV) of HSIL and CC detection, researchers are trying to analyze additional parameters of HPV infection to increase PPV. Among them are viral load and genotype. Unfortunately, both are not informative enough. The high HPV viral load in older ages is considered a surrogate marker of the HPV persistence pointing to an increased risk of malignant transformation. However, although HPV viral load may be an indicator of progression of precancerous cervical lesions, this association was shown to be age-, genotype-, and population-dependent, which limits its use as a triage indicator (50, 51). Currently, so-called “partial genotyping” of HR-HPV is commonly used in cervical screening algorithms; this is specific identification of the most aggressive genotypes (HPV16 and HPV18), which implies a more conservative approach to the management of women in whom less aggressive genotypes are found (52). Nevertheless, some studies indicate that (a) “more aggressive” genotypes do not significantly differ in the probability and rate of clearance, and (b) their contribution to the incidence of cervical pathology may be affected by the vaccination programs targeting these types. In this regard, extended genotyping may be more useful for management, which, however, involves more complex interpretation algorithms that have yet to be refined (53, 54). In our study sample, the relative frequency of HPV16 increased from NILM to CC groups (see Supplementary Table S3), which supports the concept of partial genotyping. As for the viral loads, they were not able to discriminate LSILs, HSILs and CCs (Figure 5), in line with our own and other studies, and also weakly correlated with DF values among these patient subgroups. The low correlation between DF values, HR-HPV viral loads and LBs DNA percentage, allows them to be considered as separate risk factors for cervical pathology, which could complement each other in the management of patients.
Some limitations of the study should be mentioned. The first one is the small sample size, and the second one is prevalence (of samples with various diagnoses) that differs substantially from that in the real-world patient population (where the proportion of HSIL and cancer is < 4%). These shortcomings could distort the calculation of diagnostic characteristics, leading to some overfitting. Thirdly, there were no diagnoses of ASCUS in our sample; the interpretation and classification of this pathology by means of molecular tools is one of the most urgent tasks. Another possible source of bias in the training of the 428 classifiers and in the calculation of diagnostic characteristics was the lack of histological verification of NILM and LSIL diagnoses (which is not stipulated for them in clinical guidelines). Moderate sensitivity of cytology as a reference method may lead to the presence of missed severe lesions in this group. Even in our small study population, there were three patients with histological diagnoses of HSIL and CC, but a negative cytology result (NILM), which, however, were correctly assigned by our classifier to group ≥HSIL.
It is also important to mention that a number of factors, not all of which we were able to account for, may directly or indirectly influence mRNA expression in cervical epithelial cells. This may, to a greater or lesser extent, compromise the usefulness of any selected biomarkers. E.g., age-related changes have been documented for both transcripts (p16INK4a and p14ARF) encoded by our key biomarker CDKN2A (55). Co-infections, age, and inflammation are themselves factors that influence carcinogenesis. Moreover, they manifest cross-effects (age affects the peculiarity of the course of infection, which is directly related to inflammation etc). Age-related changes in the transformation zone can affect the characteristics of the course of HPV infection (56), being in complex interactions with the activity of representatives of the cervical microbiome (57). The menstrual cycle significantly affects the cervical transcriptome (58). Furthermore, MAL and TMPRSS4 were noted in this work as differentially expressed genes in the late secretory phase. However, patients with this cycle phase were not included in our sample (based on inclusion criterion number 2 from subsection 2.2). Another possible problem is that a cervical smear may contain not only epithelial cells but also immune system cells expressing some mRNAs from a selected set, which, depending on the level of immune infiltration, may affect their integral assessments by a molecular test. However, for all the mRNAs we selected, cancer-related expression changes have been confirmed in several studies, including the current one. Further clinical validation of the developed method on larger sample sets will allow more accurate conclusions to be drawn about the influence of the above-mentioned or other factors on the accuracy of the molecular testing result.
Thus, in this paper, we propose a method for effective detection of cervical lesions at a ≥HSIL stage (ROC AUC 0.935, sensitivity 89.7%, specificity 87.6%) via an RT-qPCR assay of five cellular mRNAs with oppositely deregulated expression in cytological preparations. Among its potential advantages are high throughput of the technique and convenience, resulting from multiplex analysis of targets and the use of lyophilized ready-for-use PCR mixes, adaptability to various clinical tasks (diagnostic strategies) owing to the adjustable cutoff for the analyzed semiquantitative characteristics, as well as the ability to conduct the analysis in parallel with HPV testing (in the same laboratory, by the same operator, on a single instrument and reagent base). Such a test can be useful at different stages of cervical examination, both in primary screening (in combination with cytology, to confirm its results, or with HPV testing, for risk stratification) and as an auxiliary tool for additional examinations in intermediate and controversial cases (LSIL, HSIL, and probably ASCUS), in well-equipped laboratories where RT-qPCR is available. In the future, for introduction into clinical practice, the method requires i) extended validation within a clinical study on a large number of cytological smears with different diagnoses (including ASCUS) as well as ii) resolving issues related to the heterogeneity of cells in cytological smears and identifying the gray zone.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Ethics Committee of federal government-funded institution N.N. Petrov National Medical Research Center of Oncology (extract 11/31 from protocol №3 from 16.02.2023; Saint Petersburg, Russia). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
AA: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. MI: Conceptualization, Project administration, Writing – original draft, Writing – review & editing. ST: Data curation, Funding acquisition, Investigation, Writing – review & editing. VD: Data curation, Investigation, Writing – review & editing. SK: Resources, Writing – review & editing. AS: Resources, Writing – review & editing. NA: Resources, Writing – review & editing. AM: Conceptualization, Resources, Writing – review & editing. SG: Data curation, Software, Writing – review & editing. EA: Funding acquisition, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was funded by the Russian Science Foundation, project No. 20-14-00074-P. The reagents used were provided by AO Vector-Best. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication.
Conflict of interest
AA, MI, ST, VD, SG and EA are employed by AO Vector-Best.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1491737/full#supplementary-material
References
1. Wright TC, Stoler MH, Behrens CM, Sharma A, Sharma K, Apple R. Interlaboratory variation in the performance of liquid-based cytology: Insights from the ATHENA trial. Int J Cancer. (2014) 134:1835–43. doi: 10.1002/IJC.28514
2. Kelly H, Jaafar I, Chung M, Michelow P, Greene S, Strickler H, et al. Diagnostic accuracy of cervical cancer screening strategies for high-grade cervical intraepithelial neoplasia (CIN2+/CIN3+) among women living with HIV: A systematic review and meta-analysis. EClinicalMedicine. (2022) 53:101645. doi: 10.1016/J.ECLINM.2022.101645
3. Bigras G, Wilson J, Russell L, Johnson G, Morel D, Saddik M. Interobserver concordance in the assessment of features used for the diagnosis of cervical atypical squamous cells and squamous intraepithelial lesions (ASC-US, ASC-H, LSIL and HSIL). Cytopathology. (2013) 24:44–51. doi: 10.1111/J.1365-2303.2011.00930.X
4. Koliopoulos G, Nyaga VN, Santesso N, Bryant A, Martin-Hirsch PPL, Mustafa RA, et al. Cytology versus HPV testing for cervical cancer screening in the general population. Cochrane Database Syst Rev. (2017) 2017:CD008587. doi: 10.1002/14651858.CD008587.PUB2
5. Cuzick J, Clavel C, Petry KU, Meijer CJLM, Hoyer H, Ratnam S, et al. Overview of the European and North American studies on HPV testing in primary cervical cancer screening. Int J Cancer. (2006) 119:1095–101. doi: 10.1002/IJC.21955
6. Goodman A. HPV testing as a screen for cervical cancer. BMJ. (2015) 350:h2372. doi: 10.1136/BMJ.H2372
7. Hwang H, Follen M, Guillaud M, Scheurer M, MacAulay C, Staerkel GA, et al. Cervical cytology reproducibility and associated clinical and demographic factors. Diagn Cytopathol. (2020) 48:35. doi: 10.1002/DC.24325
8. Sørbye SW, Suhrke P, Revå BW, Berland J, Maurseth RJ, Al-Shibli K. Accuracy of cervical cytology: comparison of diagnoses of 100 Pap smears read by four pathologists at three hospitals in Norway. BMC Clin Pathol. (2017) 17:18. doi: 10.1186/S12907-017-0058-8
9. Monsonego J, Bohbot JM, Pollini G, Krawec C, Vincent C, Merignargues I, et al. Performance of the Roche AMPLICOR human papillomavirus (HPV) test in prediction of cervical intraepithelial neoplasia (CIN) in women with abnormal PAP smear. Gynecol Oncol. (2005) 99:160–8. doi: 10.1016/J.YGYNO.2005.05.030
10. Kang M, Ha SY, Cho HY, Chung DH, Kim NR, An J, et al. Comparison of papanicolaou smear and human papillomavirus (HPV) test as cervical screening tools: can we rely on HPV test alone as a screening method? An 11-year retrospective experience at a single institution. J Pathol Transl Med. (2020) 54:112–8. doi: 10.4132/JPTM.2019.11.29
11. Fokom-Domgue J, Combescure C, Fokom-Defo V, Tebeu PM, Vassilakos P, Kengne AP, et al. Performance of alternative strategies for primary cervical cancer screening in sub-Saharan Africa: systematic review and meta-analysis of diagnostic test accuracy studies. BMJ. (2015) 351:h3084. doi: 10.1136/BMJ.H3084
12. Cuzick J, Cadman L, Mesher D, Austin J, Ashdown-Barr L, Ho L, et al. Comparing the performance of six human papillomavirus tests in a screening population. Br J Cancer. (2013) 108:908. doi: 10.1038/BJC.2013.22
13. Cuzick J, Arbyn M, Sankaranarayanan R, Tsu V, Ronco G, Mayrand MH, et al. Overview of human papillomavirus-based and other novel options for cervical cancer screening in developed and developing countries. Vaccine. (2008) 26 Suppl 10:K29–41. doi: 10.1016/J.VACCINE.2008.06.019
14. Chan CK, Aimagambetova G, Ukybassova T, Kongrtay K, Azizan A. Human papillomavirus infection and cervical cancer: epidemiology, screening, and vaccination—Review of current perspectives. J Oncol. (2019) 2019:3257939. doi: 10.1155/2019/3257939
15. Xing B, Guo J, Sheng Y, Wu G, Zhao Y. Human papillomavirus-negative cervical cancer: A comprehensive review. Front Oncol. (2020) 10:606335. doi: 10.3389/FONC.2020.606335
16. Bogani G, Sopracordevole F, Di Donato V, Ciavattini A, Ghelardi A, Lopez S, et al. High-risk HPV-positive and -negative high-grade cervical dysplasia: Analysis of 5-year outcomes. Gynecol Oncol. (2021) 161:173–8. doi: 10.1016/J.YGYNO.2021.01.020
17. Wentzensen N, Schiffman M, Palmer T, Arbyn M. Triage of HPV positive women in cervical cancer screening. J Clin Virol. (2016) 76:S49. doi: 10.1016/J.JCV.2015.11.015
18. Onyango CG, Ogonda L, Guyah B, Shiluli C, Ganda G, Orang’o OE, et al. Novel biomarkers with promising benefits for diagnosis of cervical neoplasia: a systematic review. Infect Agent Cancer. (2020) 15:68. doi: 10.1186/S13027-020-00335-2
19. Molina MA, Carosi Diatricch L, Castany Quintana M, Melchers WJG, Andralojc KM. Cervical cancer risk profiling: molecular biomarkers predicting the outcome of hrHPV infection. Expert Rev Mol Diagn. (2020) 20:1099–120. doi: 10.1080/14737159.2020.1835472
20. Chen S, Yang X, Yu C, Zhou W, Xia Q, Liu Y, et al. The potential of circRNA as a novel diagnostic biomarker in cervical cancer. J Oncol. (2021) 2021:5529486. doi: 10.1155/2021/5529486
21. He J, Huang B, Zhang K, Liu M, Xu T. Long non-coding RNA in cervical cancer: From biology to therapeutic opportunity. Biomedicine Pharmacotherapy. (2020) 127:110209. doi: 10.1016/J.BIOPHA.2020.110209
22. Peng L, Yuan X, Jiang B, Tang Z, Li GC. LncRNAs: key players and novel insights into cervical cancer. Tumor Biol. (2016) 37:2779–88. doi: 10.1007/S13277-015-4663-9/METRICS
23. Curty G, de Carvalho PS, Soares MA. The role of the cervicovaginal microbiome on the genesis and as a biomarker of premalignant cervical intraepithelial neoplasia and invasive cervical cancer. Int J Mol Sci. (2020) 21:222. doi: 10.3390/IJMS21010222
24. Olivas AD, Barroeta JE, Lastra RR. Overview of ancillary techniques in cervical cytology. Acta Cytol. (2023) 67:119–28. doi: 10.1159/000528931
25. Ivanov MK, Titov SE, Glushkov SA, Dzyubenko VV, Malek AV, Arkhangelskaya PA, et al. Detection of high-grade neoplasia in air-dried cervical PAP smears by a microRNA-based classifier. Oncol Rep. (2018) 39:1099. doi: 10.3892/OR.2018.6214
26. Ivanov MK, Titov SE, Dzyubenko VV, Glushkov SA, Krasilnikov SE, Mansurova AS, et al. Detection of Cervical Lesions and Cancer in Air-Dried Cytologic Smears by Combined Analysis of mRNA and miRNA Expression Levels. J Mol Diagnostics. (2021) 23:541–54. doi: 10.1016/j.jmoldx.2021.01.016
27. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2–ΔΔCT method. Methods. (2001) 25:402–8. doi: 10.1006/METH.2001.1262
28. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python (2012). Available online at: http://arxiv.org/abs/1201.0490 (Accessed January 19, 2024).
29. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. (2020) 17:261–72. doi: 10.1038/S41592-019-0686-2
30. Ivanov MK, Brenner EV, Hodkevich AA, Dzyubenko VV, Krasilnikov SE, Mansurova AS, et al. Cervicovaginal-microbiome analysis by 16S sequencing and real-time PCR in patients from novosibirsk (Russia) with cervical lesions and several years after cancer treatment. Diagnostics. (2023) 13:140. doi: 10.3390/DIAGNOSTICS13010140/S1
31. Güzel C, van Sten-van’t Hoff J, de Kok IMCM, Govorukhina NI, Boychenko A, Luider TM, et al. Molecular markers for cervical cancer screening. Expert Rev Proteomics. (2021) 18:675–91. doi: 10.1080/14789450.2021.1980387
32. Nedjai B, Reuter C, Ahmad A, Banwait R, Warman R, Carton J, et al. Molecular progression to cervical precancer, epigenetic switch or sequential model? Int J Cancer. (2018) 143:1720. doi: 10.1002/IJC.31549
33. Ikenberg H, Bergeron C, Schmidt D, Griesser H, Alameda F, Angeloni C, et al. Screening for cervical cancer precursors with p16/ki-67 dual-stained cytology: results of the PALMS study. JNCI J Natl Cancer Institute. (2013) 105:1550. doi: 10.1093/JNCI/DJT235
34. Schmitz M, Eichelkraut K, Schmidt D, Zeiser I, Hilal Z, Tettenborn Z, et al. Performance of a DNA methylation marker panel using liquid-based cervical scrapes to detect cervical cancer and its precancerous stages. BMC Cancer. (2018) 18:1197. doi: 10.1186/S12885-018-5125-8
35. Schreiberhuber L, Barrett JE, Wang J, Redl E, Herzog C, Vavourakis CD, et al. Cervical cancer screening using DNA methylation triage in a real-world population. Nat Med. (2024) 30(8):2251–7. doi: 10.1038/s41591-024-03014-6
36. Dippmann C, Schmitz M, Wunsch K, Schütze S, Beer K, Greinke C, et al. Triage of hrHPV-positive women: comparison of two commercial methylation-specific PCR assays. Clin Epigenet. (2020) 12:171. doi: 10.1186/S13148-020-00963-W
37. Kniazeva M, Zabegina L, Shalaev A, Smirnova O, Lavrinovich O, Berlev I, et al. NOVAprep-miR-cervix: new method for evaluation of cervical dysplasia severity based on analysis of six miRNAs. Int J Mol Sci. (2023) 24:24. doi: 10.3390/IJMS24119114/S1
38. Del Pino M, Svanholm-Barrie C, Torné A, Marimon L, Gaber J, Sagasta A, et al. MRNA biomarker detection in liquid-based cytology: A new approach in the prevention of cervical cancer. Modern Pathol. (2015) 28:312–20. doi: 10.1038/MODPATHOL.2014.106
39. Wang S, Chen X. Identification of potential biomarkers in cervical cancer with combined public mRNA and miRNA expression microarray data analysis. Oncol Lett. (2018) 16:5200. doi: 10.3892/OL.2018.9323
40. Wong YF, Cheung TH, Tsao GSW, Lo KWK, Yim SF, Wang VW, et al. Genome-wide gene expression profiling of cervical cancer in Hong Kong women by oligonucleotide microarray. Int J Cancer. (2006) 118:2461–9. doi: 10.1002/IJC.21660
41. Diéguez NC, Martí C, Rodríguiez-Trujillo A, Barnadas E, Torné A, del Ordi J, et al. Detection of mRNA of CDKN2A, MKi67 and TOP2A in liquid-based cytology as biomarker of high grade lesions of uterine cervix. Int J Gynecologic Cancer. (2019) 29:A36. doi: 10.1136/IJGC-2019-ESGO.39
42. Saffi Junior MC, Duarte I da S, Brito RB de O, Prado GG, Makabe S, Dellê H, et al. Correlation between indoleamine 2,3 dioxygenase mRNA and CDKN2A/p16 mRNA: a combined strategy to cervical cancer diagnosis. Med Oncol. (2016) 33:132. doi: 10.1007/S12032-016-0844-7
43. Rokutan-Kurata M, Minamiguchi S, Kataoka TR, Abiko K, Mandai M, Haga H. Uterine cervical squamous cell carcinoma without p16 (CDKN2A) expression: Heterogeneous causes of an unusual immunophenotype. Pathol Int. (2020) 70:413–21. doi: 10.1111/PIN.12930
44. Yang X, Da M, Zhang W, Qi Q, Zhang C, Han S. Role of Lactobacillus in cervical cancer. Cancer Manag Res. (2018) 10:1219–29. doi: 10.2147/CMAR.S165228
45. Dai W, Du H, Li S, Wu R. Cervicovaginal microbiome factors in clearance of human papillomavirus infection. Front Oncol. (2021) 11:722639. doi: 10.3389/FONC.2021.722639
46. Audirac-Chalifour A, Torres-Poveda K, Bahena-Román M, Téllez-Sosa J, Martínez-Barnetche J, Cortina-Ceballos B, et al. Cervical microbiome and cytokine profile at various stages of cervical cancer: A pilot study. PloS One. (2016) 11:e0153274. doi: 10.1371/JOURNAL.PONE.0153274
47. Łaniewski P, Barnes D, Goulder A, Cui H, Roe DJ, Chase DM, et al. Linking cervicovaginal immune signatures, HPV and microbiota composition in cervical carcinogenesis in non-Hispanic and Hispanic women. Sci Rep. (2018) 8:7593. doi: 10.1038/S41598-018-25879-7
48. Łaniewski P, Cui H, Roe DJ, Barnes D, Goulder A, Monk BJ, et al. Features of the cervicovaginal microenvironment drive cancer biomarker signatures in patients across cervical carcinogenesis. Sci Rep. (2019) 9:7333. doi: 10.1038/S41598-019-43849-5
49. Molina MA, Leenders WPJ, Huynen MA, Melchers WJG, Andralojc KM. Temporal composition of the cervicovaginal microbiome associates with hrHPV infection outcomes in a longitudinal study. BMC Infect Dis. (2024) 24:552. doi: 10.1186/S12879-024-09455-1
50. Dong B, Sun P, Ruan G, Huang W, Mao X, Kang Y, et al. Type-specific high-risk human papillomavirus viral load as a viable triage indicator for high-grade squamous intraepithelial lesion: a nested case- control study. Cancer Manag Res. (2018) 10:4839–51. doi: 10.2147/CMAR.S179724
51. Zhou Y, Shi X, Liu J, Zhang L. Correlation between human papillomavirus viral load and cervical lesions classification: A review of current research. Front Med (Lausanne). (2023) 10:1111269. doi: 10.3389/FMED.2023.1111269
52. Hashim D, Engesæter B, Baadstrand Skare G, Castle PE, Bjørge T, Tropé A, et al. Real-world data on cervical cancer risk stratification by cytology and HPV genotype to inform the management of HPV-positive women in routine cervical screening. Br J Cancer. (2020) 122:1715–23. doi: 10.1038/S41416-020-0790-1
53. Demarco M, Hyun N, Carter-Pokras O, Raine-Bennett TR, Cheung L, Chen X, et al. A study of type-specific HPV natural history and implications for contemporary cervical cancer screening programs. EClinicalMedicine. (2020) 22:100293. doi: 10.1016/J.ECLINM.2020.100293
54. Su P, Ma J, Yu L, Tang S, Sun P. Clinical significance of extended high-risk human papillomavirus genotyping and viral load in cervical cancer and precancerous lesions. Gynecology Obstetrics Clin Med. (2023) 3:22–9. doi: 10.1016/J.GOCM.2023.01.001
55. Von Keyserling H, Kühn W, Schneider A, Bergmann T, Kaufmann AM. p16INK4a and p14ARF mRNA expression in Pap smears is age-related. Mod Pathol. (2012) 25:465–70. doi: 10.1038/MODPATHOL.2011.179
56. Castle PE, Jeronimo J, Schiffman M, Herrero R, Rodríguez AC, Bratti MC, et al. Age-related changes of the cervix influence human papillomavirus type distribution. Cancer Res. (2006) 66:1218–24. doi: 10.1158/0008-5472.CAN-05-3066
57. López-Filloy M, Cortez FJ, Gheit T, Cruz y Cruz O, Cruz-Talonia F, Chávez-Torres M, et al. Altered vaginal microbiota composition correlates with human papillomavirus and mucosal immune responses in women with symptomatic cervical ectopy. Front Cell Infect Microbiol. (2022) 12:884272. doi: 10.3389/FCIMB.2022.884272
Keywords: cervical cancer, squamous intraepithelial lesion, cervical screening, cytological smear, molecular biomarker, cellular mRNA, CDKN2A, RT-qPCR
Citation: Artyukh AA, Ivanov MK, Titov SE, Dzyubenko VV, Krasilnikov SE, Shumeikina AO, Afanasev NA, Malek AV, Glushkov SA and Agletdinov EF (2025) Detection of cervical precancerous lesions and cancer by small-scale RT-qPCR analysis of oppositely deregulated mRNAs pairs in cytological smears. Front. Oncol. 14:1491737. doi: 10.3389/fonc.2024.1491737
Received: 05 September 2024; Accepted: 10 December 2024;
Published: 07 January 2025.
Edited by:
Kunqi Chen, Fujian Medical University, ChinaReviewed by:
Mingzhu Li, Peking University People’s Hospital, ChinaErdong Wei, University of Minnesota Medical Center, United States
Copyright © 2025 Artyukh, Ivanov, Titov, Dzyubenko, Krasilnikov, Shumeikina, Afanasev, Malek, Glushkov and Agletdinov. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Anastasia A. Artyukh, QXJ0eXVraEB2ZWN0b3ItYmVzdC5ydQ==; Mikhail K. Ivanov, aXZhbm92bWtAdmVjdG9yLWJlc3QucnU=