From phenotypical investigation to RNA-sequencing for gene expression analysis: A workflow for single and pooled rare cells

Rossi, Tania; Angeli, Davide; Martinelli, Giovanni; Fabbri, Francesco; Gallerani, Giulia

doi:10.3389/fgene.2022.1012191

ORIGINAL RESEARCH article

Front. Genet., 14 November 2022

Sec. RNA

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.1012191

This article is part of the Research TopicWomen in RNA: 2022View all 4 articles

From phenotypical investigation to RNA-sequencing for gene expression analysis: A workflow for single and pooled rare cells

Tania Rossi¹*

Davide Angeli²

Giovanni Martinelli³

Francesco Fabbri¹^†

Giulia Gallerani¹^†

¹Biosciences Laboratory, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) “Dino Amadori”, Meldola, Italy
²Unit of Biostatistics and Clinical Trials, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) “Dino Amadori”, Meldola, Italy
³Scientific Directorate, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) “Dino Amadori”, Meldola, Italy

Combining phenotypical and molecular characterization of rare cells is challenging due to their scarcity and difficult handling. In oncology, circulating tumor cells (CTCs) are considered among the most important rare cell populations. Their phenotypic and molecular characterization is necessary to define the molecular mechanisms underlying their metastatic potential. Several approaches that require cell fixation make difficult downstream molecular investigations on RNA. Conversely, the DEPArray technology allows phenotypic analysis and handling of both fixed and unfixed cells, enabling a wider range of applications. Here, we describe an experimental workflow that allows the transcriptomic investigation of single and pooled OE33 cells undergone to DEPArray analysis and recovery. In addition, cells were tested at different conditions (unfixed, CellSearch fixative (CSF)- and ethanol (EtOH)-fixed cells). In a forward-looking perspective, this workflow will pave the way for novel strategies to characterize gene expression profiles of rare cells, both single-cell and low-resolution input.

Introduction

In recent years, advances in the development of low-input RNA-sequencing protocols have enabled the characterization of rare cells (Jindal et al., 2018; Nguyen et al., 2018; Rossi and Zamarchi, 2019; Negishi et al., 2022). Circulating tumor cells (CTCs) represent a typical case in point of rare cells in cancer (Yang et al., 2019; Rossi et al., 2021a), and are often investigated through the gold standard enumeration method CellSearch after fixation for prognostic purposes. Another frequently used approach, consisting in the DEPArray platform, allows for the immunophenotypic analysis and handling of single fixed or unfixed cells by exploiting the dielectrophoretic principle. However, experimental workflows for transcriptomic analyses of CTCs analysed using both CellSearch and DEPArray are still lacking.

Herein, we describe an experimental workflow that allows for the analysis, from phenotypical to transcriptomic, of single cells and 10-cell pools. Our findings are pioneering and pave the way to new applications in the study of rare cells like CTCs.

Materials and methods

Study design

In this study, we tested the ability of the QIAseq UPX 3’ Transcriptome kit (QIAGEN, Germantown, MD, United States) to provide valuable gene expression data on cells isolated using the DEPArray NxT platform (Menarini Silicon Biosystems, Castel Maggiore, Italy). In particular, samples were tested based on (i) condition (CellSearch fixative (CSF) and ethanol (EtOH)-fixed cells, unfixed cells) and (ii) sample type (single cell, 10-cells pools, RNA).

Cell culture and fixation

OE33 commercial cells were cultured in RPMI 1640 + 2 mM Glutamine + 10% Foetal Bovine Serum (FBS; Gibco; Thermo Fisher Scientific, Inc., Waltham, MA, United States). For sample preparation, OE33 cells were dissociated through trypsinization, then cellular pellet was split in two for CSF and EtOH fixation, respectively.

CSF fixation was performed using the fixative contained in CellSave Preservative tubes. More specifically, fixation was performed with 100 µl of preservative diluted 1:10 in PBS1X at room temperature. EtOH fixation was performed by adding 200 µl of ice-cold PBS1X and 800 µl of ice-cold EtOH dropwise. Fixation was carried out on ice for 15 min. Successively, each fixed sample was washed with ice cold PBS1X and resuspended with 900 µl of ice-cold PBS1X, then split as follows: (i) 800 µl into a 1,5 ml tube, centrifugated and supernatant discarded and immediately stored at −80°C (for RNA extraction) and (ii) 100 µl into a 1,5 ml tube, centrifugated and supernatant discarded (for DEPArray analysis).

RNA extraction from cellular pellet

We performed RNA extraction from cellular pellets (CSF and EtOH fixed, unfixed) using the RNeasy Mini Kit (QIAGEN, Germantown, MD, United States) following the instruction provided by the manufacturer, including a DNA digestion step with DNAse. RNA was quantified by Spectrophotometer Nanodrop-ND-1000 (Thermo Fisher Scientific, Waltham, MA, United States) and stored at −80°C until downstream analysis.

DEPArray analysis

CSF- and EtOH-fixed samples for DEPArray analysis were stained with EpCAM-PE 1:10 (clone HEA-125; Miltenyi Biotech, Bergisch Gladbach, Germany), CKs-PE 1:10 (clone C11; Aczon, Bologna, Italy), cMET-APC 1:10 (clone 95106; R&D Systems, McKinley Pl NE, MN, United States) antibodies and Hoechst 33342 (1 µg/ml; Life Technologies, Carlsbad, CA, United States) for nuclear staining. Samples were resuspended in DEPArray™ buffer for fixed cells (Menarini Silicon Biosystems) for DEPArray analysis and sorting. For unfixed OE33 cells, samples were stained only with cell surface antigen EpCAM-PE 1:10 (clone HEA-125; Miltenyi Biotech) and SYTO-16 Green Fluorescent Nucleic Acid Stain (10 nM; Invitrogen, Waltham, MA, United States) and resuspended in RPMI 1640 medium (Gibco) + 10% FBS (Gibco). At this phase, we assessed staining quality and routability, intended as the percentage of the number of cells routable within the DEPArray cartridge compared to the number of identified cells. To this purpose, the DEPArray instrument at the opening of the CellBrowser™ automatically displays a subpopulation of cells trapped in the cages of the cartridge, thus suitable for routing (routable cells) according to the instrument’s manual. By contrast, un-routable events may include spurious events, cell debris, small cell fragments and cells with impaired cell membrane integrity. For downstream analyses, we isolated sixteen single cells and sixteen 10-cell pools for CSF and EtOH samples, and eight single cells and eight 10-cell pools for the unfixed sample. After recovery, each sample was subjected to PBS1X washing under a sterile hood, and volume reduction (to approximatively ∼2 µl) was performed automatically using VR NxT (Menarini Silicon Biosystems). Samples were immediately stored at −80°C until library preparation.

Library preparation and sequencing

Libraries were prepared using the QIAseq 3’ UPX Transcriptome kit (96-M, QIAGEN) starting from the above-mentioned single cells and 10-cell pools isolated using DEPArray, and 8 replicates of each RNA sample extracted from fixed (CSF and EtOH) and unfixed OE33 cell pellet. In addition, we included in library preparation the Xpress Ref Universal Total RNA (QIAGEN) (8 replicates) as a reference sample. Cell lysis was carried out directly on the 0,2 ml tube used for DEPArray recovery and under a sterile hood to avoid contaminations. Reactions were kept at room temperature for 30 min, and immediately stored at −80°C for 48 h after lysis. During the reverse transcription step, each sample was assigned a different Cell-ID for downstream demultiplexing and each RNA molecule was tagged with a unique molecular index (UMI), allowing to combine together all individually tagged cDNAs. Finally, libraries were checked for quality using the Agilent High Sensitivity DNA kit (Agilent, Waldbronn, Germany), while concentration was assessed using the QIAseq Library Quantification Assay kit (QIAGEN). After calculation of pM concentration, equimolar pools were prepared, denatured at 1 nM and loaded in a V3-150 cycles cartridge at 3pM concentration. Sequencing 100 × 27 was performed on MiSeq Sequencing system (Illumina Inc., San Diego, CA, United States) including custom primer for Read 2 provided with the kit, following the instructions of the manufacturer.

Bioinformatic analyses

In this study, bioinformatic and statistical analyses were conducted using the toolkit for NGS data CLC Genomics Workbench version 22.0.1 (QIAGEN), Biomedical analysis plugin. Briefly, raw fastq data generated by the MiSeq instrument were uploaded, and demultiplexing of Cell-IDs was carried out using the function “Analyze QIAseq Samples”. Demultiplexed reads were analyzed using the ready-to-use workflow “Quantify QIAseq UPX 3’”. In this step, reads are trimmed and mapped to targets to quantify gene expression based on merging and then counting UMIs. For differential gene expression analysis, gene expression tracks were prepared by comparing the transcriptomic profile of each sample to the reference XpressRef Universal Total RNA. Data were normalized using the global TMM (Trimmed Mean of M-Method) as suggested by the software for whole transcriptome RNA-sequencing. To identify differentially expressed genes, the fold change threshold was set at 1,5, and p-values < 0,05 computed using GLM model were considered statistically significant. Shapiro-Wilk test and histograms were prepared using GraphPad Prism 8.0.2.

Results

DEPArray analysis and sample preparation for sequencing

In our study, the DEPArray instrument was used for the phenotypic analysis and isolation of fixed (CSF and EtOH) and unfixed OE33 cells as single cells and 10-cell pools (Figure 1).

FIGURE 1

FIGURE 1. Schematic representation of the workflow of the present study.

At first instance, we aimed at evaluating cell routability for each sample we found that EtOH-fixed cells had the highest routability percentage (82%) compared to CSF-fixed cells (72%). Unfixed OE33 cells showed a routability percentage of 80%. Concerning staining, both fixed and unfixed cells had a proper quality, and no false negatives were detected (Figure 2).

FIGURE 2

FIGURE 2. DEPArray images of the most representative OE33 cells. For fixed cells, the DAPI channel was used for nuclear staining using Hoechst 33342, PE channel for epithelial tag [anti-EPCAM and anti-cytokeratins (CKs)], and APC channel for anti-c-MET staining. For unfixed cells, FITC channel was used for nuclear detection (SYTO-16) and PE channel for anti-EpCAM staining.

Comparison of mapping efficiency

Firstly, to obtain preliminary information concerning the quality of our data, we investigated mapping efficiency among samples. In Table 1, we report for each sample type and condition the average raw total reads, percentage of reads mapping to genome and to genes and descriptive statistics. To examine the distribution of replicates in each sample we applied the Shapiro-Wilk test, where p-values < 0.05 indicates that data significantly deviate from a normal distribution (Shapiro and Wilk, 1965) (Table 1). Due to the data variations among groups, we could not perform further statistical analysis (i.e., statistical comparison).

TABLE 1

TABLE 1. Report of average total and mapped to genome reads for each sample type and condition.

At first instance, we found that the number of generated reads varied among samples. In the EtOH-fixed samples, except for the RNA samples, we obtained the lowest numbers of total reads, probably as a consequence of the effect of EtOH fixation in the qualitative and quantitative assessment of libraries.

The XPress Ref Universal Total RNA, which was used as a reference, displayed the highest average percentage of reads mapped (90,43%). Similar data were observed for RNA samples extracted from unfixed (88,74%) and CSF- and EtOH- fixed (85,11% and 88,69% respectively) OE33 cell pellets. Concerning the mapping efficiency to coding regions, we observed reproducible values among the RNA samples.

In 10-cell pools, the fixation using the CSF fixative appeared more efficient compared to the other conditions. In fact, we found that the CSF-fixed samples showed the highest percentage of average mapped reads (67,64%) compared to unfixed (49,59%) and EtOH-fixed (39,81%) OE33 pools. While the mapping efficiency of reads to coding regions was comparable between unfixed and CSF-fixed 10-cell pools, EtOH-fixed 10-cell pools displayed the worst results (46,53%). By contrast, in all the single cell samples the average percentage of mapped reads, as well as the percentage of reads mapped to coding regions, appeared nearly comparable regardless of the fixation conditions.

Globally, our findings indicate that the application of fixatives does not negatively influence the sequencing quality in terms of reads mapping to the genome in RNA samples, while CSF fixation seems to save a high number of reads by mapping to the genome in 10-cell pool samples. Concerning single cells, mapping efficiency is lower compared to other sample types, regardless of the condition.

Comparison of features detected

To get an estimation of the genes detected, we investigated the amount of genes mapped by at least one read. To this purpose, we filtered genes having a value greater than 0 in the Total Gene Reads column (the number of reads that are mapped to a gene) from the gene expression track provided by CLC Genomics Workbench (Table 2).

TABLE 2

TABLE 2. Number of features having > 0 mapping reads for each sample type and condition.

Among the RNA samples, we found that the unfixed sample had the highest average number of features detected, whereas both CSF- and EtOH-fixed RNA samples showed comparable values. However, this discrepancy could be imputable to the higher number of total reads used for the unfixed RNA samples, as previously reported in Table 1.

In the 10-cell pool samples, unfixed samples yielded as expected the higher mean amount of features detected (1.567 genes). At the same time, by comparing the results obtained from fixed samples, while EtOH fixed samples showed the lowest values (mean 376,50 genes) we found a higher number of detected features in 10-cell CSF fixed pools (mean 947,7 genes).

Again, unfixed single cells further demonstrated a higher number of mean features detected (796,1 genes). In fixed samples, our findings highlight a comparable amount of features detected regardless of the fixative used, having Single cell CSF- and EtOH-fixed samples 343,7 and 313,9 genes detected, respectively.

In general, our data highlight that the lack of fixation provides the best results in terms of features detected in all the sample types (RNA, 10-cell pools and single cells). At the same time, while the fixation seems not to impact the quality of the sequencing in terms of detected features in RNA samples, EtOH fixation is the worst in terms of detected features when working with 10-cell pools and single cells.

Genes detected as expressed

In order to further investigate gene expression, we binned the annotated genes in 5 groups based on their Reads Per Kilobase per Million reads (RPKM): (a) 1 < RPKM ≤ 100, (b) 100 < RPKM ≤ 1.000, (c) 1.000 < RPKM ≤ 10.000 and (d) RPKM > 10.000 (Table 3).

TABLE 3

TABLE 3. Number of genes detected binned by reads per kilobase per million reads (RPKM).

The RPKM data was obtained from the Gene Expression Track generated as output by the CLC Genomics Workbench.

As expected, the RNA samples displayed a higher amount of genes expressed at low levels compared to the other samples. However, although the number of genes is comparable, the RNA OE33 EtOH fixed sample tends to present a lower number of genes in each group compared to the CSF-fixed and the unfixed RNA samples. In the 10-cell pools samples, the analysis failed to detect genes with 1 < RPKM≤100, and genes with 100 < RPKM ≤ 1.000 were identified only in unfixed and CSF-fixed samples. Again, the EtOH fixed sample demonstrated a lower efficiency compared to the other samples. Conversely, the unfixed and CSF-fixed sample showed comparable data for genes having 1.000 < RPKM ≤ 10.000 and RPKM > 10.000 (Figure 3).

FIGURE 3

FIGURE 3. Detected genes binned by reads per kilobase per million reads (RPKM) in 10-cell pools and single cells based on their condition.

Finally, in single cell samples genes with 100 < RPKM ≤ 1.000 were found only in unfixed samples, which also yielded the higher amount of genes with 1.000 < RPKM ≤ 10.000 compared to the fixed samples. Simultaneously, we observed that all the samples shared a comparable number of highly expressed genes with RPKM > 10.000.

Globally, these findings demonstrate that the lack of fixation is the optimal condition among all the tested conditions for gene expression analysis, while EtOH fixation proved to be the worst condition.

Similarity of gene expression profiles

Besides getting insights in the sequencing quality in terms of features detected and genes identified as expressed, we aimed at assessing the level of similarity of gene expression profiles among the samples regardless of their RPKM. To this purpose, gene expression analyses were performed by normalizing the transcriptomic profile of each sample for the XPress Ref Universal Total RNA (Reference) to generate gene expression profiles, intended as a list of upregulated and downregulated genes compared to the Reference. Then, we assessed the level of similarity of expression profiles in function of the fixative condition for each samples type (RNA, 10-cell pool, single cell) by evaluating the generated Venn diagrams (Figure 4).

FIGURE 4

FIGURE 4. Venn diagrams of differentially expressed genes in (A) RNA samples, (B) 10-cell pools, (C) single cells, (D) unfixed RNA, 10-cell pools and single cells, (E) unfixed RNA and CSF-fixed 10-cell pools and single cells, (F) unfixed RNA and EtOH-fixed 10-cell pools and single cells.

First, we found that among the RNA samples, the lack of fixation allowed the identification of an increased number of differentially expressed genes compared to the reference (6.367 genes). In addition, 2.193 genes emerged as differentially expressed among the RNA samples regardless of their fixation condition, although CSF-fixed RNA displayed a higher amount of differentially expressed genes shared with the unfixed RNA (3.783 genes) compared to EtOH-fixed RNA (2.742 genes; Figure 4A). Concerning 10-cell pool samples, CSF-fixation provided a higher number of differentially expressed genes (2.052) compared to unfixed (1.599) and EtOH-fixed (388) samples. Overall, 157 genes were shared by all the samples, regardless of their condition, while 1.005 were differentially expressed genes in both CSF-fixed and unfixed 10-cell pools. 264 genes were shared between EtOH-fixed and unfixed 10-cell pools (Figure 4B). Concerning single cells, unfixed samples identified the higher number of differentially expressed genes (642). In CSF- and EtOH-fixed single cells we found respectively 147 and 89 differentially expressed genes in common with unfixed single cells, while 51 genes were shared regardless of their condition (Figure 4C).

Then, we aimed at identifying the fixative condition that allows the achievement of the most reliable OE33 gene expression profile. To this purpose, we assumed the gene expression profile of RNA from OE33 cells unfixed as the most reliable and examined the resulting Venn diagrams (Figures 4D–F). Merging data from unfixed samples highlights that RNA, 10-cell pools and single cells samples share 273 differentially expressed genes (Figure 4D). The list of these genes is reported in Supplementary Table S1. By contrast, gene expression profiles comparison of RNA unfixed vs. CSF-fixed and EtOH-fixed samples resulted 183 (Figure 4E; Supplementary Table S2) and 68 (Figure 4F; Supplementary Table S3) genes, respectively. Interestingly, 10-cell pools CSF-fixed shared with RNA unfixed gene expression profile a higher number of genes (1.247) compared to 10-cell unfixed pools (1.096).

Globally, our findings highlight that transcriptomic profile of unfixed samples are the most reliable, in particular at single cell level. Concerning 10-cell pools, CSF fixation led to the most reliable gene expression profile, compared to the other condition.

Discussion

Up to now, the development of even more efficient RNA-sequencing techniques and protocols has revolutionized scientific research in the oncology field (Hong et al., 2020; Lei et al., 2021), especially at the resolution of a few down to a single cell. This approach is highly fascinating when applied to rare cells such as CTCs, as their number is often low and requires dedicated workflow for their RNA analysis (Hwang et al., 2018). However, despite the emerging technologies in CTC detection and isolation, this operation remains challenging. One of the most prominent approaches for CTC detection and recovery consists in the DEPArray technology, which proved to be a high-performance ally for the identification and recovery of CTCs and other rare cells by the expression of specific markers (Rossi et al., 2020; Rossi et al.,. 2021b; Gallerani et al., 2021;Rossi et al.,. 2022). Lovero et al. (2022) recently developed a targeted RNA-sequencing assay including a panel of 134 genes involved in metastasis process in DEPArray-isolated CTCs from stage IV breast cancer but no protocols for 3′ transcriptomic analyses through sequencing starting from DEPArray-isolated cells have been identified so far for gene expression purposes. In this study, we describe a protocol for the 3’ RNA-sequencing of OE33 cells isolated by DEPArray NxT platform as single or pooled cells at different conditions of fixation.

The 3’ RNA-sequencing approach is also exploited by the 10X Chromium platform, which is one of the most outstanding emerging technologies for single cell analysis. However, having this technique a capture rate of about 50% (Zheng et al., 2017) and being CTCs underrepresented compared to other cells in the blood (Ferreira et al., 2016), the 10X Chromium technology is not fully applicable to rare cells without a pre-enrichment step (Pauken et al., 2021). At the same time, considered that a high percentage of patients could not have CTCs, the 10X Chromium analysis could fail and be unnecessarily expensive. By contrast, although being a time-consuming procedure, the herein described workflow allows to discriminate positive and negative patients, ensuring to perform downstream analysis on phenotypically investigated CTCs, and to obtain RNA-sequencing data on pure CTCs.

Since one key point in the DEPArray cell recovery procedure is related to the ability of treated cells to be moved within the cartridge, we firstly tested cell quality, and consequently routability (Williamson et al., 2018). At first instance, we decided to include in our test the fixative contained within the CellSave preservative tubes, as they are routinely used for CTC enumeration with the CellSearch instrument. However, the formulation of this preservative is patented and unknown. In literature alcohol-based fixatives, such as methanol and EtOH, are largely used for transcriptomic analyses as their working principle is based on dehydration, thus avoiding chemical modification, and leading to the isolation of high-quality RNA (Alles et al., 2017; Channathodiyil and Houseley, 2021). Hence, EtOH-based fixation was included in our experiments. Based on our results, CSF-fixation negatively affects routability, and seems to have a stronger impact on cell quality. In fact, routability issues could be associated with the presence of cellular debris within the main chamber of the DEPArray cartridge as a consequence of CSF-fixation, making difficult for the instrument to calculate recovery paths. On the other hand, EtOH-fixed cells have a routability rate comparable to unfixed cells.

Next, we aimed at checking whether fixation of OE33 pellet have an impact on the quality of data obtained from the 3’ RNA-sequencing. Based on our results, cell fixation does not negatively impact on RNA quality in terms of mapping efficiency and number of features detected (genes with at least 1 mapping read). Moreover, RNA from fixed cells allows the detection of genes expressed at very low levels (with RPKM value comprised between 1 and 100), accordingly to data from unfixed cells-derived RNA. However, RNA from CSF-fixed cells rather than EtOH had a gene expression profile more similar to RNA from unfixed OE33 cells.

Considered the positive rates of fixed cell routability and the quality of sequencing data from fixed OE33 cells RNA, we decided to proceed with 3’ RNA library preparation starting from unfixed and fixed pooled and single cells, followed by sequencing and bioinformatic analyses through the toolkit for NGS data CLC Genomics Workbench.

Concerning 10-cell pools, our findings highlight that CSF-fixation seems to significantly guarantee a high percentage of mapping reads (67,64%) compared to EtOH (39,82%). In addition, the number of features detected in CSF-fixed pools is in accordance with data from unfixed 10-cell pools, although a fewer amount of less expressed genes was detected. By contrast, EtOH fixation demonstrated a poor efficiency in mapping efficiency and number of features detected, and their sequencing allowed only the detection of genes with a relatively high expression (RPKM>1.000). Again, the CSF-fixed 10-cell pools showed gene expression profiles closest to the unfixed 10-cell pools and RNA, further confirming the limited efficiency of EtOH fixation in this workflow. In addition, we observed that fixed 10-cell pools showed a higher number of genes with RPKM>10.000 compared to unfixed pools. It is known that fixation in some cases may induce expression changes (Kuzmin et al., 2014). In addition, Wang et al. (2021) observed some transcripts had higher expression in fixed cells rather than in unfixed, suggesting their enrichment during library preparation and data normalization.

Lastly, we found a deep discrepancy between results obtained from unfixed and fixed OE33 single cells compared to the other sample types. In fact, while CSF-fixation resulted in acceptable sequencing data in RNA and 10-cell pools, we found that both CSF and EtOH preservatives had equally poor efficiency in number of features detected, and detection of genes with low expression levels (RPKM < 1.000). On the other hand, although the mapping efficiency was comparable to fixed single cells, we observed an increased number of features detected in unfixed single OE33 cells, and improved detection of genes with low expression (100 < RPKM < 1.000). Again, the increased number of genes with RPKM>10.000 may be imputable to library preparation and data normalization (Wang et al., 2021). Further deepen analyses revealed that unfixed single cells shared the major part of their gene expression signature with matched unfixed cells-derived RNA.

Collectively, we found that unfixed cells can be phenotypically analysed and recovered by using DEPArray, and 3’ RNA-sequencing through our workflow provide reliable gene expression results. By contrast, while fixation using EtOH is discouraged, CSF-fixation is suitable in order to get gene expression data in 10-cell pools, but not single cells.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GEO, GSE212814.

Author contributions

Conceptualization: TR. Methodology: TR and GG. Formal analysis: TR, DA, and GG. Investigation: TR and GG. Data curation: TR. Writing—Original Draft: TR and GG. Writing—Review and Editing: TR, DA, GM, FF, and FF. Visualization: TR and GG. Project administration: TR.

Acknowledgments

This work was partly supported thanks to the contribution of Ricerca Corrente by the Italian Ministry of Health within the research line “Precision, gender and ethnicity-based medicine and geroscience: genetic-molecular mechanisms in the development, characterization and treatment of tumors”. GG is currently working at Department of Experimental, Diagnostic and Specialty Medicine (DIMES), University of Bologna, Bologna, Italy.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.1012191/full#supplementary-material

References

Alles, J., Karaiskos, N., Praktiknjo, S. D., Grosswendt, S., Wahle, P., Ruffault, P.-L., et al. (2017). Cell fixation and preservation for droplet-based single-cell transcriptomics. BMC Biol. 15, 44. doi:10.1186/s12915-017-0383-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Channathodiyil, P., and Houseley, J. (2021). Glyoxal fixation facilitates transcriptome analysis after antigen staining and cell sorting by flow cytometry. PLOS ONE 16, e0240769. doi:10.1371/journal.pone.0240769

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferreira, M. M., Ramani, V. C., and Jeffrey, S. S. (2016). Circulating tumor cell technologies. Mol. Oncol. 10 (3), 374–394. doi:10.1016/j.molonc.2016.01.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Gallerani, G., Rossi, T., Valgiusti, M., Angeli, D., Fici, P., De Fanti, S., et al. (2021). Single-cell NGS-based analysis of copy number alterations reveals new insights in circulating tumor cells persistence in early-stage breast cancer. Cancers 13, E2490. doi:10.3390/cancers12092490

CrossRef Full Text | Google Scholar

Hong, M., Tao, S., Zhang, L., Diao, L.-T., Huang, X., Huang, S., et al. (2020). RNA sequencing: New technologies and applications in cancer research. J. Hematol. Oncol. 13, 166. doi:10.1186/s13045-020-01005-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Hwang, B., Lee, J. H., and Bang, D. (2018). Single-cell RNA sequencing technologies and bioinformatics pipelines. Exp. Mol. Med. 50 (8), 96–14. doi:10.1038/s12276-018-0071-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Jindal, A., Gupta, P., Jayadeva, , , and Sengupta, D. (2018). Discovery of rare cells from voluminous single cell expression data. Nat. Commun. 9 (1), 4719. doi:10.1038/s41467-018-07234-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuzmin, A. N., Pliss, A., and Prasad, P. N. (2014). Changes in biomolecular profile in a single nucleolus during cell fixation. Anal. Chem. 86 (21), 10909–10916. doi:10.1021/ac503172b

PubMed Abstract | CrossRef Full Text | Google Scholar

Lei, Y., Tang, R., Xu, J., Wang, W., Zhang, B., Liu, J., et al. (2021). Applications of single-cell sequencing in cancer research: Progress and perspectives. J. Hematol. Oncol. 14, 91. doi:10.1186/s13045-021-01105-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Lovero, D., D’Oronzo, S., Palmirotta, R., Cafforio, P., Brown, J., Wood, S., et al. (2022). Correlation between targeted RNAseq signature of breast cancer CTCs and onset of bone-only metastases. Br. J. Cancer 126, 419–429. doi:10.1038/s41416-021-01481-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Negishi, R., Yamakawa, H., Kobayashi, T., Horikawa, M., Shimoyama, T., Koizumi, F., et al. (2022). Transcriptomic profiling of single circulating tumor cells provides insight into human metastatic gastric cancer. Commun. Biol. 5, 20. doi:10.1038/s42003-021-02937-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nguyen, A., Khoo, W. H., Moran, I., Croucher, P. I., and Phan, T. G. (2018). Single cell RNA sequencing of rare immune cell populations. Front. Immunol. 9, 1553. doi:10.3389/fimmu.2018.01553

PubMed Abstract | CrossRef Full Text | Google Scholar

Pauken, C. M., Kenney, S. R., Brayer, K. J., Guo, Y., Brown-Glaberman, U. A., and Marchetti, D. (2021). Heterogeneity of circulating tumor cell neoplastic subpopulations outlined by single-cell transcriptomics. Cancers 13 (19), 4885. doi:10.3390/cancers13194885

PubMed Abstract | CrossRef Full Text | Google Scholar

Rossi, E., and Zamarchi, R. (2019). Single-cell analysis of circulating tumor cells: How far have we come in the -omics era? Front. Genet. 10, 958. doi:10.3389/fgene.2019.00958

PubMed Abstract | CrossRef Full Text | Google Scholar

Rossi, T., Angeli, D., Tebaldi, M., Fici, P., Rossi, E., Rocca, A., et al. (2022). Dissecting molecular heterogeneity of circulating tumor cells (CTCs) from metastatic breast cancer patients through copy number aberration (CNA) and single nucleotide variant (SNV) single cell analysis. Cancers 14, 3925. doi:10.3390/cancers14163925

PubMed Abstract | CrossRef Full Text | Google Scholar

Rossi, T., Gallerani, G., Angeli, D., Cocchi, C., Bandini, E., Fici, P., et al. (2020). Single-cell NGS-based analysis of copy number alterations reveals new insights in circulating tumor cells persistence in early-stage breast cancer. Cancers 12, 2490. doi:10.3390/cancers12092490

PubMed Abstract | CrossRef Full Text | Google Scholar

Rossi, T., Gallerani, G., Martinelli, G., Maltoni, R., and Fabbri, F. (2021a). Circulating tumor cells as a tool to untangle the breast cancer heterogeneity issue. Biomedicines 9, 1242. doi:10.3390/biomedicines9091242

PubMed Abstract | CrossRef Full Text | Google Scholar

Rossi, T., Palleschi, M., Angeli, D., Tebaldi, M., Martinelli, G., Vannini, I., et al. (2021b). Case report: Analysis of circulating tumor cells in a triple negative spindle-cell metaplastic breast cancer patient. Front. Med. 8, 689895. doi:10.3389/fmed.2021.689895

PubMed Abstract | CrossRef Full Text | Google Scholar

Shapiro, S. S., and Wilk, M. B. (1965). An analysis of variance test for normality (complete samples). Biometrika 52 (3–4), 591–611. doi:10.1093/biomet/52.3-4.591

CrossRef Full Text | Google Scholar

Wang, X., Yu, L., and Wu, A. R. (2021). The effect of methanol fixation on single-cell RNA sequencing data. BMC genomics 22 (1), 420. doi:10.1186/s12864-021-07744-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Williamson, V. R., Laris, T. M., Romano, R., and Marciano, M. A. (2018). Enhanced DNA mixture deconvolution of sexual offense samples using the DEPArray^TM system. Forensic Sci. Int. Genet. 34, 265–276. doi:10.1016/j.fsigen.2018.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, C., Xia, B.-R., Jin, W.-L., and Lou, G. (2019). Circulating tumor cells in precision oncology: Clinical applications in liquid biopsy and 3D organoid model. Cancer Cell Int. 19, 341. doi:10.1186/s12935-019-1067-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, G. X., Terry, J. M., Belgrader, P., Ryvkin, P., Bent, Z. W., Wilson, R., et al. (2017). Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049. doi:10.1038/ncomms14049

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: RNA-sequencing, NGS, single cell analysis, rare cells, deparray, gene expression, circulating tumor cells

Citation: Rossi T, Angeli D, Martinelli G, Fabbri F and Gallerani G (2022) From phenotypical investigation to RNA-sequencing for gene expression analysis: A workflow for single and pooled rare cells. Front. Genet. 13:1012191. doi: 10.3389/fgene.2022.1012191

Received: 05 August 2022; Accepted: 28 October 2022;
Published: 14 November 2022.

Edited by:

Sandra Marcia Muxel, University of São Paulo, Brazil

Reviewed by:

Yuanhang Liu, Mayo Clinic, United States
Clotilde Costa, Sanitary Research Institute Foundation of Santiago de Compostela, Spain

Copyright © 2022 Rossi, Angeli, Martinelli, Fabbri and Gallerani. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tania Rossi, dGFuaWEucm9zc2lAaXJzdC5lbXIuaXQ=

^†These authors have contributed equally to this work and share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.