Skip to main content

ORIGINAL RESEARCH article

Front. Cell Dev. Biol., 04 March 2022
Sec. Membrane Traffic and Organelle Dynamics
This article is part of the Research Topic Exosomes: Message in a Vesicle View all 17 articles

A Novel Tissue Atlas and Online Tool for the Interrogation of Small RNA Expression in Human Tissues and Biofluids

Eric Alsop&#x;Eric Alsop1Bessie Meechoovet&#x;Bessie Meechoovet1Robert KitchenRobert Kitchen2Thadryan SweeneyThadryan Sweeney2Thomas G. BeachThomas G. Beach3Geidy E. SerranoGeidy E. Serrano3Elizabeth HutchinsElizabeth Hutchins1Ionita GhiranIonita Ghiran4Rebecca ReimanRebecca Reiman1Michael SyringMichael Syring1Michael HsiehMichael Hsieh1Amanda Courtright-LimAmanda Courtright-Lim1Nedyalka ValkovNedyalka Valkov2Timothy G. WhitsettTimothy G. Whitsett1Jorge RakelaJorge Rakela5Paul PockrosPaul Pockros6Joel RozowskyJoel Rozowsky7Juan Gallego,,Juan Gallego8,9,10Matthew J. HuentelmanMatthew J. Huentelman1Ravi ShahRavi Shah2Peter NakajiPeter Nakaji11M. Yashar S. KalaniM. Yashar S. Kalani12Louise LaurentLouise Laurent13Saumya Das&#x;Saumya Das2Kendall Van Keuren-Jensen
&#x;Kendall Van Keuren-Jensen1*
  • 1Neurogenomics Division, The Translational Genomics Research Institute, Phoenix, AZ, United States
  • 2Cardiovascular Research Center, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States
  • 3Banner Sun Health Research Institute, Sun City, AZ, United States
  • 4Department of Medicine, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, United States
  • 5Mayo Clinic, Scottsdale, AZ, United States
  • 6Division of Gastroenterology/Hepatology, Scripps Clinic, La Jolla, CA, United States
  • 7Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, United States
  • 8Institute for Behavioral Science, The Feinstein Institute for Medical Research, Manhasset, NY, United States
  • 9Division of Psychiatry Research, The Zucker Hillside Hospital, Glen Oaks, NY, United States
  • 10Department of Psychiatry, Zucker School of Medicine at Hofstra/Northwell, Hempstead, NY, United States
  • 11Department of Neurosurgery, Banner Health, Phoenix, AZ, United States
  • 12Department of Neurosurgery, St. John Medical Center, Tulsa, OK, United States
  • 13Department of Obstetrics, Gynecology, and Reproductive Sciences, University of California, San Diego, San Diego, CA, United States

One promising goal for utilizing the molecular information circulating in biofluids is the discovery of clinically useful biomarkers. Extracellular RNAs (exRNAs) are one of the most diverse classes of molecular cargo, easily assayed by sequencing and with expressions that rapidly change in response to subject status. Despite diverse exRNA cargo, most evaluations from biofluids have focused on small RNA sequencing and analysis, specifically on microRNAs (miRNAs). Another goal of characterizing circulating molecular information, is to correlate expression to injuries associated with specific tissues of origin. Biomarker candidates are often described as being specific, enriched in a particular tissue or associated with a disease process. Likewise, miRNA data is often reported to be specific, enriched for a tissue, without rigorous testing to support the claim. Here we provide a tissue atlas of small RNAs from 30 different tissues and three different blood cell types. We analyzed the tissues for enrichment of small RNA sequences and assessed their expression in biofluids: plasma, cerebrospinal fluid, urine, and saliva. We employed published data sets representing physiological (resting vs. acute exercise) and pathologic states (early- vs. late-stage liver fibrosis, and differential subtypes of stroke) to determine differential tissue-enriched small RNAs. We also developed an online tool that provides information about exRNA sequences found in different biofluids and tissues. The data can be used to better understand the various types of small RNA sequences in different tissues as well as their potential release into biofluids, which should help in the validation or design of biomarker studies.

Introduction

Extracellular RNAs (exRNAs) have been identified in every biofluid tested to-date (Yáñez-Mó et al., 2015; Yeri et al., 2017; Murillo et al., 2019; Srinivasan et al., 2019; Hulstaert et al., 2020) and are found within extracellular vesicles (EVs) and associated with RNA binding proteins or lipoprotein particles (Valadi et al., 2007; Skog et al., 2008; Arroyo et al., 2011; Turchinovich et al., 2011; Tabet et al., 2014). While longer RNAs (>200 nucleotides) can be detected in biofluids (Wei et al., 2017; Everaert et al., 2019; Hulstaert et al., 2020; Rodosthenous et al., 2020), most studies have focused on small RNA biotypes (<50 nucleotides) (Weiland et al., 2012; Max et al., 2018) for their potential as biomarkers (Witwer, 2015; Nik Mohamed Kamal and Shahidan, 2019). Each biofluid has a distinct small RNA profile. For example, plasma has large amounts of miRNAs, but also high levels of YRNA fragments (Dhahbi et al., 2013; Max et al., 2018; Driedonks et al., 2020). Urine and cerebrospinal fluid (CSF) samples, by comparison, have large numbers of tRNA fragments (tRFs), and smaller numbers of miRNAs (Ben-Dov et al., 2016; Yeri et al., 2017; Godoy et al., 2018; Srinivasan et al., 2019; Tosar and Cayota, 2020). Biofluids also contain measurable piRNA, snoRNA, snRNA, rRNA, fragments of protein-coding genes and lncRNAs found in biofluids (Ishikawa et al., 2017; Yeri et al., 2017; Ferrero et al., 2018; Umu et al., 2018; Akat et al., 2019; Dhahbi et al., 2019; Srinivasan et al., 2019).

These distinct RNA profiles in biofluids suggest coordination of biological processes, such as specific cargo loading into EVs [reviewed in (O’Brien et al., 2020)] and enhanced EV contributions from organs that produce or are more proximally perfused by the biofluid interrogated. While blood cells certainly contribute to RNA patterns found in plasma/serum (Savelyeva et al., 2017; Max et al., 2018), the contributions of specific tissues to acellular biofluids such as urine and CSF are poorly investigated (Cui and Cui, 2020). These are important considerations as many biomarker studies place greater importance on “tissue-specific” miRNAs even though the data used to support these designations are generally poorly defined. Differences in less-studied circulating exRNA biotypes (such as tRNAs/tRFs) have exhibited increased tissue and disease specificity (Magee et al., 2019; Qu et al., 2019; Maleki Dana et al., 2020). For example, the abundance of tRNAs varies temporally and between tissues, and are demonstrated to play a role in a variety of diseases (Dittmar et al., 2006; Sagi et al., 2016; Pan, 2018). piRNAs have also been examined for somatic expression and tissue-enrichment (Martinez et al., 2015; Perera et al., 2019; Chen et al., 2020). While substantial, long RNA tissue atlases have been developed (GTEx, FANTOM, Human Body Atlas), there are fewer, less complete tissue atlases for small RNAs available (Landgraf et al., 2007; Ludwig et al., 2016; Rahman et al., 2020). These tissue atlases for small RNAs have one or more limitations: lack of consistency in analytical technique, insufficient number of replicates per sample type, or samples from disease cases without sufficient numbers of normal controls. Here we describe a tissue atlas of small RNA-seq from postmortem samples from 90 human tissue samples (30 tissues with three biological replicates each) and four biological replicates each of red blood cells (RBCs), monocytes, and peripheral blood mononuclear cells (PBMCs). To make this data uniform, easily accessible and usable to other investigators, we have also processed all of the data with a consistent bioinformatic pipeline, exceRpt (Rozowsky et al., 2019), which assesses all RNA species and fragments that align to the human genome.

Many bioinformatic tools and small RNA databases, such as miRbase, rely on alignment with the canonical sequence for each miRNA or small RNA, which is typically the first discovered sequence and/or the sequence thought to be most abundantly expressed. While consensus sequences are relied upon for naming and organizing RNA biotypes, it is well known that miRNA sequence variation exists, giving rise to isomiRs and diverse RNA fragments (Luciano et al., 2004; Ebhardt et al., 2009; Lee et al., 2010; Martí et al., 2010). IsomiRs are sequences that exhibit slight nucleotide differences (isoforms) from the canonical sequence in miRBase (Guo and Lu, 2010). miRNAs are the most diverse class of small RNA with 1000s of isomiRs that are often not included in downstream analysis, because many analyses pipelines collapse/sum isomiR counts and attribute them to the corresponding canonical mature sequence. Not only are these other isomiRs and RNA fragment sequences measurable, but some are found at higher levels in certain tissues and biofluids than the canonical sequence (Guo and Chen, 2014; Haseeb et al., 2017; Wu et al., 2018). Tissue-dependent expression of isomiRs were found to be biologically active and to target mRNAs in tissues (Londin et al., 2015). Thus, a more comprehensive view of unique RNA sequences is likely to expand the repertoire of disease- and tissue-representative biomarkers (Guo and Chen, 2014; Haseeb et al., 2017; Wu et al., 2018), which in turn may have utility as biofluid-based biomarkers to enable liquid tissue biopsy. For example, in a 2017 paper, Telonis et al. described the ability to discriminate between 32 different cancers in TCGA, based solely on the presence and absence of isomiR sequences (Telonis et al., 2017). They found that some isomiR sequences were specific to some tissues and cancers. Telonis went on to apply this approach in breast cancer where they used isomiRs to identify breast cancer subtypes (Telonis et al., 2015). The shifting of a single base can change the targeting of isomiRs for their downstream mRNA target (Karali et al., 2016; Yu et al., 2017).

RNA-sequencing (RNA-Seq) has had a substantial impact on identifying and measuring EV exRNA cargo and has been gaining momentum as a technique for diagnosing and monitoring disease (Van Keuren-Jensen et al., 2014; Byron et al., 2016; Marco-Puche et al., 2019). Many assays require targeting of specific RNA sequences, and thus limit the scope of detected RNAs. Thus, RNA-seq offers not only the ability to monitor a larger number of canonical small RNAs, but also enables monitoring of non-canonical sequences, such as isomiRs, which may be superior discriminators for disease and tissue-enrichment, identifying functional changes, differentiating between disease and health or distinguishing between source tissues. Changes at the 5′ end of isomiRs can change the position of the seed sequence [nucleotides 2-7; (Bartel, 2009)] thereby altering the binding to the target mRNA (Bartel, 2009; Ellwanger et al., 2011), while nucleotides at the 3’ end of the isomiR can also change the stability and targeting of the RNA (Grimson et al., 2007; Agarwal et al., 2015). Although the functions associated with most isomiRs are not known, given the lack of studies in this area, the inclusion of these reproducible sequences increases the number of tissue-enriched targets. It is currently unclear how accurately these slight differences in nucleotides can be measured by qRT-PCR or other methods (Witwer, 2015; Avendaño-Vázquez and Flores-Jasso, 2020).

Here we focused on the use of RNA-seq data to identify individual RNA sequences found in tissues and biofluids. We are aware that there are several advanced adaptations to small RNA sequencing that offer enhanced detection of tRNA fragments with base modifications, mRNA fragments, or sequences with 5′ or 3′ end modifications more comprehensively (Abdelhamid et al., 2014; Cozen et al., 2015; Gogakos et al., 2017; Giraldez et al., 2019; Xu et al., 2019). However, for these experiments, we chose commonly used sequencing kits for their reproducibility, uniformity (Giraldez et al., 2018; Yeri et al., 2018) and their ability to be directly compared across biofluid samples that are already sequenced and readily obtained from several diverse cohorts. We believe that our findings, and the creation of a searchable, online tool, will aid researchers validating their own small RNA studies across tissue types and biofluids.

Materials and Methods

Tissue Samples

RNA Isolation

Fresh frozen tissue was sliced into pieces that were no greater than 0.5 mm in thickness on dry ice to maintain integrity and placed in 1 ml of RNA Later ICE. The RNA later-Ice immersed tissue incubated for at least 16 h at −20°C to allow for absorption. The tissue was then immediately placed in a glass 16 mm × 100 mm Covaris tube and mixed with 400 μL of cold lysis/binding buffer from the mirVana miRNA Isolation Kit. The Covaris was set to treat each sample at a peak power of 452, a duty factor of 17.4, and cycles/burst of 280. Each sample was then treated for 15–30 s in a Covaris S220 sonicator. Lysate was transferred to a new pre- chilled 2 ml Eppendorf tube with 40 μL of miRNA homogenate solution, and incubated on ice for 3–5 min. Total RNA was extracted according to manufacturer’s protocol with the Total Exosome RNA and Protein isolation kit (cat# 4478545, Thermofisher Scientific, Waltham, MA). RNA was eluted using 100 μL of 95°C ultrapure water. Total RNA was treated with DNase, Turbo DNA-free kit (AM 1907, Thermofisher Scientific, Waltham, MA). DNA-free Total RNA was cleaned and concentrated using Zymo clean and concentrator kit (R1016) and eluted in 28 μL of ultrapure water. RIN values for tissues can be found in Supplementary Table S1.

Perkin Elmer Small RNA Library Preparation

Small RNA libraries were generated using NEXTflex Small RNA Library Prep Kit v3 following the manufacturer’s instructions with 250 ng of RNA using 100% adapter and 16 cycles of PCR amplification. Following PCR amplification, libraries between 140 and 160 bp in size were gel purified using 6% TBE gels followed by ethanol precipitation and resuspension in 11 μL of ultra-pure water.

Small RNA Library QC and Pooling

Gel purified libraries were quantified using Agilent 2100 Bioanalyzer with High-Sense DNA chips. Equimolar amounts of libraries were pooled and quantified by Bioanalyzer with High-Sense DNA chips. Pooled libraries were normalized and denatured at a working concentration range of 6–8 pM with 5% PhiX spike-in for flow cell cluster generation. Samples were sequenced until new miRNA detection plateaued for each tissue (Supplementary Figure S1).

Blood Cells

Peripheral Blood Mononuclear Cells and Washed Red Blood Cells

Human blood samples were collected with written consent from donors ≥18 years of age under an IRB protocol approved by the Human Research Protections Programs at UCSD. Biofluid samples were labeled with study identifiers.

Whole blood was collected from two male and two female healthy, non-pregnant adult donors, 22–50 years of age. For each donor, blood was collected in the following order: ∼8 ml into a serum BD Vacutainer collection tube (Becton Dickinson, PN 368045) followed by 3 ml × 4.5 ml into CTAD (0.11 M buffered trisodium citrate, 15 M theophylline, 3.7 M adenosine, 0.198 M dipyridamole) collection tubes (Becton Dickinson, PN 367947). The serum tubes were held at room temperature for 20 min prior to centrifugation at 2000 g for 5 min with no brake. 500 µL aliquots were transferred from the clear upper serum layer into screw cap 2 ml centrifuge tubes and frozen at −80°C until they were processed. PBMC, platelets, and washed red blood cells (RBC) were purified from the CTAD tubes. Wide-bore pipette tips were used at all relevant steps to reduce cell shearing and lysis.

For PBMCs, the CTAD tubes were centrifuged at 100 x g for 20 min with no brake and all but ∼100 µL of the supernatant was removed and discarded. For the PBMCs, the remaining supernatant, buffy coat, and a small portion of the RBCs were transferred from the CTAD tubes into a fresh 15 ml tube. Freshly prepared Prostaglandin I2 (PGI2) (Abcam, ab120912-1 mg) was added to ∼2 mM final concentration. The material was gently inverted several times to mix and centrifuged at 100 x g for 20 min with no brake. The supernatant material was removed to near completion and the pellet was mixed in 10 ml of RBC lysis buffer (150 mM NH4Cl, 10 mM NaHCO3, 1.27 mM EDTA) placed at room temperature for 20 min. The material was centrifuged at 500 x g for 5 min and the supernatant was discarded. The pellet was washed twice with 10 ml of Dulbecco’s phosphate-buffered saline (DPBS) each time and centrifuged as before. The pellet was gently resuspended in 2 ml of DPBS and the material was transferred to a 2 ml centrifuge tube and centrifuged as before. The supernatant was carefully removed and the pellet material in the tube was placed at −80°C until processed.

The remaining RBCs within the CTAD tubes were transferred into a 50 ml conical tube and DPBS was added to 50 ml. The cells were centrifuged at 500 x g for 5 min with no brake and the supernatant was decanted. This washing process was repeated two more times. 200 µL aliquots of the remaining washed RBC pellet were transferred into 2 ml screw cap tubes and stored at −80°C until processed.

Monocytes

Human peripheral blood was obtained from health adult volunteers in accordance with the guidelines of the IRB of Beth Israel Deaconess Medical Center after informed consent was obtained in accordance with the Declaration of Helsinki. Ten ml of blood from healthy donors were collected via cubital venipuncture into a syringe prefilled with 2.3 ml of 6% Dextran 500 (Sigma-Aldrich, St. Louis, MO) and 1 ml of 3.2% Sodium Citrate (Sigma-Aldrich). After gentle mixing the blood was sedimented for 45 min with the syringe’s nozzle up. The RBC-free fraction was washed once by centrifugation at 2000 g for 10 min. The resulting pellet was resuspended in 0.5 ml of HBSS++.

The cells were sorted using a Becton Dickinson FACSAria IIu cell sorter equipped with five lasers (350, 405, 488, 561, and 640 nm). The cell populations were sorted through a 70 μm nozzle tip at a sheath pressure of 70 psi and a drop drive frequency of 90–95 kHz. A highly pure sorting modality (4-way purity sorting for FACS Aria, Masks at 0-32-0) was chosen for cell sorting. The flow rate was maintained at an approximate speed of 10,000 events/second. Monocytes were gated based on FSC/SSC properties. The FSC values are proportional to the diameter of the interrogated cells, whereas the SSC values provide information about the internal complexity of the interrogated cell or its granularity. Sorted cells were collected in 5 ml polypropylene tubes containing 1 ml collection medium (RPMI supplemented with 50% FBS, 100 μg/ml gentamicin, 4 mM L-glutamine, 20 mM HEPES) and stored at −80°C until processed.

Biofluid Samples

Collection

The cell-free plasma, urine and saliva samples were from a previously published study (Yeri et al., 2017). Cell-free RNA isolations methods were optimized and compared to identify the best return of small RNAs from biofluid samples (Burgos et al., 2013; Burgos and Van Keuren-Jensen, 2014). Samples were collected from male college athletes ages 18–25. All human subjects provided written consent form prior to enrollment. All samples were collected with consent and approval from the Western Institutional Review Board (WIRB) study ID# 1307009395, dbGaP accession phs001258. Blood samples were collected in EDTA tubes and placed with ice packs until they were processed within 2–3 h of blood draw. Samples were spun for 10 min at 700 x g at 4°C in an Eppendorf 5804 R centrifuge using an A-4-44 rotor to remove cells and debris. The supernatant (plasma) was collected, and stored in 1 ml aliquots at −80°C for RNA isolation and sequencing. Urine was collected in sterile cups and placed in a cooler with ice packs and processed within 2–3 h of collection. Samples were spun at 1900 x g for 10 min at 4°C and 15 ml were pipetted into a 50 ml conical tube for storage at −80°C. Saliva samples were collected by passive drool to collect into a 50 ml conical tube. The sample was spun at 1900 x g for 10 min at 4°C, and 1 ml volumes were placed into 2  ml microcentrifuge tubes and stored at −80°C.

The CSF-EV and plasma-EV samples were collected under the approval of the Institutional Review Board of the North Shore-Long Island Jewish Health System. Lumbar puncture, performed using a standard technique with a 25-gauge, Whitacre point spinal needle after subcutaneous lidocaine was applied. The procedure was conducted with patient sitting up and all procedures took place at 2 p.m. local time. 15–25 ml of CSF was obtained from each subject. The first 2–3 ml obtained after collecting the fluid were sent for routine testing (cell count, proteins, glucose, and VDRL). CSF samples with macroscopic blood as consequence of traumatic procedures were not included in these analyses. A blood draw to obtain 10 ml of peripheral blood into an EDTA tube was also obtained prior to the lumbar puncture procedure. Both CSF and blood samples were centrifuged at 2,000 x g for 10 min at 4°C. Both sample types were flash frozen with liquid nitrogen and stored at −80°C at the biorepository of the Feinstein Institute for Medical Research.

RNA Isolation

CSF-EV and Plasma-EV Samples RNA Extraction

We followed the protocol in the Qiagen exoRNeasy Serum/Plasma kit (Cat No./ID: 77044) to capture extracellular vesicles on an affinity membrane, and then to lyse the vesicles and isolate the extracellular RNA. Briefly, 1 ml plasma samples were thawed at room temperature and centrifuged at 3,000 x g for 5 min at 4°C to remove debris (in an Eppendorf 5804 R centrifuge using an A-4-44). Samples were then applied to the column following the manufacturer instructions. This kit and the EVs isolated have been previously characterized (Enderle et al., 2015; Tang et al., 2017; Srinivasan et al., 2019; Kalani et al., 2020).

Cell-Free Plasma, Saliva, Urine RNA Isolation

The samples used were from a previously published study (Yeri et al., 2017). Plasma and saliva samples were isolated (1 ml) using the mirVana miRNA Isolation Kit (ThermoFisher Scientific, Waltham, MA, AM1560) according to (Burgos et al., 2013). Samples were DNase treated using TURBO DNA-free Kit (ThermoFisher Scientific, Waltham, MA, AM 1907). Samples were then cleaned and concentrated using Zymo RNA Clean and Concentrator (Zymo Research, Irvine, CA, R1016) using Protocol: Purification of small and large RNAs into separate fractions and combining the fractions at the end. Urine samples (15 ml) were isolated with Urine Total RNA Purification Maxi Kit, Slurry Format (Norgen Biotek Corp., Thorold, ON, Canada, Cat#29600). Samples were DNase treated on column using RNase-Free DNase Set (Qiagen, Germantown, MD, cat# 79254). Samples were concentrated by Speed Vacuum.

Sample Preparation for Sequencing

CSF-EV and Plasma-EV Samples Small RNA Library Preparation

Small RNA libraries were generated using NEXTflex Small RNA Library Prep Kit v2 (BIOO) with 16 cycles of PCR amplification. Amplified libraries were resolved on a 6% TBE gel for size selection. The 140- to 160-nucleotide bands corresponding to adapter-ligated libraries were excised and recovered in a DNA elution buffer followed by ethanol precipitation and resuspension in 11 μL of ultra-pure water. Gel purified libraries were quantified using Agilent 2100 Bioanalyzer with High-Sense DNA chips. Equimolar amounts of libraries were pooled and quantified by Bioanalyzer with High-Sense DNA chips. Pooled libraries were normalized and denatured at a working concentration range of 6–8 pM with 5% PhiX spike-in for flow cell cluster generation.

Cell-Free Plasma, Saliva, Urine Sequencing

The samples employed were previously published (Yeri et al., 2017). Plasma, saliva and urine RNAs were quantified in triplicate using Quant-iT Ribogreen RNA Assay kit, Low-Range protocol (R11490; ThermoFisher). The Illumina small RNA TruSeq kit (RS-200–0048; Illumina) was used for sequencing all samples. RNA input for plasma and saliva was 10–20 ng for all samples and the RNA input for urine was 30 ng for all samples. The reagents from the Illumina TruSeq kit were halved (Burgos et al., 2013). Samples were assigned and unique index. 16 PCR cycles were used for all samples. Samples were pooled and placed on Illumina V3 single read flowcells (GD-401-3001; Illumina).

Liver Fibrosis Samples

The samples employed were previously published (Van Keuren‐Jensen et al., 2016). All patients were consented at the Mayo and Scripps Clinics and the Institutional Review Boards of both sites approved the collection of the samples. For liver and serum samples, the miRVana kit (AM1560; Life Technologies) was used according to the manufacturer’s protocol with a slight modification for two phenol chloroform extractions (Burgos et al., 2013). TruSeq small RNA Library preparation Kit (Illumina RS‐200‐0048) was used. Kit reagents were used in a half reaction. Each sample was assigned one of 48 indices. 200 ng of RNA were used from tissue samples, with 15 PCR cycles. The total RNA isolated from each serum sample volume (∼1 ml) was used for small RNA sequencing. Samples were pooled and placed on a single read Illumina V3 flowcell (GD‐401‐3001). One lane of the flowcell was loaded with PhiX as a reference lane to help with low nucleotide diversity in microRNA.

Exercise Samples

The samples employed were previously published (Shah et al., 2017). All subjects provided written informed consent under approved Institutional Review Board protocols at the Beth Israel Deaconess Medical Center and Massachusetts General Hospital. Plasma samples (1 ml) were isolated using a modified mirVana PARIS protocol (AM1556, Life Technologies) with sequential phenol-chloroform extractions (Burgos et al., 2013). RNA was cleaned and concentrated (R1016, Zymo Research, Irvine, CA). Sample preparation was done using the Illumina small RNA TruSeq kit (RS-200-0048, Illumina). The kit reagents were halved at all steps with 16 rounds of PCR amplification. Libraires were quantified with the Agilent High Sensitivity DNA Kit (5067-4626, Agilent) and sequenced on an Illumina flowcell.

Stroke Samples

The samples employed were previously published (Kalani et al., 2020). This study was approved by the institutional review boards of St. Joseph’s Hospital and Medical Centre. Blood was collected in EDTA tubes. Extracellular vesicles were isolated from the plasma samples using the exoRNeasy Serum/Plasma kit from Qiagen. We followed the protocol in the Qiagen exoRNeasy Serum/Plasma kit (Cat No./ID: 77044) including a centrifugation at 3000 g for 5 min at 4°C to remove debris, after thawing and before RNA isolation. A total of 10 ng of isolated RNA was used in sample preparation (Quant-iT Ribogreen RNA Assay kit, Low-Range protocol (R11490; ThermoFisher). Samples were sequenced as described above for the liver fibrosis and exercise samples, with 16 PCR cycles.

Data Analysis

Processing

All samples were processed using the exceRpt small RNA pipeline (Rozowsky et al., 2019). The gene count tables for miRNAs, tRFs, piRNAs, protein coding genes and yRNAs were taken directly from the summary tables generated by exceRpt. To generate the count tables of sequences in each sample, the adapter trimmed fastq files were first compared against the human genome mapped BAM files from exceRpt to find reads which mapped to the human genome. The genome mapped reads where then identified in the transcriptome mapped BAM file to identify the biotypes of the sequence. Since multi-mappers are common, biotype was assigned in the same priority order as the exceRpt pipeline: miRNA, YRNA, tRNA, piRNA, protein coding, followed by a bin of any other biotype in the annotation. Reads with the same sequence (after adapter trimming) were then collapsed into a counts table with biotype and gene annotations. All counts tables (both genes and sequences) were then loaded into R (v. 4.1.0) for further analysis. Rarefaction curves were generated in R using the vegan package (https://CRAN.R-project.org/package=vegan) to assess sequencing saturation.

Tissue Atlas

The count tables from the tissue atlas samples were filtered in R to only genes and sequences present at >25 read counts in >2 of the 3 biological replicates. This allowed us to both include sequences which were well expressed while not removing sequences which were specific to a single tissue type. After filtering, read counts were normalized using median of ratios in DESeq2 (Love et al., 2014). Normalized counts were using to generate diversity and counts boxplots for each tissue type and a UMAP of all sequences was generated using the uwot package in R (https://CRAN.R-project.org/package=uwot). Tissue-enriched genes and sequences were determined using the TissueEnrich package in R with the foldChangeThreshold set to 7 (Jain and Tuteja, 2019). Sequences with specificity of “All” were removed, leaving a table of tissue-enriched, group-enriched and tissue-enhanced sequences and genes. Genes and sequences are assigned using the algorithm from the Human Protein Atlas (Uhlén et al., 2015) and can be grouped as follows: 1) Tissue-Enriched: Genes with an expression level greater than 1 normalized count that also have at least five-fold higher expression levels in a single tissue compared to all other tissues; 2) Group-Enriched: Genes with an expression level greater than 1 normalized count that also have at least five-fold higher expression levels in a group of four tissues compared to all other tissues; 3) Tissue-Enhanced: Genes with an expression level greater than 1 normalized count that also have at least five-fold higher expression levels in a single tissue compared to the average levels in all other tissues, and that are not considered Tissue-Enriched or Group-Enriched. Throughout the paper, these three categories were combined and termed; tissue-elevated.

Biofluids

The count tables from the various projects above were processed in a consistent manner. All were loaded into R as counts matrices of genes or sequences. Stacked bar plots of biofluid genome mapping and biotype diversity was generated from the raw count matrices. For further analysis, each biofluid type was filtering independently with a gene or sequence requiring >10 read counts in >50% of the samples for inclusion. As with the tissue atlas samples, filtered count matrices were normalized using median of ratios in DESeq2. Tissue-elevated sequences in biofluids were found based on the overlap of filtered sequences in biofluids samples with tissue-elevated sequences identified above.

Diversity Analysis

We use the term gene diversity in tissues to refer to the traditional analysis of collapsing all detected isoforms of a gene to a single parent gene and counting only the parent. We use the term “sequence diversity” as the total number of unique sequences with a normalized expression >10 in any tissue, without collapsing to the canonical parent gene. Similarly, in biofluids, gene diversity was determined by collapsing all isoforms of a gene to the canonical mature sequence and allowing them to count for that gene, sequence diversity in biofluids were determined by summing the number of detected sequences with normalized counts >10 in a biofluid and allowing them to be counted independently. Sequences detected in the biofluid samples with >10 read counts in 50% of the samples from that biofluid, and were listed as tissue-enriched, tissue-enhanced or group-enriched – combined they are designated tissue-elevated. Sequences that were elevated in multiple tissues were counted for each tissue.

Differential Expression Analysis

For analyses of liver fibrosis, exercise and stroke, all biofluid samples were loaded into R, filtered as above, and analyzed for differentially expressed sequences using DESeq2. Volcano plots of differentially expressed sequences were generated and colored based on the RNA biotype of the sequences. Differentially expressed sequences (Benjamini-Hochberg p-adjusted values <0.05) were compared to tissue elevated sequences and one sequence was highlighted in a bar graph displaying expression levels.

Results

Small RNA Profiles Differ Across Biofluids

We, and others, have found that each biofluid contains a distinct small RNA profile (ref 20; 1, 3), as can be observed in the percentage of reads mapped to the genome (Figure 1A) or transcriptome (Figure 1B). For this paper, we compare data from a previously published cohort: 179 cell-free plasma samples, 44 cell-free saliva samples, and 203 cell-free urine samples (Yeri et al., 2017). We also included 37 CSF and 35 matched plasma samples isolated with exoRNeasy to enrich for exRNAs contained in EVs.

FIGURE 1
www.frontiersin.org

FIGURE 1. Biofluid Profiles. For the biofluids, CSF-EVs, cell-free plasma, plasma-EVs, cell-free saliva and cell-free urine, the associated small RNAs were sequenced and aligned to the human genome (A). In Panel (A), the reads are aligned to the transcriptome, genome, rRNA, UniVec (laboratory contaminants), are categorized as unmapped, or reads that fail QC. In Panel (B), the transcriptome mapped reads are taken out and further broken down into small RNA biotypes; miRNA, tRNA, yRNA, piRNA, protein-coding fragments, and other (these are categories of RNA that annotate in GENCODE, lncRNA fragments, pseudogenes, etc).

We show that plasma and urine samples show the highest mapping rates to the human genome, while saliva and CSF samples have lower mapping rates (Figure 1A). CSF typically has very little measurable exRNA (Hulstaert et al., 2020; Burgos et al., 2013; Kopkova et al., 2018; Saugstad et al., 2017; Waller et al., 2017). These low-level RNA inputs make these samples more susceptible to amplification of synthetic (adapter) oligos and common laboratory contaminants (UniVec). Taking the portion of the CSF that maps to the transcriptome (Figure 1B), many of the exRNAs found in CSF are miRNA and tRNA, with >25% going to yRNA, piRNA and protein-coding fragments (Hulstaert et al., 2020; Godoy et al., 2018). Saliva contains high RNA concentrations, but a large fraction of this RNA is not human, with significant contributions from bacterial sequences (Li et al., 2018; Takeshita et al., 2016). Of the reads that map to the human transcriptome (Figure 1B), cell-free plasma is dominated by miRNA and yRNA fragments (yRFs), and EVs enriched from plasma samples have slightly higher levels of mapped miRNA. Finally, urine has low miRNA content, with high levels of tRFs as previously reported (Yeri et al., 2017; Godoy et al., 2018; Srinivasan et al., 2019).

Generation of a Tissue Atlas of Small RNAs

To examine whether tissue-elevated, small RNA sequences (comprised of miRNA, piRNA, snoRNA, snRNA, tRNA and YRNA fragments, etc) were found in specific biofluids and correlated with tissue abundances, we produced a high-quality tissue atlas of small RNAs across 30 human tissues. Tissue samples were received from the Brain and Body Donation Program (BBDP) at Banner Sun Health Research Institute (http://www.brainandbodydonationprogram.org); samples were rapidly collected with a median postmortem interval of 3 h and verified to be pathologically normal (see Methods; donor demographic information is available in Supplementary Table S2). Samples were sequenced deeply with at least 9.8 million reads mapping to miRNA. Supplementary Figure S1 displays rarefaction curves that have plateaued for new miRNA detection in each tissue. The three biological replicates were averaged together for each tissue, the human genome mapping rate was high for all tissues (Figure 2A). The RNA biotypes that make up the transcriptome alignments for each tissue are shown in Figure 2B. Gallbladder has high levels of tRFs and piRNA sequence alignments; whereas, monocytes and PBMCs display higher levels of yRFs, potentially contributing to the high abundance of yRFs in plasma samples.

FIGURE 2
www.frontiersin.org

FIGURE 2. Tissue Profiles. For thirty different tissues, PBMCs, monocytes, and red blood cells we isolated the RNA and sequenced their small RNA contents. In Panel (A), the reads are aligned to the transcriptome, genome, rRNA, UniVec (contaminants), are unmapped, or reads that fail QC. In Panel (B), the transcriptome mapped reads are removed and further broken down into small RNA biotypes; miRNA, tRNA, yRNA, piRNA, protein-coding fragments, and other (these are categories of RNA that annotate in GENCODE, lncRNA fragments, pseudogenes, etc).

Detection and Classification of Tissue-Elevated Sequences

There is overlap in the expression of canonical miRNAs between different tissues, as well as some unique tissue-elevated miRNAs. We sought to expand the number of tissue-elevated sequences by identifying abundant sequences that do not conform to canonical small RNA annotations. We performed a strict alignment of small RNA sequences from the biofluids and tissues using the exceRpt pipeline (see Materials and Methods). We used miR-451a and miR-30a-5p as examples to demonstrate the presence of the mature canonical sequences and the number of isomiRs present in plasma samples. There are 73 isomiRs of miR-451a with >1 read in >25% of plasma samples (Supplementary Table S3). The mature canonical sequence for miR-451a, highlighted in Figure 3A, is not the most abundant sequence (Guo and Chen, 2014; Liang et al., 2017). MiRNA miR-451a (Griffiths-Jones, 2004; Kozomara et al., 2019) is highly abundant in RBCs (Sun et al., 2020) and we used a heatmap to display expression (Figure 3B) of all miR-451a isomiRs in RBC, PBMCs and monocytes. miR-451a is a dicer-independent miRNA (Liang et al., 2017) while the second example in Figure 3A is from a dicer-dependent miRNA, miR-30a-5p. miR-30a-5p has 18 detectable isomiRs with >1 read in >25% of cell-free plasma samples with many of them more abundantly detected than the highlighted mature sequence found in miRBase.

FIGURE 3
www.frontiersin.org

FIGURE 3. Sequence Variation and Categorization. Panel (A) highlights the canonical miRNA sequence that is entered in miRbase for two miRNAs, miR-451a and miR-30a-5p. The other sequences are isomiRs of the mature sequence (highlighted) listed in miRbase. The table shows the normalized expression for the sequences in plasma from five different subjects and the average across the 179 cell-free plasma samples. In some cases, an isomiR has higher expression in plasma than the highlighted canonical sequence. miR-451a is highly expressed in red blood cells, as can be seen in the heat map in Panel (B). Panel (B) displays the canonical and isomiR sequences for miR-451a found in red blood cells, PBMCs, and monocytes. The most abundant sequence detected in urine samples is displayed in Figure 3c and is = similar to both a piRNA and a tRNA sequence.

The most abundant sequence in urine is GCA​TTG​GTG​GTT​CAG​TGG​TAG​AAT​TCT​CGC (Yeri et al., 2017), which matches a piRNA entry (hsa_piR_020326) with one less 5’ G, as well as being an exact match for a portion of a tRF (Figure 3C). Despite being an exact match, typical tRFs and those used by the tRF database in our alignment pipeline, exceRpt (Rozowsky et al., 2019), are longer than the sequence we detect (see red shading to indicate the detected fragment compared with the full-length fragment in the database). A recent paper introduced a tRF database called MINTbase (Wang et al., 2019), which describes and counts the sequence abundantly detected here in urine, as an exact match to a tRF (Figure 3C). It is important to keep these alignment and naming challenges in mind, as these sequences are short, and in many cases, cannot be classified with complete certainty back to the original parent RNA.

Using the tissue atlas of small RNAs that we generated, we next explored sequences for tissue-elevation. We generated a list of sequences at least seven-fold enriched in a maximum of four tissues using the R package, TissueEnrich (Jain and Tuteja, 2019). The TissueEnrich package contains three categories of “enrichment.” We employed tissue-elevated as the combination of the three categories (“tissue-enriched,” “group-enriched” and “tissue-enhanced”). The reason for four tissues was due in part to having multiple regions for a particular tissue (e.g., central nervous system - cerebellum, cerebral cortex, cerebral cortex-white matter, spinal cord). Tissue-elevated sequences and their fold-enrichments are available in Supplementary Table S4. Figure 4 describes the number of genes (4a) or individual sequences (4b) that were identified to be elevated in each tissue. Figure 4A represents the number of genes detected when all isomiRs and sequence fragments have been collapsed down to the parent/canonical gene. The sequences in Figure 4B represent individual isomiRs, and unique RNA fragments of genes that are expressed at a minimum of 25 counts in at least 2/3 biological replicate samples. The numbers of these tissue-elevated sequences that go to miRNA and piRNA for each tissue and biofluid are displayed in Table 1 and the full table with all RNA biotypes can be found in Supplementary Table S5. The graphs and tables can be used to determine that RBCs and monocytes, and tissues such as bladder and gall bladder, have fewer distinct tissue-defining sequences, while interestingly – jejunum – has many tissue-elevated sequences. Table 1 and Supplementary Table S5 describe the sequences detected in each tissue broken out by small RNA biotype. First column for each RNA biotype is the number of detected genes identified for each tissue (all isomiRs and similar fragments are collapsed and counted with the canonical sequence), followed by the full number of isomiRs or unique fragments (not collapsed; column two) for that RNA biotype, for each tissue. We also include the number of tissue-elevated genes (column three) and fragments/isoforms (column four) for each RNA biotype in each tissue. While the pattern of tissue-elevated sequences is similar in Figures 4A,B, the scales are very different. For example, adrenal gland has 1,572 tissue-elevated genes when collapsed (Figure 4A) and has 31,917 tissue-elevated unique sequences with >10 reads in at least 2 of the three replicate tissue samples (Figure 4B). We also observe low small RNA diversity in monocytes and RBCs.

FIGURE 4
www.frontiersin.org

FIGURE 4. Tissue Diversity. Panel (A) displays the number of genes detected using small RNASeq in each tissue. In this case, all isomiRs, isoforms, and fragments are collapsed to the parent gene and counted once. In Panel (B), all unique sequences, isomiRs, isoforms, and fragments are counted separately and summed up.

TABLE 1
www.frontiersin.org

TABLE 1. Tissue-Enriched Sequences. Displays the # of select RNA biotypes detected, the number of isoforms detected, and the number of tissue-enriched sequences observed for each tissue.

Figure 5 displays the normalized counts for a pancreas-elevated miRNA, miR-216-5p, as well as the normalized counts for miR-216-5p and its isomiRs in all tissues. The canonical sequence has 50% of the reads and the other 50% of miR-216-5p reads are distributed to its 6 isomiRs. All of these sequences are elevated in pancreas, compared to other tissues.

FIGURE 5
www.frontiersin.org

FIGURE 5. Tissue Elevation. miR-216-5p is elevated in pancreas. The expression of the mature sequence is highlighted in Panel. The stacked bar plot displays the relative proportion of other miR-216-5p isomiRs, all of which are elevated in pancreas.

Tissue-Enriched miRNAs Cluster in Tissues and Blood Cell Types

We wanted to determine how well tissues clustered based on their related sequences. Figure 6A is a UMAP using all sequences to cluster the tissues and blood cells and Figure 6B UMAP is subset to using tissue-elevated miRNA and isomiR sequences from each tissue for clustering. Most tissue replicates show consistency and cluster closely, regardless of the sequences used to generate them. Using only the tissue-elevated sequences improved clustering in the CNS tissues. Supplementary Figure S2 displays tissue clustering using the tissue-enriched tRFs, yRFs and piRNAs, each of which display less tissue specificity than tissue-elevated miRNAs/isomiRs and greater variability among tissue replicates.

FIGURE 6
www.frontiersin.org

FIGURE 6. Tissue Clustering. We clustered the tissues using all sequences in Panel (A). Using tissue-elevated miRNA sequences in Panel (B), the tissue samples clustered into tighter groups. In both figures, the tissue replicates show low variability.

Tissue-Elevated Sequences are Detected in Biofluids

We sequenced 30 tissues, representing most major organ systems in the body and classified tissue-elevated small RNA sequences. We next wanted to measure the number of tissue-elevated genes and sequences that were detectable in biofluids: CSF-EVs, plasma-EVs, cell-free plasma, cell-free saliva, and cell-free urine. Figure 7 displays the diversity of tissue-elevated genes (Figure 7A) and sequences (Figure 7B, including isomiRs and RNA fragments) detected in each biofluid. Table 1 provides the number of tissue-enriched sequences, taken from the tissue atlas portion of the paper, and present in each biofluid (Supplementary Table S5 provides the data for all RNA biotypes). Plasma samples contain the largest number of tissue-elevated sequences compared to the other biofluids.

FIGURE 7
www.frontiersin.org

FIGURE 7. Biofluid Diversity. Panels (A, B) display the number of tissue-enriched genes identified in the small RNA sequenced data. The parent tissue-elevated genes that were detected in each biofluid are displayed in Panel (A). CSF had the lowest number of tissue-elevated genes and saliva had the highest. Panel (B) displays the number of tissue-elevated sequences that were detected in each biofluid.

We looked more closely at the number of detectable tissue-elevated sequences in cell-free plasma, plasma-EVs, cell-free saliva, cell-free urine, and CSF-EVs (Figure 8). We examined tissue-elevated miRNAs and isomiRs, since they are the most diverse sequences and clustered tissues well in Figure 6B. In each panel, Figures 8A–E, the first stacked bar plot represents the number of tissue-enriched miRNA and isomiR sequences detected in that biofluid with >10 counts in >50% of samples, and the second stacked bar plot is the sum of all tissue-elevated counts for each tissue. If two tissues share a tissue-elevated sequence, the sequences are counted for both tissues. For Figure 8, the number of tissue-elevated sequences detected in each biofluid are provided in Supplementary Table S6, the number of tissue-elevated sequences and the fraction of the reads they account for in each biofluid are provided. Cell-free plasma contained tissue-elevated sequences from the largest number of tissues and CSF from the least number of tissues. There are 474 tissue-enriched RBC miRNA/isomiR sequences found in cell-free plasma, accounting for an average of 82% (874,975 counts) of the tissue-enriched sequences detected in cell-free plasma. Saliva samples display high levels of RBC sequences as well as sequences from esophagus and submandibular gland (Figure 8C). Interestingly, pancreas-enriched sequences are also found in high numbers. Kidney and brain tissue-enriched sequences tended to overlap, and both urine and CSF display high levels of kidney and CNS tissue-elevated sequences. The two sequences most abundantly detected in CSF are tissue-elevated for both cerebellum and kidney, the canonical sequence for miR-204 and an isomiR-204, and are counted for both tissues. Tissue-elevated sequence abundance from RBCs make up the majority of the overall number of counts in plasma-EVs. 3% of the tissue-enriched sequences in plasma-EVs were from skeletal muscle and other tissues that cluster with skeletal muscle in the UMAP in Figure 6B. We would like to point out caveats to our approach: 1) tissue-elevation does not equal tissue-specificity, sequences can be associated with and expressed in more than one tissue, 2) we did not sequence every tissue or cell type in the body, 3) each tissue has a different number of elevated sequences, tissues with fewer sequences were not weighted differently from tissues with a large number of sequences, and 4) tissue vascularity and size were not accounted for.

FIGURE 8
www.frontiersin.org

FIGURE 8. Tissue-elevated miRNA/isomiR Sequence Detection in Biofluids. The number of tissue-elevated sequences that were detected in each biofluid and their abundance are displayed in Panel. The number of tissue-elevated sequences detected for each tissue in cell-free plasma are shown in the first stacked bar plot Panel (A). Tissue-elevated sequences from 31 different tissue and cell types were detected (the numbers are found in Supplementary Table S5). The second stacked bar plot in Panel (A) displays the fraction of tissue-elevated reads going to each sample type. 69% of the tissue-elevated reads in cell-free plasma go to red blood cells. The number of tissue-elevated sequences detected and the fraction of tissue-elevated reads going to each tissue are displayed for plasma EVs [Panel (B)], saliva [Panel (C)], Urine [Panel (D)], and CSF [Panel (E)].

tRFs illustrate one area of caution when examining these data. Forty-six tissue-elevated tRF sequences were detected in cell-free plasma, 14 of which were tissue enriched in gallbladder. Gallbladder had a high level of tRFs compared to other tissues (Figure 2). The tRF with the sequence “TCC​CTG​GTG​GTC​TAG​TGG​TTA​GGA​TTC​GGC​GC” comprises 10.2% of gallbladder tissue small RNASeq reads. Because it is so highly expressed in the gallbladder, it is counted as ‘elevated’ in gallbladder by the criteria we used in this paper. It is nearly 10-fold more highly expressed in gallbladder than in the next most abundant tissue, Ileum. However, as can be seen in Table 2, this tRF is highly expressed in most tissues, but low in blood cell types. This illustrates that while this gallbladder-elevated tRF is detectable in every biofluid we examined, it is high expression in all tissues makes it a poor marker of tissue specificity.

TABLE 2
www.frontiersin.org

TABLE 2. Tissue-enriched gallbladder tRF. A tRF sequence that is enriched in gallbladder is also highly expressed in many tissues.

Circulating, Tissue-Elevated Small RNAs are Differentially Expressed in Physiological (Exercise) and Pathologic States (Liver Fibrosis or Ischemic Stroke)

To assess the potential for tissue-elevated isomiRs to be differentially expressed in physiological or pathological states, we re-examined previously published data from three different studies. The first study compared blood samples from subjects with Hepatitis C and liver fibrosis stage 1 (early) and liver fibrosis stage 4 (advanced) (Van Keuren‐Jensen et al., 2016). Figure 9A shows the number of differentially expressed and tissue-elevated sequences and their RNA biotypes identified in blood samples. miRNAs or isomiRs that were tissue-elevated, in any tissue, are blue dots in the volcano plot, tissue-elevated piRNAs or sequence isoforms are yellow dots, tissue-elevated tRNA sequences are green, tissue-elevated YRNA sequences are orange, and sequences that were differentially expressed but not tissue-elevated are gray. Sequences that do not make the cutoff for p-value or fold-change are black. These RNA biotypes represented the largest number of differentially expressed sequences. We focused on a tissue-elevated isomiR in Figure 9A (miR-122 isomiR, TGG​AGT​GTG​ACA​ATG​GTG​TTT), indicated by the red arrow. The expression level of this isomiR in tissues is displayed in Figure 9B. This miR-122 isomiR, like the canonical miRNA, is liver elevated and expressed very highly.

FIGURE 9
www.frontiersin.org

FIGURE 9. Differentially Expressed Tissue-Elevated Sequences. We examined three different conditions for differentially expressed sequences: (A) displays the differentially expressed tissue-elevated sequences found in blood samples from Hepatitis C patients with liver fibrosis stage 1 or stage 4. Panel (B) is a tissue-elevated isomiR that is highly expressed in liver. (C) are samples taken from participants before and after exercise, and the expression level of a muscle-elevated isomiR across tissues (D). The final example is from a comparison of plasma samples taken from individuals that had either an ischaemic stroke or a subarachnoid hemorrhage, Panel (E). In Panel (F), we selected a CNS-elevated isomiR of miR-181-5p to illustrate tissue-elevation.

Towards employing the tool to distinguish sequences differences in a physiological example, we reanalyzed changes in miRNA detection in plasma samples pre- and post-exercise. Samples were taken from 26 participants at baseline and immediately after peak exercise using the Bruce Treadmill test. Using all unique sequences and isomiRs, we compared baseline expression to expression immediately after exercise (Shah et al., 2017). Figure 9C displays the number of tissue-elevated sequences following exercise. An isomiR of miR-10b, which is tissue-elevated in skeletal muscle was significantly down-regulated following exercise, Figure 9D.

In the final dataset, we examined plasma samples from patients with two types of stroke, either ischaemic stroke (cerebral vessel occlusion) or a subarachnoid hemorrhage. We examined the differentially expressed sequences that may be able to differentiate these two types of stroke and did differential expression analysis of plasma samples. From our differentially expressed sequences, we focused on a CNS tissue-elevated sequence, an isomiR of miR-181-5p, that was downregulated in stroke and upregulated in subarachnoid hemorrhage, Figure 9E. In Figure 9F, you can see the elevation of this sequence across the CNS tissues.

The exRNA Expression Atlas—an Online Tool to Display Tissue-Elevated Sequences in Biofluids

To facilitate the widest utility of this data, we created a searchable, online tool with tissue-elevated sequences, and the biofluids in which they are present. Figure 10 displays output capabilities of the tool, the exRNA Expression Atlas (https://data.omics.kitchen/miRNAatlas/). To illustrate how this tool works, we chose miR-1298 as an example of a miRNA that is highly enriched for tissues of the central nervous system [Figure 10A; (Godoy et al., 2018)]. Figure 10B displays the biofluids in which this miRNA can be detected. miR-1298 is most highly detected in CSF with minimal counts in other biofluids for most subjects. Knowing the expression of a miRNA such as this one in tissues, and its detection in CSF, may help prioritize biomarker candidates. As such, this searchable, online tool may assist in the validation of studies aiming to demonstrate tissue-specificity for RNA species.

FIGURE 10
www.frontiersin.org

FIGURE 10. exRNA Expression Atlas. In (A) we display the expression of a tissue-elevated miRNA for the CNS (miR-1298) across all tissues. Panel (B) displays the detection of tissue-elevated miR-1298 in different biofluids. As can be seen in this figure, this enriched CNS miRNA is detectable in CSF, but poorly detected in the other biofluids. These data will provide insight regarding which biofluids hold which tissue-elevated sequences.

Discussion

Small RNA databases and resources currently exist, but are subject to confounders, such as a lack of overlapping tissues in each dataset, differential RNA isolations that can lead to potential batch effects, and inconsistent sequencing/alignment of the reads across studies. To address these unmet needs, we created a tissue atlas of small RNAs in 30 human tissues with three replicates and multiple blood cell types. Strengths of our study are the inclusion of pathologically normal tissue with low postmortem interval and uniformity in RNA isolation and sequencing and bioinformatic handling. We found that tissues had varying percentages of RNA biotypes and diversity of tissue-elevated sequences. One recent paper that has similarly brought together these disparate resources is by Rahman et al. (Rahman et al., 2020), where the investigators examined small RNA biotypes from different tissues to come up with a set that they found to be tissue-enriched.

Using tissue-elevated sequences, especially miRNAs, we clustered the tissue samples with low variability among biological tissue replicates. Tissue-elevated tRFs and yRFs successfully clustered the samples, despite greater variability among replicates. Surprisingly, as a category, the number of tissue-elevated tRF sequences were high for several tissues: gallbladder has 2283 tissue-enriched tRFs, adrenal gland has 3805 tissue-enriched tRFs, and transverse colon has 3782 tissue-enriched tRFs. The use of all unique sequences provided us with interesting observations about some of the tissue-elevated sequences. For example, by expanding the miRNA biotype to include all isomiRs, we found a heart enriched miRNA that was more highly expressed in right heart than the left heart. These data provided us with an opportunity to use these tissue-elevated sequences to explore their detection in biofluids. One qualification for this dataset is that the RNA from the tissue samples was isolated and prepped uniformly, however different RNA isolation and sample preparation protocols, optimized for low input RNA, were used for the cell-free biofluids and EVs. A second qualification for this dataset is the use of postmortem tissues from older individuals, while most of the biofluid samples are from much younger subjects. It is known that small RNA profiles can change with age (Fehlmann et al., 2020).

There are several previously published approaches to deconvolute the origin of specific RNAs from a varied set of tissue types, but this has not been widely extended to biofluids or EVs. In this case, we decided not to do deconvolution for examining the tissue contributions in biofluids, given the lack of known correspondence between enrichment of sequences in a given tissue with the release of EVs bearing that RNA cargo from that tissue into biofluids. Instead, we relied on a more “direct” approach to measure the diversity and abundance of tissue-elevated sequences detected in each biofluid based on the total frequency of tissue-elevated sequences detected in each biofluid. A unique finding of this approach is the presence of similar transcripts across biologically divergent tissue types. For example, renal and brain RNAs were abundant in samples from the CSF. The biological etiology of this is elusive, but is certainly plausible given the similar physiologic function of filtration and electrolyte regulation in both urine and CSF. The two sequences with the highest detection in CSF are tissue-enriched for both kidney and cerebellum and are detected with >1000 counts each in CSF, miR-204-5p and an isomiR of that miRNA.

In the examination of differentially expressed tissue-elevated sequences found in blood, it was interesting to note that we were able to identify tissue sequences related to the disease or event. Because we do not have corresponding tissue samples from these individuals, it is hard to know if these changes directly correlate with the tissue, or if there is different usage or packaging of these sequences into the extracellular biofluid.

Clinically, this simplistic “deconvolution” within biofluids may facilitate targeting those RNAs that may be more specific for disease monitoring. Certainly, this tissue-enrichment estimation is imperfect. First, we do not have many replicate tissues and samples from every tissue or cell type within that tissue. Nevertheless, we believe that the current study provides a framework to build shared resources to assist in studies of tissue- and cell-specific RNAs and their presence in biofluids (exRNAs). Some obvious confounding factors to target in future experimental designs include: 1) some tissues may actively release more EVs or EVs with more/specific RNA cargo, 2) vascularization of the tissue may lead to greater abundance in plasma, and 3) selective RNA loading that does not contain tissue enriched sequences for some of the tissues will lead to the appearance of low abundance in plasma, and 4) understanding expression level changes with disease, injury, stress, development or aging. In addition, care to avoid overinterpretation of “concentration” of a given exRNA species is needed: for example, the abundant gallbladder-enriched tRF is also abundant across many other tissues. Ultimately, with the incorporation of larger numbers of matched tissue and biofluid samples, these data may expand the repertoire of disease-relevant exRNA biomarkers. We have created a tool (the exRNA Expression Atlas) to display tissue-elevated sequences and their abundance in biofluids that is freely accessible to the scientific community.

Data Availability Statement

All sequencing data is available through the ERCC exRNA Atlas: https://exrna-atlas.org/exat/datasets (EXR-KJENS1WBaSro-AN), https://exrna-atlas.org/exat/datasets (EXRKJENS1sPlvS2AN), https://exrna-atlas.org/exat/datasets (EXR-KJENS1sPlvS2-AN), https://exrna-atlas.org/exat/datasets (EXR-KJENS1TISSUEDATA-AN); and through accession number phs001258.v1.p1 in dbGaP: https://www.ncbi.nlm.nih.gov/gap/.

Ethics Statement

The studies involving human participants were reviewed and approved by the appropriate institutional review board. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

Study Design: EA, BM, RK, LL, SD, and KV. Data Collection and Interpretation: EA, BM, RK, TS, TB, GES, EH, IG, RR, MS, MH, AC-L, NV, TW, JRa, PP, JRo, JG, MJH, RS, LL, SD, and KV. Writing: EA, RK, TW, RS, LL, SD, and KV. Figures: EA, RK, TW, SD, and KV.

Funding

This work was funded by a grant from the National Institutes of Health-National Center for Translational Sciences (UH3 TR000891-03S1). The Brain and Body Donation Program is supported by the National Institute of Neurological Disorders and Stroke (U24 NS072026 National Brain and Tissue Resource for Parkinson’s Disease and Related Disorders), the National Institute on Aging (P30 AG19610 Arizona Alzheimer’s Disease Core Center), the Arizona Department of Health Services (contract 211002, Arizona Alzheimer’s Research Center), the Arizona Biomedical Research Commission (contracts 4001, 0011,05-90 and 1001 to the Arizona Parkinson’s Disease Consortium). The work was also funded by the Flinn Foundation (Award #s 1994 and 2037).

Conflict of Interest

KV and SD are members of the scientific advisory board at Dyrnamix. Their involvement in this company had no bearing on the work performed in this manuscript. KV is on the scientific advisory board of HTG. Her involvement has no bearing on this work.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We thank NCATS Extracellular RNA Communication Consortium (UH3 TR000891-03S1) for funding and to the directors that provided encouragement and advice, John Satterlee and Dan Tagle. We thank Aleksander Milosavljevic and his group at Baylor for their help in the ERCC DMRR. We are grateful to the Banner Sun Health Research Institute Brain and Body Donation Program of Sun City, Arizona for the provision of human tissue. We also thank the Michael J. Fox Foundation for Parkinson’s Research for their continued support. We thank Jeff Watkins from TGen for his help with figures and beautification of the manuscript. We thank the Center for Collaborative Sequencing at TGen for maintaining the sequencing machines and related equipment. We acknowledge the Dorrance Family Research Fund for supporting studies of extracellular vesicles at TGen.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2022.804164/full#supplementary-material

Supplementary Figure S1 | This graph shows the rarefaction curves of each tissue. As can be seen here, the detection rate of new miRNAs rises steeply and then begins to plateau at 1 million reads and 3 million reads. These data indicate that using the most diverse class of small RNA, we are capturing most of the diversity for each tissue with our read depth.

Supplementary Figure S2 | Supplemental figure 1 displays the UMAP clusters when using either tissue-elevated tRFs, piRNAs, or yRFs. tRFs do a better job of clustering tissues than do the other RNA biotypes displayed here. These sequences are not as tissue-elevated and the replicates are not as close as they are with tissue-elevated miRNA and isomiRs.

Supplementary Table S1 | RIN values for tissue samples.

Supplementary Table S2 | Demographic information for tissue samples.

Supplementary Table S3 | IsomiR sequences for miR-451a in monocytes, PBMCs and RBCs.

Supplementary Table S4 | Tissue-elevated sequences and their fold-enrichments.

Supplementary Table S5 | The number of tissue-elevated sequences that go to each RNA biotype for each tissue and biofluid.

Supplementary Table S6 | The number of tissue-elevated sequences detected in each biofluid.

References

Abdelhamid, R. F., Plessy, C., Yamauchi, Y., Taoka, M., de Hoon, M., Gingeras, T. R., et al. (2014). Multiplicity of 5′ Cap Structures Present on Short RNAs. PLoS One 9 (7), e102895. doi:10.1371/journal.pone.0102895

PubMed Abstract | CrossRef Full Text | Google Scholar

Agarwal, V., Bell, G. W., Nam, J.-W., and Bartel, D. P. (2015). Predicting Effective microRNA Target Sites in Mammalian mRNAs. Elife 4. doi:10.7554/eLife.05005

PubMed Abstract | CrossRef Full Text | Google Scholar

Akat, K. M., Lee, Y. A., Hurley, A., Morozov, P., Max, K. E. A., Brown, M., et al. (2019). Detection of Circulating Extracellular mRNAs by Modified Small-RNA-Sequencing Analysis. JCI Insight 4. doi:10.1172/jci.insight.127317

PubMed Abstract | CrossRef Full Text | Google Scholar

Arroyo, J. D., Chevillet, J. R., Kroh, E. M., Ruf, I. K., Pritchard, C. C., Gibson, D. F., et al. (2011). Argonaute2 Complexes Carry a Population of Circulating microRNAs Independent of Vesicles in Human Plasma. Proc. Natl. Acad. Sci. 108 (12), 5003–5008. doi:10.1073/pnas.1019055108

PubMed Abstract | CrossRef Full Text | Google Scholar

Avendaño‐Vázquez, S. E., and Flores‐Jasso, C. F. (2020). Stumbling on Elusive Cargo: How isomiRs challenge microRNA Detection and Quantification, the Case of Extracellular Vesicles. J. Extracellular Vesicles 9 (1), 1784617. doi:10.1080/20013078.2020.1784617

CrossRef Full Text | Google Scholar

Bartel, D. P. (2009). MicroRNAs: Target Recognition and Regulatory Functions. Cell 136 (2), 215–233. doi:10.1016/j.cell.2009.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Ben-Dov, I. Z., Whalen, V. M., Goilav, B., Max, K. E. A., and Tuschl, T. (2016). Cell and Microvesicle Urine microRNA Deep Sequencing Profiles from Healthy Individuals: Observations with Potential Impact on Biomarker Studies. PLoS One 11 (1), e0147249. doi:10.1371/journal.pone.0147249

PubMed Abstract | CrossRef Full Text | Google Scholar

Burgos, K. L., Javaherian, A., Bomprezzi, R., Ghaffari, L., Rhodes, S., Courtright, A., et al. (2013). Identification of Extracellular miRNA in Human Cerebrospinal Fluid by Next-Generation Sequencing. RNA 19 (5), 712–722. doi:10.1261/rna.036863.112

PubMed Abstract | CrossRef Full Text | Google Scholar

Burgos, K. L., and Van Keuren-Jensen, K. (2014). RNA Isolation for Small RNA Next-Generation Sequencing from Acellular Biofluids. Methods Mol. Biol. 1182, 83–92. doi:10.1007/978-1-4939-1062-5_8

PubMed Abstract | CrossRef Full Text | Google Scholar

Byron, S. A., Van Keuren-Jensen, K. R., Engelthaler, D. M., Carpten, J. D., and Craig, D. W. (2016). Translating RNA Sequencing into Clinical Diagnostics: Opportunities and Challenges. Nat. Rev. Genet. 17 (5), 257–271. doi:10.1038/nrg.2016.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, L., Chen, Z., Simões, A., Wu, X., Dai, Y., DiPietro, L. A., et al. (2020). Site-Specific Expression Pattern of PIWI-Interacting RNA in Skin and Oral Mucosal Wound Healing. Ijms 21 (2), 521. doi:10.3390/ijms21020521

PubMed Abstract | CrossRef Full Text | Google Scholar

Cozen, A. E., Quartley, E., Holmes, A. D., Hrabeta-Robinson, E., Phizicky, E. M., and Lowe, T. M. (2015). ARM-seq: AlkB-Facilitated RNA Methylation Sequencing Reveals a Complex Landscape of Modified tRNA Fragments. Nat. Methods 12 (9), 879–884. doi:10.1038/nmeth.3508

PubMed Abstract | CrossRef Full Text | Google Scholar

Cui, C., and Cui, Q. (2020). The Relationship of Human Tissue microRNAs with Those from Body Fluids. Sci. Rep. 10 (1), 5644. doi:10.1038/s41598-020-62534-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhahbi, J. M., Spindler, S. R., Atamna, H., Boffelli, D., Mote, P., and Martin, D. I. K. (2013). 5′-YRNA Fragments Derived by Processing of Transcripts from Specific YRNA Genes and Pseudogenes Are Abundant in Human Serum and Plasma. Physiol. Genomics 45 (21), 990–998. doi:10.1152/physiolgenomics.00129.2013

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhahbi, J., Nunez Lopez, Y. O., Schneider, A., Victoria, B., Saccon, T., Bharat, K., et al. (2019). Profiling of tRNA Halves and YRNA Fragments in Serum and Tissue from Oral Squamous Cell Carcinoma Patients Identify Key Role of 5′ tRNA-Val-CAC-2-1 Half. Front. Oncol. 9, 959. doi:10.3389/fonc.2019.00959

PubMed Abstract | CrossRef Full Text | Google Scholar

Dittmar, K. A., Goodenbour, J. M., and Pan, T. (2006). Tissue-specific Differences in Human Transfer RNA Expression. Plos Genet. 2 (12), e221. doi:10.1371/journal.pgen.0020221

PubMed Abstract | CrossRef Full Text | Google Scholar

Driedonks, T. A. P., Mol, S., Bruin, S., Peters, A. L., Zhang, X., Lindenbergh, M. F. S., et al. (2020). Y‐RNA Subtype Ratios in Plasma Extracellular Vesicles Are Cell Type‐ Specific and Are Candidate Biomarkers for Inflammatory Diseases. J. Extracellular Vesicles 9 (1), 1764213. doi:10.1080/20013078.2020.1764213

CrossRef Full Text | Google Scholar

Ebhardt, H. A., Tsang, H. H., Dai, D. C., Liu, Y., Bostan, B., and Fahlman, R. P. (2009). Meta-analysis of Small RNA-Sequencing Errors Reveals Ubiquitous post-transcriptional RNA Modifications. Nucleic Acids Res. 37 (8), 2461–2470. doi:10.1093/nar/gkp093

PubMed Abstract | CrossRef Full Text | Google Scholar

Ellwanger, D. C., Büttner, F. A., Mewes, H.-W., and Stümpflen, V. (2011). The Sufficient Minimal Set of miRNA Seed Types. Bioinformatics 27 (10), 1346–1350. doi:10.1093/bioinformatics/btr149

PubMed Abstract | CrossRef Full Text | Google Scholar

Enderle, D., Spiel, A., Coticchia, C. M., Berghoff, E., Mueller, R., Schlumpberger, M., et al. (2015). Characterization of RNA from Exosomes and Other Extracellular Vesicles Isolated by a Novel Spin Column-Based Method. PLoS One 10 (8), :e0136133. doi:10.1371/journal.pone.0136133

PubMed Abstract | CrossRef Full Text | Google Scholar

Everaert, C., Helsmoortel, H., Decock, A., Hulstaert, E., Van Paemel, R., Verniers, K., et al. (2019). Performance Assessment of Total RNA Sequencing of Human Biofluids and Extracellular Vesicles. Sci. Rep. 9 (1), 17574. doi:10.1038/s41598-019-53892-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Fehlmann, T., Lehallier, B., Schaum, N., Hahn, O., Kahraman, M., Li, Y., et al. (2020). Common Diseases Alter the Physiological Age-Related Blood microRNA Profile. Nat. Commun. 11 (1), 5958. doi:10.1038/s41467-020-19665-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferrero, G., Cordero, F., Tarallo, S., Arigoni, M., Riccardo, F., Gallo, G., et al. (2018). Small Non-coding RNA Profiling in Human Biofluids and Surrogate Tissues from Healthy Individuals: Description of the Diverse and Most Represented Species. Oncotarget 9 (3), 3097–3111. doi:10.18632/oncotarget.23203

PubMed Abstract | CrossRef Full Text | Google Scholar

Giraldez, M. D., Spengler, R. M., Etheridge, A., Godoy, P. M., Barczak, A. J., Srinivasan, S., et al. (2018). Comprehensive Multi-center Assessment of Small RNA-Seq Methods for Quantitative miRNA Profiling. Nat. Biotechnol. 36 (8), 746–757. doi:10.1038/nbt.4183

PubMed Abstract | CrossRef Full Text | Google Scholar

Giraldez, M. D., Spengler, R. M., Etheridge, A., Goicochea, A. J., Tuck, M., Choi, S. W., et al. (2019). Phospho‐RNA‐seq: a Modified Small RNA‐seq Method that Reveals Circulating mRNA and lncRNA Fragments as Potential Biomarkers in Human Plasma. EMBO J. 38 (11). doi:10.15252/embj.2019101695

PubMed Abstract | CrossRef Full Text | Google Scholar

Godoy, P. M., Bhakta, N. R., Barczak, A. J., Cakmak, H., Fisher, S., MacKenzie, T. C., et al. (2018). Large Differences in Small RNA Composition between Human Biofluids. Cel Rep. 25 (5), 1346–1358. doi:10.1016/j.celrep.2018.10.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Gogakos, T., Brown, M., Garzia, A., Meyer, C., Hafner, M., and Tuschl, T. (2017). Characterizing Expression and Processing of Precursor and Mature Human tRNAs by Hydro-tRNAseq and PAR-CLIP. Cel Rep. 20 (6), 1463–1475. doi:10.1016/j.celrep.2017.07.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Griffiths-Jones, S. (2004). The microRNA Registry. Nucleic Acids Res. 32 (Database issue), 109D–111D. doi:10.1093/nar/gkh023

PubMed Abstract | CrossRef Full Text | Google Scholar

Grimson, A., Farh, K. K.-H., Johnston, W. K., Garrett-Engele, P., Lim, L. P., and Bartel, D. P. (2007). MicroRNA Targeting Specificity in Mammals: Determinants beyond Seed Pairing. Mol. Cel 27 (1), 91–105. doi:10.1016/j.molcel.2007.06.017

CrossRef Full Text | Google Scholar

Guo, L., and Chen, F. (2014). A challenge for miRNA: Multiple isomiRs in miRNAomics. Gene 544 (1), 1–7. doi:10.1016/j.gene.2014.04.039

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, L., and Lu, Z. (2010). Global Expression Analysis of miRNA Gene Cluster and Family Based on isomiRs from Deep Sequencing Data. Comput. Biol. Chem. 34 (3), 165–171. doi:10.1016/j.compbiolchem.2010.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Haseeb, A., Makki, M. S., Khan, N. M., Ahmad, I., and Haqqi, T. M. (2017). Deep Sequencing and Analyses of miRNAs, isomiRs and miRNA Induced Silencing Complex (miRISC)-Associated miRNome in Primary Human Chondrocytes. Sci. Rep. 7 (1), 15178. doi:10.1038/s41598-017-15388-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Hulstaert, E., Morlion, A., Avila Cobos, F., Verniers, K., Nuytens, J., Vanden Eynde, E., et al. (2020). Charting Extracellular Transcriptomes in the Human Biofluid RNA Atlas. Cel Rep. 33 (13), 108552. doi:10.1016/j.celrep.2020.108552

CrossRef Full Text | Google Scholar

Ishikawa, T., Haino, A., Seki, M., Terada, H., and Nashimoto, M. (2017). The Y4-RNA Fragment, a Potential Diagnostic Marker, Exists in Saliva. Non-coding RNA Res. 2 (2), 122–128. doi:10.1016/j.ncrna.2017.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Jain, A., and Tuteja, G. (2019). TissueEnrich: Tissue-specific Gene Enrichment Analysis. Bioinformatics 35 (11), 1966–1967. doi:10.1093/bioinformatics/bty890

PubMed Abstract | CrossRef Full Text | Google Scholar

Kalani, M. Y. S., Alsop, E., Meechoovet, B., Beecroft, T., Agrawal, K., Whitsett, T. G., et al. (2020). Extracellular microRNAs in Blood Differentiate between Ischaemic and Haemorrhagic Stroke Subtypes. J. Extracellular Vesicles 9 (1), 1713540. doi:10.1080/20013078.2020.1713540

CrossRef Full Text | Google Scholar

Karali, M., Persico, M., Mutarelli, M., Carissimo, A., Pizzo, M., Singh Marwah, V., et al. (2016). High-resolution Analysis of the Human Retina miRNome Reveals isomiR Variations and Novel microRNAs. Nucleic Acids Res. 44 (4), 1525–1540. doi:10.1093/nar/gkw039

PubMed Abstract | CrossRef Full Text | Google Scholar

Kopkova, A., Sana, J., Fadrus, P., Machackova, T., Vecera, M., Vybihal, V., et al. (2018). MicroRNA Isolation and Quantification in Cerebrospinal Fluid: A Comparative Methodical Study. PLoS One 13 (12), e0208580. doi:10.1371/journal.pone.0208580

PubMed Abstract | CrossRef Full Text | Google Scholar

Kozomara, A., Birgaoanu, M., and Griffiths-Jones, S. (2019). miRBase: from microRNA Sequences to Function. Nucleic Acids Res. 47 (D1), D155–D162. doi:10.1093/nar/gky1141

PubMed Abstract | CrossRef Full Text | Google Scholar

Landgraf, P., Rusu, M., Sheridan, R., Sewer, A., Iovino, N., Aravin, A., et al. (2007). A Mammalian microRNA Expression Atlas Based on Small RNA Library Sequencing. Cell 129 (7), 1401–1414. doi:10.1016/j.cell.2007.04.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, L. W., Zhang, S., Etheridge, A., Ma, L., Martin, D., Galas, D., et al. (2010). Complexity of the microRNA Repertoire Revealed by Next-Generation Sequencing. RNA 16 (11), 2170–2180. doi:10.1261/rna.2225110

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, F., Kaczor-Urbanowicz, K. E., Sun, J., Majem, B., Lo, H.-C., Kim, Y., et al. (2018). Characterization of Human Salivary Extracellular RNA by Next-Generation Sequencing. Clin. Chem. 64 (7), 1085–1095. doi:10.1373/clinchem.2017.285072

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, T., Yu, J., Liu, C., and Guo, L. (2017). IsomiR Expression Patterns in Canonical and Dicer-independent microRNAs. Mol. Med. Rep. 15 (3), 1071–1078. doi:10.3892/mmr.2017.6117

PubMed Abstract | CrossRef Full Text | Google Scholar

Londin, E., Loher, P., Telonis, A. G., Quann, K., Clark, P., Jing, Y., et al. (2015). Analysis of 13 Cell Types Reveals Evidence for the Expression of Numerous Novel Primate- and Tissue-specific microRNAs. Proc. Natl. Acad. Sci. USA 112 (10), E1106–E1115. doi:10.1073/pnas.1420955112

PubMed Abstract | CrossRef Full Text | Google Scholar

Love, M. I., Huber, W., and Anders, S. (2014). Moderated Estimation of Fold Change and Dispersion for RNA-Seq Data with DESeq2. Genome Biol. 15 (12), 550. doi:10.1186/s13059-014-0550-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Luciano, D. J., Mirsky, H., Vendetti, N. J., and Maas, S. (2004). RNA Editing of a miRNA Precursor: FIGURE 1. RNA 10 (8), 1174–1177. doi:10.1261/rna.7350304

PubMed Abstract | CrossRef Full Text | Google Scholar

Ludwig, N., Leidinger, P., Becker, K., Backes, C., Fehlmann, T., Pallasch, C., et al. (2016). Distribution of miRNA Expression across Human Tissues. Nucleic Acids Res. 44 (8), 3865–3877. doi:10.1093/nar/gkw116

PubMed Abstract | CrossRef Full Text | Google Scholar

Magee, R., Londin, E., and Rigoutsos, I. (2019). TRNA-derived Fragments as Sex-dependent Circulating Candidate Biomarkers for Parkinson's Disease. Parkinsonism Relat. Disord. 65, 203–209. doi:10.1016/j.parkreldis.2019.05.035

PubMed Abstract | CrossRef Full Text | Google Scholar

Maleki Dana, P., Mansournia, M. A., and Mirhashemi, S. M. (2020). PIWI-interacting RNAs: New Biomarkers for Diagnosis and Treatment of Breast Cancer. Cell Biosci 10, 44. doi:10.1186/s13578-020-00403-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Marco-Puche, G., Lois, S., Benítez, J., and Trivino, J. C. (2019). RNA-seq Perspectives to Improve Clinical Diagnosis. Front. Genet. 10, 1152. doi:10.3389/fgene.2019.01152

PubMed Abstract | CrossRef Full Text | Google Scholar

Martí, E., Pantano, L., Bañez-Coronel, M., Llorens, F., Miñones-Moyano, E., Porta, S., et al. (2010). A Myriad of miRNA Variants in Control and Huntington's Disease Brain Regions Detected by Massively Parallel Sequencing. Nucleic Acids Res. 38 (20), 7219–7235. doi:10.1093/nar/gkq575

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinez, V. D., Vucic, E. A., Thu, K. L., Hubaux, R., Enfield, K. S. S., Pikor, L. A., et al. (2015). Unique Somatic and Malignant Expression Patterns Implicate PIWI-Interacting RNAs in Cancer-type Specific Biology. Sci. Rep. 5, 10423. doi:10.1038/srep10423

PubMed Abstract | CrossRef Full Text | Google Scholar

Max, K. E. A., Bertram, K., Akat, K. M., Bogardus, K. A., Li, J., Morozov, P., et al. (2018). Human Plasma and Serum Extracellular Small RNA Reference Profiles and Their Clinical Utility. Proc. Natl. Acad. Sci. USA 115 (23), E5334–E5343. doi:10.1073/pnas.1714397115

PubMed Abstract | CrossRef Full Text | Google Scholar

Murillo, O. D., Thistlethwaite, W., Rozowsky, J., Subramanian, S. L., Lucero, R., Shah, N., et al. (2019). exRNA Atlas Analysis Reveals Distinct Extracellular RNA Cargo Types and Their Carriers Present across Human Biofluids. Cell 177 (2), 463–477. doi:10.1016/j.cell.2019.02.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Nik Mohamed Kamal, N. N. S. B., and Shahidan, W. N. S. (2019). Non-Exosomal and Exosomal Circulatory MicroRNAs: Which Are More Valid as Biomarkers? Front. Pharmacol. 10, 1500. doi:10.3389/fphar.2019.01500

PubMed Abstract | CrossRef Full Text | Google Scholar

O’Brien, K., Breyne, K., Ughetto, S., Laurent, L. C., and Breakefield, X. O. (2020). RNA Delivery by Extracellular Vesicles in Mammalian Cells and its Applications. Nat. Rev. Mol. Cel Biol 21 (10), 585–606. doi:10.1038/s41580-020-0251-y

CrossRef Full Text | Google Scholar

Pan, T. (2018). Modifications and Functional Genomics of Human Transfer RNA. Cell Res 28 (4), 395–404. doi:10.1038/s41422-018-0013-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Perera, B. P. U., Tsai, Z. T.-Y., Colwell, M. L., Jones, T. R., Goodrich, J. M., Wang, K., et al. (2019). Somatic Expression of piRNA and Associated Machinery in the Mouse Identifies Short, Tissue-specific piRNA. Epigenetics 14 (5), 504–521. doi:10.1080/15592294.2019.1600389

PubMed Abstract | CrossRef Full Text | Google Scholar

Qu, A., Wang, W., Yang, Y., Zhang, X., Dong, Y., Zheng, G., et al. (2019). A Serum piRNA Signature as Promising Non-invasive Diagnostic and Prognostic Biomarkers for Colorectal Cancer. Cmar Vol. 11, 3703–3720. doi:10.2147/CMAR.S193266

PubMed Abstract | CrossRef Full Text | Google Scholar

Rahman, R.-U., Liebhoff, A.-M., Bansal, V., Fiosins, M., Rajput, A., Sattar, A., et al. (2020). SEAweb: the Small RNA Expression Atlas Web Application. Nucleic Acids Res. 48 (D1), D204–D219. doi:10.1093/nar/gkz869

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodosthenous, R. S., Hutchins, E., Reiman, R., Yeri, A. S., Srinivasan, S., Whitsett, T. G., et al. (2020). Profiling Extracellular Long RNA Transcriptome in Human Plasma and Extracellular Vesicles for Biomarker Discovery. iScience 23 (6), 101182. doi:10.1016/j.isci.2020.101182

PubMed Abstract | CrossRef Full Text | Google Scholar

Rozowsky, J., Kitchen, R. R., Park, J. J., Galeev, T. R., Diao, J., Warrell, J., et al. (2019). exceRpt: A Comprehensive Analytic Platform for Extracellular RNA Profiling. Cel Syst. 8 (4), 352–357. doi:10.1016/j.cels.2019.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Sagi, D., Rak, R., Gingold, H., Adir, I., Maayan, G., Dahan, O., et al. (2016). Tissue- and Time-specific Expression of Otherwise Identical tRNA Genes. Plos Genet. 12 (8), e1006264. doi:10.1371/journal.pgen.1006264

PubMed Abstract | CrossRef Full Text | Google Scholar

Saugstad, J. A., Lusardi, T. A., Van Keuren-Jensen, K. R., Phillips, J. I., Lind, B., Harrington, C. A., et al. (2017). Analysis of Extracellular RNA in Cerebrospinal Fluid. J. Extracellular Vesicles 6 (1), 1317577. doi:10.1080/20013078.2017.1317577

CrossRef Full Text | Google Scholar

Savelyeva, A. V., Kuligina, E. V., Bariakin, D. N., Kozlov, V. V., Ryabchikova, E. I., Richter, V. A., et al. (2017). Variety of RNAs in Peripheral Blood Cells, Plasma, and Plasma Fractions. Biomed. Res. Int. 2017, 1–10. doi:10.1155/2017/7404912

PubMed Abstract | CrossRef Full Text | Google Scholar

Shah, R., Yeri, A., Das, A., Courtright-Lim, A., Ziegler, O., Gervino, E., et al. (2017). Small RNA-Seq during Acute Maximal Exercise Reveal RNAs Involved in Vascular Inflammation and Cardiometabolic Health: Brief Report. Am. J. Physiology-Heart Circulatory Physiol. 313 (6), H1162–H1167. doi:10.1152/ajpheart.00500.2017

CrossRef Full Text | Google Scholar

Skog, J., Würdinger, T., van Rijn, S., Meijer, D. H., Gainche, L., Curry, W. T., et al. (2008). Glioblastoma Microvesicles Transport RNA and Proteins that Promote Tumour Growth and Provide Diagnostic Biomarkers. Nat. Cel Biol 10 (12), 1470–1476. doi:10.1038/ncb1800

CrossRef Full Text | Google Scholar

Srinivasan, S., Yeri, A., Cheah, P. S., Chung, A., Danielson, K., De Hoff, P., et al. (2019). Small RNA Sequencing across Diverse Biofluids Identifies Optimal Methods for exRNA Isolation. Cell 177 (2), 446–462. doi:10.1016/j.cell.2019.03.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, L., Yu, Y., Niu, B., and Wang, D. (2020). Red Blood Cells as Potential Repositories of MicroRNAs in the Circulatory System. Front. Genet. 11, 442. doi:10.3389/fgene.2020.00442

PubMed Abstract | CrossRef Full Text | Google Scholar

Tabet, F., Vickers, K. C., Cuesta Torres, L. F., Wiese, C. B., Shoucri, B. M., Lambert, G., et al. (2014). HDL-transferred microRNA-223 Regulates ICAM-1 Expression in Endothelial Cells. Nat. Commun. 5, 3292. doi:10.1038/ncomms4292

PubMed Abstract | CrossRef Full Text | Google Scholar

Takeshita, T., Kageyama, S., Furuta, M., Tsuboi, H., Takeuchi, K., Shibata, Y., et al. (2016). Bacterial Diversity in Saliva and Oral Health-Related Conditions: the Hisayama Study. Sci. Rep. 6, 22164. doi:10.1038/srep22164

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, Y.-T., Huang, Y.-Y., Zheng, L., Qin, S.-H., Xu, X.-P., An, T.-X., et al. (2017). Comparison of Isolation Methods of Exosomes and Exosomal RNA from Cell Culture Medium and Serum. Int. J. Mol. Med. 40 (3), 834–844. doi:10.3892/ijmm.2017.3080

CrossRef Full Text | Google Scholar

Telonis, A. G., Loher, P., Jing, Y., Londin, E., and Rigoutsos, I. (2015). Beyond the One-Locus-One-miRNA Paradigm: microRNA Isoforms Enable Deeper Insights into Breast Cancer Heterogeneity. Nucleic Acids Res. 43 (19), 9158–9175. doi:10.1093/nar/gkv922

PubMed Abstract | CrossRef Full Text | Google Scholar

Telonis, A. G., Magee, R., Loher, P., Chervoneva, I., Londin, E., and Rigoutsos, I. (2017). Knowledge about the Presence or Absence of miRNA Isoforms (isomiRs) Can Successfully Discriminate Amongst 32 TCGA Cancer Types. Nucleic Acids Res. 45 (6), 2973–2985. doi:10.1093/nar/gkx082

PubMed Abstract | CrossRef Full Text | Google Scholar

Tosar, J. P., and Cayota, A. (2020). Extracellular tRNAs and tRNA-Derived Fragments. RNA Biol. 17 (8), 1149–1167. doi:10.1080/15476286.2020.1729584

PubMed Abstract | CrossRef Full Text | Google Scholar

Turchinovich, A., Weiz, L., Langheinz, A., and Burwinkel, B. (2011). Characterization of Extracellular Circulating microRNA. Nucleic Acids Res. 39 (16), 7223–7233. doi:10.1093/nar/gkr254

PubMed Abstract | CrossRef Full Text | Google Scholar

Uhlén, M., Fagerberg, L., Hallström, B. M., Lindskog, C., Oksvold, P., Mardinoglu, A., et al. (2015). Tissue-based Map of the Human Proteome. Science 347 (6220), 1260419. doi:10.1126/science.1260419

PubMed Abstract | CrossRef Full Text | Google Scholar

Umu, S. U., Langseth, H., Bucher-Johannessen, C., Fromm, B., Keller, A., Meese, E., et al. (2018). A Comprehensive Profile of Circulating RNAs in Human Serum. RNA Biol. 15 (2), 242–250. doi:10.1080/15476286.2017.1403003

PubMed Abstract | CrossRef Full Text | Google Scholar

Valadi, H., Ekström, K., Bossios, A., Sjöstrand, M., Lee, J. J., and Lötvall, J. O. (2007). Exosome-mediated Transfer of mRNAs and microRNAs Is a Novel Mechanism of Genetic Exchange between Cells. Nat. Cel Biol 9 (6), 654–659. doi:10.1038/ncb1596

CrossRef Full Text | Google Scholar

Van Keuren‐Jensen, K. R., Malenica, I., Courtright, A. L., Ghaffari, L. T., Starr, A. P., Metpally, R. P., et al. (2016). Micro RNA Changes in Liver Tissue Associated with Fibrosis Progression in Patients with Hepatitis C. Liver Int. 36 (3), 334–343. doi:10.1111/liv.12919

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Keuren-Jensen, K., Keats, J. J., and Craig, D. W. (2014). Bringing RNA-Seq Closer to the Clinic. Nat. Biotechnol. 32 (9), 884–885. doi:10.1038/nbt.3017

PubMed Abstract | CrossRef Full Text | Google Scholar

Waller, R., Wyles, M., Heath, P. R., Kazoka, M., Wollff, H., Shaw, P. J., et al. (2017). Small RNA Sequencing of Sporadic Amyotrophic Lateral Sclerosis Cerebrospinal Fluid Reveals Differentially Expressed miRNAs Related to Neural and Glial Activity. Front. Neurosci. 11, 731. doi:10.3389/fnins.2017.00731

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Yang, Y., Tan, X., Mao, X., Wei, D., Yao, Y., et al. (2019). Identification of tRNA-Derived Fragments Expression Profile in Breast Cancer Tissues. Cg 20 (3), 199–213. doi:10.2174/1389202920666190326145459

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, Z., Batagov, A. O., Schinelli, S., Wang, J., Wang, Y., El Fatimy, R., et al. (2017). Coding and Noncoding Landscape of Extracellular RNA Released by Human Glioma Stem Cells. Nat. Commun. 8 (1), 1145. doi:10.1038/s41467-017-01196-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Weiland, M., Gao, X.-H., Zhou, L., and Mi, Q.-S. (2012). Small RNAs Have a Large Impact. RNA Biol. 9 (6), 850–859. doi:10.4161/rna.20378

PubMed Abstract | CrossRef Full Text | Google Scholar

Witwer, K. W. (2015). Circulating microRNA Biomarker Studies: Pitfalls and Potential Solutions. Clin. Chem. 61 (1), 56–63. doi:10.1373/clinchem.2014.221341

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, C. W., Evans, J. M., Huang, S., Mahoney, D. W., Dukek, B. A., Taylor, W. R., et al. (2018). A Comprehensive Approach to Sequence-Oriented IsomiR Annotation (CASMIR): Demonstration with IsomiR Profiling in Colorectal Neoplasia. BMC Genomics 19 (1), 401. doi:10.1186/s12864-018-4794-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, H., Yao, J., Wu, D. C., and Lambowitz, A. M. (2019). Improved TGIRT-Seq Methods for Comprehensive Transcriptome Profiling with Decreased Adapter Dimer Formation and Bias Correction. Sci. Rep. 9 (1), 7953. doi:10.1038/s41598-019-44457-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Yáñez-Mó, M., Siljander, P. R.-M., Andreu, Z., Bedina Zavec, A., Borràs, F. E., Buzas, E. I., et al. (2015). Biological Properties of Extracellular Vesicles and Their Physiological Functions. J. Extracellular Vesicles 4, 27066. doi:10.3402/jev.v4.27066

CrossRef Full Text | Google Scholar

Yeri, A., Courtright, A., Danielson, K., Hutchins, E., Alsop, E., Carlson, E., et al. (2018). Evaluation of Commercially Available Small RNASeq Library Preparation Kits Using Low Input RNA. BMC Genomics 19 (1), 331. doi:10.1186/s12864-018-4726-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Yeri, A., Courtright, A., Reiman, R., Carlson, E., Beecroft, T., Janss, A., et al. (2017). Total Extracellular Small RNA Profiles from Plasma, Saliva, and Urine of Healthy Subjects. Sci. Rep. 7, 44061. doi:10.1038/srep44061

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, F., Pillman, K. A., Neilsen, C. T., Toubia, J., Lawrence, D. M., Tsykin, A., et al. (2017). Naturally Existing Isoforms of miR-222 Have Distinct Functions. Nucleic Acids Res. 45 (19), 11371–11385. doi:10.1093/nar/gkx788

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: small RNA, plasma, urine, saliva, cerebrospinal fluid, extracellular vesicle, tissue atlas, extracellular RNA

Citation: Alsop E, Meechoovet B, Kitchen R, Sweeney T, Beach TG, Serrano GE, Hutchins E, Ghiran I, Reiman R, Syring M, Hsieh M, Courtright-Lim A, Valkov N, Whitsett TG, Rakela J, Pockros P, Rozowsky J, Gallego J, Huentelman MJ, Shah R, Nakaji P, Kalani MYS, Laurent L, Das S and Van Keuren-Jensen K (2022) A Novel Tissue Atlas and Online Tool for the Interrogation of Small RNA Expression in Human Tissues and Biofluids. Front. Cell Dev. Biol. 10:804164. doi: 10.3389/fcell.2022.804164

Received: 28 October 2021; Accepted: 28 January 2022;
Published: 04 March 2022.

Edited by:

Georg Haase, INSERM U1106 Institut de Neurosciences des Systèmes, France

Reviewed by:

Axel Freischmidt, University of Ulm, Germany
Silvia Fischer, University of Giessen, Germany

Copyright © 2022 Alsop, Meechoovet, Kitchen, Sweeney, Beach, Serrano, Hutchins, Ghiran, Reiman, Syring, Hsieh, Courtright-Lim, Valkov, Whitsett, Rakela, Pockros, Rozowsky, Gallego, Huentelman, Shah, Nakaji, Kalani, Laurent, Das and Van Keuren-Jensen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kendall Van Keuren-Jensen, kjensen@tgen.org

These authors have contributed equally to this work and share first authorship

These authors share senior authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.