ORIGINAL RESEARCH article

Front. Plant Sci., 26 July 2012

Sec. Plant Genetics and Genomics

Volume 3 - 2012 | https://doi.org/10.3389/fpls.2012.00165

A Survey of MicroRNA Length Variants Contributing to miRNome Complexity in Peach (Prunus Persica L.)

  • MC

    Moreno Colaiacovo

  • LB

    Letizia Bernardo

  • IC

    Isabella Centomani

  • CC

    Cristina Crosatti

  • LG

    Lorenzo Giusti

  • LO

    Luigi Orrù

  • GT

    Gianni Tacconi

  • AL

    Antonella Lamontanara

  • LC

    Luigi Cattivelli

  • PF

    Primetta Faccioli *

  • CRA Genomics Research Centre, Fiorenzuola d’Arda Italy

Abstract

MicroRNAs (miRNAs) are short non-coding RNA molecules produced from hairpin structures and involved in gene expression regulation with major roles in plant development and stress response. Although each annotated miRNA in miRBase (www.mirbase.org) is a single defined sequence with no further details on possible variable sequence length, isomiRs – namely the population of variants of miRNAs coming from the same precursors – have been identified in several species and could represent a way of broadening the regulatory network of the cell. Next-gen-based sequencing makes it possible to comprehensively and accurately assess the entire miRNA repertoire including isomiRs. The aim of this work was to survey the complexity of the peach miRNome by carrying out Illumina high-throughput sequencing of miRNAs in three replicates of five biological samples arising from a set of different peach organs and/or phenological stages. Three hundred-ninety-two isomiRs (miRNA and miRNA*-related) corresponding to 26 putative miRNA coding loci, have been highlighted by mirDeep-P and analyzed. The presence of the same isomiRs in different biological replicates of a sample and in different tissues demonstrates that the generation of most of the detected isomiRs is not random. The degree of mature sequence heterogeneity is very different for each individual locus. Results obtained in the present work can thus contribute to a deeper view of the miRNome complexity and to better explore the mechanism of action of these tiny regulators.

Introduction

MicroRNAs (miRNAs) are short non-coding RNA molecules produced from hairpin structures and involved in gene expression regulation with major roles in plant development and stress response. MiRNAs are transcribed into a primary transcript which folds into a bulge with stem-loop conformation that is then cleaved by a Dicer-like (DCL) RNase III enzyme named DCL1. The cleavage results in a short duplex: one of the two strands forming the duplex and designated as miRNA* is then typically degraded while the other strand is incorporated into the RNA-induced silencing (RISC) complex where it mediates mRNA recognition and cleavage or translational repression (Jones-Rhoades et al., 2006; Voinnet, 2009; Xie et al., 2010).

Although each annotated miRNA in miRBase1 is a single defined sequence with no further details on possible variable sequence length, isomiRs – namely the population of variants of miRNAs coming from the same precursors – have been identified in several species and could represent a way of broadening the cell regulatory network (Ebhardt et al., 2009; Guo and Lu, 2010).

Vaucheret (2009) demonstrated the biological significance of mature miRNA length heterogeneity in Arabidopsis where the ath-miR168 can be processed as two different miRNA isoforms of 21 nt and 22 nt in length with different activities on AGO1 homeostasis (AGO1 is the Argonaute1 protein, a component of RISC complex that catalyzes broad miRNA- and siRNA-guided mRNA cleavage and translation repression Voinnet, 2009).

Alteration in miRNA end sequences can have strong effects on miRNA function due to the fact that the identity of the first 5′ nucleotide is the major determinant for AGO protein association (Takeda et al., 2008). As an example, Mi et al. (2008) found that AGO1 (which predominates in the miRNAs-mediated pathway) harbors miRNAs that favor a 5′ terminal uridine. A change at the 5′ terminal nucleotide of a miRNA predictably redirected it into a different AGO complex and altered its biological activity. Additionally, it was reported that the thermodynamic stability at the 5′ end of the strand is likely to affect the loading in the AGO complex (Eamens et al., 2009).

An accurate profile of the entire miRNA population of a biological sample provides useful information on miRNA activity and it can be used to compare the distribution of miRNA sequence variants in different samples. In fact, although the distribution of isomiRs across samples has been previously shown to be generally similar, examples in which the dominant isomiR is different from sample to sample have been found in animals (Lee et al., 2012) and could imply a functional role for specific isomiR sequences, besides affecting the accuracy and consistency of miRNA measurement.

This work aims to survey, by carrying out Illumina high-throughput sequencing, the complexity of peach miRNome through the analysis of the miRNA population of a set of samples representative of different tissues and developmental stages.

Materials and Methods

Plant material and RNA extraction

A 12-year-old tree grafted on wild seedling of the yellow-fleshed cv. Maycrest (Prunus persica (L.) Batsch), grown in Palazzolo di Sona, Verona, Italy (45.457°N, 10.822°E), was used as plant source material. Each sample was collected pooling together material from three different branches of the same plant. Four phenological stages (Chapman and Catlin, 1976) were considered: swollen bud, half-inch green, pink, bloom. Leaf and flower swollen buds were collected 41 days before flowering (DBF), half-inch leaves were collected 21 DBF, pink flower buds were collected six DBF. Codes were assigned to each samples: BF, pink; F, bloom; GF, swollen flower bud; O, half-inch green; GL, swollen leaf bud. Tissues were frozen in liquid nitrogen immediately after drawing. Total RNA was extracted from three independent samples with the Plant Total RNA Purification Kit (NORGEN Biotek Corp., Thorold, ON, Canada) following manufacturer instructions. RNA quality and concentration were evaluated with the Agilent 2100 Bioanalyzer RNA 6000 Nano assay (Agilent Technologies, Santa Clara, CA, USA).

Small RNA libraries construction and sequencing

Preparation of small RNA libraries was performed with the TruSeq Small RNA Sample Prep Kit (Illumina, San Diego, CA, USA) following manufacturer instructions. Briefly, 1 μg of total RNA was ligated with adapters at 3′ and 5′ ends, without any size fractionation. Adapter-ligated RNA was reverse-transcribed with SuperScript II Reverse Transcriptase (Invitrogen, Carlsbad, CA, USA), then PCR-amplified (15 cycles). Samples were barcoded using 15 variants of the reverse primer provided with the kit. Libraries were pooled together and then purified on a 6% TBE PAGE gel after electrophoresis. Libraries quality and concentration were evaluated with the Agilent 2100 Bioanalyzer DNA 1000 assay. The obtained cDNAs were sequenced using the Illumina Genome Analyzer IIx.

Data analysis

Reads were filtered with UEA sRNA plant toolkit2 (Moxon et al., 2008) to remove adaptor sequences, reads shorter than 18 nt or longer than 24, low-complexity reads, reads matching rRNAs or tRNAs and reads that did not match the peach genomic sequence available at “The International Peach Genome Initiative – www.rosaceae.org/peach/genome” (only those sequences with a full-length perfect match to the selected genome were retained). Reads from one replicate (randomly chosen) of each biological sample were then analyzed with the software miRDeep-P (Yang and Li, 2011, default parameters) to identify miRNA loci expressed in all the five tested samples (GF, GL, B, F, O). Reads associated to these loci were then also screened in the remaining two replicates.

Read counts for each variant were divided by the total number of reads with a match in the peach genome in each sample and normalized to 1,000,000 reads. Reads that could be related to more than one locus were assigned by MiRDeep-P to all possible involved loci.

IsomiRs for each putative locus were blasted against miRBase (release 18) to search for the loci related to previously known miRNAs (Kozomara and Griffith-Jones, 2011). Blast vs. mature sequences was based on the following parameters: outfmt 6, task blastn, dust “no,” e-value 10, word_size 7, reward 2, num_alignments 10. Blast vs. precursor sequences was based on the following parameters: outfmt 6, task blastn, dust “no,” e-value 10e−3, num_alignments 10.

The correlation between biological replicates was evaluated by calculating the Pearson coefficient for all the possible pairs of replicates belonging to the same biological sample, as well as samples from different tissues for sake of comparison. We decided to remove from the set a sequence (HE860285) whose expression level was abnormally high, because its presence caused the Pearson correlation to be almost one in every comparison, irrespective of the tissue.

To identify miRNAs isomiRs that were differentially expressed among the biological samples, a t-test was performed for all the possible comparisons. An isomiR was considered as differentially expressed in a specific comparison if its p-value was less than 0.05. The whole set of reads associated with the miRNA loci was then used to perform a hierarchical clustering with R software, by applying the Canberra metric to calculate the distances between the expression vectors of the samples across the reads.

MiRNA target identification was carried out by psRNATarget3 (Dai and Zhao, 2011), with default parameters. To score the complementarity between small RNA and their target transcript, psRNATarget applies the scoring schema of miRU by Zhang (2005). The maximum expectation is the threshold of the score. A small RNA/target site pair will be discarded if its score is greater than the threshold. The default cut-off threshold is 3.0.

The accessibility of the mRNA target site to small RNA has been identified as one of the important factors involved in target recognition because the secondary structure (stem, etc.) around the target site will prevent the small RNA and mRNA target from contacting. The psRNATarget server employes RNAup to calculate target accessibility, which is represented by the energy required to open (unpair) secondary structure around the target site (usually the complementary region with small RNA and up/downstream) on target mRNA. The lower the energy the higher the possibility that small RNA is able to contact (and cleave) target mRNA. PsRNATarget uses a software, namely RNAup, described by Mückstein et al. (2006) to calculate this value, denoted as UPE.

All the miRNA-related sequences were submitted to the EMBL database, whereas the sequencing raw reads were submitted to the NCBI SRA (BioProject accession: PRJNA167962).

Results

Sequencing peach small RNA libraries

Illumina deep sequencing was used to profile the whole miRNA set of five different samples corresponding to different organs and/or phenological stages. Three replicates were analyzed for each sample. A total number of 40,764,330 sequence reads were obtained and filtered as reported in Section “Materials and Methods.” Details on the results of each filtering step are reported in Table 1. On average, 2,717,622 raw reads and 664,777 clean reads perfectly matching the genome were obtained in each of the 15 samples.

Table 1

SampleRaw readsMatching adaptorsMatching adaptors (18–24 nt)Low-complexity filtered (non-redundant)rRNA/tRNA removed (non-redundant)Matching peach genome (non-redundant)
BF12842653233266115926711582689 (356161)1335567 (333478)797297 (207233)
BF22553116212414015705831560824 (370103)1401892 (351620)817438 (220223)
BF32641037218448016463601635958 (387915)1466005 (368373)853043 (235873)
F12523898181998111804241173273 (215571)858130 (194546)368958 (78332)
F22898014236145713895851381108 (233414)1040918 (214222)630870 (115644)
F33613383292324217687831757830 (320484)1362239 (297258)700729 (143030)
GF12696289217090413549151346476 (307349)1200810 (289976)774146 (190043)
GF23722325284815519628811950819 (429497)1570664 (405303)928251 (248399)
GF32952035225427613275441319295 (308967)1145330 (291401)609728 (168939)
GL12357377170767710725691065997 (241873)870779 (220909)481523 (122824)
GL22304406172928210803221073580 (253835)929602 (234995)460632 (125267)
GL318227541345521745352740785 (207198)616318 (190449)338168 (111111)
O13704334290719718110571799817 (327514)1384994 (302970)890186 (182703)
O21896816150974710686091061991 (226499)911190 (208620)624084 (129672)
O32235893175314212868331278775 (282053)1093002 (263673)696601 (162637)

Reports the number of filtered reads perfectly matching the peach genome in each of the tested samples.

BF, pink; F, bloom; GF, swollen flower bud; O, half-inch green; GL, swollen leaf bud.

One technical replicate (randomly chosen and numbered as “1”) of each sample was subsequently analyzed with miRDeeP-P which highlighted the putative miRNA coding loci of the peach genome expressed in the five tested samples (reported in Files S1S5 in Supplementary Material and summarized in File S6 in Supplementary Material). Twenty-six putative miRNA coding loci were expressed in all samples according to miRDeep-P results. The length of the putative precursors was between 41 nt and 227 nt (average length of 104 nt), while average mature miRNAs size was 22 nt.

These 26 miRNAs were selected and, for each of them, the corresponding associated reads were searched in all the replicates of each sample. The results (miRNAs and miRNAs* associated reads) are reported in Table 2 for each locus, the link between locus name and locus position can be found in File S7 in Supplementary Material and retrieved at www.rosaceae.org/peach/genome.

Table 2

miRNAReadsEMBL accession numberBF1BF2BF3F1F2F3GF1GF2GF3GL1GL2GL3O1O2O3
1_10AGTTTGTGCGTGAATCGAACCHE8629972.51.21.235.211.145.72.64.34.9010.95.91.100
CAGTTTGTGCGTGAATCGAACHE8604296.39.819.924.47.9203.910.814.88.317.45.93.408.6
TTAGATTCACGCACAAACHE8629990000001.32.21.6000000
TTAGATTCACGCACAAACTHE8604293.86.14.75.404.33.94.33.32.16.501.11.60
TTAGATTCACGCACAAACTCHE8630012.56.14.78.105.71.34.31.604.304.53.24.3
TTAGATTCACGCACAAACTCGHE86029366.59383.2208.771.3125.667.2106.7132.8105.9125.9230.72740.158.9
1_15AACCACAAATCTCTTGGACTCCTGHE8604301.301.20001.31.1000001.60
AAGAGATTTGTGGTTACTCACHE863003001.20001.32.21.604.3301.61.4
AAGAGATTTGTGGTTACTCACCHE8604302.52.40001.401.10000000
AAGAGATTTGTGGTTACTCACCGHE86300507.31.20002.61.11.60001.102.9
AAGAGATTTGTGGTTACTCACCGTHE86043112.518.412.95.43.22.914.29.76.64.22.206.76.47.2
AGAGATTTGTGGTTACTCACHE8630076.32.43.52.70001.102.1001.11.61.4
AGAGATTTGTGGTTACTCACCGHE8604311.34.94.701.602.6002.1031.14.84.3
AGAGATTTGTGGTTACTCACCGTHE86300901.22.301.6001.11.60002.202.9
AGAGATTTGTGGTTACTCACCGTTHE860297117.9162.7126.627.136.525.77165.749.245.743.420.780.952.934.5
ATTTACATCCAACGGTGAGTAACCHE8604320000000002.12.20000
CAAGAGATTTGTGGTTACTCAHE86301100001.600000031.100
CAAGAGATTTGTGGTTACTCACCHE8604321.301.200002.200001.100
CAAGAGATTTGTGGTTACTCACCGHE86301302.42.32.73.24.32.62.20000000
CCAAGAGATTTGTGGTTACTCAHE8604330000000000001.100
TCCAAGAGATTTGTGGTTACTCACHE8630151.300001.40002.100001.4
1_25CGAAACCTCCCATTCCAAHE8604331.3000001.32.202.12.23000
GAGAGGTTGCCGGAAAGAHE8630170000000002.100000
GGGTGAGAGGTTGCCGGAAAHE860434002.30000002.10001.64.3
GGGTGAGAGGTTGCCGGAAAGHE8630192.59.816.400010.3148.212.515.211.815.730.438.8
GGGTGAGAGGTTGCCGGAAAGAHE86043432.624.552.803.2096.9206.8157.4105.9121.6106.575.391.3208.2
GGGTGAGAGGTTGCCGGAAAGAAHE8630210000001.300003000
GGTGAGAGGTTGCCGGAAAGAATHE8604350000000004.200000
TCCGAAACCTCCCATTCCAAHE8630231.301.2001.43.9000002.201.4
TCCGAAACCTCCCATTCCAATHE8604353.81.21.20001.31.10000001.4
TCCGAAACCTCCCATTCCAATGHE8630250000003.900000000
TTCCGAAACCTCCCATTCCAAHE86043617.630.617.65.46.315.729.749.627.933.239.126.613.511.28.6
TTCCGAAACCTCCCATTCCAATHE8630271.32.41.22.701.46.51.13.302.2303.20
TTGGGTGAGAGGTTGCCGGAAHE8604360000000000001.100
TTGGGTGAGAGGTTGCCGGAAAHE8630292.500002.91.300000000
TTTCCGAAACCTCCCATTHE8604372.52.41.2001.402.21.604.331.100
TTTCCGAAACCTCCCATTCHE8630313.81.24.7008.67.85.43.32.14.331.100
TTTCCGAAACCTCCCATTCCHE8604378.88.623.45.49.517.134.923.72310.421.732.24.84.3
TTTCCGAAACCTCCCATTCCAHE86303315.11112.904.85.729.722.613.116.621.731.105.7
TTTCCGAAACCTCCCATTCCAAHE8603041135.11196.41134.8401.1391.5687.918022047.91835.21277.21417.6777.7433.6387.8446.5
TTTCCGAAACCTCCCATTCCAATHE86043818.815.916.48.16.312.814.221.514.820.828.233.44.85.7
1_26AAAAAGACTCAACAACCCATGTTTHE8630350000000002.100000
AAAAGACTCAACAACCCATGTHE86043801.21.22.701.400002.2001.60
AAAAGACTCAACAACCCATGTTTHE8630370000000002.100000
AAAGACTCAACAACCCATGTHE8604391.300000000000000
AAAGGCATAGTAGGGTTTAGGAHE8630390000001.300000001.4
AAAGGCATAGTAGGGTTTAGGAAGHE8604393.801.20006.58.63.32.14.30001.4
AAGGCATAGTAGGGTTTAGGAAGTHE8630411.300000000000000
ACCCCGCCCATTCCAAATATTHE8604400002.701.41.30000001.60
ACCCCGCCCATTCCAAATATTTHE863043000001.41.300000000
ATATTTTCTAAGCCTACTGTCHE8604407.53.75.98.13.28.62220.526.216.61311.813.54.87.2
CAAATATTTTCTAAGCCTACTGTCHE8630450002.700000000000
CATAGTAGGGTTTAGGAAHE8604410000001.3002.100000
CATAGTAGGGTTTAGGAAGTTHE8630470000001.300000000
CATAGTAGGGTTTAGGAAGTTTHE860441002.3001.40002.100000
CATAGTAGGGTTTAGGAAGTTTTHE8630491.31.22.301.62.95.23.21.62.1001.102.9
CATAGTAGGGTTTAGGAAGTTTTTHE8604427.57.34.75.43.24.310.311.93.310.4032.21.64.3
CTTTGCCAACCCCGCCCATTCCHE8630512.50001.62.96.55.43.30002.201.4
CTTTGCCAACCCCGCCCATTCCAHE8604421.300000001.6000000
CTTTGCCAACCCCGCCCATTCCAAHE86305301.21.20005.21.14.916.62.28.91.101.4
GAAAGGCATAGTAGGGTTTAGGAHE8604431.301.200001.102.1001.11.60
GAAAGGCATAGTAGGGTTTAGGAAHE8630551.305.92.705.73.96.5000001.64.3
GCCAACCCCGCCCATTCCAAHE8604430000000000001.100
GGAATGAGCGTGTTGGAAAHE8630571.301.2000000000001.4
GGAATGAGCGTGTTGGAAAAHE8604441.31.200001.30002.201.11.61.4
GGAATGAGCGTGTTGGAAAAGHE8630597.54.93.52.73.24.35.28.64.910.46.535.63.212.9
GGAATGAGCGTGTTGGAAAAGAHE8604442.54.95.900092.28.229.115.211.801.60
GGAATGAGCGTGTTGGAAAAGAAHE8630610000001.3002.1001.100
TATTTTCTAAGCCTACTGTCHE860445001.200003.204.200000
TCTAAGCCTACTGTCTTTCCCHE863063000002.92.61.102.1002.21.60
TCTAAGCCTACTGTCTTTCCCTHE860445000001.41.3000001.11.60
TGCCAACCCCGCCCATTCCAHE8630651.300001.4000000000
TGCCAACCCCGCCCATTCCAAHE8604466.31.22.3004.32.62.21.66.22.25.92.23.21.4
TGCCAACCCCGCCCATTCCAAAHE8630676.34.92.35.404.310.311.916.414.528.28.96.787.2
TGGAATGAGCGTGTTGGAAAAHE8604461.30000001.10000000
TTCTTTGCCAACCCCGCCCATTHE8630691.300001.42.601.64.24.331.100
TTGCCAACCCCGCCCATTHE8604471.31.20001.402.24.92.100000
TTGCCAACCCCGCCCATTCHE8630713.82.42.32.7007.810.86.64.22.201.100
TTGCCAACCCCGCCCATTCCHE86044726.324.515.210.84.817.17140.978.735.326.153.210.12412.9
TTGCCAACCCCGCCCATTCCAHE86307353.73.55.401.410.35.49.82.18.75.93.46.45.7
TTGCCAACCCCGCCCATTCCAAHE860305180.6163.9138.3140.985.6119.9260.9266.1477.3344.7382.1275126.9203.5189.5
TTGCCAACCCCGCCCATTCCAAAHE860448002.301.61.42.63.23.302.201.11.62.9
TTGCCAACCCCGCCCATTCCAAATHE8630751.30000000000001.60
TTTGAAGCAGATGATGGAACHE8604480000001.300000000
TTTGCCAACCCCGCCCATHE86307702.400001.303.302.201.101.4
TTTGCCAACCCCGCCCATTHE8604492.5002.7006.508.22.12.2006.40
TTTGCCAACCCCGCCCATTCHE86307951.21.22.701.43.92.28.24.2032.24.81.4
TTTGCCAACCCCGCCCATTCCHE86044920.119.614.110.87.912.867.226.973.833.241.250.315.719.27.2
TTTGCCAACCCCGCCCATTCCAHE86308158.63.52.71.61.4229.713.110.4135.95.63.25.7
TTTGCCAACCCCGCCCATTCCAAHE860450115.47170.359.636.575.6148.6136.8239.5240.9223.6171.557.3134.683.3
TTTGCCAACCCCGCCCATTCCAAAHE86308301.21.20003.92.211.504.311.8000
1_29AGGTGGGCATACTGCCAACTGHE8604503.82.42.313.64.8101.3000003.41.61.4
ATTGGCATTCTGTCCACCTCCHE86308501.2001.600000001.100
TGGCATTCTGTCCACCTCCHE8604511.300001.400002.20000
TTGGCATTCTGTCCACCTHE8630876.36.1713.611.17.12.61.11.62.105.9000
TTGGCATTCTGTCCACCTCHE86045118.824.515.213.628.527.11.35.48.22.14.3001.68.6
TTGGCATTCTGTCCACCTCCHE86030789.1121.189.1127.4141.1225.580.153.9592726.120.720.235.325.8
TTGGCATTCTGTCCACCTCCTHE86308936.429.436.310.826.962.823.315.113.12.14.334.54.81.4
TTGGCATTCTGTCCACCTCCTCHE8604521.300000000000000
1_3AACATGATCATCCGAATGATHE8630910000000000001.100
AATGCTGTCTGGTTCGAGAHE8604521.32.41.201.62.901.13.32.12.2001.61.4
ACCAGGCTTCATTCCCCCHE8630931.300000000000000
ATCCGAATGATCTCGGACCAGGCTHE8604530000000000001.100
ATCTCGGACCAGGCTTCATTCCCCHE8630956.314.717.609.511.42.61.106.26.501.14.85.7
ATGCTGTCTGGTTCGAGAHE8604530002.700000000000
CGGACCAGGCTTCATTCCHE86309700000001.1002.202.200
CGGACCAGGCTTCATTCCCHE8604541.32.402.71.601.35.41.60031.100
CGGACCAGGCTTCATTCCCCHE863099282.2208289.6336.1187182.7235.1213.3203.4265.8251.8174.5449.3299.6328.7
CGGACCAGGCTTCATTCCCCCHE8604541.31.202.73.21.41.31.102.14.3001.60
CTCGGACCAGGCTTCATTCCHE86310102.41.22.71.61.41.301.60001.100
CTCGGACCAGGCTTCATTCCCHE86045566.583.265.6132.842.8129.999.79.810.417.417.716.946.543.1
CTCGGACCAGGCTTCATTCCCCHE863103151.8165.2175.8192.4130189.8165.3159.4146126.7147.613675.3187.5183.7
CTCGGACCAGGCTTCATTCCCCCHE860455000001.41.3002.100000
GAATGCTGTCTGGTTCGAGACHE8631053.81.205.401.40000001.11.60
GACCAGGCTTCATTCCCCHE8604561.300000001.6000000
GGAATGCTGTCTGGTTCGAHE8631070000001.300000000
GGAATGCTGTCTGGTTCGAGAHE86045612.517.112.924.44.8303.94.34.98.317.429.64.5010
GGAATGCTGTCTGGTTCGAGACHE86310956.18.22.74.817.101.13.34.24.30002.9
GGACCAGGCTTCATTCCCHE8604571.301.2000000000000
GGACCAGGCTTCATTCCCCHE863111146.7154.1143184.3136.3148.4155161.6134.5189191147.9171.9174.7193.8
TCGGACCAGGCTTCATTCHE86045758.964.865.643.428.531.442.654.939.424.928.229.647.254.540.2
TCGGACCAGGCTTCATTCCHE863113440.2408.6385.7417.4261.5412.4384.9339.3332.9180.7256.2283.9159.5280.4254.1
TCGGACCAGGCTTCATTCCCHE860458706.1675.3720.9441.8321.8449.5586.5627546.1388.4525.4387.4410512.8541.2
TCGGACCAGGCTTCATTCCCCHE860285282744.1279717.9296598.2273500243934.6239360.7306036.6275122.8248117.2282794.4317557.2199637.5217491.6348033.3350352.6
TCGGACCAGGCTTCATTCCCCCHE863115209.5212.9233.3384.9160.1276.9170.5137.9165.6272.1191168.6174.1282236.9
TCTCGGACCAGGCTTCATTCCHE860458110.4179.8155.9273.7190.2332.511.68.614.824.999.911.828.157.745.9
TCTCGGACCAGGCTTCATTCCCHE86311701.21.25.405.700002.2001.61.4
TCTCGGACCAGGCTTCATTCCCCHE8604592.5119.48.14.812.894.38.26.24.35.94.59.610
TCTCGGACCAGGCTTCATTCCCCCHE8631191.300000000000000
1_32ATTGACAGAAGAGAGTGAGCACHE8604591.3000001.300000000
GACAGAAGAGAGTGAGCACHE8631211.31.20000000000000
GCTCATGTCTCTTTCTGTCAGCHE86046052.472.74.84.301.102.1001.11.61.4
GCTCATGTCTCTTTCTGTCAGCTHE8631232.500001.4000000000
TGACAGAAGAGAGTGAGCAHE8604601.301.2000000000000
TGACAGAAGAGAGTGAGCACHE86031171.580.797.3181.63844.229.753.931.249.817.411.811.211.220.1
TGACAGAAGAGAGTGAGCACAHE8631252.53.7001.60001.62.100000
TGCTCATGTCTCTTTCTGTCAGCHE86046102.42.38.11.601.3002.10001.60
TTGACAGAAGAGAGTGAGCACHE8631278.814.715.221.73.27.12.601.610.400000
1_44AGGTGGTCAGCATGTCAAACTHE8604613.82.43.52.7005.29.700032.23.25.7
TGGCATTCTGTCCACCTCCHE8604511.300001.400002.20000
TTGGCATTCTGTCCACCTHE8630876.36.1713.611.17.12.61.11.62.105.9000
TTGGCATTCTGTCCACCTCHE86045118.824.515.213.628.527.11.35.48.22.14.3001.68.6
TTGGCATTCTGTCCACCTCCHE86030789.1121.189.1127.4141.1225.580.153.9592726.120.720.235.325.8
TTTGGCATTCTGTCCACCTCCHE8631291.301.20001.31.11.62.1001.100
1_5TACAATGAAATCACGGCCHE8604620000000002.100000
TATAAAGAGATGTACTGGACCHE8631313.82.42.32.7002.62.202.14.302.21.61.4
TTATACAATGAAATCACGGHE8604620000000000001.100
TTATACAATGAAATCACGGCHE8602888.823.2275.47.97.114.215.13.316.623.920.720.217.618.7
TTATACAATGAAATCACGGCCHE860287125.4174.9219.270.513081.3107.2131.4147.6280.4230.1378.5179.7126.6277.1
TTATACAATGAAATCACGGCCGHE860286129.2116.2148.929.871.337.163.375.462.3170.3141.1195.285.483.3208.2
10_1ACAGGGAACAGGTAGAGCAHE8631332.51.22.300001.100001.11.60
ACAGGGAACAGGTAGAGCATGHE8604632.53.78.221.702.91.32.2000004.80
ATGCACTGCCTCTTCCCTGGCHE8631352.53.74.72.71.65.71.3000002.281.4
TGCACTGCCTCTTCCCTGHE86046311.33.775.408.63.93.2000394.85.7
TGCACTGCCTCTTCCCTGGHE86313726.32236.316.31.68.67.810.83.32.1036.72411.5
TGCACTGCCTCTTCCCTGGCHE86046447.735.532.8146.414.365.611.612.98.22.18.78.915.71615.8
TGCACTGCCTCTTCCCTGGCTHE860428154.3137137.216853.9119.995.672.290.216.619.514.883.181.764.6
TGCACTGCCTCTTCCCTGGCTGHE8631392.56.13.510.81.65.71.32.21.602.202.21.64.3
2_31CCAAAGGGATCGCATTGATCTHE8604640002.71.60000000000
TCCAAAGGGATCGCATTGAHE86314101.21.20000002.100000
TCCAAAGGGATCGCATTGATHE86046501.20001.41.31.10000000
TCCAAAGGGATCGCATTGATCHE86033412.52218.846.144.444.237.543.116.422.823.941.41.13.27.2
TCCAAAGGGATCGCATTGATCTHE8631433.812.215.21920.611.411.619.48.28.34.331.102.9
TCCAAAGGGATCGCATTGATCTAHE8604650005.41.64.3000000000
TCGATGCGATCCCTTGGGAHE8631451.300000000000000
TCGATGCGATCCCTTGGGAAGHE8603371.34.93.567.830.155.71.34.31.612.56.532.21.65.7
TCGATGCGATCCCTTGGGAAGTHE8604660002.71.60000000000
TGATATTGGATCGATGCGATCHE8631470000001.300000000
3_16ATTGTAGGAATGGGCTGTTTGHE8604662.501.20001.300000000
CCCAAGCCCGCCCATTCCHE8631490000002.600000000
CCCAAGCCCGCCCATTCCAHE8604670000001.31.10000000
CTTCCCAAGCCCGCCCATTCCAHE8631510000001.300000000
GGAATGGGCTGTTTGGGAHE86046713.87.317.616.328.512.83.97.511.510.417.420.711.214.410
GGAATGGGCTGTTTGGGATHE86315351.24.701.62.91.32.206.24.302.21.65.7
GGAATGGGCTGTTTGGGATGHE86046870.256.392.616868.292.810.324.82322.817.420.769.656.161.7
GGAATGGGCTGTTTGGGATGAHE860347100.384.4150.1311.71111772249.650.847.834.753.2197.786.5104.8
GGAATGGGCTGTTTGGGATGAAHE8631550002.701.40000001.100
GGAATGGGCTGTTTGGGATGAAAGHE8604680002.7000000002.201.4
TAGGAATGGGCTGTTTGGGAHE8631572.51.23.501.60000000001.4
TTCCCAAGCCCGCCCATTHE8604690000000002.100000
TTCCCAAGCCCGCCCATTCHE8631591.300000000000000
TTCCCAAGCCCGCCCATTCCHE86046903.7010.807.16.54.34.96.28.73001.4
TTCCCAAGCCCGCCCATTCCAHE863161002.301.61.4001.62.100000
TTCCCAAGCCCGCCCATTCCAAHE860348112.9130.999.6216.8123.6182.7217225.2288.7278.3212.8115.347.238.563.2
TTGTAGGAATGGGCTGTTTGGGAHE8604700000000002.100000
TTTCTTTCATCCCAAACAGCCHE8631630000001.300000000
3_28ATGGTGTCATCCCTCCTGTGACCHE8604700000000002.100000
CCAAATTGAGAGAGAGAGAGAGAGHE8631651.300000000000000
CCATCTTCCTGTGACATGAACHE8604710002.701.40002.100000
CGCAGGAGAGATGGCACTGHE8631670000000002.12.23000
GGTGTCATCCCTCCTGTGACCHE86047100001.600002.12.20000
TCCATCTTCCTGTGACATGAHE8631690002.700001.602.23000
TCGCAGGAGAGATGGCACHE8604722.50000001.1004.33000
TCGCAGGAGAGATGGCACTGHE8631717.52.42.35.4005.2149.835.339.147.3005.7
TCGCAGGAGAGATGGCACTGTHE8604721.302.38.10101.35.43.38.315.217.71.101.4
TCGCAGGAGAGATGGCACTGTCHE86035615.19.815.251.51938.54074.344.3110.11651393.44.87.2
TCGCAGGAGAGATGGCACTGTCTHE8631730002.700001.6000000
TGGTGTCATCCCTCCTGTGACCHE8604730008.11.64.302.24.910.48.729.61.101.4
TTCCATCTTCCTGTGACATGAHE8631750002.73.21.42.61.11.62.14.323.7000
TTCGCAGGAGAGATGGCACHE8604730000000002.100000
TTCGCAGGAGAGATGGCACTGTCHE863177001.2001.42.61.11.62.12.23001.4
4_21CCCTGCAGTACCTTCCTTTACCCHE860474001.22.700000000000
GGAGCGACCTGGGATCACATGHE86317901.21.221.71.615.70000001.100
GTGTTCTCAGGTCGCCCCTGHE860474002.3001.401.13.30002.200
TGTGTTCTCAGGTCGCCCCHE8631813.82.43.58.14.84.30002.103014.47.2
TGTGTTCTCAGGTCGCCCCTHE86047501.202.70002.200033.41.68.6
TGTGTTCTCAGGTCGCCCCTGHE860366356.2210.4280.2311.7313.9216.93171.154.160.236.935.5540.3700.2723.5
5_14AATGTTGTCTGGCTCGAGHE86318301.23.5001.40000001.11.61.4
AATGTTGTCTGGCTCGAGGHE8604756.313.578.101.46.58.606.210.906.728.811.5
AATGTTGTCTGGCTCGAGGCCHE8631850002.7001.31.11.604.301.11.60
AATGTTGTCTGGCTCGAGGCCCHE8604760000001.31.10000000
AATGTTGTCTGGCTCGAGGCCCCTHE86318700000002.23.32.103000
ACCAGGCTTCATTCCCCCHE8630931.300000000000000
ACGTCGGACCAGGCTTCATTCHE8604760000000002.100000
ACGTCGGACCAGGCTTCATTCCCCHE8631891.301.20001.301.600001.61.4
ATGTTGTCTGGCTCGAGGHE8604771.32.43.500000002.201.11.62.9
ATTTGGTTCTACATTTAGTGACHE8631911.300000000000000
CGGACCAGGCTTCATTCCHE86309700000001.1002.202.200
CGGACCAGGCTTCATTCCCHE8604541.32.402.71.601.35.41.60031.100
CGGACCAGGCTTCATTCCCCHE863099282.2208289.6336.1187182.7235.1213.3203.4265.8251.8174.5449.3299.6328.7
CGGACCAGGCTTCATTCCCCCHE8604541.31.202.73.21.41.31.102.14.3001.60
CGTCGGACCAGGCTTCATTCCHE8604771.301.201.6002.20000000
CGTCGGACCAGGCTTCATTCCCHE863193000001.40000003.400
CGTCGGACCAGGCTTCATTCCCCHE8604783.82.41.20001.303.302.202.21.64.3
GAATGTTGTCTGGCTCGAHE8631951.31.200001.301.60001.100
GAATGTTGTCTGGCTCGAGGHE860478513.5710.81.62.91.32.2002.233.414.45.7
GAATGTTGTCTGGCTCGAGGCHE86319701.22.32.70002.20003001.4
GAATGTTGTCTGGCTCGAGGCCHE86047902.40001.40000001.11.60
GAATGTTGTCTGGCTCGAGGCCCCHE863199000000003.32.100000
GACCAGGCTTCATTCCCCHE8604561.300000001.6000000
GGAATGTTGTCTGGCTCGHE86047938.924.544.516.31.65.7926.91814.521.720.722.549.763.2
GGAATGTTGTCTGGCTCGAHE863201101124.65.44.81.411.616.214.822.88.732.55.6824.4
GGAATGTTGTCTGGCTCGAGHE8604806.33.710.616.34.806.58.63.38.34.311.84.56.414.4
GGAATGTTGTCTGGCTCGAGGHE863203165.6156.6174.773.226.949.974.9106.780.458.1117.2263.2175.2211.5353.1
GGAATGTTGTCTGGCTCGAGGCHE8604801.31.21.20002.61.11.62.10304.80
GGACCAGGCTTCATTCCCHE8604571.301.2000000000000
GGACCAGGCTTCATTCCCCHE863111146.7154.1143184.3136.3148.4155161.6134.5189191147.9171.9174.7193.8
GTCGGACCAGGCTTCATTCHE8632050000001.300000000
GTCGGACCAGGCTTCATTCCHE86048100000001.100001.100
GTCGGACCAGGCTTCATTCCCHE86320764117.486.786.749.1109.949.15642.629.128.2397.794.568.9
GTCGGACCAGGCTTCATTCCCCHE86048121.326.929.329.89.514.33129.12333.230.417.751.762.537.3
GTCGGACCAGGCTTCATTCCCCCHE86320902.47001.47.88.602.14.304.54.87.2
GTTGTCTGGCTCGAGGCCHE8604820000001.300000000
TAAATGTAGAACCAAATGATCTHE8632111.300000000000000
TCACTAAATGTAGAACCAAATGHE8604820000000000001.100
TCGGACCAGGCTTCATTCHE86045758.964.865.643.428.531.442.654.939.424.928.229.647.254.540.2
TCGGACCAGGCTTCATTCCHE863113440.2408.6385.7417.4261.5412.4384.9339.3332.9180.7256.2283.9159.5280.4254.1
TCGGACCAGGCTTCATTCCCHE860458706.1675.3720.9441.8321.8449.5586.5627546.1388.4525.4387.4410512.8541.2
TCGGACCAGGCTTCATTCCCCHE860285282744.1279717.9296598.2273500243934.6239360.7306036.6275122.8248117.2282794.4317557.2199637.5217491.6348033.3350352.6
TCGGACCAGGCTTCATTCCCCCHE863115209.5212.9233.3384.9160.1276.9170.5137.9165.6272.1191168.6174.1282236.9
TGTCTGGCTCGAGGCCCCTAHE8632130002.700000000000
5_3CCCGCCTTGCATCAACTGHE860483002.300001.106.203000
CCCGCCTTGCATCAACTGAAHE863215001.2002.91.31.13.36.28.75.901.62.9
CCCGCCTTGCATCAACTGAATHE86048335.14438.775.930.195.662263.9242.7255.4455.9257.320.2139.4208.2
CCGCCTTGCATCAACTGAATHE8632172.501.20002.6002.14.30000
CGCTTGGTGCAGGTCGGGAHE86048400000001.11.62.103000
CGCTTGGTGCAGGTCGGGAAHE86321900000001.13.36.22.231.101.4
CGCTTGGTGCAGGTCGGGAACHE8604840002.70000002.20000
GCTTGGTGCAGGTCGGGAAHE8632210000001.300000000
GGGTCCCGCCTTGCATCAACHE8604850000000004.22.20000
GGTCCCGCCTTGCATCAACTGAATHE863223002.30000002.12.20000
TCGCTTGGTGCAGGTCGGGAHE86048511.320.815.248.86.314.327.126.968.978.956.465.15.619.218.7
TCGCTTGGTGCAGGTCGGGAAHE860370259.6254.5184219.5136.3186.9193.8339.3501.9494.3579.6520.5104.5208.3249.8
TCGCTTGGTGCAGGTCGGGAACTHE86322500001.603.91.18.210.44.3301.64.3
TGGGTCCCGCCTTGCATCAACHE8604866.39.812.924.414.328.528.424.839.445.752.182.84.512.810
TGGGTCCCGCCTTGCATCAACTHE8632270002.700001.6000000
TGGTGCAGGTCGGGAACTGCTHE8604861.31.21.20000004.200000
TTGGTCGGTGGGTGCGAAATGGGTHE8632290000000002.100000
6_29AAGCTCAGGAGGGATAGCHE8604871.300000000000000
AAGCTCAGGAGGGATAGCGCHE8632310002.701.401.11.6000000
AAGCTCAGGAGGGATAGCGCCHE86038870.264.886.7178.9218.7118.4153.7149.7173.8211.8254224.7141.5158.6113.4
AGCTCAGGAGGGATAGCGCCHE86048701.205.41.600002.1001.100
CGCTATCCATCCTGAGTTTCHE86323301.21.28.1192.9001.6000000
CGCTATCCATCCTGAGTTTCAHE8604881.31.24.713.642.827.12.65.41.602.201.100
TATTGCGCTATCCATCCTGAGTTHE8632350000001.300000000
TCCATCCTGAGTTTCATGGCTHE8604881.301.2000000000000
TTGCGCTATCCATCCTGAGHE8632370000001.300000000
6_30AAGCTGCCAGCATGATCTGAGCHE8604890000000000001.14.80
AGATCATGTGGTAGCTTCATCHE86323950000002.21.60001.16.47.2
CTAGATCATGTGGTAGCTTCATCHE8604891.3000001.301.60001.11.60
GAAGCTGCCAGCATGATCTGHE8632410000000000001.102.9
GAAGCTGCCAGCATGATCTGAHE8604901.301.20000000001.102.9
GATCATGTGGTAGCTTCATCHE86324315.11114.100010.38.68.200037.151.341.6
GCTAGATCATGTGGTAGCTTCATCHE86049000001.600000001.104.3
TAGATCATGTGGTAGCTTCATCHE8632450000000000001.100
TGAAGCTGCCAGCATGATHE8604910000000000001.100
TGAAGCTGCCAGCATGATCHE86324776.548.962.12.74.82.925.845.2412.18.7021.325.631.6
TGAAGCTGCCAGCATGATCTHE860284170.6172.5161.81917.430153.715388.610.415.214.835.936.948.8
TGAAGCTGCCAGCATGATCTGHE8603991434.81105.91518.143.4174.499.9496627321.510.41311.8775.1796.4785.2
TGAAGCTGCCAGCATGATCTGAHE8604001056.1677.7907.359.657.134.3511.5667.9408.416.66.53302.2427.8447.9
TGAAGCTGCCAGCATGATCTGAGCHE8604912.5000001.32.21.60002.21.62.9
TGTTGAAGCTGCCAGCATGATCHE8632491.300000000000000
6_4AAGCTCAGGAGGGATAGCHE8604871.300000000000000
AAGCTCAGGAGGGATAGCGCHE8632310002.701.401.11.6000000
AAGCTCAGGAGGGATAGCGCCHE86038870.264.886.7178.9218.7118.4153.7149.7173.8211.8254224.7141.5158.6113.4
AGCTCAGGAGGGATAGCGCCHE86048701.205.41.600002.1001.100
CGCTATCTATCCTGAGTTTCAHE8604920000001.301.68.34.30000
6_7AATTACTACTTTTGAGTGGTTAHE8632511.300000000000000
ATCTTTCCCAATCCACCCAHE8604920000000000001.100
ATCTTTCCCAATCCACCCATGCCHE8632531012.23.52.71.61.411.616.211.510.46.5314.633.620.1
CATGGGTAAGTGGGGAAGAHE860493002.3001.40002.100000
CATGGGTAAGTGGGGAAGATGHE86325518.815.919.937.923.835.71.34.36.64.2035.6812.9
CATGGGTAAGTGGGGAAGATGAHE86049356.13.52.76.32.92.64.302.16.502.21.61.4
CTTTCCCAATCCACCCATGCHE8632570000000002.100000
CTTTCCCAATCCACCCATGCCHE86049401.21.22.7001.31.11.64.205.91.100
TCCCAATCCACCCATGCCHE863259000000001.62.100000
TCTTTCCCAATCCACCCAHE8604943.84.93.52.73.21.401.19.88.308.9011.27.2
TCTTTCCCAATCCACCCATHE8632611.30000001.100001.100
TCTTTCCCAATCCACCCATGHE8604951.3000002.6002.1001.100
TCTTTCCCAATCCACCCATGCHE8632632.51.23.501.601.32.21.66.22.201.14.81.4
TCTTTCCCAATCCACCCATGCCHE860389652.2652726.8379.4271.1298.3746.6554.8657.7830.7640.4505.7497.6645.7551.2
TCTTTCCCAATCCACCCATGCCTHE8604955000002.601.60001.101.4
TGGCATGGGTAAGTGGGGAAGAHE8632650000000002.1001.100
TTAGGTTTCCTCTTATTCATCCHE86049616.339.123.42.74.82.901.11.68.34.317.710.11610
TTCCCAATCCACCCATGCCTHE8632672.500001.41.31.10000000
TTCCCAATCCACCCATGCCTTHE86049601.21.201.6001.14.902.201.11.62.9
TTTCCCAATCCACCCATGCCTHE8632690002.7001.31.11.62.14.3001.60
TTTCCCAATCCACCCATGCCTTHE8604973.83.73.52.74.81.403.23.310.44.304.53.24.3
TTTCCCAATCCACCCATGCCTTAHE86327101.21.204.81.41.32.21.602.2304.81.4
TTTCCTCTTATTCATCCCTCTHE8604970000001.300000000
7_23AAGAAAGCTGTGGGAGAACATHE8632730002.700000000000
AAGAAAGCTGTGGGAGAACATGGCHE86049801.200001.3002.1001.100
CACAGCTTTCTTGAACTTHE8632751.33.71.22.73.21.4000000001.4
CCACAGCTTTCTTGAACTHE8604980002.701.4000000000
CTCAAGAAAGCTGTGGGAGAHE8632772.51.24.72.704.30002.12.201.14.84.3
GCTCAAGAAAGCTGTGGGAGAHE8604996.318.410.68.16.314.31.31.11.66.22.28.95.685.7
TATAAACAAGTCCTGGTCATGCTTHE8632790002.700000000000
TCCACAGCTTTCTTGAACTHE8604991.3002.70001.1002.20001.4
TCCACAGCTTTCTTGAACTTHE8632813.83.74.75.44.84.31.303.30033.404.3
TTCCACAGCTTTCTTGAAHE8605000002.70001.10000000
TTCCACAGCTTTCTTGAACHE8632831015.912.927.14637.12.64.34.92.105.94.54.85.7
TTCCACAGCTTTCTTGAACTHE86050076.5132.187.9146.4271.1289.716.820.511.58.34.323.740.435.351.7
TTCCACAGCTTTCTTGAACTTHE8604131138.81645.41659.91658.72919.82253.4198.9377.1216.5211.8149.8165.6949.2741.9808.2
TTCCACAGCTTTCTTGAACTTCHE8632852.502.35.404.301.100003.401.4
7_24ACACTGTGGCTCGTTGTGTTGTCAHE8605010000000000001.100
ACGTTATGTTGTCAAATTGTCHE8632870000001.31.10000000
ATGTTGTCAAATTGTCAATCHE86050100001.61.41.301.6005.9000
CAACGTGACAACACAACGAGCHE8632891.301.2002.9001.6000000
CAACGTGACAACACAACGAGCCHE8605021015.912.98.17.912.82.65.46.68.30001.62.9
CACGTTATGTTGTCAAATTGTCHE8632910002.701.4000000000
TATGTTGTCAAATTGTCAATHE8605020000001.300000000
TATGTTGTCAAATTGTCAATCHE86041416.39.824.624.412.72024.521.529.510.417.414.812.44.85.7
TGAACACAAAGATACATGCCCGHE8632930000001.300000000
TTGACAACGTGACAACACAACHE86050301.21.20005.22.21.604.33000
7_25GACAGAAGAGAGTGAGCACHE8631211.31.20000000000000
GCTCACTTCTCTCTCTGTCAGCHE8632950000000002.100000
TGACAGAAGAGAGTGAGCAHE8604601.301.2000000000000
TGACAGAAGAGAGTGAGCACHE86031171.580.797.3181.63844.229.753.931.249.817.411.811.211.220.1
TGACAGAAGAGAGTGAGCACAHE8631252.53.7001.60001.62.100000
8_16CCGACAAGCGTGCTCTCTCTCGTTHE8605031.300000000000000
GTGCTCTCTCTTGTTGTCATGHE8632971.33.74.713.64.801.303.32.16.502.26.45.7
TGACAACGAGAGAGAGCACHE8605042.500000000000000
TGACAACGAGAGAGAGCACGHE8632990002.701.4003.3000000
TGACAACGAGAGAGAGCACGCHE86042375.335.521.129.823.841.420.745.232.839.569.562.15.614.48.6
TTGACAACGAGAGAGAGCACHE8605042.51.21.2001.4001.602.20001.4
TTGACAACGAGAGAGAGCACGHE8633016.3008.14.82.93.96.53.32.110.901.11.60
TTGACAACGAGAGAGAGCACGCHE8605050000000000001.100
TTGTCGGCACCCATGAAAGGGCCAHE8633030002.700000000000
TTTGACAACGAGAGAGAGCACHE8605052.501.22.704.31.33.21.64.26.531.11.60
8_19AATGTCGTCTGGTTCGAGAHE863305001.22.73.21.400000001.60
AATGTCGTCTGGTTCGAGATCHE8605061.300001.4000000000
ATTTCGGACCAGGCTTCATTCHE8633073.83.73.52.71.64.31.300003000
ATTTCGGACCAGGCTTCATTCCCCHE86050601.202.70000002.20000
CGGACCAGGCTTCATTCCHE86309700000001.1002.202.200
CGGACCAGGCTTCATTCCCHE8604541.32.402.71.601.35.41.60031.100
CGGACCAGGCTTCATTCCCCHE863099282.2208289.6336.1187182.7235.1213.3203.4265.8251.8174.5449.3299.6328.7
CGGACCAGGCTTCATTCCCCTHE8633092.56.11.28.115.94.301.14.92.12.201.13.20
GAATGTCGTCTGGTTCGAGAHE8605071.37.32.32.708.6000000002.9
GACCAGGCTTCATTCCCCHE8604561.300000001.6000000
GACCAGGCTTCATTCCCCTCAHE8633110002.700000000000
GATTTCGGACCAGGCTTCATTCCCHE8605071.3002.700000000000
GGAATGTCGTCTGGTTCGAHE8633131.31.2001.62.91.31.11.62.1031.101.4
GGAATGTCGTCTGGTTCGAGAHE86050820.115.919.921.728.535.72.65.48.24.2033.44.85.7
GGAATGTCGTCTGGTTCGAGATHE8633150002.700000000000
GGACCAGGCTTCATTCCCHE8604571.301.2000000000000
GGACCAGGCTTCATTCCCCHE863111146.7154.1143184.3136.3148.4155161.6134.5189191147.9171.9174.7193.8
GGGAATGTCGTCTGGTTCGAGHE8605081.300000000000000
TCGGACCAGGCTTCATTCHE86045758.964.865.643.428.531.442.654.939.424.928.229.647.254.540.2
TCGGACCAGGCTTCATTCCHE863113440.2408.6385.7417.4261.5412.4384.9339.3332.9180.7256.2283.9159.5280.4254.1
TCGGACCAGGCTTCATTCCCHE860458706.1675.3720.9441.8321.8449.5586.5627546.1388.4525.4387.4410512.8541.2
TCGGACCAGGCTTCATTCCCCHE860285282744.1279717.9296598.2273500243934.6239360.7306036.6275122.8248117.2282794.4317557.2199637.5217491.6348033.3350352.6
TCGGACCAGGCTTCATTCCCCTHE86331737.641.639.927.112.721.459.447.44131.228.235.533.73240.2
TCGGACCAGGCTTCATTCCCCTCHE860509001.20000002.100000
TTCGGACCAGGCTTCATTCCHE8633191.302.304.81.401.10000000
TTCGGACCAGGCTTCATTCCCHE860509184.4223.9195.8273.7416.9299.749.162.547.624.917.423.735.949.743.1
TTCGGACCAGGCTTCATTCCCCHE863321153168.8158.3219.5280.6191.2117.5101.395.12732.620.735.957.744.5
TTCGGACCAGGCTTCATTCCCCTHE8605101.30001.60000000000
TTGAGGGGAATGTCGTCTGGHE8633231.300000000000000
TTTCGGACCAGGCTTCATTCCHE86051067.710485.6103187114.241.325.924.64.26.520.714.611.215.8
8_21CACGTGCTCCCCTTCTCCHE8633250000000002.100000
CACGTGCTCCCCTTCTCCAACHE8605112.51.22.304.81010.34.311.512.517.420.73.41.64.3
TGGAGAAGCAGGGCACGTGCAHE86042455.230.624.662.37.945.714.22821.320.817.429.66.784.3

Reports the read count (divided by the total number of reads with a perfect match to the peach genome and normalized to 1,000,000 reads) of 26 putative miRNA coding loci that were expressed in all the 15 samples according to miRDeep-P results.

BF, pink; F, bloom; GF, swollen flower bud; O, half-inch green; GL, swollen leaf bud.

IsomiRs identification and analysis

IsomiRs at each locus were blasted against miRBase. In some cases no mismatches were reported with the conserved sequences present in miRBase (e.g., miR403, miR394, miR166, miR156) while in some others mismatches were present and related to differences in the sequence and/or in its length. Detailed blast results are reported in File S8 in Supplementary Material which reports blast results based both on mature sequences (sheet “mature”) and precursor sequences (sheet “precursors”) deposited in miRBase. The file reports the matching sequence with the lowest e-value. When more than one matching sequence, belonging to different miRNA families, were found to have the same e-value all of them were reported.

Some miRNA families have more than one putative locus, therefore miRDeeP assigned common reads to all the possible loci. Both miRNA and miRNA*-related reads were identified at each locus. In some cases putative miRNAs* were identified on the basis of the alignment orientation (± with miRNA mature sequence deposited in miRBase) in some others the miRNA* sequences were already deposited in miRBase. The results of Table 2 highlight that some loci are characterized by a larger set of variants than others.

In the majority of the loci the most frequent read for a specific locus was the same in all the tested samples and across all the replicates of a sample (Table 2). Only in a few cases were some differences detected among samples or among replicates belonging to the same sample. Locus named 3_16 is particularly interesting because all the replicates of sample O have as the most frequent read the one corresponding to miRNA* (Table 2).

In some loci also the second most frequent read referred to the mature miRNA was the same in all the replicates of a sample and in all the samples. The second most frequent read was often obtained by a different cutting site at 5′ or 3′ ends. As reported above, miRNA*-related reads have also been identified by miRDeep-P for most of the 26 loci and length variability was detected for both 5′ and 3′ends.

Target analysis was carried out by psRNATarget. The whole set of targets identified is reported in File S9 in Supplementary Material.

Intra- and inter-samples analysis

The average Pearson correlation between all the possible pairs of replicates belonging to the same biological sample was calculated, in order to evaluate whether it was in agreement with the “Standards, guidelines, and best practices for RNA-seq” adopted by ENCODE Consortium.4 Average correlation coefficients were equal to 0.98 for BF, 0.95 for F, 0.98 for GF, 0.95 for GL, and 0.97 for O. For the sake of completeness and in order to allow a comparison between related and unrelated samples, we also calculated the average Pearson correlation between samples of different tissues, which was equal to 0.66 on the basis of the reads reported in Table 2. All the Pearson coefficients are reported in File S10 in Supplementary Material. Figure 1A reports the results obtained from clustering the five samples on the basis of all the reads frequencies (average frequencies of three replicates, reads included miRNA*-related reads; reads assigned by miRDeep-P to more than one locus were counted once) at the 26 loci analyzed. Additionally, a clustering analysis was performed by considering only the count of the most frequent read in each locus. The analysis included those loci where the most frequent read was the same in all the samples (16 different reads, Figure 1B). Figure A1 in Appendix reports clustering results obtained without averaging the three replicates of each sample. As it can be seen, replicates are always grouped correctly.

Figure 1

A t-test was also performed for all the possible comparisons of biological samples (File S11 in Supplementary Material). The most frequent isomiR (highlighted in yellow in File S11 in Supplementary Material) is frequently the one able to distinguish the higher number of samples (e.g., locus 4_21, locus 6_4). Some miRNA-related reads are able to differentiate most of the analyzed samples: e.g., miR398 and miR167 got 8 significant comparisons out of 10.

Discussion

To assess the putative biological significance of isomiRs in peach, in the present study we carried out miRNAs profiling by sequencing three replicates of five biological samples arising from a set of different organs and/or phenological stages. Actually, variants of miRNAs are commonly found in deep sequencing experiments but their functional meaning and stability is still under investigation in plants.

Twenty-six miRNA putative loci expressed in all samples analyzed have been identified by miRdeep-P and analyzed for miRNA population heterogeneity. The average length of miRNA associated reads was included between 18 nt and 24 nt. Several previous works reported a miRNA length in plants included between 22 nt and 24 nt. The identification of miRNA* associated reads provides more evidence about reliability of the loci identified by miRDeep-P.

All the analyzed loci show miRNA length variants but tend to maintain the uridine at the 5′ end, in those cases where uridine is the first base of the most abundant isomiR. As reported above, uridine is the most frequent nucleotide in AGO1 association, perhaps explaining the drive to maintain it at the 5′ end. Ebhardt et al. (2009) reported examples of miRNA with 5′ deletions and 3′ uridine additions that create a different distribution in AGO complexes. As an example, ath-miR822 was determined to reside almost exclusively in the AGO1 complex while its modified variant with a U deletion at 5′ end and a UU addition at 3′end was found equally in AGO1 and AGO4 complexes.

The difference in read count between the first most frequent read and the second most frequent read varies among loci being in some cases minimal (e.g., locus 1_5) while in some others it is quite consistent (e.g., loci 4_21, 6_4). In some loci the second most frequent read was the same in all the replicates of a sample and in all the different samples. The presence of the same isomiRs in different biological replicates of a sample and in different tissues demonstrate that the generation of most of the detected isomiRs is not random. The importance of evaluating the correlation between biological replicates from RNA-seq experiments has been discussed previously in several papers (Oshlack et al., 2010; Hansen et al., 2011). As above reported, the correlation among biological replicates has been calculated to check the reliability of the experiment on the basis of the “Standards, guidelines, and best practices for RNA-seq” adopted by ENCODE Consortium which requires that the Pearson correlation of gene expression between two biological replicates for RNAs that are detected in both samples using RPKM or read counts should be between 0.92 and 0.98. Regarding the present work, the average Pearson correlation between all the possible pairs of replicates belonging to the same biological sample was greater than or equal to 0.95 for all the tested samples, in agreement with the required standards. Clustering results and t-test reported in Figure 1 and File S11 in Supplementary Material, respectively, show that it is possible to clearly distinguish among samples and to group them in a functional way. However, when considering Figure A1 in Appendix obtained without averaging replicates of each sample, it should be noted that clustering results seem to be more confident when only the most frequent read is taken into account: BF (pink) and F (bloom) are more strictly related being two subsequent phenological stages so it is expected to find a closer relationship between them.

The co-existence of different variants with a similar level of expression could imply a biological role for all of them. Locus 1_26 shows such an example: in this case there are two prevalent isomiRs (HE860305 and HE860450) that differ for one T at the 5′ end. For both the isomiRs there are then variants at the 3′ end with different lengths.

Target analysis carried out by psRNATarget (File S9 in Supplementary Material) revealed that in many cases isomiRs share the same target. However, because AGO invariably catalyzes the cleavage of targets opposite the bond between nucleotides 10 and 11 from the 5′ end of the miRNA, the cleavage products are different when there is a shift toward the 5′ end or nucleotide addition at the 5′ end of the miRNA mature sequence. Differences in cleavage sites among members of the same miRNA family have been recently studied in rice by Jeong et al. (2011) highlighting a different abundance of specific cleavage sites among plant organs.

A very interesting finding is related to the biological role of miRNA*. Despite the general consensus that miRNAs* have no regulatory activity, several recent publications have provided evidence about their biological function (Mah et al., 2010). In our results, isomiRs have been found also for miRNAs*. As an example, at locus 3_16 the conserved miRNA* has a high number of length variants, most due to a variable 3′end. Locus 3_16 codes for miR482: the miRNA* sequence deposited in miRBase was actually the most frequent read (HE860347) in all the three replicates of sample O (half-inch green) with an average ratio miRNA/miRNA* equal to 0.4. GF and GL showed an average ratio of miRNA/miRNA* equal to 5.7, while in BF and F the ratio was close to one in two out of three replicates. Similar results have been previously found in mammals by Kuchenbauer et al. (2012) that classified miRNA/miRNA* ratios into groups showing that about 50% of all miRNA duplexes revealed high ratios (>100) consistent with a strong preferential processing of one dominant miRNA strand. About 24% had intermediate ratios (between 100 and 10), about 13% showed low ratios (between 10 and 1), while another 13% showed inverted ratios (<1). The finding that miRNAs can display tissue-dependent miRNA arm selection opposes the general consensus that only one strand is highly dominant for any given miRNA duplex and opens insights into the possible biological function of selective accumulation of miRNA*. A recent review of Sunkar et al. (2012), discusses several studies showing that miRNA* tend to accumulate at a high level under particular conditions. As an example, miR393* accumulates at a high level during infection of P. syringae in Arabidopsis leaves and promotes plant resistance to bacterial infection. Mir399* is accumulated at high levels during phosphate deprivation in Arabidopsis and miR395* accumulates at high levels in Sorghum grown in optimal nutrient conditions.

PsRNATarget has been used to investigate possible target genes for miR482 and miR482* at locus 3_16. MiRNA482 target a peach sequence coding for a “probable receptor-like protein kinase” (expectation = 2, target accessibility = 17.288), while miRNA482* targets a NADH dehydrogenase gene (expectation = 3, target accessibility = 8.463). Examples of different targets for a pair of miRNA/miRNA* are reported in previous studies (Sunkar et al., 2012). Mir393 and miR393* target two entirely different gene families (TIR1 and SNARE) both involved in pathogen resistance of host plant. The possibility that a target-dependent strand selection based on the presence in the cell of miRNA or miRNA* targets might influence the selection of the active miRNA arm has been discussed by other authors. For instance Chatterjee and Grosshans (2009) reported that mRNAs can stabilize their cognate miRNAs thus suggesting coordinated RISC assembly which depends on a miRNA and its target levels.

Results obtained in the present work contribute to a deeper view of the miRNome complexity and to a better exploitation of the mechanism of action of these tiny regulators. The exact definition of the entire repertoire of peach miRNAs is in fact a prerequisite for a correct description of miRNAs whose expression is altered in response to specific developmental conditions or environmental stimuli. Future experiments based on small RNA-seq coupled with RNA-seq on the same samples will be carried out to highlight more clearly the possible biological role of miRNA isomiRs in plants.

Supplementary Material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/Plant_Genetics_and_Genomics/10.3389/fpls.2012.00165/abstract

File S1

Reports the miRNA coding loci identified by miRDeep-P in pink sample.

File S2

Reports the miRNA coding loci identified by miRDeep-P in bloom sample.

File S3

Reports the miRNA coding loci identified by miRDeep-P in swollen flower bud sample.

File S4

Reports the miRNA coding loci identified by miRDeep-P in half-green sample.

File S5

Reports the miRNA coding loci identified by miRDeep-P in swollen leaf bud sample.

File S6

Reports a summary of the miRNA coding loci identified by miRDeep-P.

File S7

Reports the link between locus name and locus position.

File S8

Reports the results of the blast analysis against known plant miRNAs.

File S9

Reports target analysis for all the identified isomiRs.

File S10

Reports Pearson correlation coefficients between all the possible pairs of replicates belonging to the same biological sample, as well as samples from different tissues.

File S11

Reports the results of the t-test which was performed for all the possible comparisons of biological samples. The most frequent isomiR in each locus is highlighted in yellow.

Statements

Acknowledgments

We thank Keith Anthony Grimaldi for helping with the preparation of the manuscript. The present work has been supported by Drupomics Project (Italian Ministry for Agriculture). We acknowledge the International Peach Genome Initiative for pre-publication access to the peach genome sequence.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    ChapmanP. J.CatlinG. A. (1976). Growth stages in fruit trees from dormant to fruit set. N. Y. Food Life Sci. Bull.58.

  • 2

    ChatterjeeS.GrosshansH. (2009). Active turnover modulates mature miRNA activity in Caenorhabditis elegans. Nature461, 546549.10.1038/nature08349

  • 3

    DaiX.ZhaoP. X. (2011). PsRNATarget: a plant small RNA target analysis server. Nucleic Acids Res.39, W155W159.10.1093/nar/gkr319

  • 4

    EamensA. L.SmithN. A.CurtinS. J.WangM. B.WaterhouseP. M. (2009). The Arabidopsis thaliana double-stranded RNA binding protein DRB1 directs guide strand selection from microRNA duplexes. RNA15, 22192235.10.1261/rna.1646909

  • 5

    EbhardtH. A.TsangH. H.DaiD. C.LiuY.BostanB.FahlmanP. (2009). Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications. Nucleic Acids Res.37, 24612470.10.1093/nar/gkp093

  • 6

    GuoL.LuZ. (2010). Global expression analysis of miRNA gene cluster and family based on isomiRs from deep sequencing data. Comput. Biol. Chem.34, 165171.10.1016/j.compbiolchem.2010.06.001

  • 7

    HansenK. D.WuZ.IrizarryR. A.LeekJ. T. (2011). Sequencing technology does not eliminate biological variability. Nature Biotechnol.29, 572573.10.1038/nbt.1910

  • 8

    JeongD.-H.ParkS.ZhaiJ.GurazadaS. G. R.de PaoliE.MeyersB. C.GreenP. J. (2011). Massive analysis of rice small RNAs: mechanistic implications of regulated microRNAs and variants for differential target RNA cleavage. Plant Cell23, 41854207.10.1105/tpc.111.089045

  • 9

    Jones-RhoadesM. W.BartelD. P.BartelB. (2006). MicroRNAs and their regulatory roles in plants. Annu. Rev. Plant Biol.57, 1953.10.1146/annurev.arplant.57.032905.105218

  • 10

    KozomaraA.Griffith-JonesS. (2011). miRBase: Integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res.39, D152D157.10.1093/nar/gkq1027

  • 11

    KuchenbauerF.MahS. M.HeuserM.McPhersonA.RüschmannJ.RouhiA.BergT.BullingerL.ArgiropoulosB.MorinR. D.LaiD.StarczynowskiD. T.KarsanA.EavesC. J.WatahikiA.WangY.AparicioS. A.GanserA.KrauterJ.DöhnerH.DöhnerK.MarraM. A.CamargoF. D.PalmqvistL.BuskeC.HumphriesR. K. (2012). Comprehensive analysis of mammalian miRNA* species and their role in myeloid cells. Blood118, 33503358.10.1182/blood-2010-10-312454

  • 12

    LeeL. W.ZhangS.EtheridgeA.MaL.MartinD.GalasD. (2012). Complexity of the microRNA repertoire revealed by next-generation sequencing. RNA16, 21702180.10.1261/rna.2225110

  • 13

    MahS. M.BuskeC.HumphriesR. K.KuchenbauerF. (2010). miRNA*: a passenger stranded in RNA-indiced silencing complex?Crit. Rev. Eukaryot. Gene Expr.20, 141148.10.1615/CritRevEukarGeneExpr.v20.i2.40

  • 14

    MiS.CaiT.HuY.ChenY.HodgesE.NiF.WuL.LiS.ZhouH.LongC.ChenS.HannonG. J.QiY. (2008). Sorting of small RNAs into Arabidopsis argonaute complexes is directed by the 5′ terminal nucleotide. Cell133, 116127.10.1016/j.cell.2008.02.034

  • 15

    MoxonS.SchwachF.DalmayT.MacLeanD.StudholmeD. J.MoultonV. (2008). A toolkit for analyzing large-scale plant small RNA datasets. Bioinformatics24, 22522253.10.1093/bioinformatics/btn428

  • 16

    MücksteinU.TaferH.HackermüllerJ.BernhartS. H.StadlerP. F.HofackerI. L. (2006). Thermodynamics of RNA–RNA binding. Bioinformatics22, 11771182.10.1093/bioinformatics/btl024

  • 17

    OshlackA.RobinsonM. D.YoungM. D. (2010). From RNA-seq reads to differential expression results. Genome Biol.11, 220.10.1186/gb-2010-11-12-220

  • 18

    SunkarR.LiY.-F.JagadeeswaranG. (2012). Functions of microRNAS in plant stress responses. Trends Plant Sci.17, 196203.10.1016/j.tplants.2012.01.010

  • 19

    TakedaA.IwasakiS.WatanabeT.UtsumiM.WatanabeY. (2008). The mechanism selecting the guide strand from small RNA duplexes is different among argonaute proteins. Plant Cell Physiol.49, 493500.10.1093/pcp/pcn043

  • 20

    VaucheretH. (2009). AGO1 homeostasis involves differential production of 21-nt and 22-nt miR168 species by miR168a and miR168b. PLoS ONE4, e6442.10.1371/journal.pone.0006442

  • 21

    VoinnetO. (2009). Origin, biogenesis, and activity of plant microRNAs. Cell136, 669687.10.1016/j.cell.2009.01.046

  • 22

    XieZ.KhannaK.RuanS. (2010). Expression of microRNAs and its regulation in plants. Semin. Cell Dev. Biol.21, 790797.10.1016/j.semcdb.2010.03.012

  • 23

    YangX.LiL. (2011). miRDeep-P: A computational tool for analyzing the microRNA transcriptome in plants. Bioinformatics27, 26142615.10.1093/bioinformatics/btr041

  • 24

    ZhangY. (2005). miRU: An automated plant miRNA target prediction server. Nucleic Acids Res.33(Suppl. 2), W701W704.10.1093/nar/gki479

Appendix

Figure A1

Summary

Keywords

microRNA, isomiRs, next generation sequencing

Citation

Colaiacovo M, Bernardo L, Centomani I, Crosatti C, Giusti L, Orrù L, Tacconi G, Lamontanara A, Cattivelli L and Faccioli P (2012) A Survey of MicroRNA Length Variants Contributing to miRNome Complexity in Peach (Prunus Persica L.). Front. Plant Sci. 3:165. doi: 10.3389/fpls.2012.00165

Received

03 April 2012

Accepted

04 July 2012

Published

26 July 2012

Volume

3 - 2012

Edited by

Takuji Sasaki, National Institute of Agrobiological Sciences, Japan

Reviewed by

Zhixi Tian, Chinese Academy of Sciences, China; Takeshi Itoh, National Institute of Agrobiological Sciences, Japan

Copyright

*Correspondence: Primetta Faccioli, CRA Genomics Research Centre, via S.Protaso 302, I-29017 Fiorenzuola d’Arda (Pc), Italy. e-mail:

Moreno Colaiacovo and Letizia Bernardo have contributed equally to this work.

This article was submitted to Frontiers in Plant Genetics and Genomics, a specialty of Frontiers in Plant Science.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics