- 1Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
- 2Department of Biology, The Bioinformatics Centre, University of Copenhagen, Copenhagen, Denmark
- 3Department of Surgical Gastroenterology and Endocrinology, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
- 4Department of Clinical Physiology, Nuclear Medicine and PET, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
The tumorigenesis of small intestinal neuroendocrine tumors (SI-NETs) is poorly understood. Recent studies have associated alternative polyadenylation (APA) with proliferation, cell transformation, and cancer. Polyadenylation is the process in which the pre-messenger RNA is cleaved at a polyA site and a polyA tail is added. Genes with two or more polyA sites can undergo APA. This produces two or more distinct mRNA isoforms with different 3′ untranslated regions. Additionally, APA can also produce mRNAs containing different 3′-terminal coding regions. Therefore, APA alters both the repertoire and the expression level of proteins. Here, we used high-throughput sequencing data to map polyA sites and characterize polyadenylation genome-wide in three SI-NETs and a reference sample. In the tumors, 16 genes showed significant changes of APA pattern, which lead to either the 3′ truncation of mRNA coding regions or 3′ untranslated regions. Among these, 11 genes had been previously associated with cancer, with 4 genes being known tumor suppressors: DCC, PDZD2, MAGI1, and DACT2. We validated the APA in three out of three cases with quantitative real-time-PCR. Our findings suggest that changes of APA pattern in these 16 genes could be involved in the tumorigenesis of SI-NETs. Furthermore, they also point to APA as a new target for both diagnostic and treatment of SI-NETs. The identified genes with APA specific to the SI-NETs could be further tested as diagnostic markers and drug targets for disease prevention and treatment.
Introduction
Small intestinal neuroendocrine tumors (SI-NETs) are a homogeneous group of tumors originating primarily from the serotonin producing enterochromaffin neuroendocrine cells dispersed as solitary cells between the mucosal cells in the small intestine (1). Most SI-NETs tend to be slow growing with a low Ki-67 proliferation index (2). The primary tumor, regional node metastases, and related mesenteric fibrosis may give rise to local symptoms as bowel obstruction and/or vascular encasement. The presence of liver metastases may give rise to the carcinoid syndrome including diarrhea, facial flushing, and on sight cardiac and pulmonary disease, caused by release of serotonin and vasoactive peptides from the liver metastases (2).
Both the incidence and prevalence of SI-NETs have been increasing during the last three decades (3). Nowadays, SI-NETs are the second most prevalent gastrointestinal cancer after colonic cancer (4). Radical surgical intervention is the only curative treatment. However, SI-NETs are difficult to diagnose, as the symptoms often are unspecific. Therefore, many patients are first diagnosed once metastases have occurred and curative surgery is no longer an option (1).
Polyadenylation is a co-transcriptional process that consists of cleavage of the pre-messenger RNA (pre-mRNA) at a polyA site and addition of a polyA tail to the 5′ cleavage product. This process is necessary for normal mRNA 3′-end formation, transcription termination, and mRNA export from the nucleus (5, 6). Studies have shown that almost all eukaryotic pre-mRNAs and several non-coding transcripts have polyA sites and are polyadenylated. More than two-thirds of human genes have multiple polyA sites and can therefore undergo alternative polyadenylation (APA) (7). This produces two or more distinct mRNA isoforms with different 3′ untranslated regions (3′ UTRs). In some cases, APA also produces mRNAs containing different 3′-terminal coding regions. Therefore, APA alters both the repertoire and the expression level of proteins (Figure 1).
Figure 1. Types of APA. UTR-APA: APA utilizing polyA sites located in the 3′ UTR of the last exon is called UTR-APA and results in mRNAs with the same coding region, but with different 3′ UTR length. CR-APA: APA utilizing polyA sites located in introns, or in the coding region of exons, is called CR-APA and results in mRNAs with 3′ truncated coding regions. Adapted from Ref. (53)
Alterations in polyadenylation can cause various diseases (8, 9) and several studies have linked cellular proliferation, transformation, and cancer progression to APA. Proliferation correlates with enhanced proximal polyA site usage leading to expression of mRNAs with 3′ truncated coding regions (10) and truncated 3′ UTRs (5, 10). Cell transformation is associated with 3′ UTR truncation (11–14) and changed APA pattern (10, 15), where transformed cells express mRNAs with shorter 3′ UTRs compared to non-transformed cells with similar proliferation rate (12). Furthermore, 3′ UTR shortening is linked to poor cancer prognosis (16). The APA pattern also differs according to cancer subtype (11), suggesting that the APA pattern could be used as a biomarker for cancer classification.
Considering the previous evidence, we hypothesized that APA leading to 3′ truncation of mRNAs could participate in the tumorigenesis of SI-NETs. To investigate this hypothesis we used high-throughput sequencing data to map polyA sites and characterize APA in SI-NETs genome-wide. We compared polyA site usage in a SI-NET group of three samples, with polyA site usage in a normal neuroendocrine reference sample, pituitary (PIT), and discovered 16 genes with significant APA specific to the SI-NETs, which lead to either 3′ truncation of mRNA coding regions or 3′ UTRs.
Materials and Methods
Human Clinical Samples and RNA Isolation
Surgical specimens from SI-NETs were obtained from three patients undergoing surgery at the Department of Surgical Gastroenterology, Rigshospitalet (Table 1). The study was approved by the regional scientific ethical committee (journal number 01 313726). Signed, informed consent was obtained from the patients. Immediately after tumor resection, tumor tissue samples from the surgical specimens were placed in RNAlater® (Applied Biosystems, Carlsbad, CA, USA) for overnight incubation. Samples were subsequently stored at −80°C until RNA was extracted using TRIzol (Life Technologies, Carlsbad, CA, USA). The reference sample (PIT), human pituitary RNA, was purchased from BioChain (Hayward, CA, USA).
Sequencing
Total RNA from the three SI-NETs and the reference sample (PIT) were sequenced with single-molecule direct RNA-sequencing (DRS) at Helicos BioSciences (Boston, MA, USA) (17). The 3′ blocking reaction was performed with 3′ deoxyATP (Jena Biosciences, Germany) by incubating the reaction mixture at 37°C for 30 min.
Read Processing and Mapping
Raw DRS reads starting with one or more Ts were trimmed as these can be due to inefficient locking of the RNA with the polyT primer at the beginning of the sequencing (17). Trimmed reads were mapped to the genome (hg19) with HeliSphere software (http://sourceforge.net/projects/openhelisphere/) using a seed length of 18 nt and allowing only one mismatch in the seed region. Only uniquely mapped reads with a minimum length of 21 nt and a minimum score of 4 where further analyzed. We additionally discarded all reads that may arise from internal priming events. DRS reads are sequenced by synthesis, and thus, they are reverse complementary to the original RNA fragments (17). We checked the 10-nt immediately downstream of the read end in the complementary strand for adenines. In the case that the first six or more of these nucleotides were adenines the read was discarded (14). A summary of the data after the different steps can be seen in Table 2.
Identification of polyA Sites
It is known that the cleavage process is inefficient and can vary within a few nucleotides (18). To account for this variability, we cluster iteratively the 5′ end of mapped reads that were <24 nt apart (14, 18). For each of these clusters, the cleavage site is defined as the median position of the cleavage sites in the cluster (Figure 2). As previous works have shown that APA is very common and that even very infrequently used polyA sites are real (7), we kept all polyA sites that have at least one read in at least two samples, or that have two or more reads. Next, each of the clusters was annotated according to the location of the cleavage site using the genome annotation from Ensembl66 (19). PolyA sites were classified as 5′ UTR, CODING, or 3′ UTR if overlapping these regions of coding genes; ncGENE if overlapping a non-coding gene; PROMOTER if they were located in the 1000 bases upstream of a gene; DOWNSTREAM if they were located in the 1000 bases downstream of a gene; and INTERGENIC otherwise (Figure 3A). PolyA sites found in both introns and exons of the coding region are classified as CODING, as usage of these changes the coding region of the mRNA. As DOWNSTREAM polyA sites (≤1 kB) have many more reads than other intergenic polyA sites (Figure 3B), we associated the polyA sites in this proximal intergenic region (≤1 kB) to the upstream gene and included them in the analysis of APA.
Figure 3. (A) Distribution of identified polyA sites across the genome for each of the samples analyzed. (B) Average number of reads per polyA site in different genomic regions.
Identification of APA in SI-NETs
We restricted the analysis to protein coding genes. For each gene, we selected only those polyA sites that have at least 10% of all reads mapped to a gene in a given sample. By doing this, we excluded from the analysis those polyA sites that could have little or no impact on the expression levels or the regulation of the mRNA. For each gene we created a contingency table with the counts of reads for each polyA site, where in each column we had represented a polyA site identified in one or more samples and on each row we had each of the samples. To identify differential polyadenylation we performed fisher exact tests with R (R Development Core Team 2008) for each pair of samples. After multiple test correction, only those cases in which no statistical difference was found between the samples in the SI-NET group [false discovery rate (FDR) >0.1] and significant differences were found for all the samples in the SI-NET group compared to the reference sample (PIT) (FDR <0.1) were considered significant.
Manual Selection of APA Cases
Firstly, we selected the genuine cases of APA, by manually removing genes if: (1) there is no APA within the gene alone, but only in tandem transcripts containing the gene (11 genes removed). (2) APA is caused by an overlapping gene (one gene removed). Secondly, we selected the cases of APA leading to a reasonable expression in the SI-NETs, by manually removing genes if: (3) the implicated polyA sites have <10 reads in 2/3 of the SI-NETs (11 genes removed). Lastly, we selected the cases of APA, which lead to 3′ truncation of the mRNA, by manually removing genes if: (4) the most distal polyA site of the gene is primarily used in the SI-NET group (14 genes removed). (5) The most proximal of the polyA sites is primarily used in the reference sample (PIT) (eight genes removed). The manually removed genes are listed in Table S2 in Supplementary Material.
Quantitative Real-Time-PCR Validation
cDNA was synthesized from the RNA samples using High Capacity cDNA Reverse Transcription Kit (Applied Biosystems, Foster City, CA, USA) according to the manufactures instructions. Quantitative real-time-PCR (Q-RT-PCR) was performed with SYBR® GreenER™ qPCR SuperMix for ABI PRISM® (Life Technologies, Carlsbad, CA, USA) on the ABI PRISM® Instrument (Applied Biosystems, Foster City, CA, USA). Primer sequences were as follows:
• DCC, upstream of proximal polyA site:
DCC-INT5-FW: 5′-acccgtcaccttcttttcct-3′
DCC-INT5-RV: 5′-caccgtgccctattttgtct-3′
• DCC, upstream of distal polyA site:
DCC-INT13-FW: 5′-aaaaacgcagtacggcaact-3′
DCC-INT13-RV: 5′-tgggatgggttctttctcag-3′
• PDZD2, upstream of proximal polyA site:
PDZD2-INT3-FW: 5′-gtgagaccctgtccctacca-3′
PDZD2-INT3-RV: 5′-gtaaagcaaccatgccacct-3′
• PDZD2, upstream of distal polyA site:
PDZD2-EX26-FW: 5′-gtcggcctagagaggtcctt-3′
PDZD2-EX26-RV: 5′-catgcaccttgcactgactt-3′
• LRRFIP1, upstream of proximal polyA site:
LRRFIP1-INT39-FW: 5′-gtcctgcagaaccaaagagc-3′
LRRFIP1-INT39-RV: 5′-aagctttcctgacggtctga-3′
• LRRFIP1, upstream of distal polyA site:
LRRFIP1-EX76-FW: 5′-cgctggaagtgatgatcaaa-3′
LRRFIP1-EX76-RV: 5′-gcttggtaatggcttgtggt-3′
Results
Global Characterization of polyA Site Usage Reveals Similarity between SI-NETs
Our first objective was to map polyA sites in SI-NETs genome-wide and characterize APA. To perform this characterization, total RNA from three SI-NET samples (NE MTT, NE CT1, and NE 2TC) and from a reference sample (PIT) were sequenced using DRS (17). DRS is a well established method, which have been used in several other studies to map polyA sites genome-wide (14, 17, 20, 21). The SI-NETs originate from neuroendocrine cells of the small intestine, which are solitary interspersed cells (22). It is very difficult to identify and collect enough normal neuroendocrine cells as a control sample, and scrapings of the mucosa adjacent to the tumors are thus normally used as reference samples (23). However, we chose to use pituitary as reference sample as we expected that the neuroendocrine cells of the pituitary are phenotypically closer to the neuroendocrine cells of the small intestine than the mixture of different cell types from the mucosal scrapings. From the DRS we obtained a total of 4895411 (NE MTT), 10878794 (NE CT1), and 8859412 (NE 2TC) reads. The final filtered reads were mapped to 221147 (NE MTT), 398528 (NE CT1), and 379890 (NE 2TC) polyA sites. These sites mapped to various genomic regions in the samples (Figure 3). Fisher exact tests found 11031 polyA sites, located in 97% of the expressed protein coding genes, with no statistical difference in usage between the individual SI-NETs (FDR >0.1). This shows similarity between the three SI-NET samples. The sequencing data obtained by DRS have been deposited to the National Center for Biotechnology Information (NCBI)’s Gene Expression Omnibus (GEO), with accession number: GSE56657.
Alternative Usage of polyA Sites in the SI-NETs
The second aim of the project was to identify protein coding genes in SI-NETs with a differential APA pattern compared to the reference sample. Using fisher exact tests we identified 61 genes with statistically significant APA between the SI-NET group and the reference sample (Table S1 in Supplementary Material). From these 61 genes, 45 genes were removed manually, to select only the genuine cases of APA, which lead to either 3′ truncation of mRNA coding regions or 3′ UTRs (Table S2 in Supplementary Material). The 16 selected genes with APA specific to the SI-NETs are described below, including a description of the possible implications for protein isoform and expression (Table S3 and Figure S1 in Supplementary Material). Interestingly, 11 out of these 16 genes have previously been associated with cancer, with 4 of the genes being known tumor suppressors. These 16 genes with APA specific to the SI-NETs could be tested as diagnostic markers and drug targets for disease prevention and treatment.
Genes with UTR-APA Specific to the SI-NETs
Of the 16 genes with APA specific to the SI-NETs, 9 genes had significant UTR-APA leading to 3′ UTR shortening in a larger proportion of mRNAs than in the reference sample. These genes are: PSMD8, DACT2, TM9SF3, CD59, ANKH, CIAO1, SRSF5, MRSP16, and NDUFA6. The shortening of 3′ UTRs plays an important part in post-transcriptional regulation, as the 3′ UTR contains target sites for microRNAs (miRNAs), which control mRNA turnover and translation rate (24). Because of miRNA target site loss, mRNAs with shorter 3′ UTRs are more stable and produce more protein (5, 11, 12). We mapped the target sites of all human miRNAs to the 3′ UTRs of genes displaying UTR-APA (Figure S2 in Supplementary Material). The miRNA target site predictions were downloaded from microRNA.org (25). Only conserved miRNAs with a good score were mapped. We then compared the miRNA target sites in the 3′ UTRs of the nine genes to miRNA-profiles of five primary SI-NET samples from another study (26). We considered only the miRNAs that were detected in a minimum of four out of these five primary SI-NET samples. For all nine genes, except DACT2, the 3′ UTR shortening leads to loss of miRNA target sites for miRNAs detected in SI-NETs, suggesting an upregulated expression for these genes. See Table 3 for a detailed description of the genes with UTR-APA.
Genes with CR-APA Specific to the SI-NETs
Seven of the genes identified had significant CR-APA leading to 3′ truncated coding regions in a larger proportion of mRNAs than in the reference sample. These genes are: DCC, PDZD2, MAGI1, ZCWPW2, LRRFIP1, RIC3, and THADA. CR-APA does not lead to nonsense-mediated decay and the expressed mRNAs thus translate into C-terminally truncated protein isoforms that could be non-functional or have functions different from the full-length protein (33). The seven genes with CR-APA are described in detail in Table 4.
Q-RT-PCR Validation of APA
Three genes were selected for Q-RT-PCR validation: DCC, PDZD2, and LRRFIP1. cDNA was synthesized from total RNA from all four samples. Primers were designed as sets targeting the regions just upstream of a proximal and a distal polyA site with significant differential usage between the SI-NET group and PIT. The proximal primer sets targeted intronic sequences, corresponding to 3′ UTRs of alternative last exons, and the distal primer sets targeted either intronic sequences, corresponding to 3′ UTRs of alternative last exons, or the 3′ UTRs of the last exon. As the proximal primer sets targeted intronic sequences, the detection of a product from these sets could only come from usage of the adjacent polyA site, as the intron is otherwise spliced out. Proximal and distal isoforms were thus individually measured. The genes were used as their own controls, as there were two sets of primers for each gene; one upstream of a proximal polyA site and one upstream of a distal polyA site. The Ct values of the proximal and distal amplicon were thus compared. Q-RT-PCR was run on triplicates of each sample with all six primer sets. Median Ct values from the triplicates were used for further analysis with individual Ct values deviating more than two from the median considered as outliers and removed (2 of the 72 Ct values removed). In contrast to our results obtained from DRS (Table 5), we found a higher expression of the proximal isoform of DCC and LRRFIP1, as well as a lower expression of the proximal isoform of PDZD2 in all three SI-NETs as well as in the reference sample (Table 6). However, when comparing the results for the SI-NETs with our reference sample, we found a higher expression of the proximal LRRFIP1 isoform in all three SI-NETs compared to our reference sample. For DCC and PDZD2, we found a higher expression of the proximal isoform in two out of the three SI-NETs compared to our reference sample (Table 7). These Q-RT-PCR results validate the presence of the polyA sites and the APA of these genes. They also indicate that these three genes indeed undergo differential APA leading to the expression of a larger amount of mRNAs with 3′ truncation of the coding region in the SI-NETs than in the reference sample.
Table 7. Expression ratio of (short isoform/long isoform) normalized to reference sample, derived from Q-RT-PCR data.
Discussion
This is the first study to map polyA sites and characterize APA genome-wide in SI-NETs. To map the polyA site usage genome-wide we sequenced total RNA from three SI-NET samples (NE MTT, NE CT1, and NE 2TC) and a reference sample (PIT) using DRS (17). DRS is a well established method, which have been used in several other studies to map polyA sites genome-wide (14, 17, 20, 21). Selected polyA sites have been confirmed in all cases with PCR in three of those studies (14, 17, 21). We selected three genes for Q-RT-PCR validation: DCC, PDZD2, and LRRFIP1. For these three genes we validated the presence of all tested polyA sites with Q-RT-PCR and thus the APA of the selected genes. However, we did not find the same ratio of usage between the proximal and distal polyA sites with Q-RT-PCR as we found with DRS (Tables 5 and 6). When comparing our Q-RT-PCR results in the SI-NETs with the reference sample, they however indicate that the three genes in the SI-NETs indeed undergo differential APA leading to expression of a larger amount of mRNAs with 3′ truncation of the coding region than in the reference sample (Table 7). The quantitative differences found in expression ratios of the short and long isoforms derived from DRS and Q-RT-PCR could be due to biases and artifacts introduced during the synthesis of cDNA, contamination of cDNA samples with genomic DNA and problems with one or more of the primer sets.
When studying SI-NETs the optimal control sample would be normal small intestinal neuroendocrine cells. These solitary cells, which are interspersed among the other cells of the mucosa, are however difficult to isolate (23). In one study, human neuroendocrine cells have been isolated by pronase/collagenase digestion, gradient centrifugation, and FACS, but this method is complicated and requires at least 5–8 cm of normal ileum (41). Additionally, it is not certain if the RNA content is damaged during those processes. Because of these difficulties, scrapings from the mucosa adjacent to the tumors are normally used as reference samples. This mucosa is however composed of a mix of various cell types, primarily enterocytes and goblet cells. We believe that this mixture of cells from the mucosa, despite a common origin with the neuroendocrine cells (42), is phenotypically farther from intestinal neuroendocrine cells than neuroendocrine cells elsewhere in the body. We therefore chose to use pituitary as a reference sample as it mainly contains neuroendocrine cells.
Using Fisher exact tests, we found 11031 polyA sites, located in 97% of the expressed protein coding genes, with no statistical difference in usage between the individual SI-NETs (FDR >0.1). This shows similarity between the three SI-NET samples. Using the same Fisher exact tests to compare APA in the SI-NETs and in the reference sample, we identified 16 genes with significant APA specific to the SI-NETs, which lead to either 3′ truncation of mRNA coding regions or 3′ UTRs (Figure S1 in Supplementary Material). These 16 genes with APA specific to the SI-NETs could be tested as diagnostic markers and drug targets for disease prevention and treatment.
Nine of the 16 genes had significant UTR-APA in the SI-NETs leading to 3′ UTR shortening in a larger proportion of mRNAs than in the reference sample. Interestingly, four genes out of the nine with UTR-APA have been found to be either overexpressed in cancer: PSMD8 (27), CD59 (29), and SRSF5 (32) or is suggested to be overexpressed: ANKH (30). The UTR-APA in these genes produces mRNA isoforms that lack one or more target sites for miRNAs present in SI-NETs. These mRNAs consequently escape a part of the miRNA regulation and, therefore, should be more stable and produce more protein (5, 11, 12). Thus, the changes predicted in these nine genes suggest that UTR-APA could contribute to the SI-NET tumorigenesis. One of these genes, DACT2, was previously found as a tumor suppressor protein (28). The UTR-APA in this gene leads to 3′ UTR shortening in a larger proportion of mRNAs in the SI-NETs than in the reference sample. However, as no miRNA target sites are found in the shortened part of the 3′ UTR, it is unclear which role this 3′ UTR region has. Apart from miRNAs, RNA binding proteins (RBPs) also target the 3′ UTR of mRNAs and can affect the stability of mRNAs (43). Thus, it is possible that the shortening of the 3′ UTR affects the stability of DACT2 by removing putative binding sites of RBPs.
Seven of the 16 genes had significant CR-APA in the SI-NETs leading to 3′ truncated coding regions in a larger proportion of mRNAs than in the reference sample. CR-APA alters the coding region of mRNAs and thus the isoform of the produced protein. Remarkably, three out of the seven genes with CR-APA have tumor suppressive functions: DCC (34), PDZD2 (35), and MAGI1 (37). The CR-APA in these genes produces mRNA isoforms lacking several 3′-terminal exons, translating into highly C-terminally truncated proteins. These protein isoforms lack the parts responsible for the tumor suppressive functions. The CR-APA in DCC, PDZD2, and MAGI1 could thus likely be contributing to the tumorigenesis in SI-NETs. DCC is located in chromosome 18, where there have been reported frequent copy-number losses in SI-NETs (44, 45). The copy-number losses might truncate the DCC gene, forcing the usage of the proximal polyA site. Another gene with CR-APA, THADA, has been hypothesized to contribute to the tumorigenesis of thyroid adenomas in its truncated form (40). The CR-APA found in the SI-NETs produces an mRNA even more 3′-terminally truncated and could thus also be a candidate contributing to the tumorigenesis of SI-NETs. One gene stands out in the CR-APA group, LRRFIP1, as it has been shown to be overexpressed in cancer cell lines (38) and to be pro-metastatic (39). The CR-APA of this gene in the SI-NETs gives rise to an mRNA isoform, with a 3′-end corresponding to the 3′-end of the 4.2 kB mRNA overexpressed in cancer cell lines (38). CR-APA of LRRFIP1 could thus also be a contributor to the tumorigenesis in SI-NETs, as the major mRNA isoform expressed in the SI-NETs is likely equal to that overexpressed in the cancer cell lines.
The suggested participation of APA in the tumorigenesis of SI-NETs could lead to new medical treatments of SI-NETs. This is particularly interesting as polyA sites of interest can be very specifically and effectively blocked and patterns of APA can thus be changed. Different techniques utilizing antisense elements have already been used to effectively inhibit polyA site usage. They include antisense oligonucleotides (46), siRNAs (47), and U1 modifications (48–52). Such techniques could, e.g., be used to force the expression of the full-length mRNAs of the tumor suppressor genes DCC, PDZD2, and MAGI1.
As this study was performed on only three SI-NET samples the findings should be confirmed in a larger cohort. Preferably, future studies on APA in SI-NETs should also include alternative reference tissues and use different methods for characterizing poly(A) site usage. The findings could also be tested in other types of NETs. If the findings are confirmed, the effects of blocking the implicated proximal poly(A) sites with antisense elements should be tested experimentally.
In conclusion, we have mapped for the first time polyA sites genome-wide in SI-NETs. We found 11031 polyA sites, located in 97% of the expressed protein coding genes, with no statistical difference in usage between the individual SI-NETs. This shows similarity between the three SI-NET samples. In the SI-NETs, we identified 16 genes with significant APA leading to either 3′ truncation of coding regions or 3′ UTRs in a larger proportion of mRNAs than in the reference sample (Tables 3 and 4). The APA was confirmed in three genes using Q-RT-PCR: DCC, PDZD2, and LRRFIP1. However, we did not find the same expression ratios between the proximal and distal isoforms for these genes with Q-RT-PCR as with DRS (Tables 5 and 6).
As 11 of the 16 identified genes have previously been associated with cancer, it is likely that APA in these genes could take part in the SI-NET tumorigenesis. Future studies will have to confirm these findings in a larger cohort. The identified 16 genes with APA specific to the SI-NETs could be tested as diagnostic markers and as drug targets for disease prevention and treatment.
Author Contributions
Study concept and design (Anders Rehfeld, Lennart Friis-Hansen). Acquisition of data (Anders Rehfeld, Lennart Friis-Hansen). Analysis and interpretation of data (Anders Rehfeld, Mireya Plass, Anders Krogh, Lennart Friis-Hansen). Drafting of the manuscript (Anders Rehfeld, Mireya Plass, Lennart Friis-Hansen). Critical revision of the manuscript for important intellectual content (Anders Rehfeld, Mireya Plass, Kristina Døssing, Ulrich Knigge, Andreas Kjær, Anders Krogh, Lennart Friis-Hansen). Statistical analysis (Mireya Plass, Anders Krogh). Obtained funding (Anders Rehfeld, Anders Krogh, Lennart Friis-Hansen). Administrative, technical, or material support (Lennart Friis-Hansen). Study supervision (Lennart Friis-Hansen).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The work in laboratories of the authors was supported by the Danish MRC, Rigshospitalet. Funding: this work was supported by, The Danish Council for Independent Research – Medical Sciences, The Danish Council for Strategic Research, The Novo Nordisk Foundation, and Rigshospitalet.
Supplementary Material
The Supplementary Material for this article can be found online at http://www.frontiersin.org/Journal/10.3389/fendo.2014.00046/abstract
References
1. Öberg K, Knigge U, Kwekkeboom D, Perren A. Neuroendocrine gastro-entero-pancreatic tumors: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol (2012) 23(Suppl 7):vii124–30. doi: 10.1093/annonc/mds295
2. Janson ET, Sørbye H, Welin S, Federspiel B, Grønbaek H, Hellman P, et al. Nordic guidelines 2010 for diagnosis and treatment of gastroenteropancreatic neuroendocrine tumours. Acta Oncol (2010) 49:740–56. doi:10.3109/0284186X.2010.492791
3. Lawrence B, Gustafsson BI, Chan A, Svejda B, Kidd M, Modlin IM. The epidemiology of gastroenteropancreatic neuroendocrine tumors. Endocrinol Metab Clin North Am (2011) 40:1–18. doi:10.1016/j.ecl.2010.12.005
4. Schimmack S, Svejda B, Lawrence B, Kidd M, Modlin IM. The diversity and commonalities of gastroenteropancreatic neuroendocrine tumors. Langenbecks Arch Surg (2011) 396:273–98. doi:10.1007/s00423-011-0739-1
5. Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3’ untranslated regions and fewer microRNA target sites. Science (2008) 320:1643–7. doi:10.1126/science.1155390
6. Ji Z, Tian B. Reprogramming of 3’ untranslated regions of mRNAs by alternative polyadenylation in generation of pluripotent stem cells from different cell types. PLoS One (2009) 4:e8419. doi:10.1371/journal.pone.0008419
7. Derti A, Garrett-Engele P, Macisaac KD, Stevens RC, Sriram S, Chen R, et al. A quantitative atlas of polyadenylation in five mammals. Genome Res (2012) 22:1173–83. doi:10.1101/gr.132563.111
8. Danckwardt S, Hentze MW, Kulozik AE. 3’ End mRNA processing: molecular mechanisms and implications for health and disease. EMBO J (2008) 27:482–98. doi:10.1038/sj.emboj.7601932
9. Rehfeld A, Plass M, Krogh A, Friis-Hansen L. Alterations in polyadenylation and its implications for endocrine disease. Front Endocrinol (Lausanne) (2013) 4:53. doi:10.3389/fendo.2013.00053
10. Elkon R, Drost J, van Haaften G, Jenal M, Schrier M, Vrielink JAO, et al. E2F mediates enhanced alternative polyadenylation in proliferation. Genome Biol (2012) 13:R59. doi:10.1186/gb-2012-13-7-r59
11. Singh P, Alley TL, Wright SM, Kamdar S, Schott W, Wilpan RY, et al. Global changes in processing of mRNA 3’ untranslated regions characterize clinically distinct cancer subtypes. Cancer Res (2009) 69:9422–30. doi:10.1158/0008-5472.CAN-09-2236
12. Mayr C, Bartel DP. Widespread shortening of 3’UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell (2009) 138:673–84. doi:10.1016/j.cell.2009.06.016
13. Morris AR, Bos A, Diosdado B, Rooijers K, Elkon R, Bolijn AS, et al. Alternative cleavage and polyadenylation during colorectal cancer development. Clin Cancer Res (2012) 18:5256–66. doi:10.1158/1078-0432.CCR-12-0543
14. Lin Y, Li Z, Ozsolak F, Kim SW, Arango-Argoty G, Liu TT, et al. An in-depth map of polyadenylation sites in cancer. Nucleic Acids Res (2012) 40:8460–71. doi:10.1093/nar/gks637
15. Fu Y, Sun Y, Li Y, Li J, Rao X, Chen C, et al. Differential genome-wide profiling of tandem 3’ UTRs among human breast cancer and normal cells by high-throughput sequencing. Genome Res (2011) 21:741–7. doi:10.1101/gr.115295.110
16. Lembo A, Di Cunto F, Provero P. Shortening of 3’UTRs correlates with poor prognosis in breast and lung cancer. PLoS One (2012) 7:e31129. doi:10.1371/journal.pone.0031129
17. Ozsolak F, Platt AR, Jones DR, Reifenberger JG, Sass LE, McInerney P, et al. Direct RNA sequencing. Nature (2009) 461:814–8. doi:10.1038/nature08390
18. Tian B, Hu J, Zhang H, Lutz CS. A large-scale analysis of mRNA polyadenylation of human and mouse genes. Nucleic Acids Res (2005) 33:201–12. doi:10.1093/nar/gki158
19. Flicek P, Ahmed I, Amode MR, Barrell D, Beal K, Brent S, et al. Ensembl 2013. Nucleic Acids Res (2013) 41:D48–55. doi:10.1093/nar/gks1236
20. Ozsolak F, Kapranov P, Foissac S, Kim SW, Fishilevich E, Monaghan AP, et al. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell (2010) 143:1018–29. doi:10.1016/j.cell.2010.11.020
21. Sherstnev A, Duc C, Cole C, Zacharaki V, Hornyik C, Ozsolak F, et al. Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation. Nat Struct Mol Biol (2012) 19:845–52. doi:10.1038/nsmb.2345
22. Ahlman H, Nilsson O. The gut as the largest endocrine organ in the body. Ann Oncol (2001) 12(Suppl 2):S63–8. doi:10.1093/annonc/12.suppl_2.S63
23. Leja J, Essaghir A, Essand M, Wester K, Oberg K, Tötterman TH, et al. Novel markers for enterochromaffin cells and gastrointestinal neuroendocrine carcinomas. Mod Pathol (2009) 22:261–72. doi:10.1038/modpathol.2008.174
24. Friedman RC, Farh KK-H, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res (2009) 19:92–105. doi:10.1101/gr.082701.108
25. Betel D, Wilson M, Gabow A, Marks DS, Sander C. The microRNA.org resource: targets and expression. Nucleic Acids Res (2008) 36:D149–53. doi:10.1093/nar/gkm995
26. Li S-C, Essaghir A, Martijn C, Lloyd RV, Demoulin J-B, Oberg K, et al. Global microRNA profiling of well-differentiated small intestinal neuroendocrine tumors. Mod Pathol (2013) 26(5):685–96. doi:10.1038/modpathol.2012.216
27. Deng S, Zhou H, Xiong R, Lu Y, Yan D, Xing T, et al. Over-expression of genes and proteins of ubiquitin specific peptidases (USPs) and proteasome subunits (PSs) in breast cancer tissue observed by the methods of RFDD-PCR and proteomics. Breast Cancer Res Treat (2007) 104:21–30. doi:10.1007/s10549-006-9393-7
28. Hou J, Liao L-D, Xie Y-M, Zeng F-M, Ji X, Chen B, et al. DACT2 is a candidate tumor suppressor and prognostic marker in esophageal squamous cell carcinoma. Cancer Prev Res (Phila) (2013) 6:791–800. doi:10.1158/1940-6207.CAPR-12-0352
29. Li B, Lin H, Fan J, Lan J, Zhong Y, Yang Y, et al. CD59 is overexpressed in human lung cancer and regulates apoptosis of human lung cancer cells. Int J Oncol (2013) 43:850–8. doi:10.3892/ijo.2013.2007
30. Kloth JN, Oosting J, van Wezel T, Szuhai K, Knijnenburg J, Gorter A, et al. Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex genetic alterations in cervical cancer. BMC Genomics (2007) 8:53. doi:10.1186/1471-2164-8-53
31. Johnstone RW, Wang J, Tommerup N, Vissing H, Roberts T, Shi Y. Ciao 1 is a novel WD40 protein that interacts with the tumor suppressor protein WT1. J Biol Chem (1998) 273:10880–7. doi:10.1074/jbc.273.18.10880
32. Gautrey HL, Tyson-Capper AJ. Regulation of Mcl-1 by SRSF1 and SRSF5 in cancer cells. PLoS One (2012) 7:e51497. doi:10.1371/journal.pone.0051497
33. Lejeune F, Maquat LE. Mechanistic links between nonsense-mediated mRNA decay and pre-mRNA splicing in mammalian cells. Curr Opin Cell Biol (2005) 17:309–15. doi:10.1016/j.ceb.2005.03.002
34. Duman-Scheel M. Deleted in colorectal cancer (DCC) pathfinding: axon guidance gene finally turned tumor suppressor. Curr Drug Targets (2012) 13:1445–53. doi:10.2174/138945012803530215
35. Tam CW, Liu VWS, Leung WY, Yao K-M, Shiu SYW. The autocrine human secreted PDZ domain-containing protein 2 (sPDZD2) induces senescence or quiescence of prostate, breast and liver cancer cells via transcriptional activation of p53. Cancer Lett (2008) 271:64–80. doi:10.1016/j.canlet.2008.05.047
36. Yeung M-L, Tam TSM, Tsang ACC, Yao K-M. Proteolytic cleavage of PDZD2 generates a secreted peptide containing two PDZ domains. EMBO Rep (2003) 4:412–8. doi:10.1038/sj.embor.embor804
37. Zaric J, Joseph J-M, Tercier S, Sengstag T, Ponsonnet L, Delorenzi M, et al. Identification of MAGI1 as a tumor-suppressor protein induced by cyclooxygenase-2 inhibitors in colorectal cancer cells. Oncogene (2012) 31:48–59. doi:10.1038/onc.2011.218
38. Rikiyama T, Curtis J, Oikawa M, Zimonjic DB, Popescu N, Murphy BA, et al. GCF2: expression and molecular analysis of repression. Biochim Biophys Acta (2003) 1629:15–25. doi:10.1016/S0167-4781(03)00156-8
39. Ariake K, Ohtsuka H, Motoi F, Douchi D, Oikawa M, Rikiyama T, et al. GCF2/LRRFIP1 promotes colorectal cancer metastasis and liver invasion through integrin-dependent RhoA activation. Cancer Lett (2012) 325:99–107. doi:10.1016/j.canlet.2012.06.012
40. Rippe V, Drieschner N, Meiboom M, Murua Escobar H, Bonk U, Belge G, et al. Identification of a gene rearranged by 2p21 aberrations in thyroid adenomas. Oncogene (2003) 22:6111–4. doi:10.1038/sj.onc.1206867
41. Modlin IM, Kidd M, Pfragner R, Eick GN, Champaneria MC. The functional characterization of normal and neoplastic human enterochromaffin cells. J Clin Endocrinol Metab (2006) 91:2340–8. doi:10.1210/jc.2006-0110
42. Modlin IM, Oberg K, Chung DC, Jensen RT, de Herder WW, Thakker RV, et al. Gastroenteropancreatic neuroendocrine tumours. Lancet Oncol (2008) 9:61–72. doi:10.1016/S1470-2045(07)70410-2
43. Keene JD. RNA regulons: coordination of post-transcriptional events. Nat Rev Genet (2007) 8:533–43. doi:10.1038/nrg2111
44. Hashemi J, Fotouhi O, Sulaiman L, Kjellman M, Höög A, Zedenius J, et al. Copy number alterations in small intestinal neuroendocrine tumors determined by array comparative genomic hybridization. BMC Cancer (2013) 13:505. doi:10.1186/1471-2407-13-505
45. Andersson E, Swärd C, Stenman G, Ahlman H, Nilsson O. High-resolution genomic profiling reveals gain of chromosome 14 as a predictor of poor outcome in ileal carcinoids. Endocr Relat Cancer (2009) 16:953–66. doi:10.1677/ERC-09-0052
46. Vorlová S, Rocco G, Lefave CV, Jodelka FM, Hess K, Hastings ML, et al. Induction of antagonistic soluble decoy receptor tyrosine kinases by intronic polyA activation. Mol Cell (2011) 43:927–39. doi:10.1016/j.molcel.2011.08.009
47. Vickers TA, Crooke ST. siRNAs targeted to certain polyadenylation sites promote specific, RISC-independent degradation of messenger RNAs. Nucleic Acids Res (2012) 40:6223–34. doi:10.1093/nar/gks239
48. Fortes P, Cuevas Y, Guan F, Liu P, Pentlicky S, Jung SP, et al. Inhibiting expression of specific genes in mammalian cells with 5’ end-mutated U1 small nuclear RNAs targeted to terminal exons of pre-mRNA. Proc Natl Acad Sci U S A (2003) 100:8264–9. doi:10.1073/pnas.1332669100
49. Beckley SA, Liu P, Stover ML, Gunderson SI, Lichtler AC, Rowe DW. Reduction of target gene expression by a modified U1 snRNA. Mol Cell Biol (2001) 21:2815–25. doi:10.1128/MCB.21.8.2815-2825.2001
50. Blazquez L, Gonzalez-Rojas SJ, Abad A, Razquin N, Abad X, Fortes P. Increased in vivo inhibition of gene expression by combining RNA interference and U1 inhibition. Nucleic Acids Res (2012) 40:e8. doi:10.1093/nar/gkr956
51. Goraczniak R, Behlke MA, Gunderson SI. Gene silencing by synthetic U1 adaptors. Nat Biotechnol (2009) 27:257–63. doi:10.1038/nbt.1525
52. Vickers TA, Sabripour M, Crooke ST. U1 adaptors result in reduction of multiple pre-mRNA species principally by sequestering U1snRNP. Nucleic Acids Res (2011) 39:e71. doi:10.1093/nar/gkr150
Keywords: carcinoids, neuroendocrine tumors, gene transcription, molecular biology, polyadenylation
Citation: Rehfeld A, Plass M, Døssing K, Knigge U, Kjær A, Krogh A and Friis-Hansen L (2014) Alternative polyadenylation of tumor suppressor genes in small intestinal neuroendocrine tumors. Front. Endocrinol. 5:46. doi: 10.3389/fendo.2014.00046
Received: 09 February 2014; Accepted: 22 March 2014;
Published online: 15 April 2014.
Edited by:
Stephen Plymate, University of Washington, USAReviewed by:
Marialuisa Appetecchia, Regina Elena National Cancer Institute IRCCS, ItalyCarl Friedrich Classen, University Children’s Hospital, Germany
Copyright: © 2014 Rehfeld, Plass, Døssing, Knigge, Kjær, Krogh and Friis-Hansen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lennart Friis-Hansen, Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, Copenhagen DK 2100, Denmark e-mail: lfh@dadlnet.dk
†Anders Rehfeld and Mireya Plass have contributed equally to this work.