Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 13 October 2022
Sec. Genomic Assay Technology
This article is part of the Research Topic Multi-Omics Analysis During the Animal Development View all 4 articles

Functional evaluation of LTR-derived lncRNAs in porcine oocytes and zygotes with RNA-seq and small RNA-seq

Xu Yang&#x;Xu Yang1Jingzhang Ji&#x;Jingzhang Ji1Hongdi Cui&#x;Hongdi Cui1Qi Zhao&#x;Qi Zhao2Chunming Ding
Chunming Ding1*Chang Xu
Chang Xu3*
  • 1School of Laboratory Medicine and Life Sciences, Wenzhou Medical University, Wenzhou, Zhejiang, China
  • 2Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou Medical University, Wenzhou, Zhejiang, China
  • 3Department of Colorectal Surgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China

Long noncoding RNAs (lncRNAs) are increasingly being recognized as modulators of early embryonic development in mammals. However, they are seldom investigated in pigs. Here, to annotate full-length RNA transcripts, we performed annotation using a newly developed computational pipeline—an RNA-seq and small RNA-seq combined strategy—using our previously obtained RNA-seq and small RNA-seq data from porcine oocytes and zygotes. As evidenced by the length comparison, the frequency of the core promoter, and the polyadenylation signal motifs, the transcripts appear to be full-length. Furthermore, our strategy allowed the identification of a large number of endogenous retrovirus-associated lncRNAs (ERV-lncRNAs) and found that some of them were highly expressed in porcine zygotes, as compared to oocytes. Through the knockdown strategy, two ERV-lncRNAs (TCONS_00035465 and TCONS_00031520) were identified as playing potential roles in the early embryo development of pigs, laying a foundation for future research.

Introduction

Long noncoding RNA (lncRNA) is a genomic transcription product of more than 200 nt with no or very little protein-coding ability (Jarroux et al., 2017; Ransohoff et al., 2018). lncRNAs can be divided into intragenic lncRNAs and intergenic lncRNAs, according to their derivation (Bouckenheimer et al., 2016). In mice and humans, more than two-thirds of lncRNAs are endogenous retrovirus- (ERV-) associated (Wang et al., 2016; Ransohoff et al., 2018). In addition, the LTR regions of ERVs enriched with pluripotency-related lncRNAs, such as the human pluripotency associated transcripts (HPATs), are involved in the formation of blastocyst ICMs and the maintenance of pluripotency (Durruthy-Durruthy et al., 2016). These findings point to likely roles for ERV-associated lncRNAs in the maintenance of pluripotency and lineage commitment. However, the functions of ERV-associated lncRNAs are difficult to evaluate due to their very low expression levels.

The development of early mammalian embryos, from zygote to blastocyst, is a strictly regulated process (Jukam et al., 2017). lncRNAs have been proven to participate in a wide variety of developmental stages. For example, we have reported that an ERV-associated lncRNA, lincGET, is identified as one of the earliest known lineage regulators to bias cell fate in the two-cell embryo, by promoting the nuclear localization of CARM1, and is essential for embryo development (Wang et al., 2016). However, almost all studies have been conducted in mouse embryos (Hamazaki et al., 2015; Wang et al., 2016; Wang et al., 2018; Geng et al., 2022). Given the poor conservation of ERV-associated lncRNAs among different species, it is essential to demonstrate their functional roles in the early embryos of other mammals. Pig, as an important livestock animal that shares conserved principles of early development with humans, is considered a human disease model and a potential organ donor for xenotransplantation in regenerative medicine (Platt, 1998; Lu et al., 2019; Zhou et al., 2022). Thus, the investigation of transcription element- (TEs-) associated lncRNAs in porcine early embryos can provide clues for understanding the early embryonic development of humans.

With the development of RNA sequencing (RNA-seq) technology, de novo transcripts annotation is becoming easier and easier (Reuter et al., 2015; Van Borm et al., 2015). However, due to the inevitable degradation of RNA during library building, annotation is unable to reach the 5′ ends of genes, especially the low-expressed ones; this constrains the functional annotation of lncRNAs. Our teams developed a computational pipeline, RSCS, to assemble RNA-seq and small RNA-seq data. This study raises the exciting possibility that the combination of RNA-seq and sRNA-seq could allow the determination of the 5′ and 3′ ends of low-expressed transcripts from small samples, such as from early embryos.

In this study, we combined our previous RNA-seq and sRNA-seq data on mature porcine oocytes and zygotes, respectively, to identify lncRNAs. We identified thousands of novel lncRNAs, and most of them seem to be exhibited full-length, with TSS and terminators. We also characterized a large number of LTR-derived lncRNAs (LTR-lncRNAs) with 5′ ends located in specific LTR retrotransposon families and identified two candidates that are essential for the early development of pigs.

Materials and methods

Porcine embryo culture and collection

All experiments were performed according to the guidelines of the State Key Laboratory Animal Care and Use Committee. The procedure for porcine IVF has been described previously (Kong et al., 2020). Briefly, freshly ejaculated sperm-rich fractions were collected from fertile boars. Following a short incubation at 39°C, semen was resuspended and washed three times in DPBS supplemented with 0.1% (w/v) BSA, then centrifuged at 1,500 g for 4 min. A hemocytometer was used to measure spermatozoan concentrations and to determine the proportion of motile sperm. Next, spermatozoa were diluted with modified tris-buffered medium (mTBM) to an optimal concentration. Cumulus-free oocytes were washed three times in mTBM. Approximately 30 oocytes were inseminated in 50 ml mTBM at a final sperm concentration of 300,000/ml for 5 h. Embryos were cultured in porcine zygote medium-3 (PZM-3) at 39°C in 5% CO2 in air. Embryos were collected after IVF at the following time points: one-cell stage (24 h), two-cell stage (40–45 h), four-cell stage (65–72 h), eight-cell stage (84–90 h), morula stage (108–115 h), and blastocyst stage (156–160 h). Additionally, the oocytes were collected after 42 h in vitro maturation. For qPCR, about 50 embryos of each stage were used. Embryo development was then observed every 24 h.

Microinjection

To knock down lncRNAs, we injected about 10 pl 10 μM siRNA targeting lncRNAs into each MII oocyte and then performed IVF to obtain embryos. The sequences of siRNAs are shown in Table 1.

TABLE 1
www.frontiersin.org

TABLE 1. List of major primers and siRNAs.

Real-time RT-PCR (qPCR) analysis

Total RNA was extracted using the PureLink Micro-to-Midi System (Invitrogen) according to the manufacturer’s instructions, and reverse transcription was used to generate cDNAs using the PrimeScript RT Reagent kit (TaKaRa). qPCR was performed using SYBR Premix Ex Taq (TaKaRa) and the 7,500 Real-Time PCR System. The reaction parameters were 95°C for 30 s, followed by 40 two-step cycles of 95°C for 5 s and 60°C for 34 s. Ct values were calculated using Sequence Detection System software, and the amount of target sequence normalized to the reference sequence was calculated as 2−ΔΔCt. All primers and probes were designed using Primer Premier 5.

The RSCS pipeline

The RSCS pipeline includes 1) pre-configuration of sample data, 2) modified transcriptome assembly, and 3) transcript classification and lncRNA prediction.

Pre-configuration of sample data

The raw sequence data, in FASTQ format, were filtered to remove reads with unknown nucleotides, and FastQC (v0.11.5) was used for Illumina reads. Subsequently, Trim Galore (v0.6.4) software was used to discard low-quality reads, trim the adaptor sequences, and eliminate poor-quality bases. Specifically, Trim Galore discarded trimmed reads from the small RNA-seq data with less than 18 nucleotides or more than 50 nucleotides (parameter: “--small_rna --length 18”). RNA-seq and small RNA reads were mapped to 10 mm using HISAT2 (v2.1.0), and the unique reads were used in further analyses.

Modified transcriptome assembly

We then transformed Sequence Alignment/Map- (SAM-) format files to BAM format (a compressed binary version of the SAM format), discarded reads that were not aligned to the reference genome, and sorted the BAM file with SAMtools (v1.9).

For deep annotation, we considered the following features:

• BAM files are generated after alignment and after low-quality mapped reads have been filtered out.

• BAM is the compressed binary version of the SAM format, uses Blocked GNU Zip Format (BGZF) compression, and can support indexes to achieve fast random access by generating BAM index (BAI) files.

The BAM files of RNA-seq and small RNA-seq data obtained at identical biological stages were merged using SAMtools merge, generating a pooled BAM file. Importantly, this study used only reads with MAPQ values over 10 and that mapped to a single locus in the genomes. These steps were performed simultaneously with duplicate marking by running Mark Duplicates on all read groups in each pooled BAM file obtained for a sample; this procedure was repeated for each sample. We then ran base recalibration on the aggregated per-sample BAM files.

Transcript assemblies were performed separately for each sample using StringTie (v1.2.3) (Pertea et al., 2016) with GENCODE annotation (vM20) (Harrow et al., 2006) as a guide. The expression level of each gene was quantified by normalized FPKM (fragments per kilobase of exon per million mapped fragments) using StringTie. For new transcripts, data normalization was performed by transforming summational mapped transcript reads to RPKM.

Transcript classification and lncRNA prediction

We used StringTie merge to merge all the transcripts identified by the RSCS across all stages. The transcripts were annotated by comparison with mouse GENCODE annotation (vM20) using Cuffcompare (v2.2.1). The long-read transcripts were classified into four classes according to their most closely matching GENCODE transcript: “ = ” for “Exact match to annotation,” “e” and “j” for “Potentially novel isoform,” “u” for “Potentially novel gene,” and “Other.”

To identify reliable, multi-exonic long non-coding transcripts, we implemented the following selection criteria: 1) read coverage ≥3 in at least one of the tissues, 2) ≥200 bp, 3) ≥2 exons, 4) low protein-coding potential predicted by CPC2 and CPAT, and 5) no sense overlap with known coding genes derived from the UCSC, Ensembl, or Resfeq databases.

Annotation of LTR-lncRNAs

Coordinates and annotations of TE elements were downloaded from the UCSC Genome Browser (7/3/2012 version of RepeatMasker). lncRNAs overlapping with TE elements were identified as TE-lncRNAs using BEDTools software. ERV-lncRNAs were further identified by RepeatMasker annotation. ERV-lncRNAs with expression levels (FPKM) > 1 were used for further analysis. An ERV-lncRNA PCA plot was performed in the R language.

Statistical analysis

The statistical analyses were performed using R4.0.2. The data are shown as the means ± SEMs. Differences between the results obtained for two groups were evaluated using either two-tailed Student’s t-test or the Wilcoxon rank-sum test. The asterisks indicate significant differences: *p < 0.05, **p < 0.01, and ***p < 0.001.

Results

Annotation of transcripts by combined RNA-seq and sRNA-seq data in porcine oocytes and zygotes

To comprehensively recognize the transcripts expressed in porcine oocytes and zygotes, we created extensive transcriptome profiles, based on million-tag sequences mapping to the reference genome from RNA-seq and small RNA-seq (sRNA-seq) (Table 2). In total, 27,694 and 32,875 high-quality non-redundant transcripts were assembled in oocytes and zygotes, respectively, and sRNA data was involved in the assembly of most transcripts (89.05% in oocytes and 86.21% in zygotes, Figure 1A). Furthermore, more than 50% of sRNA tags contributed to the exons in both oocytes and zygotes, although some tags contributed to the antisense chains (Figure 1B). Most of the sRNA tags contributing to the exons were positioned at the transcript terminals (23,402 vs 4,578 in oocytes and 20,702 vs 3,256 in zygotes, Figure 1C). microRNA constituted most of the terminal small RNA tags (Figure 1D). The density distribution of sRNA showed the same result (Figures 1E,F). Taken together, RNA-seq combined with sRNA-seq could allow for the determination of the 5′ and 3′ ends of low-expressed transcripts in pig oocytes and zygotes.

TABLE 2
www.frontiersin.org

TABLE 2. Aligned tags of RNA-seq and short RNA-seq.

FIGURE 1
www.frontiersin.org

FIGURE 1. Annotation of transcripts by combination of RNA-seq and sRNA-seq data. (A) Pie chart shows that sRNA is involved in splicing in 89.05% and 86.21% of transcripts in porcine oocytes and zygotes, respectively. (B) Percentage of locations where sRNA data contributes. AS: antisense strand. (C) Pie chart shows that over 75% of the sRNA tags were positioned to the 3′ UTR and 5′ UTR of transcripts. UTR: untranslated regions. (D) Percentage of different classes of small RNA constituted over the 3′ UTR and 5′ UTR. (E,F) The density distribution of sRNA shows that most sRNA tags were positioned to the 3′ UTR and 5′ UTR of transcripts.

Transcripts annotated with small RNAs appear full-length

To further verify whether the transcripts obtained by RSCS were full-length transcripts, we analyzed the characteristics of the transcripts we obtained. Based on the above results (Figure 1A), transcripts could be divided into two classes: transcripts with sRNA and transcripts without sRNA. First, the lengths of the two classes of transcripts were statistically analyzed in both oocytes and zygotes (Figures 2A,B). We found that transcript length was significantly greater for transcripts with sRNA than for transcripts without sRNA, indicating that more complete RNA sequences were obtained with sRNA aligned. It has been shown that the first base of complete transcripts biases the purine bases (A and G) (Zecherle et al., 1996; Albersmeier et al., 2017). The first bases of the two classes of transcripts were also detected. The result shows that for more than 60% of transcripts with sRNA, the first base was A or G in both oocytes and zygotes, but this number was only about 50% for transcripts without sRNA in both oocytes and zygotes (Figures 2C,D). Furthermore, it has been reported that a complete transcript should have the transcription initiation sequence TATAA at the 5′ region and the transcription termination sequence AATAA and GC at the 3′ region (Gregor et al., 1986; Huck et al., 1986; Savinkova et al., 2009). Therefore, motif prediction was performed on the 5′ and 3′ regions of oocytes and zygotes. As expected, transcripts with sRNA showed clearer TATAA, AATAAA, and GC motifs than did transcripts without sRNA (Figures 2E,F). Taken together, these data demonstrate that transcripts annotated with small RNAs appear to be full-length.

FIGURE 2
www.frontiersin.org

FIGURE 2. Annotation with small RNAs can produce a more complete transcript. (A,B) Box plot shows that the length of transcripts with sRNA and transcripts without sRNA in porcine oocytes and zygotes, respectively. (C,D) Pie chart shows that the first base for more than 60% of transcripts with sRNA was a purine base in both oocytes and zygotes. (E,F) Frequently observed sequence motifs upstream of the initial and end bases of the transcripts. The region used for motif identification and the p value are shown.

Characterization of lncRNAs in porcine oocytes and zygotes

Next, transcripts with sRNA were divided into two classes: coding transcripts and non-coding transcripts, according to the GENCODE annotation and protein-coding potential of the novel transcripts using CPAT (v2.2.0) and CPC2 (v2.0). In the non-coding transcripts of oocytes, 4.32% were annotated and 95.68% were unannotated; in the non-coding transcripts of zygotes, 3.74% were annotated and 96.26% were unannotated (Figures 3A,B). Consistent with lncRNAs identified in mice and humans, in pigs the transcript lengths and expression levels of the lncRNAs were shorter and lower than those of coding genes (Figures 3C–F).

FIGURE 3
www.frontiersin.org

FIGURE 3. Characterization of lncRNAs in porcine oocytes and zygotes. (A,B) Pie chart shows the classification of the annotated and novel RSCS transcripts according to the GENCODE annotation or protein-coding potential. (C,D) Box plot shows that the expression level of the lncRNAs in pig were lower than in the coding genes. (E,F) Box plot shows that the transcript length of the lncRNAs in pigs was shorter than in the coding genes.

Characterization of LTR-lncRNAs in porcine oocytes and zygotes

Recent studies have revealed that families of ERV-associated lncRNAs are closely related to pluripotency; that the transcription of MERVL and HERVH or HERVK is a hallmark of two-cell embryo-like totipotent mouse embryonic stem cells (ESCs) and naïve-like human ESCs, respectively; and that LTR is an important regulatory region in ERV. To explore the regulatory mechanism of lncRNA on early embryo development in pigs, lncRNA derived from LTR was investigated in both oocytes and zygotes. In oocytes, 30.05% of the non-coding transcripts were associated with LTR and 69.95% were not. In zygotes, 28.65% were associated with LTR and 71.35% were not (Figure 4A). Next, we explored which ERV families these LTR-associated transcripts belong to. Statistical analysis revealed that LTR-associated transcripts in oocytes and zygotes were all from the ERV1, ERVK, ERVL, and MALR families and that most of the transcripts were from the endogenous retrovirus families ERVK and MALR (Figure 4B). Next, the lengths and expression levels of LTR-associated non-coding transcripts with coding transcripts were compared. As expected, the expression levels of LTR-associated non-coding transcripts were lower than for coding transcripts (Figure 4C). Also, the length of LTR-associated non-coding transcripts was lower than that of coding transcripts (Figure 4D).

FIGURE 4
www.frontiersin.org

FIGURE 4. Characterization of LTR-lncRNAs in pigs. (A) Pie chart shows that about 30% lncRNAs are derived from LTR in porcine oocytes and zygotes. (B) LTR-associated transcripts in oocytes and zygotes were all from the ERV1, ERVK, ERVL, and MALR families, and most of the transcripts were from the endogenous retrovirus families ERVK and MALR, ERV: endogenous retrovirus. (C) Box plot shows that the expression level of the LTR-associated non-coding transcripts in pig was lower than the coding transcripts. (D) Box plot shows that the transcript length of the LTR-associated non-coding transcripts in pig was shorter than the coding transcripts.

Implications of LTR-lncRNAs in early embryonic development in pigs

Next, the expression levels of LTR-associated transcripts in oocytes and zygotes were analyzed. First, analysis showed that the number of reads that could be mapped to ERV in in zygotes was higher than in oocytes (Figure 5A). Next, differential expression analysis of ERV-associated lncRNAs from oocytes and zygotes was performed (Figure 5B), with a further screen for highly expressed ERV-associated lncRNAs in oocytes and zygotes (Figure 5B). Among all lncRNAs, five LTR-associated transcripts, LTR11, MLTA0, LTR37A, MLT1H, and MLT1D, were highly expressed in oocytes (Figure 5C), while MLT1F1, LTR16, LTR33, MLT1K, and MLT1F showed higher expression levels in zygotes (Figure 5D). Then, we focused on the highest-expressed LTR in the zygote. The 19 candidate lncRNAs from the LTR mentioned above showed higher expression levels in zygotes than in oocytes, as expected (Figure 5D). To investigate the potential regulatory roles of these 19 lncRNAs in the early embryonic development of pigs, a knockdown experiment was performed. We microinjected specific siRNAs into MII oocytes and observed embryo development at 24, 48, and 156 h after IVF. Among the 19 transcripts, two lncRNAs showed significant effects on embryo development after knockdown, namely TCONS_00035465 and TCONS_00031520 (Table 3). These results suggest that these two lncRNAs may have certain functional mechanisms during early embryonic development in pigs. Next, the expression levels of TCONS_00035465 and TCONS_00031520 at different stages were investigated. TCONS_00035465 showed high expression at the two-cell stage, and TCONS_00031520 showed high expression at the eight-cell stage (Figure 5E). We observed a significant decrease in cleavage rate after the TCONS_00035465 knockdown, suggesting that TCONS_00035465 may participate in the cleavage process. While no significant differences were detected in the cleavage rate and four-cell rate of embryos after TCONS_00031520 knockdown, we observed a significant decrease in the blastocyst rate compared with the control group (Figure 5F). This suggests that TCONS_00031520 may be involved in the development from eight-cell to blastocyst.

FIGURE 5
www.frontiersin.org

FIGURE 5. LTR-lncRNAs are functional in porcine early embryonic development. (A) The number of reads mapped to ERV in zygotes was higher than in oocytes. (B) Differential expression analysis of ERV-associated lncRNAs from oocytes and zygotes. The red dots indicate an LTR that is highly expressed in oocytes, the green dots indicate an LTR that is highly expressed in zygotes, and the blue dots indicate an LTR that has no significant difference between oocytes and zygotes. (C,D) The top five LTR-associated transcripts in oocyte (C) and zygote (D), respectively. (E,F) TCONS_00035465 showed high expression at the two-cell stage, and TCONS_00031520 showed high expression at the eight-cell stage.

TABLE 3
www.frontiersin.org

TABLE 3. The effect of lncRNA knockdown on early embryonic development of pigs.

Discussion

lncRNA is becoming a popular subject in scientific research. Much research has shown that lncRNAs play an important role in regulating stem cell pluripotency, reprogramming, and early embryonic development. Numerous studies have been conducted in mice, humans, and other model organisms to explore how lncRNAs regulate embryonic development and reprogramming. However, to date, research on the mechanisms of porcine preimplantation embryos and pluripotent stem cells has been extremely limited. Currently, known methods of transcribing annotations include 5′ RACE, 3′ RACE, RNA-seq, GAGE-seq, and Nano GAGE-seq (Shiraki et al., 2003; Kodzius et al., 2006; Plessy et al., 2010). RNA-seq is emerging as an attractive method to study gene expression levels in terms of reducing sequencing costs. However, RNA-seq has its own technical limitations. Due to the small number of embryonic cells and their relatively low expression levels, it is difficult to analyze lncRNA in preimplantation pig embryos. In our study, the combination of RNA-seq data and sRNA-seq data improved the incomplete annotation of the transcript, laying an important foundation for subsequent study of the effects of lncRNAs on early pig embryo development after the relatively complete transcript is obtained.

As mentioned above, it is difficult to annotate lncRNAs in early pig embryos, and studies on lncRNAs related to early pig embryo development are relatively fewer than for other species. In this study, RNA-seq and sRNA-seq data of porcine oocytes and zygotes were combined to annotate the transcripts by RSCS. We found that sRNA was involved in the assembly of more than 80% of transcripts. In addition, we used bioinformatics methods to analyze the locations of aligned sRNA and found that most sRNA was located at the 5′ and 3′ regions of the transcripts. These results indicate that the combination of RNA-seq and sRNA-seq data may be helpful in obtaining relatively complete transcript sequences, which can promote the study of the functional mechanism of lncRNA in the future.

The characteristics of the transcripts we obtained were also analyzed. Transcripts with sRNA were longer in length, with more A and G at the first base and more complete motifs than transcripts without sRNA, indicating that the transcripts annotated with small RNAs were full-length.

Some research has shown that ERV plays an important role in early embryonic development, and LTR plays an important role in the mechanism of ERV. Therefore, we explored which of the obtained transcripts were derived from LTR. LTR was found to be associated with 30% of transcripts. These LTR transcripts belong to the ERV family. We found that these transcripts are mainly derived from the ERVK and MALR families. Then, the length and expression of LTR-derived non-coding transcripts and coded transcripts were compared.

Pig is a suitable model animal for human organ donors, and there has been increasing research on early embryonic development and the pluripotency of stem cells in pigs in recent years. We believe that the study of early embryonic development in pigs is crucial for advancing medical and agricultural development. In this study, transcripts with high expression in the zygote were knocked down to observe the embryonic development level. A significant decrease in cleavage rate after TCONS_00035465 knockdown showed that TCONS_00035465 may participate in the cleavage process. While no significant differences were detected in the cleavage rate and four-cell rate of embryos after TCONS_00031520 knockdown, we observed a significant decrease in the blastocyst rate as compared with the control group. This suggests that TCONS_00031520 may be involved in the development from eight-cell to blastocyst.

In summary, we applied the RSCS to annotate augmented but high-confidence potential transcriptomes with more complete and precise contents. Taking advantage of our strategy, we identified many ERV-lncRNAs, which indicates that the set of retrotransposon functions in specific biological events has only just begun to be unraveled. We also screened out two lncRNAs that may have important functions in the early embryonic development of pigs.

Data availability statement

The original contributions presented in the study are included in the article, and further inquiries can be directed to the corresponding authors.

Ethics statement

The animal study was reviewed and approved by the Laboratory Animal Center of Wenzhou Medical University. Written informed consent was obtained from the owners for the participation of their animals in this study.

Author contributions

CD, CX, and XY conceived and designed the study. HC and QZ analyzed the data and JJ wrote the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Albersmeier, A., Pfeifer-Sancar, K., Ruckert, C., and Kalinowski, J. (2017). Genome-wide determination of transcription start sites reveals new insights into promoter structures in the actinomycete Corynebacterium glutamicum. J. Biotechnol. 257, 99–109. doi:10.1016/j.jbiotec.2017.04.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Bouckenheimer, J., Assou, S., Riquier, S., Hou, C., Philippe, N., Sansac, C., et al. (2016). Long non-coding RNAs in human early embryonic development and their potential in ART. Hum. Reprod. Update 23 (1), 19–40. doi:10.1093/humupd/dmw035

PubMed Abstract | CrossRef Full Text | Google Scholar

Durruthy-Durruthy, J., Sebastiano, V., Wossidlo, M., Cepeda, D., Cui, J., Grow, E. J., et al. (2016). The primate-specific noncoding RNA HPAT5 regulates pluripotency during human preimplantation development and nuclear reprogramming. Nat. Genet. 48 (1), 44–52. doi:10.1038/ng.3449

PubMed Abstract | CrossRef Full Text | Google Scholar

Geng, J., Cui, C., Yin, Y., Zhao, Y., and Zhang, C. (2022). LncRNA NEAT1 affects endometrial receptivity by regulating HOXA10 promoter activity. Cell. Cycle 21, 1932–1944. doi:10.1080/15384101.2022.2075198

PubMed Abstract | CrossRef Full Text | Google Scholar

Gregor, P. D., Kobrin, B. J., Milcarek, C., and Morrison, S. L. (1986). Sequences 3' of immunoglobulin heavy chain genes influence their expression. Immunol. Rev. 89, 31–48. doi:10.1111/j.1600-065x.1986.tb01471.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Hamazaki, N., Uesaka, M., Nakashima, K., Agata, K., and Imamura, T. (2015). Gene activation-associated long noncoding RNAs function in mouse preimplantation development. Development 142 (5), 910–920. doi:10.1242/dev.116996

PubMed Abstract | CrossRef Full Text | Google Scholar

Harrow, J., Denoeud, F., Frankish, A., Reymond, A., Chen, C. K., Chrast, J., et al. (2006). Gencode: Producing a reference annotation for ENCODE. Genome Biol. 7 (1), 1–9. doi:10.1186/gb-2006-7-s1-s4

PubMed Abstract | CrossRef Full Text | Google Scholar

Huck, S., Fort, P., Crawford, D. H., Lefranc, M. P., and Lefranc, G. (1986). Sequence of a human immunoglobulin gamma 3 heavy chain constant region gene: Comparison with the other human C gamma genes. Nucleic Acids Res. 14 (4), 1779–1789. doi:10.1093/nar/14.4.1779

PubMed Abstract | CrossRef Full Text | Google Scholar

Jarroux, J., Morillon, A., and Pinskaya, M. (2017). History, discovery, and classification of lncRNAs. Adv. Exp. Med. Biol. 1008, 1–46. doi:10.1007/978-981-10-5203-3_1

PubMed Abstract | CrossRef Full Text | Google Scholar

Jukam, D., Shariati, S. A. M., and Skotheim, J. M. (2017). Zygotic genome activation in vertebrates. Dev. Cell. 42 (4), 316–332. doi:10.1016/j.devcel.2017.07.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Kodzius, R., Kojima, M., Nishiyori, H., Nakamura, M., Fukuda, S., Tagami, M., et al. (2006). Cage: Cap analysis of gene expression. Nat. Methods 3 (3), 211–222. doi:10.1038/nmeth0306-211

PubMed Abstract | CrossRef Full Text | Google Scholar

Kong, Q., Yang, X., Zhang, H., Liu, S., Zhao, J., Zhang, J., et al. (2020). Lineage specification and pluripotency revealed by transcriptome analysis from oocyte to blastocyst in pig. FASEB J. 34 (1), 691–705.

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, T., Yang, B., Wang, R., and Qin, C. (2019). Xenotransplantation: Current status in preclinical research. Front. Immunol. 10, 3060. doi:10.3389/fimmu.2019.03060

PubMed Abstract | CrossRef Full Text | Google Scholar

Pertea, M., Kim, D., Pertea, G. M., Leek, J. T., and Salzberg, S. L. (2016). Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11 (9), 1650–1667. doi:10.1038/nprot.2016.095

PubMed Abstract | CrossRef Full Text | Google Scholar

Platt, J. L. (1998). Current status of xenotransplantation: Research and technology. Transpl. Proc. 30 (5), 1630–1633. doi:10.1016/s0041-1345(98)00371-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Plessy, C., Bertin, N., Takahashi, H., Simone, R., Salimullah, M., Lassmann, T., et al. (2010). Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nat. Methods 7 (7), 528–534. doi:10.1038/nmeth.1470

PubMed Abstract | CrossRef Full Text | Google Scholar

Ransohoff, J. D., Wei, Y., and Khavari, P. A. (2018). The functions and unique features of long intergenic non-coding RNA. Nat. Rev. Mol. Cell. Biol. 19 (3), 143–157. doi:10.1038/nrm.2017.104

PubMed Abstract | CrossRef Full Text | Google Scholar

Reuter, J. A., Spacek, D. V., and Snyder, M. P. (2015). High-throughput sequencing technologies. Mol. Cell. 58 (4), 586–597. doi:10.1016/j.molcel.2015.05.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Savinkova, L. K., Ponomarenko, M. P., Ponomarenko, P. M., Drachkova, I. A., Lysova, M. V., Arshinova, T. V., et al. (2009). TATA box polymorphisms in human gene promoters and associated hereditary pathologies. Biochemistry. 74 (2), 117–129. doi:10.1134/s0006297909020011

PubMed Abstract | CrossRef Full Text | Google Scholar

Shiraki, T., Kondo, S., Katayama, S., Waki, K., Kasukawa, T., Kawaji, H., et al. (2003). Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl. Acad. Sci. U. S. A. 100 (26), 15776–15781. doi:10.1073/pnas.2136655100

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Borm, S., Belak, S., Freimanis, G., Fusaro, A., Granberg, F., and Hoper, D. (2015). Next-generation sequencing in veterinary medicine: How can the massive amount of information arising from high-throughput technologies improve diagnosis, control, and management of infectious diseases? Methods Mol. Biol. 1247, 415–436. doi:10.1007/978-1-4939-2004-4_30

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Li, X., Wang, L., Li, J., Zhao, Y., Bou, G., et al. (2016). A novel long intergenic noncoding RNA indispensable for the cleavage of mouse two-cell embryos. EMBO Rep. 17 (10), 1452–1470. doi:10.15252/embr.201642051

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Wang, L., Feng, G., Wang, Y., Li, Y., Li, X., et al. (2018). Asymmetric expression of LincGET biases cell fate in two-cell mouse embryos. Cell. 175 (7), 1887–1901.e18. doi:10.1016/j.cell.2018.11.039

PubMed Abstract | CrossRef Full Text | Google Scholar

Zecherle, G. N., Whelen, S., and Hall, B. D. (1996). Purines are required at the 5' ends of newly initiated RNAs for optimal RNA polymerase III gene expression. Mol. Cell. Biol. 16 (10), 5801–5810. doi:10.1128/mcb.16.10.5801

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Q., Li, T., Wang, K., Zhang, Q., Geng, Z., and Deng, S. (2022). Current status of xenotransplantation research and the strategies for preventing xenograft rejection. Front. Immunol. 13, 928173. doi:10.3389/fimmu.2022.928173

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: pig, lncRNA, LTR, embryo, RNA-seq, small RNA-seq

Citation: Yang X, Ji J, Cui H, Zhao Q, Ding C and Xu C (2022) Functional evaluation of LTR-derived lncRNAs in porcine oocytes and zygotes with RNA-seq and small RNA-seq. Front. Genet. 13:1023041. doi: 10.3389/fgene.2022.1023041

Received: 19 August 2022; Accepted: 08 September 2022;
Published: 13 October 2022.

Edited by:

Xiaoyu Liu, Tongji University, China

Reviewed by:

Yan Shi, Agricultural Genomics Institute at Shenzhen (CAAS), China
Bingteng Xie, Beijing Institute of Technology, China
Heng Zhang, Westlake University, China

Copyright © 2022 Yang, Ji, Cui, Zhao, Ding and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chunming Ding, Y21kaW5nQGdtYWlsLmNvbQ==; Chang Xu, dGFyeTIwMDJAZm94bWFpbC5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.