Literature DB >> 28934224

Identification and analysis of differentially expressed long non-coding RNAs between multiparous and uniparous goat (Capra hircus) ovaries.

Yinghui Ling1,2, Lina Xu3, Long Zhu1,2, Menghua Sui1,2, Qi Zheng1,2, Wenyong Li4, Yong Liu4, Fugui Fang1,2, Xiaorong Zhang1,2.   

Abstract

Long non-coding RNAs (lncRNAs) play important roles in almost all biological processes. However, there is little information on the effects of lncRNAs on ovulation and lambing rates. In the present study, we used high-throughput RNA sequencing to identify differentially expressed lncRNAs between the ovaries of multiparous (Mul) and uniparous (Uni) Anhui White goats. Among the 107,255,422 clean reads, 183,754 lncRNAs were significantly differentially expressed between the Uni and Mul. Among them, 455 lncRNAs were co-expressed between the two samples, whereas, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 uniquely lncRNAs were expressed in the Mul. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. These 4 pathways were related to ovulation, and the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. In addition, 10 lncRNAs harbored precursors of 40 miRNAs, such as TCONS_00320849 related to a mature miRNA sequence, miR-365a, which was reported to be related to proliferation, were annotated in the precursor analysis of miRNAs. The present expand the understanding of lncRNA biology and contribute to the annotation of the goat genome. The study will provide a resource for lncRNA studies of ovulation and lambing.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28934224      PMCID: PMC5608193          DOI: 10.1371/journal.pone.0183163

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Long non-coding RNAs (lncRNAs) are non-coding RNA transcripts of more than 200 nucleotides in length. Initially, lncRNA was regarded as transcriptional “noise” that lacked a complete open reading frame and had no ability to encode proteins [1]. Human homologue 19 (H19), found in mice, was the first lncRNA which was discovered in mammals in 1998 [2], then many more lncRNAs were discovered in the following 20 years. Recent studies have shown that lncRNA participates in many important physiological processes, such as X chromosome inactivation [3], dosage compensation [4], and gene imprinting [5]. Additionally, lncRNAs have a significant impact on the diagnosis, treatment, and development of disease. For example, lncRNA plays an important role in epigenetics [6, 7], post-transcriptional regulation [8], maintenance of stem cell pluripotency, and regulation of gene expression during disease processes [9]. Although lncRNAs are widely found in animals, their mechanisms of action are unclear [10, 11]. Using high-throughput poly (A)-independent and strand-specific RNA-seq of rats, lncRNA expression was found to be more tissue specific than that of mRNA [12]. Knockdown of several lncRNAs in mature oocytes increases developmental rates in cattle and leads to larger blastocysts [13]. The sixth CCCTC binding-factor (CTCF) binding site (CCCTC) was identified in H19 DMR, however, had a significant methylation difference between the high- and low-fertility bulls [14]. Boulanger promised that Forkhead box L2 (Foxl2) loss of function dissociated from loss of lncRNA expression is sufficient to cause an XX female-to-male sex reversal in the goat model and, as in the mouse model, an agenesis of eyelids [15]. Ovarian hormones regulate Foxl2 expression and thereby expand the number of genes controlled by the hypothalamic–pituitary–gonad axis that ultimately dictates reproductive fitness [16]. Although lncRNA studies have been begun in several species, the research conducted into the effects of lncRNA on the reproductive rate was relatively limited. High-throughput sequencing can be used to identify the lncRNAs, and bioinformatics analysis can predict the function of lncRNAs, which would provide a basis for animal reproduction study and possibly improve the reproductive rate of goats. Compared with other goats, Anhui White goat (AWG) is known for its higher fertility, precocious puberty, and higher leather quality. The kidding rate of AWG is 2.27–2.39 and the ewes can be in estrus all year round, which makes it to be an ideal model for the study of goat breeding traits [17]. To explore the function of lncRNAs in ovulation and lambing, we used Solexa sequencing to identify differentially expressed lncRNAs between the ovaries of multiparous and uniparous AWGs in the study, and predict the target genes of the differentially expressed lncRNAs. In addition, classified annotation, GO analysis and KEGG pathway were used to analyze the function of target genes. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. These 4 pathways were related to ovulation, and the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. These results will help us to further understand the role of lncRNAs in ovulation traits.

Materials and methods

Experimental animal preparation and library construction

The experimental goats in this study, Anhui White goats (AWG, an indigenous Chinese breed), were obtained from the College of Animal Science and Technology, Anhui Agricultural University, Hefei, China. Based on the long-term observation, laparoscopic and ultrasound detection, 6 target goats, 3 multiparous goats (Mul) and 3 uniparous goats (Uni), which were undergoing oestrus, were selected for their all ovaries. According to the at least 3-year records of lambing, the litter size of Mul and Uni were more than one and only one, respectively. Meanwhile, the physical condition and age, which is 3 to 4 years old, of the experimental samples were basically consistent. To further eliminate the influence of other factors, the unified management system of the field was adopted in the study farm feeding and stabling. The selected goats were killed and dissected, and both whole ovaries from each goat were collected immediately, snap-frozen in liquid nitrogen, and stored at –80°C until use. One whole ovary from each goat was used for RNA-Seq, meanwhile, the other one from the same goat was used for real-time PCR. All experimental procedures involving AWGs performed in the present study had been given prior approval by the ethics committee of Anhui Agricultural University under permit no. AHAU20101025. The total RNA of the ovaries (12 samples) from the 6 goats was extracted using RNAiso Plus (Takara) following the manufacturer’s protocol. The quantity and quality of the total RNA were measured using the Agilent 2100 Bioanalyzer system. To minimize differences among the goats, an equal quantity (10 μg) of total RNA isolated from one ovary from the 3 individual multiparous and uniparous goats, respectively, were pooled for library preparation and Illumina sequence (The Beijing Genomics Institute). In addition, the total RNA from the other one of the ovaries from the 3 individual multiparous and uniparous goat were reverse transcribed, respectively, into cDNA using the AceQ qPCR SYBR Green Master Mix lncRNA RT-PCR Kit (Vazyme Biotech Co., Ltd.) for qRT-PCR verification.

Basic data statistics

Before further analysis, raw data filtering was needed to decrease data noise. "Raw reads" were defined as reads containing the adapter sequence, a high content of unknown bases (reads with more than 10% unknown bases), and low-quality reads (>50% low-quality bases in a read, with a low-quality base defined as a base whose sequencing quality was no more than 10). Then clean reads were mapped to the rRNA reference sequence using SOAP aligner/ SOAP2 short-read alignment software [18] to remove the remaining rRNA reads, and leave the reads that were used for transcriptome assembly and quantification. The statistical analysis of the alignment results is presented for each sample.

Assembly and novel lncRNA prediction

Reads were mapped to the genome and assembled using Cufflinks [19]. Faux-reads were then generated from reference transcripts to capture features in the reference sequence that, due to low coverage, might be missing from the sequencing data. We used a reference annotation-based assembly method to reconstruct the transcripts, while background noise was reduced by using the FPKM (fragments per kilobase of exon per million fragments mapped) value and coverage threshold. These reads were then merged with the sequenced reads for assembly. The set of transcripts generated in the last step was then compared with the reference transcripts to remove the transcripts that were approximately equivalent to the whole or a portion of a reference transcript. Through comparison with the reference, we were able to detect the novel transcripts and calculate the coding potential of these transcripts to identify the novel lncRNA [20]. After the assembly, we compared the assembled transcripts to the reference annotation by Cuffcompare, and categorized the transcripts to 12 classes (=, c, j, e, i, o, p, r, u, x, s and.). Among them, only five candidate categories of transcripts will be extracted, including 'i', 'j', 'o', 'u', and 'x', which may contain novel transcripts. The “u” category meant unknown intergenic transcript. The “x” category meant lncRNAs with exonic overlap with known transcripts, but on the opposite strand. The “i” category contained lncRNAs with transcripts falling entirely within a reference intron. The “j” category meant potentially novel isoforms (fragments) with at least one splice junction shared with the reference transcripts.

Classification and annotation of lncRNAs

The lncRNAs were also classified by their position compared with the reference gene, with different strategies used for function prediction. For antisense lncRNA, we performed free energy calculation to detect possible hybridization sites for lncRNA and mRNA, which might be RNA-RNA interactions. To further reveal potential antisense lncRNA–mRNA interactions, we searched for all antisense lncRNA-mRNA complementary base pair duplexes using RNAplex [21], a tool specially created to rapidly search for short interactions between two long RNAs. If such interactions are found upstream or downstream of a gene, we will search for lncRNAs located in unknown regions. Because lncRNA can be processed to yield small RNAs, small RNA precursors can be predicted. Recent genome-wide studies suggested that the function of a significant fraction of long non-coding transcripts may be to serve as precursors for microRNAs (miRNAs) that are usually generated via the sequential cleavage of long transcripts by the enzymes Drosha and Dicer [22, 23]. Thus, we aligned lncRNAs to miRBase [24] to detect potential pre-miRNAs by selecting those with a hit coverage greater than 90%. The SVM-based software miRPara [25] was also used to predict probable miRNAs. In addition, Rfam divides non-coding RNAs into families based on evolution from a common ancestor. To better annotate novel lncRNAs from an evolutionary standpoint, we classified all predicted novel lncRNAs into different non-coding RNA families using INFERNAL, which categorizes non-coding RNAs and their conserved primary sequence and RNA secondary structure through the use of multiple sequence alignments, consensus secondary structure annotation, and covariance models [26]. These results will classify lncRNA families by their RNA structure and sequence similarities, helping us to reveal the potential functions of the lncRNAs.

Screening of differentially expressed lncRNAs and real-time PCR validation

Cuffdiff was used to calculate expression levels of lncRNAs in more than one condition and test them for significant differences on the basis of FPKM values. To screen differentially expressed genes from the Cuffdiff results, we followed the priority rule: STATUS = OK and P ≤ 0.05. The differentially expressed lncRNAs were screened using NOIseq [27] following the priority rule: filtering condition, log2 (ratio) ≥ 1; probability ≥ 0.8. Real-time PCR (qPCR) was performed to validate the Solexa sequencing data. Eight differentially expressed lncRNAs between the two libraries were randomly selected for qPCR verification. The primers were designed based on the lncRNA sequences (S1 Table). After incubation at 37°C for 1 h and deactivation at 95°C for 5 min, we obtained the template for qPCR. The reaction solution of qPCR contained 2.0 μl cDNA, 10 μl AceQ qPCR SYBR Green Master Mix and ROX Reference Dye 1, 0.4 μl of each primer, and 6.8 μl ddH2O. GAPDH was used as the internal reference gene control. qPCR was performed using standard protocols on the StepOnePlus™ Real-Time PCR System (Thermo Fisher Scientific). The threshold cycle (CT) was obtained from each reaction, and the relative expression level of each lncRNA was evaluated using the 2-ΔΔCt method. Three biological replicates for each selected lncRNA were used. The expression levels of lncRNA in the ovaries of Mul and Uni were determined individually and the t-test was used to examine the significant of expression differences between the two samples using SAS v8.0 software.

Target genes prediction of differentially expressed lncRNAs and function analyses

To explore the functions of lncRNAs, we predicted the target genes of the differentially expressed lncRNAs through Cis role. Cis role is lncRNA action on neighboring target genes. The basic principles of Cis role of target gene is prediction the lncRNA function and the protein coding gene which near the lncRNA’s coordinates. We searched coding genes 100 kb upstream and downstream of lncRNA and then analyzed their function by functional enrichment analysis. Gene Ontology (GO) and KEGG pathway analyses can help us to understand the biological functions of genes [28]. GO enrichment analysis of lncRNA target genes were implemented by the GOseq R package (http://www.bioconductor.org/packages/release/bioc/html/goseq.html), in which gene length bias was corrected. GO terms with corrected P-value less than 0.05 were considered significantly enriched by differential expressed genes. KEGG is database resource for understand high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information. We used KOBAS software (http://kobas.cbi.pku.edu.cn/download.php) to test the statistical enrichment of lncRNA target genes in KEGG pathways. Pathway with corrected FDR-value less than 0.05 were considered significantly enriched by differential expressed genes.

Results

In the base composition of the reads, on the X-axis, 1–90 bp represents “read 1” and 91–180 bp represents “read 2”. The A curve should overlap with the T curve, while the G curve should overlap with the C curve. In this case, the figure showed a balanced composition (S1 Fig). By comparing with the rRNA reference sequence through SOAP aligner/ SOAP2 short-read alignment software, 52,178,296 and 55,077,126 clean reads in the Uni and Mul libraries, respectively, were mapped to the genome. Only 0.02% and 0.03% of the total sequence reads were mapped to single-end in the two libraries, respectively. Additionally, only 0.01% of the total sequence reads were mapped to paired-end mapping reads in the two libraries (S2 Table). Otherwise, about 80% of the total mapped reads were mapped to the reference genome, indicating that the quality of the two libraries is good (S3 Table). The clean reads were mapped to the genome using TopHat (Fig 1). There were 663,188 genes were identified in the gene sequencing coverage statistics of the Uni. Among them, transcript coverage on the genome of 63,140 genes was more than50%, which accounted for 9.24% of the identified genes. Meanwhile, 675,422 genes were identified in the Mul library, and 383,043 genes (accounting for 56.71%), whose transcript coverage on the genome was more than 50%. In total, 511,721 lncRNAs and 13,875 mRNAs were identified in the two libraries. Among them, 271,584 lncRNA and 13,224 mRNA were co-expressed in the two libraries, and 240,137 lncRNAs and 651 mRNAs were respectively expressed in the certain library (Table 1).
Fig 1

Transcript coverage stat of uniparous goats (Uni) and multiparou goats (Mul).

Table 1

Expression statistic of different lncRNAs and mRNA.

Expression typeNovel lncRNAKnown lncRNAmRNA
Expressed in 1 sample2401370651
Expressed in 2 samples271584213224
Total expressed number(ratio)511721213875
Cuffcompare categories were used to distinguish transcripts according to 12 class codes (S4 Table). Only five candidate categories of transcripts will be extracted, including “i”, “j”, “o”, “u” and “x” which may contain novel transcripts. The “u” category contained 525,553 transcripts, which meant unknown intergenic transcript lncRNAs (lincRNAs). The “x” category contained 4,053 transcripts, which meant lncRNAs with exonic overlap with known transcripts, but on the opposite strand. The “i” category contained lncRNAs with transcripts falling entirely within a reference intron. The “j” category meant potentially novel isoforms (fragments) with at least one splice junction shared with the reference transcripts, which contained 38,842 transcripts, and 102 transcripts in “o” category. Due to the complexity of the repeat region, the probability of transcript misassembly is also increased. Thus, we filtered transcripts classified with “r”, which could possibly be artificial fragments. As determined by their genomic locations with respect to nearby genes, the 568,550 lncRNAs included members of all four classes of lncRNAs, with most in the u class, which overlap unknown intergenic transcript lncRNAs. The results of the lncRNA triage were used for further analysis.

Classification and annotation of lncRNA

By examining the up/downstream lncRNAs of a gene, we found 27,228 lncRNAs had up/downstream gene. Of these, 13,069 lncRNAs were located in the antisense strand and 14,159 were located in the sense strand. Therefore, 49% of lncRNAs were downstream of the bracketing genes, and the others were upstream. These lncRNAs could overlap with cis-regulatory elements probably involved in transcriptional regulation. TCONS_00076241, TCONS_00087806, and TCONS_00076240 were upstream of XM_005686239.1 (BUB1) in the oocyte meiosis pathway. TCONS_00248477 was downstream of XM_005676195.1 (MAP3K19) in the GnRH signaling pathway. TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS) in the progesterone-mediated oocyte maturation pathway. TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS) in the oocyte meiosis pathway. MOS, BUB1, and MAP3K19 may regulate oocyte meiosis (Table 2).
Table 2

Up/down stream lncRNA of a gene.

PathwaylncRNA IDChrstrandBraketing geneUp/down stream
Progesterone-mediated oocyte maturationTCONS_00390275chr3-XM_005678520.1UPSTREAM_2K
TCONS_00136407chr14+XM_005689083.1DOWNSTREAM_2K
TCONS_00146968chr14-XM_005689083.1DOWNSTREAM_2K
Steroid biosynthesisTCONS_00376376chr3+XM_005678362.1DOWNSTREAM_2K
TCONS_00376377chr3+XM_005678362.1DOWNSTREAM_2K
Oocyte meiosisTCONS_00386107chr3-XM_005678170.1UPSTREAM_2K
TCONS_00386108chr3-XM_005678170.1UPSTREAM_2K
TCONS_00094458chr11-XM_005686832.1UPSTREAM_2K
TCONS_00094457chr11-XM_005686832.1UPSTREAM_2K
TCONS_00259604chr20+XM_005694802.1UPSTREAM_2K
TCONS_00267219chr20-XM_005694802.1DOWNSTREAM_2K
TCONS_00071211chr10-XM_005685706.1DOWNSTREAM_2K
TCONS_00071210chr10-XM_005685706.1DOWNSTREAM_2K
TCONS_00061051chr10+XM_005685706.1DOWNSTREAM_2K
TCONS_00136407chr14+XM_005689083.1DOWNSTREAM_2K
TCONS_00146968chr14-XM_005689083.1DOWNSTREAM_2K
TCONS_00076241chr11+XM_005686239.1UPSTREAM_2K
TCONS_00087806chr11-XM_005686239.1UPSTREAM_2K
TCONS_00076240chr11+XM_005686239.1UPSTREAM_2K
TCONS_00322561chr25+XM_005697767.1UPSTREAM_2K
TCONS_00322565chr25+XM_005697767.1DOWNSTREAM_2K
TCONS_00322562chr25+XM_005697767.1UPSTREAM_2K
GnRH signaling pathwayTCONS_00248477chr2-XM_005676195.1DOWNSTREAM_2K
TCONS_00386107chr3-XM_005678170.1UPSTREAM_2K

Note: Pathway, the KEGG pathway; lncRNA ID, the ID of lncRNA; Chr: chromosome which is the lncRNA came from; Strand, the strand which the lncRNA located; Braketing gene, the nearest gene in upstream or downstream; Up/down stream, whether upstream or downstream is the lncRNA located to a gene.

Note: Pathway, the KEGG pathway; lncRNA ID, the ID of lncRNA; Chr: chromosome which is the lncRNA came from; Strand, the strand which the lncRNA located; Braketing gene, the nearest gene in upstream or downstream; Up/down stream, whether upstream or downstream is the lncRNA located to a gene. We selected 65,536 lncRNAs which coverage were greater than 90% to examine the regulatory functions of the lncRNAs. We aligned lncRNAs to miRBase to determine if they could target known miRNAs: TCONS_00320849 had 16 known precursors, TCONS_00284825 had 10 known precursors, TCONS_00080255 had 7 known precursors, TCONS_00284070, TCONS_00304715, TCONS_00424188, TCONS_00533458, TCONS_00308106, TCONS_00087926, and TCONS_00205426 had 1 known precursor, respectively. Bta-mir-365-1, cfa-mir-493, and bta-mir-493 are precursors of TCONS_00284825, whereas oar-mir-493 is the precursor of TCONS_00320849, which the identities were 100%. We found that TCONS_00087926, TCONS_00304715, TCONS_00424188, and TCONS_00320849 had several novel precursors, but TCONS_00320849 was related to a mature miRNA sequence, miR-365a, which had the same mature miRNA in the miRBase database (Fig 2, S5 and S6 Tables).
Fig 2

The pre-miRNA and secondary structure of oar-mir-493, bta-mir-493 and bta-mir-365-1.

Note: A, known pre-miRNA and secondary structure pictures of oar-mir-493; B, known pre-miRNA and secondary structure pictures of bta-mir-493; C, known pre-miRNA and secondary structure pictures of bta-mir-365-1.

The pre-miRNA and secondary structure of oar-mir-493, bta-mir-493 and bta-mir-365-1.

Note: A, known pre-miRNA and secondary structure pictures of oar-mir-493; B, known pre-miRNA and secondary structure pictures of bta-mir-493; C, known pre-miRNA and secondary structure pictures of bta-mir-365-1. To better annotate the novel lncRNA from an evolutionary standpoint, we classified all of the predicted novel lncRNAs into different non-coding RNA families using INFERNAL. Fifty families were obtained via the lncRNA family prediction. The C/G content was between 25% and 70%. Xist_exon1, TUG1_3, TUG1_4, SOX2OT_exon3, and SMAD5 may regulate reproduction and embryonic development [29-31] (Table 3).
Table 3

Families prediction of lncRNAs.

Family NameFamily AccessionLncRNAStartEndE-valueScoreGC%
MALAT1RF01871TCONS_00360963376838593.70E-0858.20.25
MIAT_exon5_2RF01876TCONS_001946196167001.20E-2099.50.54
MIAT_exon5_2RF01876TCONS_002020163182341.20E-2099.50.54
MIAT_exon5_3RF01877TCONS_001946201704277.30E-82282.50.6
MIAT_exon5_3RF01877TCONS_001946211628.70E-1768.90.48
Xist_exon1RF01880TCONS_00543946212820442.60E-24102.80.52
BC040587RF01884TCONS_001644314072690.0006435.50.47
Evf1_1RF01887TCONS_003933822053472.70E-26133.40.31
TUG1_3RF01891TCONS_0019496798812298.40E-69232.60.5
TUG1_4RF01892TCONS_00194967153617125.00E-51195.80.4
CDKN2BRF01909TCONS_001902004325800.006229.30.36
KCNQ1OT1_2RF01947TCONS_005003702991410.0004226.50.65
KCNQ1OT1_2RF01947TCONS_005125895627200.0004226.50.65
KCNQ1OT1_2RF01947TCONS_00081060183560.009822.30.52
SOX2OT_exon3RF01953TCONS_00031909362871.20E-101321.80.51
SOX2OT_exon3RF01953TCONS_00048514142891.20E-101321.80.51
KCNQ1DNRF01961TCONS_003294082301700.00535.70.28
RMST_2RF01963TCONS_00423556431835.10E-28125.30.5
RMST_7RF01968TCONS_00423569623281.50E-923100.34
RMST_10RF01971TCONS_004235782393881.50E-26110.40.51
ZEB2_AS1_1RF01984TCONS_002475912353615.30E-36147.20.49
ZEB2_AS1_1RF01984TCONS_002329183732475.30E-36147.20.49
ZEB2_AS1_1RF01984TCONS_00274957417543000.0056290.4
ZEB2_AS1_2RF01985TCONS_002329178267664.70E-1686.60.43
DAOARF02091TCONS_001152173731744.10E-53183.60.49
DLEU1_1RF02103TCONS_0010030671971.30E-49187.60.48
DLG2RF02112TCONS_00357136208118936.30E-46195.50.39
DLG2RF02112TCONS_001680672013921.20E-28128.70.28
DLG2RF02113TCONS_003571334052163.70E-39139.30.55
SMAD5RF02174TCONS_000526102801780.002827.40.48
SMAD5RF02174TCONS_000360783624640.002827.40.48
ST7RF02179TCONS_004101444194793.70E-0648.90.34
ST7RF02187TCONS_004101431983913.40E-42194.90.41
ST7RF02188TCONS_004101436728571.10E-30135.90.42
ST7RF02189TCONS_004101421553232.60E-2182.40.44
TCL6_1RF02191TCONS_002768901781.70E-07510.5
TTC28RF02199TCONS_001262282914310.0062360.37
TTC28RF02200TCONS_001484979357660.002225.80.48
WT1RF02204TCONS_001598364115392.20E-1477.60.55
WT1RF02205TCONS_0015983690911433.30E-531840.65
WT1RF02206TCONS_00159836118613072.80E-1582.60.59
WT1RF02208TCONS_00159836318733083.60E-241150.7
WT1RF02209TCONS_001598373426352.50E-106337.50.47
WT1RF02210TCONS_0015983773310031.10E-68264.50.31
ZNFX1RF02215TCONS_001237961597.70E-0842.40.68
ZNFX1RF02216TCONS_001237961792707.50E-1168.80.59
Six3os1_2RF02247TCONS_001146281002830.002824.90.47
Six3os1_2RF02247TCONS_003493644242470.00723.60.48
adapt33_2RF02256TCONS_004627725835120.0003139.20.4
adapt33_2RF02256TCONS_004627735835120.0003139.20.4

Note: Family Name, assigned Family name in Rfam database; Family Accession, Correspond Family accession number; LncRNA, long noncoding RNA; Start, start site of the alignment; End, end site of the alignment; E-value, expect value for the alignment; Score, score of the alignment given by Infernal; GC%, GC content of the conserved region; Rank, Rank of this lncRNA in the corresponding family.

Note: Family Name, assigned Family name in Rfam database; Family Accession, Correspond Family accession number; LncRNA, long noncoding RNA; Start, start site of the alignment; End, end site of the alignment; E-value, expect value for the alignment; Score, score of the alignment given by Infernal; GC%, GC content of the conserved region; Rank, Rank of this lncRNA in the corresponding family.

Differentially expressed lncRNAs and real-time PCR Validation

Compared to the Uni library, 26,187 up-regulated and 157,567 down-regulated lncRNAs were identified in the Mul library (Fig 3). A total of 183,754 lncRNAs were significantly differently expressed between the Mul and Uni with |log2Ratio|2 ≥ 1 and p-value ≤ 0.05. Among them, 455 lncRNAs, which included 188 known genes and 267 predicted transcripts, were co-expressed in the two samples. In the Mul, the highest expressed lncRNA from the co-expressed lnRNAs was XLOC-013414, which FPKM value was 5052.08, whereas, its FPKM in the Uni was 572.03. In the Uni, the highest expressed lncRNA from the co-expressed lnRNAs was 102188530, which FPKM value was 741.67, whereas, its FPKM in the Mul was 109.21. In addition, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 uniquely lncRNAs were expressed in the Mul. The expression of uniquely expressed lncRNA was relatively low. Such as the expression of XLOC-505751, which was highest uniquely expressed in the Uni, was 60.29. And the expression of XLOC-263636, which was highest uniquely expressed in the Mul, was 70.38 (S7 Table).
Fig 3

Stat chart of up-down-regulated expressed genes.

To ensure the accuracy and reliability of the RNA-seq results, eight differentially expressed lncRNAs were randomly selected for qPCR to confirm the expression changes. The results showed good consistency between the RNA-seq results and the qPCR results (Fig 4).
Fig 4

Real-time PCR results of randomly selected differentially expressed lncRNAs.

Note: X-axis represents selected 8 differential expressed transcripts in two libraries. Here, GAPDH was chosen as the reference gene. Relative expression value per selected transcripts between uniparous goats (Uni) and multiparous goats (Mul) samples was calculated (y-axis). Superscript letters indicate significant difference at the level of 0.05.

Real-time PCR results of randomly selected differentially expressed lncRNAs.

Note: X-axis represents selected 8 differential expressed transcripts in two libraries. Here, GAPDH was chosen as the reference gene. Relative expression value per selected transcripts between uniparous goats (Uni) and multiparous goats (Mul) samples was calculated (y-axis). Superscript letters indicate significant difference at the level of 0.05.

Function analyses of target genes of differentially expressed lncRNAs

The target genes of the differentially expressed lncRNAs were predicted through Cis role. After searching the coding genes 100 kb upstream and downstream of lncRNA, we analyzed the function of the target genes through GO and KEGG enrichment analysis. In total, there were 32,627 genes were predicted as the target genes of differentially lncRNAs between the Uni and Mul (S8 and S9 Tables). Based on the gene background, there were 439, 432, and 407 target genes clustered into 141, 198, and 868 GO terms in component, function and process ontology, respectively. The GO terms “Cell part”, “Binding” and “Cellular process” were the biggest terms, which included the most target genes, in the above ontologies, respectively. However, compared with the reference gene background, no GO term was significantly enriched (p-value ≤ 0.05). The “Phosphatidylinositol binding” and “Cyclic-nucleotide phosphodiesterase activity” in function ontology, and “Cholesterol metabolic process”, “Cell aging”, “Aging”, “Steroid metabolic process” in process ontology were the only GO terms with corrected p-value < 1 (S10 Table). According to the KEGG pathway analysis, a total of 875 target genes, which showed 22,885 background genes, of the differentially expressed lncRNAs between the Uni and Mul, were annotated for 236 pathways. Among them, 12 pathways, “mTOR signaling pathway (ko04150)”, “Measles (ko05162)”, “Olfactory transduction (ko04740)”, “Regulation of autophagy (ko04140)”, “Insulin signaling pathway (ko04910)”, “RNA degradation (ko03018)”, “RNA polymerase (ko03020)”, “Nucleotide excision repair (ko03420)”, “Adipocytokine signaling pathway (ko04920)”, “Fatty acid biosynthesis (ko00061)” “Purine metabolism (ko00230)”, and “SNARE interactions in vesicular transport (ko04130)”, were significantly enriched (p-value ≤ 0.05). In addition, five pathways, “Progesterone-mediated oocyte maturation (ko04914) (p-value = 0.1484)”, “Steroid biosynthesis (ko00100) (p-value = 0.1511)”, “Oocyte meiosis (ko04114) (p-value = 0.1963)”, “GnRH signaling pathway (ko04912) (p-value = 0.5711)”, and “Steroid hormone biosynthesis (ko00140) (p-value = 0.2323)” were related to the animal reproduction [32-34] (S11 Table).

Discussion

In the study, we used RNA-seq to study the lncRNAs expressed in the ovaries of multiparous (Mul) and uniparous (Uni) goat to explore the function of lncRNAs in ovulation and lambing. In total, 107,255,422 clean reads were obtained from the two libraries, and 52,178,296 and 55,077,126 clean reads in the Uni and Mul libraries, respectively, were mapped to the genome. However, a small part of the lncRNA sequence still could not be aligned to the target genome and located to a specific chromosome. Because the sequence was broken up into smaller pieces called segments, with the software inferring that the read spans a splice junction and estimating the location of the splice sites of the junction. On one hand, we believed that this shortcoming may be because the reference goat genome is still not perfect, with more pieces to be spliced. On the other hand, there may be post-transcriptional RNA editing, resulting in inconsistencies between the lncRNA sequence and the corresponding DNA sequence. Some lncRNAs highly expressed in the ovary may lack a classical poly (A) tail [35]. In the renal medulla, 28% of lncRNAs were found to lack a classical poly (A) tail [12]. Strand-specific sequences rather than 3’-directed cDNA sequences were used in this study. Strand-specific sequences help to determine the polarity of transcripts and improve the accuracy of the quantification of antisense transcripts. When lncRNA and mRNA were isolated from total RNA, the ribo-depleted method was used instead of the poly (A) enrichment method. Large-scale bioinformatic studies suggest that a significant fraction (>24%) of long non-coding transcripts present in cells may lack a classical poly (A) tail. In this study, we searched for lncRNAs located in unknown regions if they could be found upstream or downstream of a gene to predict the function of the lncRNA. TCONS_00076241, TCONS_00087806, and TCONS_00076240 were upstream of XM_005686239.1 (BUB1). TCONS_00248477 was downstream of XM_005676195.1 (MAP3K19). TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS). Studies have shown that XM_005689083.1 and XM_005686239.1 play a role in meiosis I in the oocyte [36]. XM_005676195.1 may regulate GnRH in the GnRH signaling pathway. Bta-mir-365-1, cfa-mir-493 [37], and bta-mir-493 [38] are precursors of TCONS_00284825 and oar-mir-493 [39] is a precursor of TCONS_00320849 in pre-miRNA prediction. TCONS_00320849 has a mature miRNA sequence, miR-365a, with the same mature sequence in the miRBase database. miR-365 can regulate proliferation by mediating the expression of KRAS [40]. miR-365 regulates cell cycle progression and apoptosis in various types of tumors [41]. Thus TCONS_00136407, TCONS_00146968, and TCONS_00320849 may also play roles in meiosis in the oocyte. To further detect the lncRNAs, which may participate in the ovulation and lambing of goat, we analyzed the differentially expressed lncRNAs between the Uni and Mul. A total of 183,754 lncRNAs were significantly differently expressed between the two libraries. Among them, 455 lncRNAs, which included 188 known genes and 267 predicted transcripts, were co-expressed between the two samples. In addition, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 lncRNAs were uniquely expressed in the Mul. Accordingly, we analyzed the lncRNAs which have high expression levels and were significantly difference expressions between the Uni and Mul libearies. XLOC_013414 was differentially expressed lncRNA with the highest expression level in the Uni library, whereas 102188530 (COL1A1) had the highest expression in the Mul library. The function of XLOC_013414 is unknown but COL1A1 polymorphisms may individually play minor roles in osteoporosis and fracture [42]. Meanwhile, XLOC_500835, XLOC_075014, XLOC_523784 were significantly differentially expressed novel lncRNA between the Uni and the Mul. And their target genes were KIAA0368, CNGA3 and MID2, respectively. KIAA0368 as selectively up-regulated by contact allergene, may be potential markers of skin irritation and allergy [43]. Achromatopsia can be caused by mutations in the CNGA3 gene [44]. MID2 leads to misregulation of microtubule organization and the downstream disease pathology associated with X-linked intellectual disabilities [45]. In consideration of the specific and higher expression levels of XLOC_013414 and COL1A1, some further research of their roles in the ovary is warranted. According to the KEGG pathway analysis, a total of 875 target genes, which showed 22,885 background genes, of the differentially expressed lncRNAs between the Uni and Mul were annotated to 236 pathways. Among them, 12 pathways were significantly enriched (p-value ≤ 0.05). In addition, five pathways, “Progesterone-mediated oocyte maturation”, “Steroid biosynthesis”, “Oocyte meiosis”, “GnRH signaling pathway”, and “Steroid hormone biosynthesis” were related to the animal reproduction. An increased FSH secretion level will lead to multiple follicular development and ovulation in the follicular phase [46-48]. In addition, Neat1 knockout mice fail to become pregnant despite normal ovulation [49]. Ovarian hormones can regulate the expression of Foxl2 to expand the number of genes controlled by the hypothalamic–pituitary–gonadal axis, ultimately dictating reproductive fitness [16]. In summary, the differentially expressed profile of lncRNAs was successfully established in the Uni and Mul phase libraries. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. Meanwhile, the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. In addition, 10 lncRNAs harbored precursors of 40 miRNAs, which was reported to be related to proliferation, were annotated in the precursor analysis of miRNAs. The present expand the understanding of lncRNA biology and will provide a resource for lncRNA studies of ovulation and lambing.

Base composition analysis of sample Uni and Mul.

(JPG) Click here for additional data file.

Summary of lncRNA primers sequences for the RT-PCR.

(XLSX) Click here for additional data file.

Statistics of clean data mapping to rRNA by soap.

(XLSX) Click here for additional data file.

Statistics of clean data without rRNA mapping to genome by tophat.

(XLSX) Click here for additional data file.

Cuffcompare category to distinguish transcripts by 12 symbols.

(XLS) Click here for additional data file.

Result of comparison with known pre-miRNA.

(XLS) Click here for additional data file.

Result of pre-miRNA prediction.

(XLSX) Click here for additional data file.

The differentially expressed lncRNAs between the two libraries.

(XLSX) Click here for additional data file.

The protein-coding genes within 100 k downstream of an lncRNA.

(XLSX) Click here for additional data file.

The protein-coding genes within 100 k upstream of an lncRNA.

(XLSX) Click here for additional data file.

10 GO analysis for the target genes of the differentially expressed lncRNAs.

(XLSX) Click here for additional data file.

KEGG pathways for the target genes of the differentially expressed lncRNAs.

(XLS) Click here for additional data file.
  48 in total

1.  Assessing the effect of the CLPG mutation on the microRNA catalog of skeletal muscle using high-throughput sequencing.

Authors:  Florian Caiment; Carole Charlier; Tracy Hadfield; Noelle Cockett; Michel Georges; Denis Baurain
Journal:  Genome Res       Date:  2010-10-13       Impact factor: 9.043

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

Review 3.  Long noncoding RNAs: functional surprises from the RNA world.

Authors:  Jeremy E Wilusz; Hongjae Sunwoo; David L Spector
Journal:  Genes Dev       Date:  2009-07-01       Impact factor: 11.361

4.  Differential methylation status of IGF2-H19 locus does not affect the fertility of crossbred bulls but some of the CTCF binding sites could be potentially important.

Authors:  Subas C Jena; Sandeep Kumar; Sandeep Rajput; Bhaskar Roy; Arpana Verma; Arumugam Kumaresan; Tushar K Mohanty; Sachinandan De; Rakesh Kumar; Tirtha K Datta
Journal:  Mol Reprod Dev       Date:  2014-02-18       Impact factor: 2.609

5.  Rod and rod-driven function in achromatopsia and blue cone monochromatism.

Authors:  Anne Moskowitz; Ronald M Hansen; James D Akula; Susan E Eklund; Anne B Fulton
Journal:  Invest Ophthalmol Vis Sci       Date:  2008-09-29       Impact factor: 4.799

6.  Follicle stimulating hormone (FSH) dynamics of low dose step-up ovulation induction with FSH in patients with polycystic ovary syndrome.

Authors:  M van der Meer; P G Hompes; F Scheele; E Schoute; S Veersema; J Schoemaker
Journal:  Hum Reprod       Date:  1994-09       Impact factor: 6.918

7.  Maximal expression of Foxl2 in pituitary gonadotropes requires ovarian hormones.

Authors:  Maria K Herndon; John H Nilson
Journal:  PLoS One       Date:  2015-05-08       Impact factor: 3.240

8.  Characteristics of long non-coding RNAs in the Brown Norway rat and alterations in the Dahl salt-sensitive rat.

Authors:  Feng Wang; Liping Li; Haiming Xu; Yong Liu; Chun Yang; Allen W Cowley; Niansong Wang; Pengyuan Liu; Mingyu Liang
Journal:  Sci Rep       Date:  2014-11-21       Impact factor: 4.379

9.  MicroRNAs for Fine-Tuning of Mouse Embryonic Stem Cell Fate Decision through Regulation of TGF-β Signaling.

Authors:  Christiana Hadjimichael; Christoforos Nikolaou; Joseph Papamatheakis; Androniki Kretsovali
Journal:  Stem Cell Reports       Date:  2016-02-11       Impact factor: 7.765

Review 10.  Considerations when investigating lncRNA function in vivo.

Authors:  Andrew R Bassett; Asifa Akhtar; Denise P Barlow; Adrian P Bird; Neil Brockdorff; Denis Duboule; Anne Ephrussi; Anne C Ferguson-Smith; Thomas R Gingeras; Wilfried Haerty; Douglas R Higgs; Eric A Miska; Chris P Ponting
Journal:  Elife       Date:  2014-08-14       Impact factor: 8.140

View more
  8 in total

Review 1.  Statistical analysis of non-coding RNA data.

Authors:  Qianchuan He; Yang Liu; Wei Sun
Journal:  Cancer Lett       Date:  2018-01-04       Impact factor: 8.679

2.  Long Non-Coding RNA GDAR Regulates Ovine Granulosa Cells Apoptosis by Affecting the Expression of Apoptosis-Related Genes.

Authors:  Yong Wang; Yunxia Guo; Chunhui Duan; Ruochen Yang; Lechao Zhang; Yueqin Liu; Yingjie Zhang
Journal:  Int J Mol Sci       Date:  2022-05-06       Impact factor: 6.208

3.  Identification of Long Noncoding RNAs Involved in Eyelid Pigmentation of Hereford Cattle.

Authors:  Eugenio Jara; Francisco Peñagaricano; Eileen Armstrong; Claudia Menezes; Lucía Tardiz; Gastón Rodons; Andrés Iriarte
Journal:  Front Genet       Date:  2022-05-04       Impact factor: 4.772

4.  Comprehensive analysis of mRNAs and miRNAs in the ovarian follicles of uniparous and multiple goats at estrus phase.

Authors:  Xian Zou; Tingting Lu; Zhifeng Zhao; Guangbin Liu; Zhiquan Lian; Yongqing Guo; Baoli Sun; Dewu Liu; Yaokun Li
Journal:  BMC Genomics       Date:  2020-03-30       Impact factor: 3.969

5.  Identification and Comparative Analysis of Long Non-coding RNAs in High- and Low-Fecundity Goat Ovaries During Estrus.

Authors:  Yaokun Li; Xiangping Xu; Ming Deng; Xian Zou; Zhifeng Zhao; Sixiu Huang; Dewu Liu; Guangbin Liu
Journal:  Front Genet       Date:  2021-06-25       Impact factor: 4.599

6.  Differential Expression and Functional Analysis of CircRNA in the Ovaries of Low and High Fecundity Hanper Sheep.

Authors:  Aiju Liu; Xiaoyong Chen; Menghe Liu; Limeng Zhang; Xiaofei Ma; Shujun Tian
Journal:  Animals (Basel)       Date:  2021-06-23       Impact factor: 2.752

7.  Genome-wide identification of tissue-specific long non-coding RNA in three farm animal species.

Authors:  Colin Kern; Ying Wang; James Chitwood; Ian Korf; Mary Delany; Hans Cheng; Juan F Medrano; Alison L Van Eenennaam; Catherine Ernst; Pablo Ross; Huaijun Zhou
Journal:  BMC Genomics       Date:  2018-09-18       Impact factor: 3.969

8.  Genome‑wide integrated analysis demonstrates widespread functions of lncRNAs in mammary gland development and lactation in dairy goats.

Authors:  Zhibin Ji; Tianle Chao; Zhaohua Liu; Lei Hou; Jin Wang; Aili Wang; Jie Zhou; Rong Xuan; Guizhi Wang; Jianmin Wang
Journal:  BMC Genomics       Date:  2020-03-23       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.