| Literature DB >> 27315811 |
Yujiao Dang1, Liying Yan1,2, Boqiang Hu1, Xiaoying Fan1, Yixin Ren1,2, Rong Li1,2, Ying Lian1,2, Jie Yan1,2, Qingqing Li1, Yan Zhang1,2, Min Li1,2, Xiulian Ren1,2, Jin Huang1,2, Yuqi Wu1,2, Ping Liu1,2, Lu Wen1, Chen Zhang1, Yanyi Huang3,4,5, Fuchou Tang6,7,8,9, Jie Qiao10,11,12,13.
Abstract
BACKGROUND: PolyA- RNAs have not been widely analyzed in human pre-implantation embryos due to the scarcity of materials. In particular, circular RNA (circRNA), a novel type of polyA- RNA, has not been characterized during human pre-implantation development.Entities:
Keywords: RNA de novo assembly; SUPeR-seq; circular RNA; human pre-implantation embryos; mRNA quantification
Mesh:
Substances:
Year: 2016 PMID: 27315811 PMCID: PMC4911693 DOI: 10.1186/s13059-016-0991-3
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Morphology of human early embryos and global expression pattern of RefSeq genes during human pre-implantation development. a Microscopy images of human mature oocyte and pre-implantation embryos at zygote (2PN), 2-cell, 4-cell, 8-cell, morula, early blastocyst, blastocyst, and hatched blastocyst stages. Scale bar, 50um. b PCA of the transcriptome of single embryos during human pre-implantation development. c Pearson correlation coefficient heat map of single embryo transcriptomes during human pre-implantation development
Fig. 2mRNA quantification and analysis of differential expressed genes. a mRNA copy numbers in each individual embryo during human pre-implantation development were estimated by the algorithm of ERCC/RGC-A80 normalization. The calculation process is described in the section of “Methods.” b Clusters of differential expressed genes by normalized RPKM accounting the total mRNA content in each sample during pre-implantation development. The first cluster includes 5573 maternal genes, such as ZP family genes, TET3, and GDF9. These 7427 genes from the 2–5 clusters which are activated from the 4-cell stage during the subsequent developmental stages are defined as zygotic genes. The representative GO terms of each cluster and corresponding P value are shown at the right panel
mRNA and circRNA quantification by ERCC/RGC-A80 normalization algorithm
| Stage | Oocyte | Zygote | 2-cell | 4-cell | 8-cell | Morula | Early blastocyst | Blastocyst | Hatched blastocyst |
|---|---|---|---|---|---|---|---|---|---|
| Average mRNA copy number (106) | 66.4 | 29.0 | 24.6 | 17.0 | 15.7 | 38.1 | 45.9 | 104.5 | 150.7 |
| Average circRNA copy number | 56,059 | 48,983 | 80,410 | 190,008 | 148,656 | 149,508 | 93,745 | 40,194 | 28,774 |
| circRNA/hosting gene (median) | 9 % | 7 % | 9 % | 12 % | 14 % | 25 % | 22 % | 17 % | 20 % |
| No. of circRNA genes per embryo | 149 | 299 | 358 | 727 | 602 | 376 | 519 | 252 | 80 |
| No. of circRNA transcript types per embryo | 179 | 392 | 488 | 1,115 | 869 | 509 | 630 | 278 | 85 |
| No. of circRNA reads per embryo | 585 | 1529 | 2248 | 8059 | 8024 | 2702 | 2911 | 838 | 225 |
Fig. 3Genomic features of circRNAs expressed during human pre-implantation development. a Distribution of the number of different types of circRNA transcripts from each circRNA hosting gene. b Distribution of the back-spliced exons in circRNAs. Nearly all (99.9 %) back-spliced exons that contribute to circRNAs are located in the middle of their hosting genes, whereas six are in the first exon and none are in the last exon, as annotated. c An example in which potential extra exons are located upstream of the annotated first exon that participates in the circularization of FAT3. Back-spliced reads of FAT3 circRNA are presented as a red curve. The peaks connected by the green dashed line are the reads mapped to the first exon that participates in the circularization, and the extra exon which is not annotated, simultaneously. d Distribution of the number of back-spliced exons in each circRNA. More than 95 % of circRNAs contain multiple back-spliced exons and more than half of them contain 2–6 exons. The maximum number of exons in a single circRNA was 56. e Length distribution of back-spliced exons. The box plots show that the exon length distribution from the circRNA consisted of a different number of back-spliced exons (***P value = 7.4E-32, Student’s t-test). f Distribution of the number of Alu elements in flanking and all other introns. The median number is given in the bracket. The number of Alu elements in flanking intron (upstream in blue and downstream in purple) is much higher than that in the randomly selected control introns no matter in circRNA (in green) or not (in red)
Fig. 4CircRNA and mRNA expression pattern during human pre-implantation development. a The pattern of circRNAs percentage in their hosting genes changes during pre-implantation development based on the ratio of back-spliced reads in total reads at each junction locus, the equation is: back-spliced reads/(back-spliced reads + forward-spliced reads). b Comparison of the expression levels of circRNA hosting genes and other coding genes in each stage during pre-implantation development. The expression levels of the hosting genes only include linear transcripts by excluding circular transcripts (RPKMdiscounted = RPKM×(1-circRNA ratio in Fig. 4a). The expression levels of circRNA hosting genes are higher than those of other coding genes before the 8-cell stage, whereas the opposite pattern is observed after the 8-cell stage (***P value <0.001, *P value <0.1, Student’s t-test). c The CLR values comparison of hosting genes from maternal genes and zygotic genes, respectively, during human pre-implantation development. d The circRNA percentage in their hosting genes which are maternal genes and have circular transcripts detected before 4-cell stage, as 950 genes from 1554 maternal hosting genes
Fig. 5Comparative analysis of human and mouse circRNAs. a High conservation of circRNA hosting genes between human and mouse pre-implantation embryos. The Venn diagram shows that the majority of genes that express circRNAs in mouse also produce circular transcripts in human embryos. b GO analysis of the top enriched terms of the circRNA hosting genes in human pre-implantation embryos. c The box plots show the numbers of expressed circRNA hosting genes in human embryos (in red) and mouse embryos (in blue), during pre-implantation development, and the number was normalized to the total mapped reads (in millions) after subsampling the sequencing data. d Expression level of circRNA hosting genes in human embryo and mouse embryo. All circRNA hosting genes are plotted according to the value of log10 of (max RPKM during pre-implantation) in human and mouse embryo, respectively. e Heat map of circularized level of highly expressed circRNA hosting genes (RPKM >10) during human and mouse pre-implantation development. The numbers of circular isoforms from each hosting gene were presented and the columns on the top of the heat map show the average number of circular isoforms from each hosting gene who were producing circular transcripts at this stage. The number of circRNA isoforms is frequently higher than that in mouse. The shared, human-specific, and mouse-specific hosting genes are presented, respectively. f Box plots show the distribution of the length of introns flanking the back-spliced sites from the highly expressed hosting genes after subsampling the sequencing data. The median value of intron length is shown in the box. The back-spliced sites of human-specific and mouse-specific circRNAs are converted using liftover tool, respectively. The numbers of introns are indicated in the bracket
Fig. 6Expression pattern of de novo assembled transcripts during human pre-implantation development. a The box plots show that the expression levels of the novel assembled genes are lower than those of the RefSeq genes but higher than those of the lncRNAs from Noncode V4.0 and our previous paper. b Hierarchical clustering analysis of novel genes indicating stage-specific expression patterns during human pre-implantation. Based on the heat map, these novel genes can be divided into three major types: early zygotic genes, maternal genes, and late zygotic genes. c The pattern of total expression levels of all novel 2322 candidate genes revealing that genes enriched before the 8-cell stage are mostly maternal genes