| Literature DB >> 34082708 |
Dahai Liu1, Li Liu2, Kui Duan2, Junqiang Guo3, Shipeng Li2, Zhigang Zhao2, Xiaotuo Zhang4, Nan Zhou3, Yun Zheng5,6.
Abstract
BACKGROUND: Transposable elements (TE) account for more than 50% of human genome. It has been reported that some types of TEs are dynamically regulated in the reprogramming of human cell lines. However, it is largely unknown whether some TEs in Macaca mulatta are also regulated during the reprogramming of cell lines of monkey.Entities:
Keywords: Endogenous retrovirus (ERV); Fibroblast; Induced pluripotent stem cell (iPSC); Long terminal repeat (LTR); Neuroepithelial stem cell (NESC); Transcription; Transposable element (TE)
Mesh:
Substances:
Year: 2021 PMID: 34082708 PMCID: PMC8176597 DOI: 10.1186/s12864-021-07717-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The clusters of gene with different expression patterns during the reprogramming of fibroblast cells in Macaca mulatta.(A) The clusters of genes and their expression patterns at different stages of reprogramming. The numbers in the plots are the numbers of genes in the clusters. Blue lines indicate the average expression levels of genes in the cluster in different cell lines. Red lines indicate average expressions plus/minus standard deviations. (B) The expressions of two representative genes in each cluster of Part (A). (C) The most significantly enriched GO terms in the G0 cluster. (D) The most significantly enriched GO terms in the G10 cluster. (E) The most significantly enriched GO terms in the G6 cluster. (F) The most significantly enriched GO terms in the G5 cluster. In Part (C) to (F), the top 7 GO terms with the smallest multiple test corrected P-values in the three major GO categories were shown. The source data of Part (C) to (F) are available in Additional file 1: Table S4, S14, S10, and S9, respectively
Fig. 2The clusters of TEs with different expression patterns during the reprogramming of fibroblast cells in Macaca mulatta. (A) The clusters of TEs and their expression patterns at different stages of reprogramming. (B) The categories of TEs in different clusters. Blue lines indicate the average expression levels of genes in the cluster in different cell lines. Red lines indicate average expressions plus/minus standard deviations. (C) - (D) Several TEs that belong to the Cluster T0 of Part (A). (E) - (F) Several TEs that belong to the Cluster T1 in Part (A). (G) - (I) Several MacERV3 integrase elements that belong to the Cluster T3 in Part (A). In Part (C) to (I), one of the two replicated RNA-Seq profiles for each lines were shown
Fig. 3The clusters of LTRs with different expression patterns during the reprogramming of fibroblast cells in Macaca mulatta. In Part (B) to (D), the numbers in the trees are bootstrap values greater than 500 (50%). (A) The clusters of LTRs and their expression patterns at different stages of reprogramming. Blue lines indicate the average expression levels of genes in the cluster in different cell lines. Red lines indicate average expressions plus/minus standard deviations. (B) The phylogenetic tree of LTRs in the Clusters L0 and L3 of Part (A). (C) The phylogenetic tree of LTRs in the Cluster L1 in Part (A). (D) The phylogenetic tree of LTRs in the Cluster L4 in Part (A). (E) The expression levels of LTRs in the Clusters L0 and L3 of Part (A). (F) - (G) The expressions of two solo LTRs in the 7 cell lines. One of the two replicated RNA-Seq profiles for each lines were shown
Fig. 4Validating the expressions of some MacERV3 LTRs with PCR experiments. (A) The relative expressions of 13 MacERV3 LTRs detected with the qRT-PCR experiments. (B) The expressions of the two MacERV3 LTRs in Part (A) detected with RNA-Seq profiles. One of the two replicated RNA-Seq profiles for each lines were shown. (C) The expressions of the LTRs in Part (A) examined with semi-quantitative RT-PCR in the samples used. GAPDH was used as an internal control. Three replicates (r1 to r3) were included for each of the 7 cell lines. (D) The sequence and scores of nucleotides of the PCR product. (E) The genomic loci of MacERV3_LTR2_34. The positions of the PCR product and primers were shown in different lanes below MacERV3_LTR2_34
Fig. 5The enriched sequence motifs in the 60 LTRs activated at the early stages of reprogramming rhesus monkey fibroblast cells to NESCs. (A) The five motifs with the smallest E-values and their corresponding putative TFs. (B) The expression levels of the TFs in Part (A). Only those TFs with at least 5 FPKM in at least one of the 7 samples and standard deviation values of at least 5 FPKM were shown. (C) The distribution of motifs on some of the selected LTRs. M1 to M5 represents Motif 1 to Motif 5 in Part (A), respectively. (D) The bidirectional clustering of TEs and TFs in Part (B) based on the correlation coefficient values of their expression levels
Fig. 6A brief summary of KLF4, LTR and KRAB-ZFP/TRIM28 regulatory systems in iPSCs, NESCs and ESCs of different vertebrates. Arrows from TFs to LTRs mean direct regulatory relations. Dashed arrow from KLF4 to LTRs of MacERV3 means predicted regulatory relations based on motif analysis with MEME and correlation coefficient between expression levels of KLF4 and these LTRs. “?”: The KRAB-ZFP concordant with TRIM28 in human and mouse iPSC generation and the recruitment of TRIM28 to LTRs of MacERV3 by an unknown KRAB for monkey are still to be verified