| Literature DB >> 32640983 |
Handan Can1, Sree K Chanumolu1, Elena Gonzalez-Muñoz2,3, Sukumal Prukudom4, Hasan H Otu5, Jose B Cibelli6.
Abstract
BACKGROUND: Zebrafish is a popular model organism, which is widely used in developmental biology research. Despite its general use, the direct comparison of the zebrafish and human oocyte transcriptomes has not been well studied. It is significant to see if the similarity observed between the two organisms at the gene sequence level is also observed at the expression level in key cell types such as the oocyte.Entities:
Keywords: Oocyte; Orthology; RNA-seq; Transcriptome; Zebrafish
Year: 2020 PMID: 32640983 PMCID: PMC7346435 DOI: 10.1186/s12864-020-06860-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Transcripts per million (TPM) distribution for the nine samples used in our analysis: TPM values are divided into five intervals for each sample and the number of genes in each interval are shown. Biological replicates are indicated with lower case letters, a,b,c. Sample order follows the two human datasets (H1 and H2) followed by our zebrafish dataset (ZF)
Fig. 2Gene types that form an orthologous pair between human and zebrafish
Fig. 3Quantile mapping between pairs of data sets: (a) H1 vs. ZF, (b) H2 vs. ZF, and (c) H1 vs. H2. For each mapping, a heatmap shows the number of common genes in each quantile. For across organism mappings (a and b), Row 11: genes that are expressed in zebrafish, have a high-confident orthologue in human, but are not expressed in human; Row 12: genes that are expressed in zebrafish but do not have a high-confidence orthologue in human; Column 11: genes that are expressed in human, have a high-confident orthologue in zebrafish, but are not expressed in zebrafish; Column 12: genes that are expressed in human but do not have a high-confidence orthologue in zebrafish. For H1-H2 mapping, Row/Column 11 identify the genes that are expressed in only one of the datasets. For each quantile, we also show the average TPM value shown in data value bars with a yellow background. In (d), we summarize the overlap between the top 30% of highly expressed (the 3 × 3 top-left corner of the quantile mappings in a and b) genes that are high-confidence orthologs across the two organisms for the H1 and H2 datasets
Top 25 genes that are orthologous between human and zebrafish and expressed in the top 30% of all three data sets. Average TPM was calculated using all nine samples. For a gene family, e.g., “ribosomal proteins,” only the top representative is listed. The complete list of genes can be found in Supplementary file 3
| Rank | GeneID (ENSG00000+) | Symbol | Description | Average TPM |
|---|---|---|---|---|
| 1 | 198712 | MT-CO2 | Mitochondrially encoded cytochrome c oxidase | 16,481 |
| 3 | 198886 | MT-ND4 | Mitochondrially encoded NADH | 9839 |
| 7 | 198899 | MT-ATP6 | mitochondrially encoded ATP synthase | 8180 |
| 9 | 130816 | DNMT1 | DNA methyltransferase 1 | 5089 |
| 10 | 132646 | PCNA | Proliferating cell nuclear antigen | 4380 |
| 11 | 173207 | CKS1B | CDC28 protein kinase regulatory subunit | 4273 |
| 13 | 138326 | RPS24 | Ribosomal protein | 2701 |
| 15 | 182004 | SNRPE | Small nuclear ribonucleoprotein polypeptide E | 2497 |
| 19 | 137707 | BTG4 | BTG anti-proliferation factor 4 | 1994 |
| 21 | 120533 | ENY2 | Transcription and export complex 2 subunit | 1796 |
| 23 | 113387 | SUB1 | SUB1 homolog, transcriptional regulator | 1663 |
| 24 | 113558 | SKP1 | S-phase kinase associated protein 1 | 1656 |
| 25 | 170315 | UBB | Ubiquitin B | 1611 |
| 27 | 132341 | RAN | Member RAS oncogene family | 1572 |
| 31 | 122674 | CCZ1 | Vacuolar protein trafficking and biogenesis | 1526 |
| 33 | 198668 | CALM | Calmodulin | 1464 |
| 36 | 134057 | CCNB1 | Cyclin B1 | 1384 |
| 37 | 132780 | NASP | Nuclear autoantigenic sperm protein | 1379 |
| 39 | 173812 | EIF1 | Eukaryotic translation initiation factor 1 | 1285 |
| 40 | 221983 | UBA52 | Ubiquitin A−52 residue ribosomal protein fusion product 1 | 1183 |
| 43 | 076043 | REXO2 | RNA exonuclease 2 | 1056 |
| 46 | 115540 | MOB4 | MOB family member 4, phocein | 1007 |
| 49 | 182117 | NOP10 | NOP10 ribonucleoprotein | 962 |
| 56 | 214102 | WEE2 | WEE1 homolog 2 | 809 |
| 58 | 162961 | DPY30 | Histone methyltransferase complex regulatory subunit | 790 |
Fig. 4Sample similarity between the oocytes: (a) Hierarchical clustering and (b) principal components analysis (PCA) of the 9 samples using the 397 highly concordant orthologous genes. In (b), the percent variation explained by each PC is shown in parentheses
Fig. 5Summary of IPA results based on the 397 highly concordant orthologous genes. a, b Top Biofunctions and the most significantly enriched Canonical Pathways identified by IPA. Bars represent the number of genes in the functional category or the canonical pathway (primary y-axis) and the orange line represents the significance of the category or the pathway in -Log(p-value) (secondary y-axis). c Upstream regulators that target a significant portion of the genes in the input list. The inferred activation states of the regulators based on the observed expression of their targets are noted (e.g. an increased expression in targets that are induced by a regulator may imply an “activated” state for the regulator). N/A implies an inconclusive activation state of the regulator. d Number of genes and emerging biological functions in the deduced interaction networks that involve input genes. e Sets of regulators with a combined target gene set that show concordant enrichment in biological functions. Bars represent the total number of genes targeted by each set of regulators. On each bar, the biological functions that are significantly enriched by the target genes are noted
Thirty-six out of the 397 highly concordant orthologous genes with an epigenetic function based on the EpiFactors database. Genes are sorted by decreasing abundance
| Symbol | Description | Function | SpecificTarget | |
|---|---|---|---|---|
| DNMT1 | DNA methyltransferase 1 | DNA modification | dhC | |
| PCNA | Proliferating cell nuclear antigen | Chromatin remodeling | H2A, H2B | |
| SKP1 | S-phase kinase associated protein 1 | Histone modification write cofactor | N/A | |
| ENY2 | ENY2, transcription and export complex 2 subunit | Histone modification erase cofactor | N/A | |
| DPY30 | Dpy-30, histone methyltransferase complex regulatory subunit | Histone modification write cofactor | N/A | |
| NAP1L1 | Nucleosome assembly protein 1 like | Histone chaperone | N/A | |
| MORF4L1 | Mortality factor 4 like 1 | Histone modification read | H4 | |
| NASP | Nuclear autoantigenic sperm protein | Chromatin remodeling | H1 | |
| UHRF1 | Ubiquitin like with PHD and ring finger domains 1 | Histone modification read, Histone modification write cofactor | H3K9me3, H3R2, H3, mCG | |
| PCGF1 | Polycomb group ring finger 1 | Polycomb group (PcG) protein | N/A | |
| EXOSC7 | Exosome component 7 | Scaffold protein, RNA modification | N/A | |
| ACTB | Actin beta | Chromatin remodeling cofactor | N/A | |
| BRD7 | Bromodomain containing 7 | Histone modification read | H3K9ac, H3K14ac, H3K8ac | |
| YWHAB | Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein beta | Histone modification erase cofactor | N/A | |
| CHEK1 | Checkpoint kinase 1 | Histone modification write | H3.1 | |
| ANP32B | Acidic nuclear phosphoprotein 32 family member B | Histone chaperone | H3, H4 | |
| CDK7 | Cyclin dependent kinase 7 | Histone modification write | H1 | |
| ACTL6A | Actin like 6A | Chromatin remodeling cofactor | N/A | |
| CUL1 | Cullin 1 | Chromatin remodeling cofactor | H3K9me3, H3K36me3, H1.4K26me3 | |
| VRK1 | Vaccinia related kinase 1 | Histone modification write | H3S10, H3T3 | |
| MEAF6 | MYST/Esa1 associated factor 6 | Histone modification write cofactor | H2A, H3K14, H4K5, H4K8, H4K12 | |
| MTA3 | Metastasis associated 1 family member 3 | Chromatin remodeling cofactor | N/A | |
| EXOSC9 | Exosome component 9 | Scaffold protein, RNA modification | N/A | |
| RBX1 | Ring-box 1 | Histone modification write cofactor | H3, H4 | |
| MBIP | MAP 3 K12 binding inhibitory protein 1 | Histone modification write cofactor | N/A | |
| TRIM33 | Tripartite motif containing 33 | Histone modification read | H3K9me3, H3K18ac | |
| HELLS | Helicase, lymphoid specific | Chromatin remodeling | N/A | |
| SIN3A | SIN3 transcription regulator family member A | Histone modification erase cofactor, TF | DNA motif | |
| DPF2 | Double PHD fingers 2 | Chromatin remodeling | N/A | |
| C17orf49 | Chromosome 17 open reading frame 49 | Histone modification read | H3K4me3 | |
| CBX1 | Chromobox 1 | Histone modification read | H3K9me3, H3K27me3 | |
| UBE2N | Ubiquitin conjugating enzyme E2 N | Histone modification write | H2AX | |
| UBE2H | Ubiquitin conjugating enzyme E2 H | Histone modification write | H2A, H2B | |
| KANSL2 | KAT8 regulatory NSL complex subunit 2 | Histone modification write cofactor | H5 | |
| APEX1 | Apurinic/apyrimidinic endodeoxyribonuclease 1 | DNA modification cofactor | N/A | |
| DDB1 | Damage specific DNA binding protein 1 | Histone modification write | H2A |
Fig. 6Significantly enriched KEGG pathways based on DAVID functional analysis using genes that are highly expressed in the individual datasets. Bars represent the number of highly expressed genes in the pathway for each data set