| Literature DB >> 30514206 |
Shicheng Yu1,2, Chaoran Zheng3, Fan Zhou4, David L Baillie5, Ann M Rose6, Zixin Deng7, Jeffrey Shih-Chieh Chu8.
Abstract
BACKGROUND: Essential genes are required for an organism's viability and their functions can vary greatly, spreading across many pathways. Due to the importance of essential genes, large scale efforts have been undertaken to identify the complete set of essential genes and to understand their function. Studies of genome architecture and organization have found that genes are not randomly disturbed in the genome.Entities:
Keywords: Essential gene; Functional characterization; Genetic balancer; Lethal; Whole genome sequencing (WGS)
Mesh:
Substances:
Year: 2018 PMID: 30514206 PMCID: PMC6278001 DOI: 10.1186/s12864-018-5251-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Biological functions of the identified 44 essential genes
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
|
|
| h191 | C- > T | R- > X | Phosphatidylinositol 3- and 4-kinase | Replication, recombination and repair | I,F,M,N |
|
|
| h500 | G- > A | R- > K | GNAT acetyltransferase 2 | General function prediction only | I,F,M,N |
|
|
| h449,h839,h222 | C- > T | Q- > X | Nucleoporin Nup120/160 | Unkown | I,F,M,N |
|
|
| h695 | C- > T | P- > S | Homodimerisation region of STAR domain protein | RNA processing and modification | I,F,M,N |
|
|
| h779,h123 | G- > A | W- > X | non-SMC mitotic condensation complex subunit 1 | Function unknown | I,F,M,N |
|
|
| h744,h282,h89 | G- > A | W- > X | Peptidase family M41 | Posttranslational modification, protein turnover, chaperones | I,F,M,N |
|
|
| h483 | C- > T | Q- > X | Peptidase family C50 | Cell cycle control, cell division. Chromosome partitioning | F,M,N |
|
|
| h440 | T- > A | Y- > X | DEAD/DEAH box helicase | RNA processing and modification | I,F,M,N |
|
|
| h212 | C- > T | Q- > X | Innexin | Unkown | I,F,N |
|
|
| h506 | G- > A | D- > N | Ribosomal protein L4/L1 family | RNA processing and modification | I,F,M,N |
|
|
| h696,h778 | C- > T | R- > X | Myb DNA-binding like | Transcription | I,F,M,N |
|
|
| h781 | G- > A | D- > N | Unkown | Unkown | I,N,M,F |
|
|
| h529 | C- > T | Q- > X | Ribosomal protein L13e | Translation, ribosomal structure and biogenesis | I,F,M,N |
|
|
| h501 | G- > A | G- > R | ATPase family associated with various cellular activities (AAA) | Cell cycle control, cell division. Chromosome partitioning;Replication, recombination and repair | I,F,M,N |
|
|
| h426 | C- > T | R- > X | Elongation factor Tu GTP binding domain | Translation, ribosomal structure and biogenesis | I,F,M,N |
|
|
| h253 | 465 + 1G > A | None | 50S ribosome-binding GTPase | General function prediction only | I,F,M,N |
|
|
| h257 | C- > T | Q- > X | GGL domain | Signal transduction mechanisms | I,F,M,N |
|
|
| h348,h681 | C- > T | H- > Y | AMP-binding enzyme | General function prediction only | I,F,M,N |
|
|
| h491,h811 | G- > A | G- > R | RecF/RecN/SMC N terminal domain | Cell cycle control, cell division. Chromosome partitioning | I,N,M,F |
|
|
| h512 | G- > A | W- > X | Nucleoporin FG repeat region | Intracellular trafficking, secretion, and vesicular transport;Nuclear structure | I,F,M,N |
|
|
| h792,h692 | G- > A | G- > R | DEAD/DEAH box helicase | RNA processing and modification | I,F,M,N |
|
|
| h147 | C- > T | S- > L | RecQ mediated genome instability protein | Function unknown | F,N |
|
|
| h726 | G- > A | A- > T | Protein-tyrosine phosphatase | Signal transduction mechanisms | I,F,M,N |
|
|
| h1012 | G- > A | W- > X | Cytochrome b5-like Heme/Steroid binding domain | Energy production and conversion | I,F,M,N |
|
|
| h755 | 262-1G > A | None | RNA polymerase Rpc34 subunit | Transcription | I,F,M,N |
|
|
| h268 | G- > A | A- > T | HMG (high mobility group) box | Transcription | I,F,M,N |
|
|
| h767 | C- > T | S- > F | ATP synthase alpha/beta family, nucleotide-binding domain | Energy production and conversion | I,F,M,N |
|
|
| h513 | G- > A | G- > E | Nup93/Nic96 | Cell cycle control, cell division. Chromosome partitioning | I,F,M,N |
|
|
| h347 | G- > A | G- > R | Initiation factor eIF2 gamma, C terminal | Translation, ribosomal structure and biogenesis | I,F,M,N |
|
|
| h461 | C- > T | R- > X | Innexin | Unkown | I,F,N |
|
|
| h460 | G- > A | G- > E | Golgi phosphoprotein 3 (GPP34) | Intracellular trafficking, secretion, and vesicular transport | I,F,M,N |
|
|
| h740 | 1355-1G > A | None | Zinc carboxypeptidase | Function unknown | I,F,M,N |
|
|
| h132 | 1539 + 1G > A | None | Fragile site-associated protein C-terminus | Unkown | I,F,M,N |
|
|
| h227 | G- > A | W- > X | Cse1 | Intracellular trafficking, secretion, and vesicular transport;Nuclear structure | I,F,M,N |
|
|
| h810,h197 | C- > T | Q- > X | DNA replication factor CDT1 like | Unkown | I,F,M,N |
|
|
| h773 | G- > A | R- > Q | Unkown | Unkown | F,N |
|
|
| h247 | C- > T | T- > I | Ribosomal protein L1p/L10e family | Translation, ribosomal structure and biogenesis | I,F,M,N |
|
|
| h182 | C- > T | Q- > X | Actin | Cytoskeleton | I,F,M,N |
|
|
| h413,h518,h738 | G- > A | G- > R | Sec34-like family | Intracellular trafficking, secretion, and vesicular transport | I,F,M,N |
|
|
| h503 | G- > A | E- > K | Unkown | Unkown | F,M,N |
|
|
| h356,h393 | C- > T | Q- > X | Talin, middle domain | Cytoskeleton | I,F,M,N |
|
|
| h817 | C- > T | Q- > X | GDP-mannose 4,6 dehydratase | Unkown | F,N |
|
|
| h120,h122 | G- > A | G- > E | Protein kinase domain | General function prediction only | F,M,N |
|
|
| h251 | G- > A | W- > X | Unkown | Unkown | I,F,M,N |
Table includes 44 identified essential genes in this study with the information of let-x name, Alleles, Allele mutation, biological functions, and evolutionary conservation. N-Nematodes, I-Invertebrates (Drosophila), M-Mammals (mouse, human), F-Fungi (Saccharomycetaceae)
Fig. 1The heat map analysis for the significant conserved gene functions based on PANTHER Overrepresentation Test. The hierarchal cluster diagram was constructed by using pheatmap clustering in R. The P-values of each annotation data set (a) Molecular Function, (b) Biological Process, (c) Cellular Component, and (d) PANTHER Protein Class) are calculated with the Bonferroni correction for multiple testing by each functional group, which reflect the significance of the difference for the enrichment value between the essential and nonessential genes. The red boxes represent that the functional group are overrepresentation, while the blue boxes represent the opposite case. Conversion P-value to -log10 (x), and get the heatmap of the converted p-value, the conversion value is thought to be 0 while the P-value is greater than 0.05, the conversion value is thought to be 10 while the P-value is less than 0.0000000001
Fig. 2The cluster frequency of different group genes in mixed-stage embryos C. elegans. Box plots of each group genes for gene interactions frequency. The numbers on the right side of the yellow block represent the average interactions frequency of genes in each group. The P-values were obtained from the Mann-Whitney U test / the Wilcoxon rank sum test after the Levene’s test
Fig. 3The percentage of different essentiality genes locate in TAD boundaries. The four groups are labelled in black, red, green, and blue, respectively. Statistical difference was calculated for G1 vs. G2, G3, and G4 individually by using Fisher’s exact test (* P-value < 0.05, ** P-value < 0.01, *** P-value < 0.001)
Fig. 4The Gene expression: This figure represents the normalized transcript level (read number per coding length per million reads) for each gene across the developmental stages including 18 embryo stages, four larval stages (L1-L4), and young adult. To facilitate comparison, we subtract each gene expression values in different periods from the average value of the gene expression in different periods. The heatmap represents normalized transcript level from high (pink) to low (skyblue). Eight distinct modules that are based on their expression pattern are shown by colored modules. Yellow, Turquoise, Red, Purple, Blue, and Black: early-embryonic; Magenta: early- and mid-embryonic; Tan and Brown,: mid-embryonic; Green: late-embryonic; Greenyellow: early-, mid- and late-embryonic; Pink: larval
Fig. 5The protein interaction frequency of different group genes. Interaction number distribution of each group with whole genome protein interactions in C. elegans. Box plots of each group for protein interaction frequency. The numbers on the right side of the yellow block represent the average interaction frequency of proteins in each group. The P-values were obtained from the Mann-Whitney U test / the Wilcoxon rank sum test after the Levene’s test