| Literature DB >> 33084874 |
Sanathoi Gurumayum1, Puzi Jiang1, Xiaowen Hao1, Tulio L Campos2,3, Neil D Young2, Pasi K Korhonen2, Robin B Gasser2, Peer Bork4,5,6,7, Xing-Ming Zhao8,9, Li-Jie He10, Wei-Hua Chen1,11.
Abstract
OGEE is an Online GEne Essentiality database. Gene essentiality is not a static and binary property, rather a context-dependent and evolvable property in all forms of life. In OGEE we collect not only experimentally tested essential and non-essential genes, but also associated gene properties that contributes to gene essentiality. We tagged conditionally essential genes that show variable essentiality statuses across datasets to highlight complex interplays between gene functions and environmental/experimental perturbations. OGEE v3 contains gene essentiality datasets for 91 species; almost doubled from 48 species in previous version. To accommodate recent advances on human cancer essential genes (as known as tumor dependency genes) that could serve as targets for cancer treatment and/or drug development, we expanded the collection of human essential genes from 16 cell lines in previous to 581. These human cancer cell lines were tested with high-throughput experiments such as CRISPR-Cas9 and RNAi; in total, 150 of which were tested by both techniques. We also included factors known to contribute to gene essentiality for these cell lines, such as genomic mutation, methylation and gene expression, along with extensive graphical visualizations for ease of understanding of these factors. OGEE v3 can be accessible freely at https://v3.ogee.info.Entities:
Year: 2021 PMID: 33084874 PMCID: PMC7779042 DOI: 10.1093/nar/gkaa884
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Statistics on 581 human cell lines collected in OGEE v3. (A) The 581 cell lines were related to 25 different tissues by their histological origins. (B) CRISPR-based method often identified more essential genes than RNAi-based method. Each point indicates a cell line, and the x-axis and y-axis mean the number of identified essential genes of the cell line in CRISPR–Cas9 and RNAi datasets respectively. The colors (blue, green, red, yellow) illustrate the number of overlapping essential genes between CRISPR–Cas9 and RNAi data. The red line denotes the fitted regression line between numbers of essential genes in RNAi and CRISPR-Cas9 data. In total 150 cell lines were tested by both techniques.
Figure 2.Extensive annotations on human essential genes provided by OGEE v3. Shown here are graphical visualizations taken from the OGEE v3 (https://v3.ogee.info/#/cellline/large%20intestine/RKO/summary). (A) gene essentiality determined using different experimental methods. (B) Correlational co-essentiality chart of AAMP gene from central nervous system tissue tested by CRISPR-Cas9 technique. The orange edges denote negative correlation while the blue edges denote positive correlation among the connected genes. (C) Gene essentiality as function of mutation: mutation outcomes predicted using FATHMM were used to group all tested genes into three categories, the percentage of essential in each group was then calculated (please consult ‘Help’ page for more details); left panel: gene essentiality tested using RNAi, right panel: gene essentiality tested using CRISPR–Cas9. (D) Gene essentiality as function of methylation: genes are grouped based on methylation sites and calculate percentage of essential for each sites; red color gradient is used to denote the methylation score, increasing gradually from 0 to 1; users can select any methylations from the drop down menu. (E) Gene essentiality as function of gene expression: all 0 rpkm genes are in the first bin and rest of the genes are assigned into equal size nine bins; lastly, percentage of essential is calculated for each bin.
Figure 3.Comparing gene essentiality between human-mouse orthologs. Here only genes with one-to-one human-mouse orthologous relationships are included. Mouse essential genes were obtained from IMPC (blue) and MGI (green) databases. The two human essential gene dataset, RNAi and CRISPR–Cas9 were generated by selecting those that were essential in over 50% the cell lines of a tissue; tissues with less than six cell lines were excluded from this analysis.