| Literature DB >> 27799467 |
Wei-Hua Chen1, Guanting Lu2, Xiao Chen3, Xing-Ming Zhao3, Peer Bork4,5,6,7.
Abstract
OGEE is an Online GEne Essentiality database. To enhance our understanding of the essentiality of genes, in OGEE we collected experimentally tested essential and non-essential genes, as well as associated gene properties known to contribute to gene essentiality. We focus on large-scale experiments, and complement our data with text-mining results. We organized tested genes into data sets according to their sources, and tagged those with variable essentiality statuses across data sets as conditionally essential genes, intending to highlight the complex interplay between gene functions and environments/experimental perturbations. Developments since the last public release include increased numbers of species and gene essentiality data sets, inclusion of non-coding essential sequences and genes with intermediate essentiality statuses. In addition, we included 16 essentiality data sets from cancer cell lines, corresponding to 9 human cancers; with OGEE, users can easily explore the shared and differentially essential genes within and between cancer types. These genes, especially those derived from cell lines that are similar to tumor samples, could reveal the oncogenic drivers, paralogous gene expression pattern and chromosomal structure of the corresponding cancer types, and can be further screened to identify targets for cancer therapy and/or new drug development. OGEE is freely available at http://ogee.medgenius.info.Entities:
Mesh:
Year: 2016 PMID: 27799467 PMCID: PMC5210522 DOI: 10.1093/nar/gkw1013
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics on conditionally essential genes in selected species with least 200 genes covered by multiple data sets in OGEE
| Species | data sets | tested genes | essential genes | genes covered by multiple data sets | conditionally essential genes |
|---|---|---|---|---|---|
| Homo sapiens | 18 | 21 556 | 7168 | 18 855 | 6985 (37.0%) |
| Schizosaccharomyces pombe | 7 | 5509 | 1571 | 2522 | 279 (11.1%) |
| Drosophila melanogaster | 2 | 13 781 | 408 | 437 | 141 (32.3%) |
| Pseudomonas aeruginosa UCBPP-PA14 | 4 | 5966 | 1842 | 5300 | 1455 (27.5%) |
| Escherichia coli K12 | 4 | 4322 | 740 | 4066 | 509 (12.5%) |
| Mycobacterium tuberculosis H37Rv | 5 | 4008 | 1028 | 4002 | 1388 (34.7%) |
| Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344 | 4 | 3774 | 1514 | 2715 | 1130 (41.6%) |
| Staphylococcus aureus subsp. aureus NCTC 8325 | 2 | 2899 | 557 | 2713 | 250 (9.2%) |
| Haemophilus influenzae Rd KW20 | 4 | 1750 | 847 | 1634 | 617 (37.8%) |
| Mycoplasma pneumoniae M129 | 2 | 1203 | 508 | 1203 | 196 (16.3%) |
Species having at least 200 genes covered by multiple data sets are listed here; the species are ordered first by the kingdom they are in (the 1st column) and then by the number of genes covered by multiple data sets (the 5th column). Essential genes are those that are essential in any collected data sets, i.e. genes that are essential in one data set but non-essential in others are also counted. The proportion of conditionally essential genes (PCEG, percentage in parentheses of the last column) is calculated as the ratio between the ‘conditionally essential genes’ (the last column) and the ‘genes covered by multiple data sets’ (the 5th column). Please note that text-mining results, if available, will be counted as one data set in a species; please consult the ‘Browse’ page of the database for a complete and interactive version of the table.
Figure 1.Screenshots taken from the ‘Analyze’ page. With integrated tools, users can easily analyze the collected data and visualize the results. Shown here are the proportion of essential genes (PE) as a function of involvement in development (developmental versus non-developmental genes, panel (A) and duplication statuses (duplicates versus singlets, panel (B)) in mouse.
Summary of the 16 gene essentiality data sets from 9 human cancers collected in OGEE
| Cancer | Data sets | Essential genes | Uniquely essential genes |
|---|---|---|---|
| breast | 1 | 146 | 67 |
| Burkitt's lymphoma | 2 | 1897 | 198 |
| CML | 4 | 3210 | 1324 |
| colon | 2 | 1394 | 899 |
| esophageal squamous | 1 | 41 | 34 |
| GBM | 1 | 21 | 14 |
| NSCLC | 1 | 28 | 20 |
| ovarian | 2 | 130 | 87 |
| pancreatic | 2 | 199 | 126 |
‘Essential genes’ (the 3rd column) are genes that are essential in any of the data set(s) of a particular cancer type; ‘Uniquely essential genes’ (the last column) are genes that are subset of ‘Essential genes’ but non-essential in any other collected human data sets. An up-to-date version of this table can be found at http://ogee.medgenius.info/cancer/.