| Literature DB >> 29518040 |
Wei Liu1, Li Li2, Xuhe Long3, Weixin You4, Yuexian Zhong5, Menglin Wang6, Huan Tao7, Shoukai Lin8, Huaqin He9.
Abstract
Network-based systems biology has become an important method for analyzing high-throughput gene expression data and gene function mining. Escherichia coli (E. coli) has long been a popular model organism for basic biological research. In this paper, weighted gene co-expression network analysis (WGCNA) algorithm was applied to construct gene co-expression networks in E. coli. Thirty-one gene co-expression modules were detected from 1391 microarrays of E. coli data. Further characterization of these modules with the database for annotation, visualization, and integrated discovery (DAVID) tool showed that these modules are associated with several kinds of biological processes, such as carbohydrate catabolism, fatty acid metabolism, amino acid metabolism, transportation, translation, and ncRNA metabolism. Hub genes were also screened by intra-modular connectivity. Genes with unknown functions were annotated by guilt-by-association. Comparison with a previous prediction tool, EcoliNet, suggests that our dataset can expand gene predictions. In summary, 31 functional modules were identified in E. coli, 24 of which were functionally annotated. The analysis provides a resource for future gene discovery.Entities:
Keywords: Escherichia coli; annotation; gene co-expression network
Year: 2018 PMID: 29518040 PMCID: PMC5870351 DOI: 10.3390/cells7030019
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1A representative network view of the constructed Escherichia coli (E. coli) gene co-expression network. Each node represents a probe and each line denotes the gene expression correlation between the two nodes.
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation of the 24 gene co-expression modules identified in E. coli.
| Module | GO Term | KEGG (Adjusted | ||
|---|---|---|---|---|
| Biological Process | Cellular Component | Molecular Function | ||
| Black | Carbohydrate catabolic process | Plasma membrane | Cation binding | Two-component system |
| Blue | Biological adhesion | External encapsulation structure part | — | Bacterial secretion system |
| Brown4 | Siderophore metabolic process | — | Iron transmembrane transporter activity | Lysine degradation |
| Cyan | Oxidation-reduction | Anchored to membrane | — | — |
| Darkgreen | — | — | Purine nucleoside binding | Lipopolysaccharide biosynthesis |
| Darkgrey | Lipopolysaccharide biosynthetic process | Organelle inner membrane | Cell surface antigen activity, host-interacting | Lipopolysaccharide biosynthesis |
| Darkmagenta | — | Viral capsid | — | — |
| Darkolivegreen | Sulfate metabolic process | — | Sulfate transmembrane transporter activity | Sulfur metabolism |
| Darkorange | Pathogenesis | — | — | Pathogenic |
| Darkorange2 | Phosphonate transport | Peptidoglycan-based cell wall | — | ABC transporters |
| Darkred | Behavior | Flagellum | Motor activity | Flagellar assembly |
| Darkturquoise | Cellular amino acid biosynthetic process | External encapsulating structure | Acetolactate synthase activity | Valine, leucine, and isoleucine biosynthesis |
| Green | Translation | Ribosome | Structural constituent of ribosome | Ribosome |
| Greenyellow | peptidoglycan-based cell wall | |||
| Grey60 | Iron ion transport | — | Nucleoside binding | ABC transporters |
| Ivory | Transition metal ion transport | — | Iron ion binding | Biosynthesis of siderophore group nonribosomal peptides |
| Lightcyan1 | Protein folding | Zinc ion binding | — | |
| Lightyellow | Fatty acid oxidation | Peptidoglycan-based cell wall | Acyl carrier activity | Benzoate degradation via CoA ligation |
| Magenta | Amine biosynthetic process | Peptidoglycan-based cell wall | Nucleoside binding | Glycine, serine and threonine metabolism |
| Orangered4 | Nitrogen compound biosynthetic process | — | Anthranilate synthase activity | Thiamine metabolism |
| Paleturquoise | Glycerol metabolic process | — | Glycerol-3-phosphate dehydrogenase activity | Glycerophospholipid metabolism |
| Plum1 | Anaerobic respiration | Nitrate reductase complex | — | — |
| Sienna3 | Anaerobic respiration | — | Metal cluster binding | Butanoate metabolism |
| Tan | ncRNA metabolic process | Plasma membrane | RNA methyltransferase activity | Mismatch repair |
— represents no significant GO or KEGG terms detected.
Figure 2Correlation of intramodule connectivity for each module after sampling 1000 times (mean ± SD). Connectivity was calculated by randomly sampling half of the 1391 samples 1000 times.
Hub genes and their encoding proteins of the gene co-expression modules in E. coli.
| Module | Gene | Encoding Protein |
|---|---|---|
| Black | Z4148 | Hypothetical protein |
| Blue | ECs1057 | Hypothetical protein |
| Brown4 | fhuF | Ferric iron reductase protein |
| Cyan | ECs2033 | Hypothetical protein |
| Darkgreen | elaA | Hypothetical protein |
| Darkgrey | rfbB | dTDP-glucose 4,6 dehydratase, NAD(P)-binding |
| Darkmagenta | c1590 | Tail component of prophage |
| Darkolivegreen | ssuC | Alkanesulfonate transporter permease |
| Darkorange | ECs4574 | SepD |
| Darkorange2 | c4484 | Aldolase |
| Darkred | ECs2660 | Flagella biosynthesis protein FliZ |
| Darkturquoise | ECs5231 | Ornithine carbamoyltransferase subunit I |
| Green | rpsN | 30S ribosomal protein S14 |
| Greenyellow | c0944 | Hypothetical protein |
| Grey60 | ECs1840 | Hypothetical protein |
| Ivory | fecR | Anti-sigma transmembrane signal transducer for ferric citrate transport; periplasmic FecA-bound ferric citrate sensor and cytoplasmic FecI ECF sigma factor activator |
| Lightcyan1 | ECs3950 | RNA polymerase sigma factor RpoD |
| Lightyellow | pepN | Aminopeptidase |
| Magenta | usg | Semialdehyde dehydrogenase |
| Orangered4 | ECs1854 | OMP decarboxylase |
| Paleturquoise | yagF | CP4-6 prophage |
| Plum1 | yebT | MCE domain protein |
| Sienna3 | ECs2379 | Hypothetical protein |
| Tan | ECs4128 | Acetyl-CoA carboxylase biotin carboxylase subunit |
Figure 3Module hub probes visualized in the modules Darkorange (a) and Darkred (b). The top 200 highly connected probe pairs for the two modules were visualized by Cytoscape.
Figure 4Gene expression variation analysis in E. coli modules. Variation was calculated as the average relative standard deviation for each gene in a module.
Figure 5Intermodule connectivity proportion in E. coli. The intermodule connectivity/total connectivity value was calculated for every gene; then, the average value for each module was visualized.