| Literature DB >> 29166914 |
Olesya I Klimchuk1, Kirill A Konovalov2, Vadim V Perekhvatov3, Konstantin V Skulachev3, Daria V Dibrova4, Armen Y Mulkidjanian5,6,7.
Abstract
BACKGROUND: In prokaryotic genomes, functionally coupled genes can be organized in conserved gene clusters enabling their coordinated regulation. Such clusters could contain one or several operons, which are groups of co-transcribed genes. Those genes that evolved from a common ancestral gene by speciation (i.e. orthologs) are expected to have similar genomic neighborhoods in different organisms, whereas those copies of the gene that are responsible for dissimilar functions (i.e. paralogs) could be found in dissimilar genomic contexts. Comparative analysis of genomic neighborhoods facilitates the prediction of co-regulated genes and helps to discern different functions in large protein families. AIM: We intended, building on the attribution of gene sequences to the clusters of orthologous groups of proteins (COGs), to provide a method for visualization and comparative analysis of genomic neighborhoods of evolutionary related genes, as well as a respective web server.Entities:
Keywords: Clusters of orthologous groups of proteins; Comparative genomics; NADH:Quinone oxidoreductase of type 1; Operon; Orthologs; Paralogs; Phylogenomic analysis
Mesh:
Year: 2017 PMID: 29166914 PMCID: PMC5700660 DOI: 10.1186/s13062-017-0196-z
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1A scheme of obtaining annotations of COGs and Pfam domains for proteins from 711 prokaryotic genomes (see the main text for details)
Fig. 2A screenshot of the COGNAT main window running in the COG mode. (a), Request field and options panel; (b), two high-level taxonomy units for each nucleotide record; (c), genetic neighborhoods where genes are colored according to the panel (d); (d), the color code and domain annotation for the panel (c); (e), a detailed annotation for a clicked gene
Fig. 3Genomic neighborhoods for the clades of predicted proteins belonging to the COG1009 (NADH:ubiquinone oxidoreductase of type 1, NuoL subunit, and Na+/H+ antiporter, MnhA subunit). Typical genomic neighborhoods are shown for the top and bottom clades of the tree that contain sequences of well-studied proteins. A set of 177 proteins belonging to a reduced but representative set of 124 genomes was aligned with the MUSCLE software [5]. The multiple alignment was verified manually and conserved blocks (total 238 positions) were used for the phylogenetic tree construction with the MEGA 7 software [17]. Bootstrap support values calculated from 100 iterations are shown on the branches. Red arrows correspond to the proteins that belong to COG3002