| Literature DB >> 31729432 |
Abstract
The eutherian connexins were characterized as protein constituents of gap junctions implicated in cell-cell communications between adjoining cells in multiple cell types, regulation of major physiological processes and disease pathogeneses. However, conventional connexin gene and protein classifications could be regarded as unsuitable in descriptions of comprehensive eutherian connexin gene data sets, due to ambiguities and inconsistencies in connexin gene and protein nomenclatures. Using eutherian comparative genomic analysis protocol and 35 public eutherian reference genomic sequence data sets, the present analysis attempted to update and revise comprehensive eutherian connexin gene data sets, and address and resolve major discrepancies in their descriptions. Among 631 potential coding sequences, the tests of reliability of eutherian public genomic sequences annotated, in aggregate, 349 connexin complete coding sequences. The most comprehensive curated eutherian connexin gene data set described 21 major gene clusters, 4 of which included evidence of differential gene expansions. For example, the present gene annotations initially described human CXNK1 gene and annotated 22 human connexin genes. Phylogenetic tree calculations and calculations of pairwise nucleotide sequence identity patterns proposed revised and updated phylogenetic classification of eutherian connexin genes. Therefore, the present study integrating gene annotations, phylogenetic analysis and protein molecular evolution analysis proposed new nomenclature of eutherian connexin genes and proteins.Entities:
Year: 2019 PMID: 31729432 PMCID: PMC6858305 DOI: 10.1038/s41598-019-53458-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic analysis of eutherian connexin genes. The minimum evolution phylogenetic tree was calculated using maximum composite likelihood method. The bootstrap estimates higher than 50% were shown after 1000 replicates. The 21 major gene clusters CXNA-CXNU were indicated.
Figure 2Major landmarks in eutherian connexin protein sequence alignments. The black squares labelled common cysteine amino acid residues and white squares labelled common N-glycosylation sites. The connexin amino acid signature common cysteine amino acid residues that were implicated in disulfide bonding were labelled by stars. The numbers indicated numbers of amino acid residues. The human substitutions of common cysteine amino acid residues were also indicated.
Figure 3Tests of protein molecular evolution of eutherian connexins. (A) Relative synonymous codon usage statistics of eutherian CXN gene data set. The not preferable amino acid codons were indicated by white letters on red backgrounds. Counts, observed amino acid codon counts; R, relative synonymous codon usage statistics; &, stop codons. (B) Reference human CXNA protein amino acid sequence. Using white letters on violet backgrounds, the 15 invariant amino acid sites were shown. The 2 forward amino acid sites were indicated by arrows and shown using white letters on red backgrounds. The connexin amino acid signature common cysteine amino acid residues that were implicated in disulfide bonding were labelled by stars. The N-terminal and C-terminal boundaries of transmembrane α-helices 1–4 were described according to Nicholson[10] and Sosinsky and Nicholson[11]. (C) Distribution of invariant and forward amino acid sites in human CXNA protein regions. The 15 invariant amino acid sites were shown using violet squares, and 2 forward amino acid sites were shown using red squares. The common cysteine amino acid residues that were implicated in disulfide bonding were connected by lines. C, C-terminal domain; E1 and E2, extracellular connexin regions 1 and 2; L, cytoplasmic loop; M1–M4, transmembrane α-helices 1–4; N, N-terminus.