| Literature DB >> 34450623 |
Xiaoqing Zhang1,2, Hai Chi1, Gang Li1, David M Irwin3, Shuyi Zhang2, Stephen J Rossiter4, Yang Liu1,5.
Abstract
Lysozyme enzymes provide classic examples of molecular adaptation and parallel evolution, however, nearly all insights to date come from chicken-type (c-type) lysozymes. Goose-type (g-type) lysozymes occur in diverse vertebrates, with multiple independent duplications reported. Most mammals possess two g-type lysozyme genes (Lyg1 and Lyg2), the result of an early duplication, although some lineages are known to have subsequently lost one copy. Here we examine g-type lysozyme evolution across >250 mammals and reveal widespread losses of either Lyg1 or Lyg2 in several divergent taxa across the mammal tree of life. At the same time, we report strong evidence of extensive losses of both gene copies in cetaceans and sirenians, with an additional putative case of parallel loss in the tarsier. To validate these findings, we inspected published short-read data and confirmed the presence of loss of function mutations. Despite these losses, comparisons of selection pressures between intact g- and c-type lysozyme genes showed stronger purifying selection in the former, indicative of conserved function. Although the reasons for the evolutionary loss of g-type lysozymes in fully aquatic mammals are not known, we suggest that this is likely to at least partially relate to their hairlessness. Indeed, although Lyg1 does not show tissue-specific expression, recent studies have linked Lyg2 expression to anagen hair follicle development and hair loss. Such a role for g-type lysozyme would explain why the Lyg2 gene became obsolete when these taxa lost their body hair.Entities:
Keywords: Cetacea; Chiroptera; Lysozyme g; Sirenia; parallelism; pseudogenization
Mesh:
Substances:
Year: 2021 PMID: 34450623 PMCID: PMC8449827 DOI: 10.1093/gbe/evab201
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.Loss of Lyg genes in aquatic mammals based on published sequences. (A) Species trees for Cetacea and Sirenia (blue clades), and their close relatives (black clades), with divergence time shown (Upham et al. 2019). (B) Coding regions of the Lyg1 and Lyg2 genes from each species is shown, which are located between the Txndc9 and Mrpl30 genes. For aquatic mammals, exons for the coding regions of Lyg1 and Lyg2 are shown in blue, with untranslated regions in gray. The two flanking genes are represented by hollow rectangles, with the dotted one showing a missing gene. Only exons are drawn to scale, with introns indicated by horizontal lines. The zig-zag line represents a gap in the genomic sequence within a scaffold, whereas the dashed line indicates a gap as the sequences are from different scaffolds. Arrows above the genes indicate the direction of gene transcription. Frameshift indels and premature stop codons are indicated in red. Indels with lengths that are multiples of 3, but not more than nine bases long (three amino acids), are not marked in the exons. (C) Numbering of the key catalytic amino acid residues 73, 86, and 97 are based on the goose Lyg positions, with the substitutions shaded in either black (for site 73) or gray (for both 86 and 97).
. 2.Loss of Lyg2 genes in bats based on published sequences. (A) Clades of different colors represent the different superfamilies of bats (Upham et al. 2019). For species marked with an asterisk, putative duplicated exons may exist. (B) Exons with coding regions for Lyg2 and its two flanking genes are displayed. The coding and untranslated regions are indicated in black and gray rectangles, respectively. Flanking genes (Lyg1 and Mrpl30) are indicated by hollow rectangles. Only exons are shown to scale, with introns indicated by horizontal lines. Arrows above the genes indicate the direction of gene transcription. Inactivating mutations in Lyg2 are marked with red numbers and asterisks. Lyg1 genes that do not have a complete open reading frame are marked with a red strike. Flanking genes that are not on the same scaffold and those that are in the same scaffold but whose sequence is not continuous with Lyg2 are indicated by the dotted and zig-zag lines, respectively. Flanking genes marked by question marks indicate that their positions are not the expected pattern. (C) Display of the key catalytic amino acid residue sites in Lyg1 and Lyg2. Numbering is based on goose Lyg. Black shading indicates that the amino acid change might have a large impact on the function of g-type lysozyme (site 73), whereas amino acid changes in gray shading might not affect the function (sites 86 and 97).
. 3.The Bayesian phylogenetic tree reconstructed based on intact Lyg coding sequences. The Lyg1 (brown) and Lyg2 (green) clades are highly supported by Bayesian posterior probabilities. The amniote LygA genes from Alligator sinensis and Gallus gallus are used as outgroups.
. 4.The Lyg1 (outer circles) and Lyg2 (inner circles) genes of mammals, with pseudogenes validated. Color of circles represent different states of the genes, with black indicating complete open reading frames, gray being uncertain, and red being pseudogenes. Silhouettes are used to highlight species that have lost both of their Lyg1 and Lyg2 genes, with the extinct Steller’s sea cow indicated by a fading pattern, and also bats, showing extensive losses of Lyg2. Fully aquatic species are indicated by the bold blue branches. Abbreviations for the orders are: DER, Dermoptera; SCA, Scandentia; PIL, Pilosa; CIN, Cingulata; SIR, Sirenia; HYR, Hyracoidea; PRO, Proboscidea; TUB, Tubulidentata; MAC, Macroscelidea; AFR, Afrosoricida; DID, Didelphimorphia; DAS, Dasyuromorphia; DIP, Diprotodontia; and Mon, Monotremata.
Selection Intensity for Pseudogenes in Whales, Sirenians and Bats
| Gene | Focal Clades | Backgrounds | Selection Intensity | |
|---|---|---|---|---|
|
| Cetartiodactylan pseudogenes | Intact | 0 | <0.001 |
| Sirenian pseudogenes | 0.31 | 0.011 | ||
|
| Cetartiodactylan pseudogenes | Intact | 0.04 | <0.001 |
| Sirenian pseudogenes | 0.11 | 0.008 | ||
| Chiropteran pseudogenes | 0.13 | <0.001 |
Selective Pressure on Mature Protein Region of Mammalian Lysozyme Genes
| Gene | Model |
| Parameters | Site(s) under Positive Selection | |
|---|---|---|---|---|---|
|
| M1a | −6139.11 |
| ||
| M2a | −6138.76 | 0.702 |
| 106, 118 | |
| M8a | −6115.27 |
| |||
| M8 | −6113.02 | 0.034 |
| 40, 71, 81, 99, 106, 116, 118, 133, 139 | |
|
| M1a | −6311 |
| ||
| M2a | −6311 | 1 |
| 190 | |
| M8a | −6286.36 |
| |||
| M8 | −6286.16 | 0.531 |
| 52, 113, 187, 190 | |
|
| M1a | −4878.91 |
| ||
| M2a | −4847.18 | 0 |
| ||
| M8a | −4866.97 |
| |||
| M8 | −4840.66 | 0 |
|
P < 0.05,
P < 0.01.
Positively selected sites (probability > 0.99) shown in bold with underlines.