| Literature DB >> 32068833 |
Balaji Chattopadhyay1, Kritika M Garg1, Rajasri Ray2,3, Ian H Mendenhall4, Frank E Rheindt1.
Abstract
Major novel physiological or phenotypic adaptations often require accompanying modifications at the genic level. Conversely, the detection of considerable contractions and/or expansions of gene families can be an indicator of fundamental but unrecognized physiological change. We sequenced a novel fruit bat genome (Cynopterus brachyotis) and adopted a comparative approach to reconstruct the evolution of fruit bats, mapping contractions and expansions of gene families along their evolutionary history. Despite a radical change in life history as compared with other bats (e.g., loss of echolocation, large size, and frugivory), fruit bats have undergone surprisingly limited change in their genic composition, perhaps apart from a potentially novel gene family expansion relating to telomere protection and longevity. In sharp contrast, within fruit bats, the new Cynopterus genome bears the signal of unusual gene loss and gene family contraction, despite its similar morphology and lifestyle to two other major fruit bat lineages. Most missing genes are regulatory, immune-related, and olfactory in nature, illustrating the diversity of genomic strategies employed by bats to contend with responses to viral infection and olfactory requirements. Our results underscore that significant fluctuations in gene family composition are not always associated with obvious examples of novel physiological and phenotypic adaptations but may often relate to less-obvious shifts in immune strategies.Entities:
Keywords: gene family evolution; histones; immunity; lesser short-nosed fruit bat; olfactory
Mesh:
Year: 2020 PMID: 32068833 PMCID: PMC7151552 DOI: 10.1093/gbe/evaa030
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Comparison of Four Genome Assemblies Using QUAST
| Parameter | 63-mer SOAP Assembly | 127-mer SOAP Assembly | CLC and SOAP Hybrid Assembly | PLATANUS Assembly |
|---|---|---|---|---|
| Number of scaffolds | 365,196 | 720,696 | 298,912 | 48,012 |
| N50 | 17.08 kb | 3.52 kb | 16.46 kb | 251.28 kb |
| N75 | 5.39 kb | 1.91 kb | 5.23 kb | 109.30 kb |
| L50 | 28,676 | 126,156 | 22,166 | 1,873 |
| L75 | 82,804 | 335,486 | 80,691 | 4,545 |
| Length of largest contig | 1.33 Mb | 473.87 kb | 837.12 kb | 4.47 Mb |
| GC content | 39.53% | 39.43% | 39.38% | 38.98% |
Note.—All statistics are based on scaffolds ≥1,000 bp. Abbreviations: kb, kilobase (=1,000 bp); Mb, megabase (=1,000,000 bp).
Percentages of Different Repeat Elements in the Cynopterus brachyotis Genome
| Repeat Element | Percentage of the Genome |
|---|---|
| SINEs | 0.93 |
| LINEs | 10.18 |
| LTR elements | 0.68 |
| DNA elements | 0.68 |
| Unclassified | 9.37 |
| Simple repeats | 2.4 |
| Low complexity repeats | 0.27 |
Note.—SINE, short-interspersed nuclear elements; LINE, long-interspersed nuclear elements; LTR, long-terminal repeats.
Details of Gene Annotation from AUGUSTUS
| Gene Annotation | Number of Genes Identified | Number of Genes with InterProScan ID | Number of Genes with Gene Ontology ID |
|---|---|---|---|
| Without cDNA hints | 21,822 | 19,371 | 11,636 |
| Using cDNA hints from the | 23,727 | 22,070 | 14,390 |
. 1.—Phylogenetic reconstructions showing evolutionary relationships of Cynopterus brachyotis with other mammalian taxa included in this study. (A) Maximum likelihood tree generated using concatenated data in RAxML based on 473,499 amino acids, (B) maximum likelihood tree generated using concatenated data in RAxML based on 9,353,867 bp of DNA sequence, (C) species tree reconstruction in MP-EST based on 298 single copy protein sequences, and (D) species tree reconstruction in MP-EST based on 3,194 DNA loci. Nodal values represent bootstrap support. Time of divergence is denoted in millions of years by a scale bar below the tree.
Point Estimate of Age of the Most Recent Common Ancestor Computed by r8s for the Four Different Starting Phylogenetic Trees
| Most Recent Common Ancestor of | Tree Topology | |||
|---|---|---|---|---|
| RAxML Tree Based on Protein Sequences | RAxML Tree Based on DNA Sequences | MP-EST Tree Based on Protein Sequences | MP-EST Tree Based on DNA Sequences | |
| Fruit bats | 31.42 | 25.76 | 34.05 | 40.27 |
| Bats | 73.82 | 65.98 | 60.20 | 58.51 |
Note.—Ages are given in millions of years.
Number of Gene Families Exhibiting Significant Expansion or Contraction in Cynopterus brachyotis, Fruit Bats, and Bats for the Three Different Phylogenies Tested
| Phylogenetic Tree |
| Fruit Bats | Bats | |||
|---|---|---|---|---|---|---|
| Number of Gene Families Expanding | Number of Gene Families Contracting | Number of Gene Families Expanding | Number of Gene Families Contracting | Number of Gene Families Expanding | Number of Gene Families Contracting | |
| RAxML tree using protein sequences | 19 | 32 | 1 | 0 | 196 | 17 |
| RAxML tree using DNA sequences | 19 | 27 | 3 | 0 | 207 | 15 |
| MP-EST tree using DNA sequences | 14 | 17 | 0 | 0 | 0 | 0 |
Comparison of Number of Gene Copies in Cynopterus brachyotis, Fruit Bats (excluding C. brachyotis), Bats (excluding C. brachyotis), and Mammals (excluding bats) for (A) Gene Families Exhibiting Expansion within C. brachyotis and (B) Gene Families Exhibiting Contraction within C. brachyotis
| Gene Family ID | Example Protein within the Gene Family | Number of Genes Identified in | Average Number of Genes Identified across Fruit Bats Excluding | Average Number of Genes Identified across Bats Excluding | Average Number of Genes Identified across Mammals Excluding Bats |
|---|---|---|---|---|---|
| (A) | |||||
| OG0000313 | Microtubule-actin crosslinking factor 1 | 12 | 2 | 8.71 | 1.73 |
| OG0000353 | Ribosomal protein L23a | 9 | 1.5 | 2 | 5.91 |
| OG0000372 | RNA-binding motif protein 23 | 14 | 5 | 7.43 | 1.82 |
| OG0000374 | LDL receptor-related protein 1B | 14 | 4.5 | 4.86 | 3.45 |
| OG0000648 | Ferritin light chain | 7 | 1 | 3.29 | 3.64 |
| OG0001063 | Hemicentin 1 | 10 | 2.5 | 2.71 | 2.64 |
| OG0001804 | Proline-rich coiled-coil 2C | 9 | 1 | 3.86 | 1 |
| OG0002597 | ATP-binding cassette, subfamily D (ALD), member 4 | 8 | 1.5 | 2.86 | 1 |
| OG0006526 | Sodium voltage-gated channel alpha subunit 7 | 8 | 0 | 0.86 | 1 |
| OG0007974 | Tubulin alpha 4a | 6 | 0 | 1.14 | 0.82 |
| (B) | |||||
| OG0000007 | Olfactory receptor family 6 subfamily C member 74 | 0 | 21 | 23.14 | 34 |
| OG0000011 | Histone cluster 1 H2B family member a | 2 | 19 | 17.14 | 21.91 |
| OG0000012 | Histone cluster 1 H2A family member a | 2 | 18.5 | 16.71 | 19.64 |
| OG0000020 | Major histocompatibility complex, class I, A | 6 | 29.5 | 27.43 | 4.27 |
| OG0000024 | Lymphoid enhancer-binding factor 1 | 2 | 8.5 | 9.57 | 14.09 |
| OG0000025 | Histone cluster 2 H3 family member d | 0 | 5 | 14.43 | 11 |
| OG0000027 | Leukocyte immunoglobulin-like receptor B5 | 3 | 27 | 24.29 | 4.18 |
| OG0000029 | IKAROS family zinc finger 3 | 2 | 9 | 8.43 | 14.18 |
| OG0000043 | Olfactory receptor family 4 subfamily F member 16 | 0 | 7.5 | 5.71 | 13.45 |
| OG0000080 | Olfactory receptor family 8 subfamily K member 1 | 0 | 8.5 | 6.14 | 9.91 |
| OG0000110 | Olfactory receptor family 2 subfamily A member 4 | 0 | 8 | 5.43 | 8.73 |
| OG0000156 | Abl interactor 1 | 1 | 12 | 12.86 | 2.36 |
| OG0000185 | Killer cell lectin-like receptor C4 | 0 | 21 | 10.86 | 3.18 |
. 2.—Scatter plot of GO terms based on semantic similarity identified in bats: (A) exhibiting significant expansion and (B) exhibiting significant contraction.
. 3.—(A and E) Quaternary fluctuations in effective population size in Cynopterus brachyotis and Pteropus vampyrus based on complete sequence data (dark red), with bootstraps depicted in light red, assuming a generation time of 8 years and a mutation rate of 2.2×10−9 per base pair per year. Colored highlights refer to: early Holocene (light orange; 10,000–12,000 years ago), last glacial period (light blue; ∼12,000–110,000 years ago), last glacial maximum (dark blue line; ∼22,000 years ago), and last interglacial (light green; 110,000–130,000 years ago). (B–G) Ecological niche models of C. brachyotis and P. vampyrus for different time periods: (B and F) mid-Holocene (∼6,000 years ago), (C and G) last glacial maximum (∼22,000 years ago), and (D and H) last interglacial period (∼120,000–140,000 years ago). The following colors designate the probability of presence in ecological niche model maps: (0–0.1) pale yellow, (0.1–0.3) light green, (0.3–0.5) pale blue, (0.5–0.7) light blue, and (0.7–1) dark blue.