| Literature DB >> 30517748 |
Rebekah L Rogers1, Long Zhou2,3, Chong Chu4, Roberto Márquez5, Ammon Corl6, Tyler Linderoth6, Layla Freeborn7, Matthew D MacManes8,9, Zijun Xiong2, Jiao Zheng2, Chunxue Guo2, Xu Xun2, Marcus R Kronforst5, Kyle Summers10, Yufeng Wu11, Huanming Yang2,12, Corinne L Richards-Zawacki7, Guojie Zhang2,3,13, Rasmus Nielsen6.
Abstract
We sequenced the genome of the strawberry poison frog, Oophaga pumilio, at a depth of 127.5× using variable insert size libraries. The total genome size is estimated to be 6.76 Gb, of which 4.76 Gb are from high copy number repetitive elements with low differentiation across copies. These repeats encompass DNA transposons, RNA transposons, and LTR retrotransposons, including at least 0.4 and 1.0 Gb of Mariner/Tc1 and Gypsy elements, respectively. Expression data indicate high levels of gypsy and Mariner/Tc1 expression in ova of O. pumilio compared with Xenopus laevis. We further observe phylogenetic evidence for horizontal transfer (HT) of Mariner elements, possibly between fish and frogs. The elements affected by HT are present in high copy number and are highly expressed, suggesting ongoing proliferation after HT. Our results suggest that the large amphibian genome sizes, at least partially, can be explained by a process of repeated invasion of new transposable elements that are not yet suppressed in the germline. We also find changes in the spliceosome that we hypothesize are related to permissiveness of O. pumilio to increases in intron length due to transposon proliferation. Finally, we identify the complement of ion channels in the first genomic sequenced poison frog and discuss its relation to the evolution of autoresistance to toxins sequestered in the skin.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30517748 PMCID: PMC6278860 DOI: 10.1093/molbev/msy185
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Sequence Data Used to Assemble the O. pumilio Reference Genome.
| Paired-End Libraries | Insert Size (bp) | Reads Length (bp) | Total Size (G) | High-Quality Size (G) | Sequence Coverage (X) | High-Quality Coverage (X) |
|---|---|---|---|---|---|---|
| Solexa reads | 250 | 150 | 186.75 | 149.28 | 27.87 | 22.28 |
| 500 | 150 | 133.09 | 94.45 | 19.86 | 14.10 | |
| 800 | 150 | 91.28 | 53.75 | 13.62 | 8.02 | |
| 2k | 49 | 224.25 | 140.50 | 33.47 | 20.97 | |
| 5k | 49 | 88.10 | 42.04 | 13.15 | 6.27 | |
| 10k | 49 | 66.32 | 34.22 | 9.90 | 5.11 | |
| 20k | 49 | 64.80 | 16.93 | 9.67 | 2.53 | |
| Total | 854.59 | 531.16 | 127.55 | 79.28 |
Notes: 250, 500, and 800 bp belong to the short insert size paired-end libraries and 150 bp sequencing was performed for both ends. The remaining libraries belong to the long insert size paired-end libraries and 49 bp sequencing length was conducted for both ends with which were used for the scaffold building.
Genome Assembly with and without Repetitive Reads.
| All Data | Repeats Masked | |
|---|---|---|
| Scaffold N50 | 72,788 bp | 61,257 bp |
| Contig N50 | 385 bp | 399 bp |
| Assembled genome | 5.0 Gb | 2.7 Gb |
. 1.Bayesian gene trees reconstructed for annotation of O. pumilio genes in the nACHR (A) and SCNA (B) gene families, and nucleotide sequences of the CHRNE gene. Nodal support and branch labeling/coloring follow in fig. 2.
. 2.Bayesian gene tree of the voltage-gated sodium channel alpha subunit (SCNA) gene family and amino acid replacements identified as possibly associated with toxin autoresistance in O. pumilio (A), with their positions on a schematic voltage-gated sodium channel (B). Branches are colored and labeled based on the closest X. tropicalis ortholog, and nomenclature follows ENSEMBL annotations. Asterisks on internodes denote Bayesian posterior probabilities and aBayes support values above 0.95. Clades resulting from amniote-specific gene family expansions were collapsed to improve visualization.
. 3.Total sequence content of TEs as classified using a tblastx against the RepBase database. Among low differentiated TE families, we identified 1 Gb of Gypsy elements, 298 Mb of Copia elements, 255 Mb of hAT sequence, 197 Mb of Mariner elements, and 181 Mb of Tc1 elements in O. pumilio.
. 4.Bimodal distribution of nucleotide similarity for repeat contigs identified using RepDeNovo compared with Mariner-4_DR consensus sequence from D. rerio. The presence of two peaks is consistent with two waves of proliferation after HT of an element between fish and frogs. Danio and Oophaga diverged 480MYA. The distribution of nucleotide similarity is not consistent with vertical transmission from a common ancestor.
. 5.Phylogenetic relationships between horizontally transmitted repetitive elements in O. pumilio (black) and annotated TE Tc1-3_SSa (red), a salmon TE in RepBase. Similar patterns are observed in other TE families. For each of the most diverse TEs that are candidates for HT, the RepBase element lies within the diversity of TEs in O. pumilio. All of these diverse and rapidly proliferating horizontally transferred elements are a close match with a TE from freshwater fish.
. 6.Total amount of expressed sequence for TE classes in O. pumilio and X. laevis. Gypsy and Mariner elements are expressed at significantly higher levels in O. pumilio, but Tc1 elements are not. We observe an excess of germline expression of Gypsy (t = 7.007; P = 0.006393) and Mariner sequences (t = 7.1938; P = 0.005313) in O. pumilio compared with Xenopus.
Most Common Horizontally Transmitted TE Families.
| Family | Species | Number of Contigs | Mean Length (bp) | Total Copy Number | Total Content (Mb) |
|---|---|---|---|---|---|
| 1,441 | 1245 | 46,112 | 52.3 | ||
| 1,191 | 1092 | 39,303 | 86.3 | ||
| 144 | 647 | 4,032 | 25.4 | ||
| 139 | 1110 | 6,255 | 15.3 | ||
| 17 | 692 | 9,214 | 9.8 | ||
| 155 | 653 | 5,425 | 9.5 | ||
| 45 | 308 | 6,390 | 5.2 | ||
| 28 | 384 | 19,852 | 5.0 | ||
| 10 | 291 | 910 | 4.2 | ||
| 12 | 976 | 804 | 2.4 | ||
| 17 | 365 | 2,686 | 2.6 | ||
| 16 | 1264 | 928 | 1.9 | ||
| 15 | 710 | 495 | 0.9 | ||
| 116 | 170 | 1,856 | 0.4 | ||
| 10 | 145 | 200 | 0.04 |
.7.Expression level for transcripts is positively correlated with nucleotide similarity with Mariner-4_DR (Danio rerio) consensus sequence in RepBase. Such a result is expected if recently transferred transposons invade genomes that lack repressors for the new TEs (F = 3.956; df = 5; P = 0.002128).