| Literature DB >> 28562605 |
Marc Tollis1, Dale F DeNardo1, John A Cornelius1, Greer A Dolby1, Taylor Edwards2, Brian T Henen3, Alice E Karl4, Robert W Murphy5, Kenro Kusumi1.
Abstract
Agassiz's desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex.Entities:
Mesh:
Year: 2017 PMID: 28562605 PMCID: PMC5451010 DOI: 10.1371/journal.pone.0177708
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Chelonian species with whole genome sequences.
| Scientific Name | Family | Common Name | Citation |
|---|---|---|---|
| Testudinae | Agassiz’s desert tortoise | Current study | |
| Emydidae | Western painted turtle | [ | |
| Cheloniidae | Green sea turtle | [ | |
| Trionynichidae | Chinese softshell turtle | [ |
Sequences used for the assembly of the Agassiz’s desert tortoise (Gopherus agassizii) genome.
| Targeted Library Size | Number of Reads | Estimated Coverage |
|---|---|---|
| 200 bp paired-end | 1,222,612,586 | 54X |
| 300 bp paired-end | 1,186,573,102 | 53X |
| 1 kbp paired-end | 468,101,028 | 21X |
| 2 kbp mate pair | 147,436,368 | 6X |
| 4 kbp mate pair | 182,973,840 | 8X |
| 10 kbp mate-pair | 112,067,096 | 5X |
| Total | 3,319,764,020 | 147X |
1Based on an estimated total genome size of 2.4 Gbp, see Results and discussion.
Fig 1Assembly statistics for the genome of Gopherus agassizii and comparison with Python bivittatus and Alligator mississippiensis.
(A) Two k-mer frequency peaks exist for G. agassizii: a peak at 35X coverage, representing heterozygous regions and a peak at 71X coverage, representing homozygous regions. (B) Gene-based scaffolding greatly increased scaffold N50 from 169.2 kbp to 251.6 kbp, which places it within the range of recently published genomes of reptiles including the Burmese python [71] and the American alligator [72].
Assembly statistics for the genome of Gopherus agassizii.
| Contigs | Scaffolds | |
|---|---|---|
| N50 | 42,722 bp | 251,635 bp |
| L50 (Number) | 15,759 | 2,592 |
| Longest | 436,052 bp | 2,046,553 bp |
| Total Length | 2,363,296,958 bp | 2,399,952,228 bp |
| Number ≥ 1 kbp | 106,825 | 42,911 |
| Number ≥ 100 kbp | 2,672 | 6,264 |
| Proportion gaps | 0% | 1.55% |
Fig 2Frequency distribution (proportion) of GC content calculated in 5 kbp non-overlapping windows for Gopherus agassizii and other reptilian genomes.
GC content of G. agassizii closely matches that of the turtles Chrysemys picta bellii and Pelodiscus sinensis [17,18], and overlaps the American alligator (Alligator mississippiensis) [72], the green anole (Anolis carolinensis) [73], and the chicken (Gallus gallus) [70].
Fig 3Gene structure of Gopherus agassizii compared to other reptiles.
Frequency distribution of genes or exons relative to (A) coding DNA sequence (CDS) lengths, (B) exon length and (C) intron length of Agassiz’s desert tortoise, chicken [74], western painted turtle [18], and Chinese softshell turtle [17]. CDS is for genes less than 10 kb.
Repetitive content of the Agassiz’s desert tortoise (Gopherus agassizii) and the western painted turtle (Chrysemys picta bellii) genomes, estimated with libraries of known repeats (RepBase) and de novo repeat identification (RepeatModeler).
| Repeat Type | Length (bp) | % genome | Length (bp) | % genome | Length (bp) | % genome |
|---|---|---|---|---|---|---|
| SINEs | 42,775,342 | 1.78 | 44,092,705 | 1.84 | 44,277,662 | 1.87 |
| LINEs | 263,416,736 | 10.98 | 276,159,275 | 11.51 | 248,848,977 | 10.52 |
| LTR | 165,658,037 | 6.90 | 184,823,905 | 7.70 | 123,030,173 | 5.20 |
| DNA Transposons | 244,173,277 | 10.17 | 297,537,719 | 12.4 | 18,952,044 | 9.80 |
| Unclassified | 18,348,096 | 0.76 | 203,226,010 | 8.47 | 20,076,666 | 0.80 |
| Small RNA | 8,799,494 | 0.37 | 9,451,148 | 0.39 | 8,527,612 | 0.36 |
| Satellites | 519,850 | 0.02 | 1,402,596 | 0.06 | 267,675 | 0.01 |
| Simple Repeats | 11,038,402 | 0.46 | 19,005,379 | 0.79 | 11,395,517 | 0.48 |
| Low Complexity | 2,145,620 | 0.09 | 1,714,833 | 0.07 | 2,244,030 | 0.09 |
1SINE, short interspersed nuclear element; LINE, long interspersed nuclear element; LTR, long terminal repeat retrotransposons.
Fig 4Phylogeny of 10 sauropsids including Gopherus agassizii.
Phylogeny shows the relationships of sauropsids, including four chelonians, with complete genome sequences. Branch lengths are given in terms of DNA substitutions per site derived from fourfold degenerate sites.
Fig 5Comparative genomics of chelonians.
Representation of Gene Ontology terms for biological processes shared by genes with pairwise Ka/Ks > 1 in comparisons of Gopherus agassizii to (A) Chrysemys picta bellii and (B) Chelonia mydas. Treemap boxes are both sized and colored by uniqueness. Abbreviations in (A): RB, receptor biosynthesis, LCFACB, long chain fatty acyl-CoA biosynthesis, MTMD, membrane to membrane docking.
Gene Ontology categories most enriched for fixed unique SNPs in G. agassizii compared with G. morafkai and G. evgoodei.
These results are for Biological Processes, for a complete listing see S6–S8 Tables.
| Gene Ontology (GO) category | GO ID | Number of enriched genes | p-value |
|---|---|---|---|
| RNA processing | GO:0006396 | 59 | 1.23E-07 |
| macromolecule modification | GO:0043412 | 193 | 2.66E-04 |
| cellular macromolecule metabolic process | GO:0044260 | 371 | 4.43E-04 |
| primary metabolic process | GO:0044238 | 446 | 9.94E-04 |
| cellular protein metabolic process | GO:0044267 | 226 | 1.07E-03 |
| cellular nitrogen compound metabolic process | GO:0034641 | 280 | 1.59E-03 |
| cellular metabolic process | GO:0044237 | 433 | 2.34E-03 |
| protein modification process | GO:0036211 | 182 | 2.55E-03 |
| cellular protein modification process | GO:0006464 | 182 | 2.55E-03 |
| heterocycle metabolic process | GO:0046483 | 258 | 2.90E-03 |
| nucleobase-containing compound metabolic process | GO:0006139 | 253 | 3.21E-03 |
| macromolecule metabolic process | GO:0043170 | 393 | 4.47E-03 |
| mRNA metabolic process | GO:0016071 | 32 | 6.38E-03 |
| organic substance metabolic process | GO:0071704 | 454 | 6.46E-03 |
| chromatin modification | GO:0016568 | 42 | 6.96E-03 |
| cellular aromatic compound metabolic process | GO:0006725 | 256 | 7.53E-03 |
Overlapping genes in the two most enriched GO categories for fixed unique SNPs in G. agassizii compared with key functional categories.
No overlap was identified with GO:0002456 T cell mediated immunity.
| Overlapping GO category | Gene symbol | Description | |
|---|---|---|---|
| GO:0006396 RNA processing | GO:0045087 innate immune response | SRPK1 | Serine and arginine rich splicing factors protein kinase 1 |
| GO:0043412 macromolecule modification | GO: 0007568 aging | ABL1 | Tyrosine-protein kinase ABL1 |
| CARM1 | Histone-arginine methyltransferase CARM1 | ||
| HYAL2 | Hyaluronidase-2 | ||
| PRKCD | Protein kinase C delta type | ||
| RPN2 | Dolichyl-diphosphooligosaccharide—protein glycosyltransferase subunit 2 | ||
| TGFB1 | Transforming growth factor beta-1 | ||
| GO:0007623 circadian rhythm | CRY1 | Cryptochrome-1 | |
| CSNK1D | Casein kinase I isoform delta | ||
| GO:0009411 response to UV | ERCC6 | DNA excision repair protein ERCC-6 | |
| HYAL2 | Hyaluronidase-2 | ||
| PPID | Peptidyl-prolyl cis-trans isomerase D | ||
| UBE2B | Ubiquitin-conjugating enzyme E2 B | ||
| USP28 | Ubiquitin carboxyl-terminal hydrolase 28 | ||
| GO:0019724 B cell mediated immunity | PAXIP1 | PAX-interacting protein 1 | |
| PRKCD | Protein kinase C delta type | ||
| TGFB1 | Transforming growth factor beta-1 | ||
| GO:0045087 innate immune response | ABL1 | Tyrosine-protein kinase ABL1 | |
| CDC37 | Hsp90 co-chaperone Cdc37 | ||
| IFIH1 | Interferon-induced helicase C domain-containing protein 1 | ||
| NOD1 | Nucleotide-binding oligomerization domain-containing protein 1 | ||
| PELI1 | E3 ubiquitin-protein ligase pellino homolog 1 | ||
| PRKCD | Protein kinase C delta type | ||
| SRPK1 | SRSF protein kinase 1 | ||
| TAB1 | TGF-beta-activated kinase 1 and MAP3K7-binding protein 1 | ||
| TGFB1 | Transforming growth factor beta-1 | ||
| GO:0035809 regulation of urine volume | HYAL2 | Hyaluronidase-2 |
Genes under accelerated evolution across chelonians and genes with fixed differences between species in the Gopherus desert tortoise species complex according to this study.
| Gene | Gene symbol | Locus |
|---|---|---|
| Hematopoietic progenitor cell antigen | gopAga1_00002251 | |
| Cystatin-A | scaffold10781 (XM_005291323) | |
| Short Chain Dehydrogenase/Reductase Family 16c Member 5 | gopAga1_00014840 | |
| Selenoprotein S | gopAga1_00004579 | |
| Superoxide dismutase 1 | scaffold518 (XM_005283906) | |
| Free fatty acid receptor 4 | gopAga1_00009948 | |
| C-x-c motif chemokine ligand 11 | gopAga1_00011528 | |
| Interleukin 10 | gopAga1_00002900 | |
| Nfat activating protein with itam motif 1 | gopAga1_00016407 | |
| Tumor necrosis factor superfamily member 4 | scaffold21701 (XM_007061249) | |
| Interferon gamma | gopAga1_00016782 | |
| Peptidoglycan recognition protein 1 | gopAga1_00017264 | |
| Bcl2 related protein a1 | gopAga1_00011769 | |
| Granzyme A | gopAga1_00003655 | |
| Cryptochrome-1 | gopAga1_00014628 | |
| Casein kinase i isoform delta | gopAga1_00008116 | |
| DNA excision repair protein ercc-6 | gopAga1_00012364 | |
| Hyaluronidase-2 | gopAga1_00019510 | |
| Peptidyl-prolyl cis-trans isomerase d | gopAga1_00006436 | |
| Ubiquitin-conjugating enzyme e2 b | gopAga1_00015001 | |
| Ubiquitin carboxyl-terminal hydrolase 28 | gopAga1_00008255 | |
| Tyrosine-protein kinase abl1 | gopAga1_00009847 | |
| Histone-arginine 8 methyltransferase carm1 | gopAga1_00007934 | |
| Protein kinase c delta type | gopAga1_00004289 | |
| Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 2 | gopAga1_00010670 | |
| Transforming growth factor beta-1 | gopAga1_00017303 | |
| Casein kinase I isoform delta | gopAga1_00008116 | |
*Identification number of annotated Gopherus agassizii protein, otherwise G. agassizii scaffold and RefSeq accession number of mapped locus from LASTZ (see Methods).