| Literature DB >> 36251273 |
F Gözde Çilingir1, Luke A'Bear2, Dennis Hansen3,4, Leyla R Davis5, Nancy Bunbury2,6, Arpat Ozgul1, Daniel Croll7, Christine Grossen1.
Abstract
BACKGROUND: The Aldabra giant tortoise (Aldabrachelys gigantea) is one of only two giant tortoise species left in the world. The species is endemic to Aldabra Atoll in Seychelles and is listed as Vulnerable on the International Union for Conservation of Nature Red List (v2.3) due to its limited distribution and threats posed by climate change. Genomic resources for A. gigantea are lacking, hampering conservation efforts for both wild and ex situpopulations. A high-quality genome would also open avenues to investigate the genetic basis of the species' exceptionally long life span.Entities:
Keywords: Aldabrachelys gigantea; Hi-C sequencing; HiFi sequencing; conservation management; genome assembly; reference genome; rewilding
Mesh:
Year: 2022 PMID: 36251273 PMCID: PMC9553416 DOI: 10.1093/gigascience/giac090
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 7.658
Figure 1:(A) A female Aldabrachelys gigantea resting at La Vanille Nature Park, Mauritius. (B) World map showing the location of Aldabra Atoll. (C) Hi-C contact map of the chromosome-level assembled A. gigantea reference genome. Blue boxes represent assembled pseudo-chromosomes and green boxes represent assembled scaffolds that constitute pseudo-chromosomes. (D) BUSCO completeness scores for the Sauropsida dataset and (E) assembly metrics (including length of the longest scaffold N50 and N90) and sequence composition (GC content) of the chromosome-level A. gigantea genome.
Summary of the genomic data produced in this study.
|
| |
| NCBI BioProject | PRJNA822095 |
|
| |
| PacBio SMRT II HiFi data (Gb) | 192 |
| HiFi reads NCBI SRA Accession | SRR18672579 |
|
| |
| Illumina NovaSeq 6000 data (Gb) | 196 |
| Hi-C reads SRA Accession | SRR18673000 |
|
| |
| Assembled genome size (Gb) | 2.37 |
| Scaffold N50 (Mb) | 148.6 |
| No. of scaffolds | 719 |
| Contig N50 (Mb) | 61.5 |
| No. of contigs | 422 |
| BUSCO completeness (sauropsida_odb10) | 97.3% complete, 0.4% fragmented, 2.3% missing |
|
| |
| Assembled genome size (bp) | 16,467 |
|
| |
| PacBio SMRT IsoSeq data (Gb) | 1.1 |
| IsoSeq reads NCBI SRA Accession | SRR18674283 |
| No. of predicted protein-coding genes | 23,953 |
| No. of functionally annotated genes | 22,554 |
| Mean gene length (bp) | 39,458 |
| BUSCO completeness (sauropsida_odb10) | 91.9% complete, 2.3% fragmented, 5.8% missing |
| DOI for annotations | |
|
| |
| Illumina NovaSeq 6000 data (Gb) | 202 |
| NCBI SRA Accessions | SRR14611971-SRR18674101 |
Contiguity and completeness statistics of all available chromosome-level assembled chelonian genomes.
| Species name (Accession No.) | Family | Genome size (Gbp) | Contig N50 (Mbp) | Scaffold N50 (Mbp) | BUSCO completeness* |
|---|---|---|---|---|---|
|
| Testudinidae | 2.374 | 61.5 | 148.6 | 97.3% [S: 96.2%, D: 1.1%], F: 0.4%, M: 2.3% |
|
| Testudinidae | 2.299 | 13.027 | 147.4 | 97% [S: 95.9, D: 1.1%], F: 0.5%, M: 2.5% |
|
| Cheloniidae | 2.134 | 39.416 | 134.4 | 97.3% [S: 96.3%, D: 1.0%], F: 0.4%, M: 2.3% |
|
| Dermochelyidae | 2.165 | 7.03 | 137.6 | 96.3% [S: 95.3%, D: 1.0%], F: 0.6%, M: 3.1% |
|
| Emydidae | 2.481 | 0.021 | 16 | 96.6% [S: 95.7%, D: 0.9%], F: 1.1%, M: 2.3% |
|
| Emydidae | 2.126 | 0.205 | 140.4 | 95.0% [S: 94.0%, D: 1.0%], F: 1.2%, M: 3.8% |
|
| Geoemydidae | 2.484 | 15.011 | 135 | 97.3%[S: 95.2%, D: 2.1%], F: 0.5%, M: 2.2% |
|
| Geoemydidae | 2.368 | 33.353 | 130.5 | 97.5% [S: 95.9%, D: 1.6%], F: 0.4%, M: 2.1% |
|
| Trionychidae | 2.238 | 30.964 | 132 | 96.4% [S: 95.3%, D: 1.1%], F: 0.6%, M: 3.0% |
BUSCO score generated from the sauropsid (sauropsida_odb10) database. BUSCO statistics: C, complete; D, duplicated; F, fragmented; M, missing; S, single copy.
Figure 2:Synteny analysis of 10 chromosomes in Aldabrachelys gigantea (blue horizontal lines), Gopherus evgoodei (orange horizontal lines), Mauremys mutica (green horizontal lines), and Trachemys scripta elegans (red horizontal lines) genome assemblies shows high levels of conservation between distantly related chelonian taxa. Gray, yellow, green, and light blue lines between assemblies indicate syntenic regions, inversions, translocations, and duplications, respectively. The four compared assemblies represent all chelonian families (see cladogram on the left, split times from [104]) except Platysternidae within the chelonian superfamily Testudinoidea, which includes families of Emydidae (terrapins), Geoemydidae, and Testudinidae (land-dwelling tortoises). Trachemys scripta elegans is from Emydidae, Mauremys mutica is from Geoemydidae, and Gopherus evgoodei and Aldabrachelys gigantea are from Testudinidae.
Figure 3:(A) The location of the Aldabra Atoll in the Western Indian Ocean and sampling locations of 30 individuals across the atoll. Every colored mark on the map represents a sampled tortoise. (B) Principal component analysis (PCA) plot of 30 Aldabrachelys gigantea individuals from the islands of Grande Terre (East, light blue circles; South & West, green circles) and Malabar (East, dark blue triangles; West, purple triangles) and two individuals from the Zurich Zoo (Hermania, black diamond; Maleika, light pink square). Principal components 1 and 2 account for 14.2% and 3.64% of the overall genetic variability, respectively. (C) Admixture proportions of all the individuals for ancestral populations (k) varied from 2 to 5. Each bar represents one individual and shows its admixture proportions. Colors used for k = 4 correspond to the colors used in the PCA in panel B. (D) Picture of zoo-housed individual Hermania, originating from the Aldabra Atoll and whose sample contributed to this genome assembly. Photo courtesy of Leyla Davis, Zoo Zürich.