| Literature DB >> 32806747 |
Ling Sze Lee1, Beatriz M Navarro-Domínguez1, Zhiqiang Wu1, Eugenia E Montiel1, Daleen Badenhorst1, Basanta Bista1, Thea B Gessler1, Nicole Valenzuela1.
Abstract
Recent sequencing and software enhancements have advanced our understanding of the evolution of genomic structure and function, especially addressing novel evolutionary biology questions. Yet fragmentary turtle genome assemblies remain a challenge to fully decipher the genetic architecture of adaptive evolution. Here, we use optical mapping to improve the contiguity of the painted turtle (Chrysemys picta) genome assembly and use de novo fluorescent in situ hybridization (FISH) of bacterial artificial chromosome (BAC) clones, BAC-FISH, to physically map the genomes of the painted and slider turtles (Trachemys scripta elegans). Optical mapping increased C. picta's N50 by ~242% compared to the previous assembly. Physical mapping permitted anchoring ~45% of the genome assembly, spanning 5544 genes (including 20 genes related to the sex determination network of turtles and vertebrates). BAC-FISH data revealed assembly errors in C. picta and T. s. elegans assemblies, highlighting the importance of molecular cytogenetic data to complement bioinformatic approaches. We also compared C. picta's anchored scaffolds to the genomes of other chelonians, chicken, lizards, and snake. Results revealed a mostly one-to-one correspondence between chromosomes of painted and slider turtles, and high homology among large syntenic blocks shared with other turtles and sauropsids. Yet, numerous chromosomal rearrangements were also evident across chelonians, between turtles and squamates, and between avian and non-avian reptiles.Entities:
Keywords: BioNano optical genome mapping; Chrysemys picta and Trachemys scripta; avian, squamate, and chelonian vertebrates; genome alignments; genome and chromosome evolution; karyotype evolution; phylogenomics; physical molecular cytogenetic BAC clone mapping; turtle, lizard and snake non-avian reptiles
Mesh:
Year: 2020 PMID: 32806747 PMCID: PMC7464131 DOI: 10.3390/genes11080928
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Bacterial artificial chromosome (BAC) clones mapped to C. picta chromosomes (indicated by numbers 1-25). Blue and red text denotes BAC clones mapped in the present study. Red text denotes BAC clones with discrepancies that reveal assembly error detailed in Table 2. Pink text denotes sexual development genes newly identified on improved hybrid scaffolds.
De novo bacterial artificial chromosome (BAC) clone mapping data to C. picta (CPI) chromosomes collected in present study.
| BAC ID | BAC Size | ID BioNano Scaffold | ID NCBI Scaffold | Scaffold Size (bp) | CPI Chromosome |
|---|---|---|---|---|---|
| 424K2 | 163,122 | 203 | ML621420.1 | 15,883,268 | 1p |
| 318H23 | 145,864 | 217 | ML621429.1 | 8,721,052 | 1p |
| 537D7 | 154,142 | 60 | ML621305.1 | 21,088,589 | 1q |
| 168O10 | 162,311 | 153 | ML621386.1 | 8,440,838 | 1q |
| 470O22 | 158,668 | 331 | ML621495.1 | 5,731,011 | 1q |
| 225M17 | 149,235 | 179 | ML621405.1 | 57,448,544 | 2p |
| 106D24 | 132,359 | 23 | ML621269.1 | 31,217,720 | 2p |
| 470J13 | 136,853 | 304 | ML621480.1 | 3,056,537 | 2p |
| 380G19 | 160,060 | 141 | ML621375.1 | 5,625,148 | 4q |
| 318A2 | 138,653 | 60 | ML621305.1 | 21,088,589 | 4q |
| 500C11 | 176,356 | 29,527 | ML621534.1 | 1,293,745 | 5p |
| 105C15 | 151,304 | 87 | ML621330.1 | 5,648,650 | 5q |
| 318A22 | 134,050 | 163 | ML621393.1 | 21,409,697 | 5q |
| 225A1 | 149,859 | 178 | ML621404.1 | 40,323,178 | 6 |
| 424G4 | 146,986 | 75 | ML621319.1 | 13,166,690 | 7 |
| 423F21 | 152,254 | 75 | ML621319.1 | 13,166,690 | 7 |
| 125H10 | 146,876 | 18 | ML621264.1 | 1,390,666 | 8p |
| 318N9 | 153,773 | 76 | ML621320.1 | 21,927,555 | 8p |
| 431J7 | 125,453 | 309 | ML621483.1 | 5,421,227 | 8q |
| 443C11 | 136,412 | 194 | ML621413.1 | 18,639,614 | 9p |
| 318D7 | 137,214 | 14 | ML621260.1 | 13,251,999 | 9q |
| 457H22 | 146,712 | 126 | ML621362.1 | 19,176,785 | 13 |
| 380O21 | 149,406 | 120 | ML621356.1 | 1,852,121 | 15 |
| 428I15 | 135,794 | 145 | ML621378.1 | 23,325,489 | 17 |
| 421D7 | 172,285 | 56 | ML621301.1 | 6,383,505 | 18 |
| 146H23 | 170,310 | 177 | ML621403.1 | 1,209,793 | 19 |
Chromosome arms are denoted by p = short-arm, q = long-arm. NCBI = National Center for Biotechnology Information.
Figure 2Examples of chromosomal locations of BAC clones on C. picta chromosome spreads detected by fluorescent in situ hybridization (FISH) that reveal assembly errors as listed in Table 2. Top panels (A–D) show FISH results from the present study. Bottom panels (E–H) contain unpublished images showing FISH results from our previous study [12]. Green denotes probes labeled with biotin-16-dUTP and highlighted by green arrows. Red denotes probes labeled with digoxigenin-11-dUTP and highlighted by red arrows. The red hybridization signal in panel F is subtler than in other panels and zooming in may be needed for better visualization.
Assembly errors identified in BioNano scaffolds by BAC clone FISH to C. picta chromosomes.
| ID BioNano Scaffold | ID NCBI Scaffold | Scaffold Size (bp) | BAC ID | Chromosome |
|---|---|---|---|---|
| 60 | ML621305.1 | 21,088,589 |
|
|
| 106H12 | 4 | |||
| 153 | ML621386.1 | 8,440,838 |
|
|
| 116H12 | 22 | |||
| 163 | ML621393.1 | 21,409,697 |
|
|
| 38H12 | 8 | |||
| 178 | ML621404.1 | 40,323,178 |
|
|
| 63H12 | 4 | |||
| 15 | ML621261.1 | 59,628,869 | 15H12 | 5 |
| 72H12, 35H18 | 6 | |||
| 70 | ML621314.1 | 11,495,585 | 147L13 | 1 |
| 27H12 | 11 | |||
| 207 | ML621424.1 | 27,887,944 | 45H12, 55A6 | 1 |
| 26H12 | 13 |
Errors were identified when two or more BAC clones painted to two different chromosomes but aligned to a single scaffold in the painted turtle genome assembly. Gray cells with bold font denote BAC-FISH data collected de novo in this study, whereas BACs in clear cells denote previously published BAC-FISH data [12].
Genes in the sex determination network of turtles and vertebrates that mapped to BioNano hybrid scaffolds, along with their chromosomal location and gene order in C. picta (CPI) and chicken (GGA) chromosomes.
| ID BioNano Scaffold | ID NCBI Scaffold | Sex-Related Genes | CPI Chromosome | GGA Chromosome | Gene Order |
|---|---|---|---|---|---|
| 289 | ML621472.1 |
| 1 | 1 |
|
| 70 | ML621314.1 |
| 1 | 1 | |
| 207 | ML621424.1 |
| 1 | 1 | |
| 339 | ML621502.1 |
| 2 | 2 | |
| 65 | ML621310.1 |
| 3 | 18 |
|
| 65 | ML621310.1 |
| 3 | 3 | |
| 135 | ML621369.1 |
| 3 | 3 | |
| 330 | ML621494.1 |
| 4 | 5 | |
| 59 | ML621304.1 |
| 5 | 12 |
|
| 15 | ML621261.1 |
| 5 | 6 | |
| 75 | ML621319.1 |
| 7 | Z |
|
| 128 | ML621363.1 |
| 7 | Z | |
| 163 | ML621393.1 |
| 8 | 8 |
|
| 309 | ML621483.1 |
| 8 | 8 | |
| 4 | ML621250.1 |
| 9 | 9 | |
| 145 | ML621378.1 |
| 17 | 15 | |
| 198 | ML621416.1 |
| 18 | 23 | |
| 83 | ML621327.1 |
| 20 | 17 |
Chromosomal information for chicken was obtained from NCBI (accession number GCF_000002315.5). Grey cells denote new hybrid scaffolds generated in the present study. Genes in bold text denote newly identified genes on improved hybrid scaffolds. Regular font denotes previously mapped genes [10,12].
Figure 3Exemplar FISH results using C. picta BACs to hybridize onto T. scripta chromosomes (A–D), and broad-level chromosome homology between these species based on BAC-FISH data (E). Numbers on the left of the ideograms in panel (E) denote the ID and position of the BACs hybridized. White chromosomes in panel (E) denote T. scripta chromosomes without mapped BACs to date, whose homology remains unknown. The red hybridization signal in panels (A), (B) and (D) is subtle such that zooming in may be needed for better visualization.
Chromosome comparison between the published chromosome-level genome assembly of T. s. elegans (TSE) [13] and de novo BAC-FISH data from T. scripta (TSC) (present study).
| BAC ID | TSE Assembly Scaffold | Assembly TSE Chromosome Assignment | BAC-FISH-Based TSC Chromosome (True Location) |
|---|---|---|---|
| 68H12 | CM023056.1 | TSE 1 | TSC 1 |
| 7H12 | CM023056.1 | TSE 1 | TSC 1 |
| 67H12 | CM023058.1 | TSE 3 | TSC 1 |
| O34H12 | CM023056.1 | TSE 1 | TSC 1 |
| 61H12 | CM023056.1 | TSE 1 | TSC 1 |
| 45H12 | CM023056.1 | TSE 1 | TSC 1 |
| 113H12 | CM023056.1 | TSE 1 | TSC 1 |
| 14H12 | CM023057.1 | TSE 2 | TSC 2 |
| 99H12 | CM023057.1 | TSE 2 | TSC 2 |
| 5H12 | CM023057.1 | TSE 2 | TSC 2 |
| 25H12 | CM023057.1 | TSE 2 | TSC 2 |
| 96H12 | CM023057.1 | TSE 2 | TSC 2 |
| 4H12 | CM023057.1 | TSE 2 | TSC 2 |
| 125H12 | CM023058.1 | TSE 3 | TSC 3 |
| 53H12 | CM023058.1 | TSE 3 | TSC 3 |
| 82H12 | CM023058.1 | TSE 3 | TSC 3 |
| 104H12 | CM023058.1 | TSE 3 | TSC 3 |
| 31H12 | CM023058.1 | TSE 3 | TSC 3 |
| 94H12 | CM023059.1 | TSE 4 | TSC 4 |
| 85H12 | CM023059.1 | TSE 4 | TSC 4 |
| 88H12 | CM023059.1 | TSE 4 | TSC 4 |
| 6H12 | CM023059.1 | TSE 4 | TSC 4 |
| 63H12 | CM023059.1 | TSE 4 | TSC 4 |
| 106H12 | CM023059.1 | TSE 4 | TSC 4 |
| 120H12 | CM023060.1 | TSE 5 | TSC 5 |
| 33H12 | CM023060.1 | TSE 5 | TSC 5 |
| 72H12 | CM023060.1 | TSE 5 | TSC 5 |
| 9H12 | CM023058.1 | TSE 3 | TSC 6 |
| 122H12 | CM023061.1 | TSE 6 | TSC 6 |
| 114H12 | CM023062.1 | TSE 7 | TSC 7 |
| 60H12 | CM023062.1 | TSE 7 | TSC 7 |
| 15H12 | CM023062.1 | TSE 7 | TSC 7 |
| 3H12 | CM023063.1 | TSE 8 | TSC 8 |
| 54H12 | CM023063.1 | TSE 8 | TSC 8 |
| 40H12 | CM023063.1 | TSE 8 | TSC 8 |
| 38H12 | CM023063.1 | TSE 8 | TSC 8 |
| 105H12 | CM023063.1 | TSE 8 | TSC 8 |
| 123H12 | CM023063.1 | TSE 8 | TSC 8 |
| 78H12 | CM023064.1 | TSE 9 | TSC 9 |
| 89H12 | CM023064.1 | TSE 9 | TSC 9 |
| 29H12 | CM023065.1 | TSE 10 | TSC 10 |
| 27H12 | CM023066.1 | TSE 11 | TSC 11 |
| 100H12 | CM023069.1 | TSE 14 | TSC 13 |
| 26H12 | CM023069.1 | TSE 14 | TSC 13 |
| 118H12 | CM023068.1 | TSE 13 | TSC 15 |
| 121H12 | CM023075.1 | TSE 20 | TSC 18 |
| 12H12 | CM023058.1 | TSE 3 | TSC 19 |
| 28H12 | CM023074.1 | TSE 19 | TSC 24 |
| 52H12 | CM023071.1 | TSE 16 | TSC 21 |
| 116H12 | CM023077.1 | TSE 22 | TSC 22 |
| 16H12 | CM023071.1 | TSE 16 | TSC 25 |
Grey cells denote discrepancies detected.
Figure 4Phylogenetic relationships (A) and Circos plots (B,C) showing chromosomal homology and synteny blocks identified between C. picta turtle (CPI) and selected sauropsid genomes. Panel A: ‘26′ denotes the chromosome number of Gopherus evgoodei inferred by Simison et al. (2019) from that of close relatives [13]. The short branch length between LAG and (ACA+TEL) should not be confused with a polytomy. Panels B and C: Colored blocks represent C. picta turtle scaffolds within an individual chromosome. Black and grey blocks represent individual chromosome in the each sauropsid vertebrate. CPI = Chrysemys picta, TSE = Trachemys scripta elegans, GEV = Gopherus evgoodei, DCO = Dermochelys coriacea, GGA = Gallus gallus, LAG = Lacerta agilis, ACA = Anolis carolinensis, TEL = Thamnophis elegans. Phylogenetic tree modified from Valenzuela (2018) [34]. Zooming in Panel C permits visualization of the detailed circos plots (see also Figure S2 for enlarged circos plots).