| Literature DB >> 30355328 |
Rebecca E O'Connor1, Marta Farré2, Sunitha Joseph1, Joana Damas2, Lucas Kiazim1, Rebecca Jennings1, Sophie Bennett1, Eden A Slack2, Emily Allanson2, Denis M Larkin2, Darren K Griffin3.
Abstract
BACKGROUND: The number of de novo genome sequence assemblies is increasing exponentially; however, relatively few contain one scaffold/contig per chromosome. Such assemblies are essential for studies of genotype-to-phenotype association, gross genomic evolution, and speciation. Inter-species differences can arise from chromosomal changes fixed during evolution, and we previously hypothesized that a higher fraction of elements under negative selection contributed to avian-specific phenotypes and avian genome organization stability. The objective of this study is to generate chromosome-level assemblies of three avian species (saker falcon, budgerigar, and ostrich) previously reported as karyotypically rearranged compared to most birds. We also test the hypothesis that the density of conserved non-coding elements is associated with the positions of evolutionary breakpoint regions.Entities:
Keywords: CNE; Chromosome-level genome assembly; EBR; Genome evolution
Mesh:
Substances:
Year: 2018 PMID: 30355328 PMCID: PMC6201548 DOI: 10.1186/s13059-018-1550-x
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Statistics for the scaffold split regions tested by PCR
| Statistics | Saker falcon | Ostrich | Budgerigar |
|---|---|---|---|
| Pair-end read physical coverage within tested scaffolds | 135–524 | 2–604 | 0–631 |
| No. split SF adjacencies by RACA (default param.) | 61 | 69 | 154 |
| No. tested scaffold split regions | 22 (100%) | 49 (100%) | 43 (100%) |
| No. amplified split regions (confirmed SF joints) | 11 (50%) | 32 (65%) | 20 (46%) |
| No. non-amplified split regions | 11 (50%) | 17 (35%) | 23 (54%) |
| No. tested RACA-suggested adjacencies | 11 | 8 | 18 |
| No. amplified adjacencies (chimeric SF joints) | 5 | 7 | 11 |
| Final no. ambiguous SF joints from tested split regions | 6 | 10 | 12 |
| Selected pair-end read spanning threshold | 379 | 239 | 216 |
Fig. 1BAC clones hybridized to budgerigar chromosome two (MUN2). The green (FITC labeled) signal represents TGMCBA-375I5 (GGA17 homolog) and maps to PCF 17, and the Texas red labeled signal represents CH261-169K18 (GGA3 homolog) and maps to PCF 3c_5a
Assembly statistics from original NGS genome to RACA assembly and combined RACA and FISH assembly
| Original assembly | |||
| Stats | Budgie | Ostrich | Saker falcon |
| No. scaffolds longer 10 kbp | 1138 | 1179 | 731 |
| Total length (Gbp) | 1.08 | 1.22 | 1.17 |
| N50 (Mbp) | 11.41 | 3.64 | 4.16 |
| Default RACA assembly | |||
| Stats | Budgerigar PCFs | Ostrich PCFs | Saker falcon PCFs |
| No. PCFs | 84 | 100 | 95 |
| Total length (Gbp) | 1.04 | 1.17 | 1.14 |
| N50 (Mbp) | 46.54 | 37.95 | 39.38 |
| No. chimeric scaffolds | 80 (31%) | 58 (10%) | 50 (10%) |
| No. used scaffolds | 254 | 588 | 458 |
| % original assembly | 96.29 | 95.90 | 97.26 |
| RACA + PCR assembly | |||
| Stats | Budgerigar PCFs | Ostrich PCFs | Saker falcon PCFs |
| No. PCFs | 95 | 136 | 103 |
| Total length (Gbp) | 1.04 | 1.17 | 1.14 |
| N50 (Mbp) | 37.96 | 28.09 | 22.28 |
| No. chimeric scaffolds | 55 (21%) | 31 (5%) | 25 (5%) |
| No. used scaffolds | 254 | 588 | 458 |
| % original assembly | 96.29 | 96.02 | 97.26 |
| RACA + FISH assembly | |||
| Stats | Budgerigar chromosomes | Ostrich chromosomes | Saker falcon chromosomes |
| No. PCFs placed | 46 | 53 | 64 |
| No. PCFs oriented | 28 | 37 | 37 |
| Disagreements RACA-FISH | 4 | 0 | 0 |
| Length placed (bp) | 1,013,720,408 | 969,537,146 | 1,055,312,481 |
| Length oriented (bp) | 844,433,024 | 869,521,333 | 790,725,803 |
| % original assembly placed | 93.56 | 79.45 | 90.12 |
| % original assembly oriented | 77.93 | 71.26 | 67.52 |
Fig. 2Chromosomes homologous to chicken (ancestral) chromosome 3 with mapped BACs, scaffolds, PCFs, and zebra finch homologies shown. SCA3 = ostrich chromosome 3, FCH6 = saker falcon chromosome 6, MUN6 = budgerigar chromosome 2. The full dataset can be found on the interactive browser Evolution Highway at the following link: http://eh-demo.ncsa.uiuc.edu/birds/
Fig. 3Ideogram representation of the gross genomic structure of the ostrich (Struthio camelus—SCA) with chicken homologies per chromosome. Each GGA (chicken) homolog is represented as a different color—randomly assigned. Intrachromosomal differences are not shown here but listed in Additional file 1: Table S1
Fig. 4Ideogram representation of the gross genomic structure of the budgerigar (Melopsittacus undulatus—MUN) with chicken homologies per chromosome. Each GGA (chicken) homolog is represented as a different color as assigned in Fig. 3. Intrachromosomal differences are not shown here but listed Additional file 1: Table S2
Fig. 5Ideogram representation of the gross genomic structure of the saker falcon (Falco cherrug—FCH) with chicken homologies per chromosome. Each GGA (chicken) homolog is represented as a different color as assigned in Fig. 3. Intrachromosomal differences are not shown here but listed in Additional file 1: Table S3
Patterns of fusion and fission revealed in the budgerigar, saker falcon, and the ostrich using the chicken genome as a reference
| Ancestral chromosome (numbered according to chicken) | Budgerigar | Saker falcon | Ostrich | Chicken | Chicken-ostrich differences | ||
|---|---|---|---|---|---|---|---|
| Inter- | Intra- | Inter- | Intra- | Inter- | Inter- | Intra- | |
| 1 | Fission | 1 | Fission | 4 | 0 | 0 | 0 |
| 2 | – | 2 | Fission and fusion to GGA21 and 28 | 0 | 0 | 0 | 0 |
| 3 | Fusion to GGA17 | 3 | Fission | 4 | 0 | 0 | 3 |
| 4a | – | 0 | – | 2 | 0 | Fusion | 1 |
| 4b | Fusion to GGA9 | 0 | Fusion to GGA15 | 4 | 0 | 2 | |
| 5 | Fission and fusion to GGA6 | 1 | Fission and fusion to GGA10 and 20 | 2 | 0 | 0 | 1 |
| 6 | Fusion to GGA5 | 0 | Fusion to GGA17 | 3 | 0 | 0 | 0 |
| 7 | Fission and fusion to GGA6 and 5 | 1 | Fusion to GGA13 | 2 | 0 | 0 | 3 |
| 8 | Fusion to GGA9 | 0 | – | 0 | 0 | 0 | 1 |
| 9 | Fusion to GGA8 | 1 | – | 0 | 0 | 0 | 0 |
| 10 | Fusion to GGA12 | 0 | Fusion to GGA5 | 0 | 0 | 0 | 0 |
| 11 | Fusion to GGA4q | 0 | – | 0 | 0 | 0 | 0 |
| 12 | Fusion to GGA10 | 1 | Fusion to GGA14 | 0 | 0 | 0 | 0 |
| 13 | Fusion to GGA20 | 0 | Fusion to GGA7 | 0 | 0 | 0 | 0 |
| 14 | Fusion to GGA5 | 1 | Fusion to GGA12 and 28 | 1 | 0 | 0 | 1 |
| 15 | – | 2 | Fusion to GGA4q and 19 | 2 | 0 | 0 | 0 |
| 16 | No data | No data | No data | No data | No data | No data | No data |
| 17 | Fusion to GGA3 | 0 | Fusion to GGA6 | 1 | 0 | 0 | 0 |
| 18 | – | 1 | Fusion to GGA19 | 2 | 0 | 0 | 1 |
| 19 | – | 0 | Fusion to GGA15 and 18 | 0 | 0 | 0 | 0 |
| 20 | Fusion to GGA13 | 0 | Fusion to GGA5 | 0 | 0 | 0 | 0 |
| 21 | – | 0 | Fusion to GGA2 and 23 | 0 | 0 | 0 | 0 |
| 22 | – | 0 | – | 2 | 0 | 0 | 1 |
| 23 | – | 2 | Fusion to GGA21 | 2 | 0 | 0 | 0 |
| 24 | – | 0 | – | 1 | 0 | 0 | 0 |
| 25 | No data | No data | No data | No data | No data | No data | No data |
| 26 | – | 0 | – | 2 | 0 | 0 | 0 |
| 27 | – | 0 | – | 1 | 0 | 0 | 0 |
| 28 | – | 0 | Fusion to GGA2 and 14 | 1 | 0 | 0 | 0 |
| Z | – | 0 | – | 0 | 0 | 0 | 0 |
The left-hand column represents the ancestral avian chromosome, with the subsequent columns indicating the number of inter- and intrachromosomal changes detected that have led to each extant species. For the intrachromosomal differences between ostrich and chicken, in the absence of an outgroup, the direction of change cannot be determined and thus only differences between the two species is noted
Fig. 6Phylogenetic tree highlighting the relationship of species analyzed here and in our previous study [15] demonstrating the number of inter- and intrachromosomal rearrangements that have occurred relative to the avian ancestor (interchromosomal) and the chicken (intrachromosomal). Species investigated using our approach are highlighted in yellow with other species (chicken, duck, and zebra finch) represented for context. Phylogeny is based on Burleigh et al. [70]
Statistics for CNE density in 1-kb windows for avian EBRs, msHSBs, and genomewide
| Average no. CNE bases | Average density of CNE bases | |
|---|---|---|
| Genome | 86.85 | 0.087 |
| msHSB | 106.81 | 0.107 |
| Intra | 26.14 | 0.026 |
| Fusion | 13.76 | 0.013 |
| Fission | 5.48 | 0.005 |
| EBR* | 23.76 | 0.024 |
*Fission, fusion, and intrachromosomal EBRs combined