| Literature DB >> 29519939 |
Carson Holt1,2, Michael Campbell1, David A Keays3, Nathaniel Edelman3, Aurélie Kapusta1,2, Emily Maclary4, Eric T Domyan4,5, Alexander Suh6, Wesley C Warren7, Mark Yandell1,2, M Thomas P Gilbert8,9, Michael D Shapiro10.
Abstract
The domestic rock pigeon (Columba livia) is among the most widely distributed and phenotypically diverse avian species. C. livia is broadly studied in ecology, genetics, physiology, behavior, and evolutionary biology, and has recently emerged as a model for understanding the molecular basis of anatomical diversity, the magnetic sense, and other key aspects of avian biology. Here we report an update to the C. livia genome reference assembly and gene annotation dataset. Greatly increased scaffold lengths in the updated reference assembly, along with an updated annotation set, provide improved tools for evolutionary and functional genetic studies of the pigeon, and for comparative avian genomics in general.Entities:
Keywords: Columba livia; Genome Report; HiRise assembly; MAKER annotation; rock pigeon
Mesh:
Substances:
Year: 2018 PMID: 29519939 PMCID: PMC5940132 DOI: 10.1534/g3.117.300443
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Assembly statistics for Cliv_2.1
| Parameter | Value |
|---|---|
| Estimated Physical Coverage | 389.7x |
| Total Length | 1,108,534,737 bp |
| Total scaffolds | 15,057 |
| Total scaffolds >1kb | 4,062 |
| Total scaffolds >10kb | 848 |
Assembly version comparison
| Cliv_1.0 | Cliv_2.1 | |
|---|---|---|
| Total Length | 1110.8 Mb | 1110.9 Mb |
| N50 Length | 3.15 Mb and 82 scaffolds | 14.3 Mb and 17 scaffolds |
| N90 Length | 0.618 Mb and 394 scaffolds | 1.56 Mb and 113 scaffolds |
| Completeness Estimate | 72.3–86.4% | 72.9–86.2% |
Figure 1Assembly scaffolding contiguity and scaffolding library insert size distributions. (a) Scaffolding comparison between Cliv_1.0 (input scaffolds) and Cliv_2.1 (final scaffolds) assemblies. (b) Distribution of Dovetail Genomics “Chicago” library inserts.
Figure 2Temporal landscape of transposable elements. The amounts of DNA in each TE class were split into bins of 1 My, shown on the x axis (see Methods). We note that the lower detection of older elements (right of the graph) comes from a combination of lack of detection and TE removal, and that the amount of DNA corresponding to recent elements may be underestimated (recent copies are often collapsed in assemblies). The “Others” category primarily includes unclassified repeats.
Transcriptome assembly summary
| SRA accession | Tissue | Breed | # assembled transcripts |
|---|---|---|---|
| SRR521357 | Heart | Danish tumbler | 79,473 |
| SRR521358 | Liver | Danish tumbler | 35,691 |
| SRR521359 | Heart | Oriental frill | 71,078 |
| SRR521360 | Liver | Oriental frill | 74,180 |
| SRR521361 | Heart | racing homer | 80,034 |
| SRR521362 | Liver | racing homer | 80,642 |
| SRR5878849 | Embryo | racing homer | 208,682 |
| SRR5878850 | Embryo | parlor roller | 344,735 |
| SRR5878851 | Spleen | racing homer | 156,415 |
| SRR5878852 | Olfactory epithelium | racing homer | 112,632 |
| SRR5878853 | Subepidermis | racing homer | 185,484 |
| SRR5878854 | Cochlear duct | racing homer | 189,438 |
| SRR5878855 | Brain | racing homer | 131,999 |
| SRR5878856 | Retina | racing homer | 186,060 |
Annotation statistics for Cliv_2.1
| Genes | Transcripts | |
|---|---|---|
| Total | 15,392 | 18,966 |
| match | 14,898 | 18,472 |
| new | 494 | 494 |
Count that match Cliv_1.0 annotations with a value of at least 90% (match is calculated as % identity multiplied by % end-to-end coverage).
Annotation version comparison
| Cliv_1.0 | Cliv_2.1 | |
|---|---|---|
| Total Gene Models | 15,724 | 15,392 |
| 15,022 | 14,683 | |
| 702 | 709 | |
| Total Transcripts | 19,585 | 18,966 |
| 18,569 | 18,148 | |
| 1016 | 818 |
Figure 3Evidence support comparison of annotation sets. Annotation edit distance (AED) support for gene models in Cliv_2.1 (blue line) is improved over Cliv_1.0 (NCBI Gnomon annotation, red line).
Figure 4Dot plot of syntenic regions between the Cliv_2.1 and colLiv2 assemblies of the C. livia genome. Each segment of the X axis represents a single colLiv2 scaffold ordered from largest (left) to smallest (right), while each segment of the Y axis represents a scaffold of the Cliv_2.1 assembly, ordered from largest (bottom) to smallest (top). Green dots indicate aligned regions of synteny.
Summary of Cliv_2.1 alignment to colLiv2 chromosome-level scaffolds. Overall, colLiv2 appears to exclude 1,184, or approximately 7.7%, of the 15,392 annotated genes from the Cliv_2.1 assembly; this is consistent with the overall decrease in genome size
| Cliv_2.1 scaffold representation | # of scaffolds | Scaffold length range | Scaffolds with genes | # of genes | Genes in LAST alignment to colLiv2 | Genes missing from LAST alignment to colLiv2 |
|---|---|---|---|---|---|---|
| Missing | 14,189 | 200-393,647 | 147 | 164 | NA | 164 |
| ≤50% aligned | 251 | 318-2,545,801 | 183 | 506 | 369 | 137 |
| 50–75% aligned | 183 | 581-5,717,624 | 251 | 638 | 550 | 88 |
| ≥75% aligned | 434 | 259-94,473,889 | 434 | 14,084 | 13,289 | 795 |
Figure 5Correspondence between genotyping data and marker order in colLiv2 and Cliv_2.1 assemblies. (a) Representative plot of pairwise recombination fractions for GBS markers, ordered based on best alignment to colLiv2 assembly, for chromosomes CM007527.1, CM007528.1, and CM007529.1. X and Y axes show individual markers, ordered as they map to the colLiv2 chromosomes CM007527.1, CM007528.1, and CM007529.1. White lines mark the boundaries between chromosomes. Yellow indicates low pairwise recombination fraction (linked markers), while purple indicates high pairwise recombination fraction (unlinked markers). Red arrows highlight two markers, one mapped to chromosome CM007527.1 and one mapped to CM007529.1, for which recombination fractions suggest that these markers should instead be located on chromosome CM007528.1. A white bracket indicates a region on chromosome CM007528.1 where portions of the chromosome appear to be assembled in the wrong order. (b) Plot of pairwise recombination fractions for the Cliv_2.1 scaffolds that make up linkage groups 3, 4, and 5. In (a), colLiv2 CM007527.1 largely corresponds to linkage group 3, CM007528.1 to linkage group 4, and CM007529.1 to linkage group 5. White lines mark the boundaries between individual scaffolds, with scaffold IDs indicated on the right side.