| Literature DB >> 24759626 |
Jacqueline M Doyle1, Todd E Katzner2, Peter H Bloom3, Yanzhu Ji1, Bhagya K Wijayawardena4, J Andrew DeWoody4.
Abstract
Biologists routinely use molecular markers to identify conservation units, to quantify genetic connectivity, to estimate population sizes, and to identify targets of selection. Many imperiled eagle populations require such efforts and would benefit from enhanced genomic resources. We sequenced, assembled, and annotated the first eagle genome using DNA from a male golden eagle (Aquila chrysaetos) captured in western North America. We constructed genomic libraries that were sequenced using Illumina technology and assembled the high-quality data to a depth of ∼40x coverage. The genome assembly includes 2,552 scaffolds >10 Kb and 415 scaffolds >1.2 Mb. We annotated 16,571 genes that are involved in myriad biological processes, including such disparate traits as beak formation and color vision. We also identified repetitive regions spanning 92 Mb (∼6% of the assembly), including LINES, SINES, LTR-RTs and DNA transposons. The mitochondrial genome encompasses 17,332 bp and is ∼91% identical to the Mountain Hawk-Eagle (Nisaetus nipalensis). Finally, the data reveal that several anonymous microsatellites commonly used for population studies are embedded within protein-coding genes and thus may not have evolved in a neutral fashion. Because the genome sequence includes ∼800,000 novel polymorphisms, markers can now be chosen based on their proximity to functional genes involved in migration, carnivory, and other biological processes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24759626 PMCID: PMC3997482 DOI: 10.1371/journal.pone.0095599
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Assembled avian nuclear genomes in NCBI as of 12 September 2013.
| Species | Order | Assemblysize (Mb) | Estimatedgenomesize | Estimated#genes | Meangenelength | Meanexonspergene | Meanexonlength | Meanintronlength |
|
|
|
|
|
|
|
|
|
|
|
| Psittaciformes | 1175.4 |
| - | - | - | - | - |
|
| Anseriformes | 1105.1 | 1.26 | 19,144 | 20574 | 8.2 | 164 | 2664 |
|
| Psittaciformes | 1204.7 |
| 14,405 | - | - | - | - |
|
| Falconiformes | 1172.0 | 1.22 | 16,263 | 20646 | 8.9 | 173 | 2395 |
|
| Falconiformes | 1174.8 | 1.19 | 16,204 | 19314 | 8.8 | 173 | 2250 |
|
| Passeriformes | 1118.3 | 1.31 | 18,649 | - | - | - | - |
|
| Galliformes | 1046.9 |
| 17,040 | 16702 | 8 | 166 | 2203 |
|
| Passeriformes | 1065.3 | 1.25 | 13291 | - | - | - | - |
|
| Galliformes | 1061.8 |
| 15,704 | - | - | - | - |
|
| Psittaciformes | 1117.4 |
| - | - | - | - | - |
|
| Passeriformes | 1043.0 | 1.08 | 17,520 | 19840 | 9.27 | 170 | 2208 |
|
| Passeriformes | 1232.1 |
| 14527 | - | - | - | - |
|
| Columbiformes | 1108.0 | 1.3 | 17,300 | 18364 | 8.5 | 166 | 2271 |
|
| 1143.0 |
Genome sizes in plain text were estimated by the kmer method (citations can be found in the “species” column). Genome sizes in italics were estimated by other methods (e.g., Feulgen absorption microspectrophotometry; citations can be found in the “estimated genome size” column).
Figure 1Movements of the captured male golden eagle.
Movements of the golden eagle (USFWS Band #0679-02608) whose genome sequence is presented herein. GPS data were collected by a CTT-11060 telemetry unit at 15-minute intervals from capture date (6 December 2012) through 07 March 2013. Home range size during this period was 1068 km2 (95% KDE).
Figure 2A. chrysaetos mitochondrial genome map.
Cox1, cox2 and cox3 indicate cytochrome oxidase subunits 1–3; cob indicates cytochrome b; atp6 and atp8 indicate ATPase subunits 6 and 8; nad1–nad6 indicate NADH dehydrogenase subunits 1–6. Transfer RNA genes are designated by single-letter amino acid codes.
Figure 3Depth of sequencing of the A. chrysaetos genome.
Sequencing depth is on the x-axis while the y-axis shows the percentage of total bases at a given depth. Reads were aligned to the genome using bowtie2.
Repetitive elements in the A. chrysaetos genome. Numbers indicate repeat size in bp and percentage of genome assembly (in parenthesis).
| Total | RepeatProteinMask | RepeatMasker | RepeatModeler | trf | |
| repeat size: bp (%) | repeat size: bp (%) | repeat size: bp (%) | repeat size: bp (%) | repeat size: bp (%) | |
| SINEs | 2,063,865 (0.13%) | NA | 1,664,482 (0.11%) | 773,136 (0.05%) | NA |
| LINEs | 39,834,388 (2.57%) | 22,041,715 (1.42%) | 35,622,475 (2.30%) | 28,613,532 (1.85%) | NA |
| LTRs | 21,717,448 (1.40%) | 2,619,141 (0.17%) | 19,036,431 (1.23%) | 17,744,753 (1.15%) | NA |
| DNAs | 8,382,378 (0.54%) | 256,301 (0.02%) | 7,635,911 (0.49%) | 1,412,454 (0.09%) | NA |
| Unknown | 7,837,457 (0.51%) | 0 (0.00%) | 844,327 (0.05%) | 6,993,212 (0.45%) | NA |
| Tandem repeats | 14,577,786 (0.94%) | NA | 588,197 (0.04%) | 244,508 (0.02%) | 14,109,713 (0.91%) |
| Total | 92,021,614 (5.94%) | 24,908,961 (1.61%) | 64,751,314 (4.18%) | 56,079,698 (3.62%) | 14,109,713 (0.91%) |
trf, Tandem Repeat Finder [46].
Proximity of anonymous microsatellites [51] to annotated A. chrysaetos genes.
| Locus | Repeat motif | Scaffold length(nt) | Number of geneswithin | Notes | Ontologies of genes within 20 kb | ||
| 1 kb | 10 kb | 20 kb | |||||
| Aa43 | (AC)14 | 385553 | 0 | 1 | 1 | Exostosin | |
| Aa15 | (CA)13 | 201766 | 1 | 2 | 3 | Microsatellite within gene | Metabolic process |
| Aa26 | (AC)14 | 498563 | 0 | 0 | 0 | ||
| Hal10 | (CA)12 | 897486 | 0 | 0 | 0 | Nearest gene within 430 kb | |
| IEAAAG09 | (RAAG)18 | 750407 | 0 | 0 | 0 | Nearest gene within 306 kb | |
| Aa11 | (CA)11 | 211691 | 0 | 0 | 0 | Nearest gene within 63 kb | |
| Aa36 | (AC)16 | 173020 | 0 | 0 | 0 | Nearest gene within 215 kb | |
| Hal13 | (GT)17 | 147430 | 0 | 0 | 0 | ||
| Aa12 | (GT)12 | 109279 | 0 | 0 | 0 | Nearest gene within 302 kb | |
| Aa27 | (CA)11 | 71173 | 0 | 0 | 0 | Nearest gene within 72 kb | |
| Aa39 | (AC)13 | 24219 | 0 | 0 | 0 | Nearest gene within 40 kb | |
| IEAAAG04 | (AAAG)6(AAAC)4(AAAG)6 | 134664 | 0 | 0 | 0 | Nearest gene within 140 kb | |
| IEAAAG13 | (AAAG)3(RAAG)13(AAAG)16 | 302707 | 0 | 0 | 0 | Nearest gene within 77 kb | |
| IEAAAG14 | (AAAG)18 | 363834 | 0 | 0 | 0 | Nearest gene within 165 kb | |
| IEAAAG15 | (AAAG)7 | 464074 | 1 | 1 | 1 | Microsatellite within gene | Tumor necrosis factor |