| Literature DB >> 29850797 |
Joseph D Manthey1,2, Robert G Moyle3, Stéphane Boissinot1.
Abstract
The small and conserved genomes of birds are likely a result of flight-related metabolic constraints. Recombination-driven deletions and minimal transposable element (TE) expansions have led to continually shrinking genomes during evolution of many lineages of volant birds. Despite constraints of genome size in birds, we identified multiple waves of amplification of TEs in Piciformes (woodpeckers, honeyguides, toucans, and barbets). Relative to other bird species' genomic TE abundance (< 10% of genome), we found ∼17-30% TE content in multiple clades within Piciformes. Several families of the retrotransposon superfamily chicken repeat 1 (CR1) expanded in at least three different waves of activity. The most recent CR1 expansions (∼4-7% of genome) preceded bursts of diversification in the woodpecker clade and in the American barbets + toucans clade. Additionally, we identified several thousand polymorphic CR1 insertions (hundreds per individual) in three closely related woodpecker species. Woodpecker CR1 insertion polymorphisms are maintained at lower frequencies than single nucleotide polymorphisms indicating that purifying selection is acting against additional CR1 copies and that these elements impose a fitness cost on their host. These findings provide evidence of large scale and ongoing TE activity in avian genomes despite continual constraint on genome size.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29850797 PMCID: PMC6007501 DOI: 10.1093/gbe/evy105
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Summary of Piciformes genomic transposable element content. (a) Phylogeny of samples in our study [based on phylogeny of Jetz et al. (2012)]. (b) Summary of TE superfamily content. Stars indicate estimates from fully assembled genomes (Jarvis et al. 2014) and show that our estimates are conservative and underestimate genomic TE content. (c) Summary of select TE families genomic content. (d) Divergence curves of select CR1 families based on BLASTing raw TE matches to a portion of the 3′ end of species- and TE-specific consensus sequences (see supplementary table S1, Supplementary Material online). The y axis is the relative frequency of percent divergence, and the families are on different scales to make them all legible. (e) Genome size estimates of all sampled clades (Gregory et al. 2007). Gray boxes highlight the Picidae clade (woodpeckers) and the clade including barbets and toucans.
Sampling Information
| Common Name | Species Name | Sample # | Set | # Reads | Prop. Reads |
|---|---|---|---|---|---|
| Northern Carmine Bee-eater | SRR958514 | Phy | 12,000,000 | 0.51 | |
| Bluish-fronted Jacamar | KU 24566 | Phy | 14,193,044 | 0.77 | |
| Green-eared Barbet | KU 33324 | Phy | 15,599,647 | 0.61 | |
| Vieillot’s Barbet | KU 15540 | Phy | 19,029,009 | 0.64 | |
| Green-billed Toucan | KU 3649 | Phy | 18,531,397 | 0.64 | |
| Gilded Barbet | KU 18855 | Phy | 12,855,219 | 0.74 | |
| Spotted Honeyguide | KU 29101 | Phy | 17,445,057 | 0.78 | |
| Rufous Piculet | KU 24421 | Phy | 16,261,827 | 0.69 | |
| Antillean Piculet | KU 8153 | Phy | 15,253,876 | 0.77 | |
| Greater Flameback | KU 25777 | Phy | 11,283,426 | 0.72 | |
| Gilded Flicker | KU 30078 | Phy | 16,480,947 | 0.84 | |
| Yellow-bellied Sapsucker | KU 21911 | Phy | 17,981,465 | 0.82 | |
| Eurasian Three-toed Woodpecker | KU 30447 | Phy | 10,515,156 | 0.67 | |
| Brown-backed Woodpecker | KU 20039 | Phy | 9,945,685 | 0.69 | |
| Downy Woodpecker | SRR949789 | Phy | 12,000,000 | 0.66 | |
| Nuttall’s Woodpecker | KU 29815 | Pol | 39,814,228 | 0.94 | |
| Nuttall’s Woodpecker | KU 29816 | Pol | 59,121,354 | 0.96 | |
| Ladder-backed Woodpecker | KU 29797 | Pol | 74,403,829 | 0.97 | |
| Ladder-backed Woodpecker | KU 30061 | Pol | 53,177,745 | 0.96 | |
| Downy Woodpecker | KU 11987 | Pol | 57,308,585 | 0.96 | |
| Downy Woodpecker | KU 15939 | Pol | 60,567,542 | 0.97 |
Note.—KU samples from University of Kansas Biodiversity Institute and SRR numbers from NCBI sequence read archive.
Phy, phylogenetic data set; Pol, polymorphisms data set; # reads, number raw reads in FASTQ files; Prop. Reads, proportion of raw reads retained after quality trimming.
. 2.—Patterns of CR1 polymorphisms in three closely related woodpeckers (genus Dryobates). (a) Patterns of CR1 J3_Pass insertion polymorphisms. Numbers on branches indicate fixed differences in that lineage. (b) Variation of insertion detection using four sensitivity settings in the MELT analyses. (c) Pairwise comparisons of polymorphisms* in four CR1 families relative to single nucleotide polymorphisms (SNPs). * = [FixedBETWEEN/(FixedBETWEEN + PolymorphicWITHIN)].
. 3.—Workflow to detect genomic TE content using low-coverage shotgun sequencing of Piciformes genomes. Detailed information for all steps is described in Materials and Methods. Green shaded ovals indicate output used for comparisons and are summarized in figure 1.
. 4.—Neighbor-joining phylogeny of consensus TE sequences from CR1 families with large expansions in Piciformes genomes. The scale bar indicates estimated sequence divergence across the phylogeny.