| Literature DB >> 23359205 |
Daniel R Schrider1, Fabio C P Navarro, Pedro A F Galante, Raphael B Parmigiani, Anamaria A Camargo, Matthew W Hahn, Sandro J de Souza.
Abstract
The era of whole-genome sequencing has revealed that gene copy-number changes caused by duplication and deletion events have important evolutionary, functional, and phenotypic consequences. Recent studies have therefore focused on revealing the extent of variation in copy-number within natural populations of humans and other species. These studies have found a large number of copy-number variants (CNVs) in humans, many of which have been shown to have clinical or evolutionary importance. For the most part, these studies have failed to detect an important class of gene copy-number polymorphism: gene duplications caused by retrotransposition, which result in a new intron-less copy of the parental gene being inserted into a random location in the genome. Here we describe a computational approach leveraging next-generation sequence data to detect gene copy-number variants caused by retrotransposition (retroCNVs), and we report the first genome-wide analysis of these variants in humans. We find that retroCNVs account for a substantial fraction of gene copy-number differences between any two individuals. Moreover, we show that these variants may often result in expressed chimeric transcripts, underscoring their potential for the evolution of novel gene functions. By locating the insertion sites of these duplicates, we are able to show that retroCNVs have had an important role in recent human adaptation, and we also uncover evidence that positive selection may currently be driving multiple retroCNVs toward fixation. Together these findings imply that retroCNVs are an especially important class of polymorphism, and that future studies of copy-number variation should search for these variants in order to illuminate their potential evolutionary and functional relevance.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23359205 PMCID: PMC3554589 DOI: 10.1371/journal.pgen.1003242
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Detecting retroCNVs using sequence reads.
a) RetroCNVs present in the reference genome are detected by searching for retrocopies in the reference that are absent from a sequenced individual, as revealed by paired-end reads spanning the location of the retroCNV and mapping too far apart from one another. b) RetroCNVs absent from the reference genome are detected by using paired-end reads to detect retroCNV insertion sites, and c) using reads that span exon-exon junctions but do not map to the reference genome.
RetroCNVs versus fixed retrogenes moving from an autosome to an autosome (A→A) from the X chromosome to the X (X→X), from the X to the autosomes (X→A), or vice-versa (A→X).
| RetroCNVs | Fixed retrogenes | |
| A→A or X→X | 36 | 70 |
| A→X or X→A | 3 | 29 |
Data from Emerson et al. [29].
RetroCNVs versus fixed retrocopies inserted in intronic versus intergenic sequence.
| RetroCNVs | Fixed retrocopies | |
| Intronic insertions | 19 | 2,492 |
| Intergenic insertions | 19 | 5,339 |
Figure 2Estimated derived allele frequencies of retroCNVs segregating in three human subpopulations.
Allele frequencies were calculated as described in the Materials and Methods. RetroCNVs fixed in or absent from a given subpopulation are not shown.
Figure 3Reduced nucleotide diversity on chromosome 18 among chromosomes containing the DHFR retroCNV in CEU.
π is shown in 10 kilobase windows for chromosomes containing the DHFR retroCNV (red) and those lacking this retroCNV (black). The location of the retroCNV insertion is marked by an arrow. While there is little difference in nucleotide diversity distal to the retroCNV, there is a recombination hotspot in that region (data from ref. [65]).