| Literature DB >> 27687565 |
Justin B Lack1, Jeremy D Lange1, Alison D Tang2, Russell B Corbett-Detig2, John E Pool3.
Abstract
The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations.Entities:
Keywords: Drosophila melanogaster; data resource.; genomic variation; reference alignment
Mesh:
Year: 2016 PMID: 27687565 PMCID: PMC5100052 DOI: 10.1093/molbev/msw195
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Genomic Data Sets Present in the Drosophila Genome Nexus Are Summarized.
| DGN Set | Data reference | Genomes | Populations | Geographic focus | Genome type |
|---|---|---|---|---|---|
| DGRP | 205 | 1 | North America | Inbred | |
| DPGP3 | 197 | 1 | Africa (Zambia) | Haploid | |
| DPGP2 | 150 | 22 | Mostly Africa | Mostly Haploid | |
| AGES | 53 | 12 | Africa | Haploid | |
| DSPR | 18 | 18 | Worldwide | Inbred | |
| POOL | (present study) | 305 | 6 | Africa/Europe | Inbred |
| CLARK | 85 | 7 | Worldwide | Inbred | |
| NUZHDIN | 58 | 13 | North America | Inbred | |
| BERGMAN | 50 | 3 | Africa/Eur./N. Am. | Isofemale | |
Note.—Further details concerning the population samples and individual genomes represented in these data sets are given in Table S1 and Table S2, respectively.
Fig. 1Geographic locations of selected population samples are shown, with the largest samples in bold print. These populations have at least three sequenced genomes with DGN consensus sequences available.
Fig. 2The extent of genomic data annotated for masking due to heterozygosity, relatedness, and admixture is shown per 119 Mb genome (filtered in that order).
Fig. 3Average values of nucleotide diversity (π) within populations (on the diagonal), average pairwise distance between populations (D, above the diagnonal), and F between populations (below the diagonal) are shown. Values are averaged across chromosome arms X, 2L, 2R, 3L, and 3R, each of which was analyzed using inversion-free genomes only.