| Literature DB >> 22432011 |
Minako Izutsu1, Jun Zhou, Yuzo Sugiyama, Osamu Nishimura, Tomoyuki Aizu, Atsushi Toyoda, Asao Fujiyama, Kiyokazu Agata, Naoyuki Fuse.
Abstract
Organisms are remarkably adapted to diverse environments by specialized metabolisms, morphology, or behaviors. To address the molecular mechanisms underlying environmental adaptation, we have utilized a Drosophila melanogaster line, termed "Dark-fly", which has been maintained in constant dark conditions for 57 years (1400 generations). We found that Dark-fly exhibited higher fecundity in dark than in light conditions, indicating that Dark-fly possesses some traits advantageous in darkness. Using next-generation sequencing technology, we determined the whole genome sequence of Dark-fly and identified approximately 220,000 single nucleotide polymorphisms (SNPs) and 4,700 insertions or deletions (InDels) in the Dark-fly genome compared to the genome of the Oregon-R-S strain, a control strain. 1.8% of SNPs were classified as non-synonymous SNPs (nsSNPs: i.e., they alter the amino acid sequence of gene products). Among them, we detected 28 nonsense mutations (i.e., they produce a stop codon in the protein sequence) in the Dark-fly genome. These included genes encoding an olfactory receptor and a light receptor. We also searched runs of homozygosity (ROH) regions as putative regions selected during the population history, and found 21 ROH regions in the Dark-fly genome. We identified 241 genes carrying nsSNPs or InDels in the ROH regions. These include a cluster of alpha-esterase genes that are involved in detoxification processes. Furthermore, analysis of structural variants in the Dark-fly genome showed the deletion of a gene related to fatty acid metabolism. Our results revealed unique features of the Dark-fly genome and provided a list of potential candidate genes involved in environmental adaptation.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22432011 PMCID: PMC3303825 DOI: 10.1371/journal.pone.0033288
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1History of Dark-fly.
In 1954, a fly population derived from one pair of Oregon-R-S flies was divided into 6 populations. Three of them (aL, bL and cL populations) were reared in normal light-dark cycling conditions and the remaining three populations (dD, eD, and fD populations) were reared in constant dark conditions. Unfortunately, all of the L lines were lost by 2002. The dD and eD lines were lost in 1965 and 1967, and only the fD line has been maintained until now. In 2008, we started to rear the fD line and designated it “Dark-fly”. We have maintained Dark-fly in a minimum medium as done before (black lines), and in a standard cornmeal medium (white lines) in parallel. The population size of Dark-fly has not been controlled but has usually been about 100 flies each in several culture vials.
Figure 2Fecundity of Dark-fly and Oregon-R-S.
(A) Three-day fecundity (offspring/female) of Dark-fly and Oregon-R-S in LL, LD and DD conditions are shown by box plots. Boxes and median lines represent inter-quartile range and median values of data, and vertical lines represent minimum and maximum values of data within 1.5-fold of the inter-quartile range. Circles indicate values of outliers. * indicates FDR-adjusted p-value<0.05, Welch t-test. n = 10 (total 100 females). (B) Lifetime fecundity (offspring/female) of Dark-fly and Oregon-R-S in LD and DD conditions are shown by box plots in a similar manner to (A). ** indicates p-value<0.01, Welch t-test. n = 10 (total 100 females).
Figure 3Survival curves of Dark-fly and Oregon-R-S.
The viability of male flies (A) and female flies (B) reared together is plotted versus time (days). Dark-fly (red lines) and Oregon-R-S (blue lines) were reared under LD (dotted lines) and DD (solid lines) conditions. The viability of virgin females (C) was also measured in a similar manner. n = 92–100 flies. Oregon-R-S virgin females showed longer longevity than the mated ones, whereas Dark-fly virgin females showed shorter longevity than the mated ones.
Summary of genome sequencing.
| fly line | read length | read number | mapped read number | mapped read % | total read bases | mean depth |
| Dark-fly | 36 | 66,855,594 | 64,422,374 | 96.4 | 2,319,205,464 | 13.7 |
| Oregon-R-S | 36, 39, 48 | 87,101,330 | 78,109,114 | 89.7 | 3,307,906,716 | 19.6 |
The results of genome sequencing using an Illumina Genome Analyzer II are summarized. Flybase Dmel 5.22 genome (168,736,537 bases) was used as a reference genome.
SNP, InDel and CNV analyses.
| fly line | Dark-fly | Oregon-R-S |
| total fixed SNPs | 415,626 | 415,668 |
| SNP frequency (bases/SNP) | 406 | 406 |
| line-specific SNPs | 217,340 | 217,382 |
| total SNPs | ||
| total SNP-effects | 1,435,028 | 1,424,012 |
| intergenic | 826,111 | 824,781 |
| UTR and intron | 486,090 | 499,604 |
| synonymous coding (sSNP) | 96,674 | 78,152 |
| non-synonymous coding (nsSNP) | 25,514 | 20,840 |
| others | 639 | 635 |
| line-specific SNPs | ||
| nsSNPs without redundancy | 9,695 | 6,521 |
| genes carrying nsSNPs | 4,323 | 3,039 |
| genes carrying nonsense mutations | 28 | 23 |
| total fixed InDels | 5,322 | 5,461 |
| InDel frequency (bases/InDel) | 31,705 | 30,898 |
| line-specific InDels | 4,660 | 4,799 |
| total InDels | ||
| total InDel-effects | 16,726 | 17,507 |
| intergenic | 8,790 | 9,767 |
| UTR and intron | 7,790 | 7,674 |
| coding region (cInDel) | 144 | 66 |
| others | 2 | 0 |
| line-specific InDels | ||
| cInDels without redundancy | 52 | 27 |
| genes carrying cInDels | 50 | 27 |
| genes showing increased CNVs | 122 | ND |
| genes showing decreased CNVs | 133 | ND |
These data represent a summary of our analyses of SNPs, InDels and CNVs for the Dark-fly and Oregon-R-S genomes. ND means not determined.
Identification of ROH regions.
| fly line | Dark-fly | Oregon-R-S |
| homo and hetero SNPs (0.4 = <freq) | 477,816 | 486,013 |
| homo SNPs (0.9 = <freq) | 449,684 | 453,646 |
| hetero SNPs (0.4 = <freq<0.9) | 28,132 | 32,367 |
| homo SNP fraction in total (%) | 94.1 | 93.3 |
| number of ROH regions | 24 | 128 |
| total length of ROHs (kb) | 5,934 | 43,868 |
| fraction of ROHs in genome (%) | 4.99 | 36.85 |
| average length of ROH (mean ± SD, kb) | 230±70 | 342±155 |
| average SNP number in ROH (mean ± SD) | 981±449 | 1621±989 |
| average homo SNP fraction in ROH(mean ± SD, %) | 97.5±0.9 | 98.2±0.5 |
| number of ROH regions with significantly high homozygosity | 21 | ND |
| genes carrying nsSNPs and cInDels in ROH regions | 241 | ND |
These data represent a summary of our analyses of ROH regions. Homo and hetero SNPs were identified using Samtools and Vcftools functions. The number of homo SNPs was slightly different from that of the fixed SNPs identified using VarScan functions (Table 2), due to the difference of data filtering. ROH regions were identified using PLINK software (Tables S5 and S6). The Dark-fly ROH regions showing significantly high homozygosity were determined by statistical analyses (Tables S7 and S8). Genes carrying nsSNPs and cInDels in 21 ROH regions were counted. ND means not determined.
Figure 4Homozygosity and ROH regions.
Mean homozygosity of SNPs in a sliding window (200-kb window at 100-kb steps) was plotted versus the location on 2L (A), 2R (B), 3L (C), 3R (D) and X (E) chromosomes. The Oregon-R-S genome (blue lines) displayed higher homozygosity than the Dark-fly genome (red lines) in most of the regions. Thick horizontal bars represent ROH regions identified by PLINK software for Oregon-R-S (blue bars) and Dark-fly (red bars) and are plotted above the graph without homozygosity values.
Genes carrying nsSNPs and cInDels in the Dark-fly ROH regions.
| ROH ID# | Chr | positionstart | positionend | lengthbases | genes carrying nsSNPs or cInDels |
| ROH1 | 2L | 3353705 | 3669168 | 315463 | CG8838, CG34394, Ptpa, CG34175, CG31952, CG3238, CG31776, Sr-CIV, Spindly |
| ROH2 | 2L | 6535198 | 6782752 | 247554 | CG9596, CG11319, CG11320, CG34345, Oatp26F, Tango1, CG31633, CG11070, CG13771, Nhe3, CG11327, GRHR, CG11188, homer, TTLL3A, CG31910, CG11221, CG11322, CG11321, CG17378 |
| ROH3 | 2L | 8847085 | 9109796 | 262711 | CG32986, CG34398, CG9510, CG31886, CG32985, CG32984, CG18088, CG9541, CG9555, CG17906, CG18661, CG9568, CG9582, Toll-4 |
| ROH4 | 2L | 10278630 | 10524864 | 246234 | CG34043, CG5604, CG13138, CG5384, CG4972, GATAd, CG34367, CG5367, Cand1, pim, CG5056, rho-5, CG33303, gny, CG5168, CG5188, CG6232, CG5322, CG6206, RluA-1, RluA-2, CG7456, CG13144, Myo31DF, CG7384, Fatp |
| ROH5 | 2L | 13521459 | 13806482 | 285023 | CG33641, CG33644, CG33645, CG16853, CG18507, CG7311, CG31814, CG9014, CR31845, CG31731, sec71 |
| ROH6 | 2L | 13806743 | 14034237 | 227494 | CG16865, Sos, b, tam, Orc5, mRpS23, CG33307, CG33306, CG8997, cenG1A, Ance-2, CG16886, CG16884, nimB1, nimB3, nimB5, He, nimC1, rk, bgm, CG18095 |
| ROH7 | 2L | 15628469 | 15854613 | 226144 | CG7631, CG18480, CG4587, CycE, Ku80, CG18109, CG18518 |
| ROH9 | 2R | 2722221 | 2975600 | 253379 | CG15236, Spn42Db, Spn42De, CG3358, CheB42b, CheB42c, ppk25, mim, Cyp6u1, CG30157, vimar, Tsp42Ee, Tsp42Eh, Tsp42Ei, CG12831 |
| ROH10 | 2R | 12738094 | 13006423 | 268329 | Fen1, CG8910, Pkc53E, CG15614, mute, CG6665, ste24b, CG6796, CG8963, Ark, RhoGEF2, CG9640, CG9642, CG9646, CG8950, CG6967, CG30460, CG30456, CG15611 |
| ROH11 | 3L | 3118085 | 3327625 | 209540 | CG14963, CG32284, CG32277, CG12034, CG11505, CG12009 |
| ROH13 | 3L | 14059399 | 14275678 | 216279 | pex1, CG8100, Fbp1, Sox21b, nuf, CG34244 |
| ROH14 | 3L | 15737620 | 15945049 | 207429 | CG13445, CG12713, CG32150, CG12486, pHCl, sff, Pka-C3 |
| ROH15 | 3L | 18793182 | 19024297 | 231115 | CG14073, CR32027, CG14074, dysb, CG11637, Ir75d, CG14077, CG3819, CG14075, CG11619, CG18135, CG3808, CG18136, nkd |
| ROH16 | 3L | 20560665 | 20819130 | 258465 | CG13251, CG34260, CG13252, CG4074, Pitslre, Spc105R |
| ROH17 | 3L | 22471441 | 22725139 | 253698 | CG14459, CG14453, CG11370, CG6838, CG32454 |
| ROH19 | 3R | 2862778 | 3085343 | 222565 | CG1988, CG1105, CG1965, CG1943, CG1091, CG31248, MAGE, lap, CG14605, CG1227 |
| ROH20 | 3R | 3257401 | 3475620 | 218219 | CG14598, alpha-Est10, alpha-Est9, alpha-Est8, alpha-Est7, alpha-Est6, alpha-Est5, alpha-Est3, alpha-Est2, CG34127 |
| ROH21 | 3R | 8358059 | 8659641 | 301582 | Octbeta2R, CG11608, Cyp313a4, CG14391, mus308, Men, CG5724, CG5999 |
| ROH22 | 3R | 9912039 | 10141059 | 229020 | CCHa1, Or88a, Kif19A, 140up, CG14356, CG42500, CG31533, CG31327, DopR, CG9649, CG9631, Aats-met, trx, CG3259, su(Hw), CG31321 |
| ROH23 | 3R | 12540659 | 12771162 | 230503 | Ubx, Glut3, Abd-B |
| ROH24 | 3R | 22056540 | 22307403 | 250863 | CG14239, Hex-t1, CG5455, CG6490 |
The chromosomal position and length of the Dark-fly ROH regions showing significantly high homozygosity are listed. Genes carrying nsSNPs and InDels in each ROH region are shown. Details regarding nsSNPs and cInDels are presented in File S1.
Figure 5Alignment of read sequences around CG4594 gene.
A view of Integrated Genomics Viewer around the CG4594 gene. The numerous small gray bars represent reads of genome sequencing. A region of about 500 bases in the CG4594 gene (red thick bar) was not covered by any read sequences of the Dark-fly genome (upper), but was fully covered by the sequences of the Oregon-R-S genome (lower). Numbers on a horizontal line indicate nucleotide position on chromosome 2L. Numbers on vertical alignment indicate read depth.