| Literature DB >> 26463411 |
Wenbo Chen1, Daniel K Hasegawa2,3, Kathiravetpillai Arumuganathan4, Alvin M Simmons3, William M Wintermantel5, Zhangjun Fei6,7, Kai-Shu Ling8.
Abstract
Whiteflies of the Bemisia tabaci (Hemiptera: Aleyrodidae) cryptic species complex are among the most important agricultural insect pests in the world. These phloem-feeding insects can colonize over 1000 species of plants worldwide and inflict severe economic losses to crops, mainly through the transmission of pathogenic viruses. Surprisingly, there is very little genomic information about whiteflies. As a starting point to genome sequencing, we report a new estimation of the genome size of the B. tabaci B biotype or Middle East-Asia Minor 1 (MEAM1) population. Using an isogenic whitefly colony with over 6500 haploid male individuals for genomic DNA, three paired-end genomic libraries with insert sizes of ~300 bp, 500 bp and 1 Kb were constructed and sequenced on an Illumina HiSeq 2500 system. A total of ~50 billion base pairs of sequences were obtained from each library. K-mer analysis using these sequences revealed that the genome size of the whitefly was ~682.3 Mb. In addition, the flow cytometric analysis estimated the haploid genome size of the whitefly to be ~690 Mb. Considering the congruency between both estimation methods, we predict the haploid genome size of B. tabaci MEAM1 to be ~680-690 Mb. Our data provide a baseline for ongoing efforts to assemble and annotate the B. tabaci genome.Entities:
Keywords: Bemisia tabaci; flow cytometry; k-mer analysis; next-generation sequencing
Year: 2015 PMID: 26463411 PMCID: PMC4598660 DOI: 10.3390/insects6030704
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 2.769
Bemisia tabaci Middle East-Asia Minor 1 (MEAM1) genome size estimation by k-mer analysis.
| Library | Total High-Quality Cleaned Bases | Total Number of Corrected 27-mers | Peak Value of 27-mer Depth | Estimated Genome Size (bp) b |
|---|---|---|---|---|
| 300 bp R1 a | 28,576,003,381 | 23,334,888,827 | 35 | 666,711,109 |
| 300 bp R2 a | 27,654,048,868 | 22,412,934,314 | 33 | 679,179,828 |
| 500 bp | 48,793,881,921 | 39,521,025,705 | 58 | 681,396,995 |
| 1 Kb | 49,456,289,287 | 40,032,313,271 | 57 | 702,321,285 |
a R1: left paired-end reads; R2: right paired-end reads. b Estimated genome size (bp) = total number of k-mer/peak value of k-mer depth distribution.
Figure 1Distribution of unique k-mer depth. The depth of k-mers (size of 27) was plotted against the frequency at which they occurred. Unique k-mers were identified from left (R1) and right (R2) paired-end reads from the library with insert size of ~300 bp, and from all paired-end reads from libraries with insert sizes of 500 bp and 1 Kb, respectively. The peak of the k-mer depth distribution was 35, 33, 58 and 57, respectively. The left smaller peaks indicated a certain degree of heterogeneity of the materials used for genome sequencing of B. tabaci MEAM1.
Flow cytometric estimation of nuclear DNA content of Bemisia tabaci MEAM1.
| Replicate | Sample | Standard a | DNA Content (pg) | |
|---|---|---|---|---|
| Male (haploid) | 1 | 100.80 | 343.51 | 0.73 |
| 2 | 103.20 | 366.74 | 0.70 | |
| 3 | 112.17 | 396.05 | 0.71 | |
| 4 | 116.46 | 419.92 | 0.69 | |
| Mean ± SD | 0.7075 ± 0.0171 | |||
| Coefficient of variation | 0.024 | |||
| Female (diploid) | 1 | 190.21 | 343.96 | 1.38 |
| 2 | 205.07 | 370.81 | 1.38 | |
| 3 | 230.07 | 399.54 | 1.44 | |
| 4 | 243.95 | 423.25 | 1.44 | |
| Mean ± SD | 1.41 ± 0.0346 | |||
| Coefficient of variation | 0.025 |
a Sample and Standard values represent the mean of G0 + G1, with chicken red blood cells used as the internal standard (2.5 pg/2C).
Figure 2Flow cytometric estimation of the nuclear DNA content of haploid male and diploid female B. tabaci MEAM1. Histograms represent relative fluorescence of stained nuclei of male (A) and female (B) whiteflies relative to the internal standard, chicken red blood cells (CRBC). X-axis = relative nuclear DNA content; Y-axis = number of nuclei.