| Literature DB >> 22919272 |
Brad S Coates1, Analiza P Alves, Haichuan Wang, Kimberly K O Walden, B Wade French, Nicholas J Miller, Craig A Abel, Hugh M Robertson, Thomas W Sappington, Blair D Siegfried.
Abstract
Feeding damage caused by the western corn rootworm, Diabrotica virgifera virgifera, is destructive to corn plants in North America and Europe where control remains challenging due to evolution of resistance to chemical and transgenic toxins. A BAC library, DvvBAC1, containing 109,486 clones with 104 ± 34.5 kb inserts was created, which has an ~4.56X genome coverage based upon a 2.58 Gb (2.80 pg) flow cytometry-estimated haploid genome size. Paired end sequencing of 1037 BAC inserts produced 1.17 Mb of data (~0.05% genome coverage) and indicated ~9.4 and 16.0% of reads encode, respectively, endogenous genes and transposable elements (TEs). Sequencing genes within BAC full inserts demonstrated that TE densities are high within intergenic and intron regions and contribute to the increased gene size. Comparison of homologous genome regions cloned within different BAC clones indicated that TE movement may cause haplotype variation within the inbred strain. The data presented here indicate that the D. virgifera virgifera genome is large in size and contains a high proportion of repetitive sequence. These BAC sequencing methods that are applicable for characterization of genomes prior to sequencing may likely be valuable resources for genome annotation as well as scaffolding.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22919272 PMCID: PMC3420361 DOI: 10.1155/2012/604076
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1Flow cytometry estimate of the D. virgifera virgifera genome size compared to internal standards of Zea mays (inbred line B73; 2.500 Gb) and Glycine max (isoline Williams 83; 1.115 Gb).
Figure 2Estimated BAC genomic insert sizes using contour-clamped homogeneous electric field (CHEF) electrophoresis. DNA preparations were digested with NotI prior to separation on a 0.9% agarose gel in 0.5X TAE buffer for 24 h at 4°C.
The number of InterPro accessions obtained during annotation of D. v. virgifera BAC end sequences.
| InterPro entry | Number | InterPro functional description(s) |
|---|---|---|
| IPR015706 | 87 | RNA-directed DNA polymerase (reverse transcriptase) |
| IPR000477 | 67 | Reverse transcriptase |
| IPR009072 | 18 | Histone-fold |
| IPR012337 | 17 | Ribonuclease H-like |
| IPR007125 | 16 | Histone core |
| IPR000558 | 14 | Histone H2B |
| IPR005135 | 12 | Endonuclease/exonuclease/phosphatase |
| IPR011991 | 8 | Winged helix-turn-helix transcription repressor |
| IPR001878 | 8 | Zinc finger, CCHC-type |
| IPR005819 | 5 | Histone H5 |
| IPR001584 | 5 | Integrase, catalytic core |
| IPR002156 | 5 | Ribonuclease H domain |
| IPR005818 | 5 | Histone H1/H5 |
Summary of contigs per BAC that were assembled from Roche-454 sequencing data.
| Mid-tag library | BAC clone | Marker | Raw data (reads/kb) | Assembled data (reads/kb) | GenBank accession | Contig size (kb) |
|---|---|---|---|---|---|---|
| RL001 | 40F02 | 1304 | 23,300/10,298 | 21,525/97.8 | JQ581035 | 19.6 ± 22.2 |
| RL002 | 89B10 | 1224 | 8,701/7,408 | 7,277/118.7 | JQ581036 | 29.7 ± 20.6 |
| RL003 | 142B02 | 1203 | 29,444/12,894 | 19,447/30.5 | JQ581037 | 4.3 ± 3.6 |
| RL005 | 191G22 | 1125 | 64,410/28,909 | 8,348/104.0 | JQ581038 | 17.3 ± 18.8 |
| RL006 | 163F14 | 1304 | 9,495/3,523 | 22,530/101.7 | JQ581039 | 25.4 ± 7.2 |
| RL007 | 156M20 | 1203 | 16,427/7,179 | 9,435/24.6 | JQ581040 | 4.1 ± 2.8 |
| RL008 | 222P02 | 1345 | 43,702/20,431 | 0/0.0 | FAILED | NA |
| RL009 | 213A05 | 1411 | 25,196/11,345 | 18,868/74.5 | JQ581041 | 74.5 ± 0.0 |
| RL010 | 188M01 | 1300 | 19,912/9,238 | 18,478/90.6 | JQ581042 | 15.1 ± 9.7 |
|
| ||||||
| Total | 240,587/111,225 | 125,908 (15,738 ± 6286)/642.3 (80.3 ± 34.9) | 13.6 ± 20.1 | |||
NA: not applicable due to DNA sequencing failure.
Figure 3Comparison of haplotypes between assembled full BAC insert sequences of clones 142B02 and 156 M20. Homologous regions are aligned and representative read depths are indicated above for 142B02 and below for 156M20. Annotated genes (dark green), expressed sequence tag (EST; light green), and repetitive element sequences (orange) are indicated. Microsatellite repeat motifs are shown as (∣∣∣∣).
Figure 4Identification of putative miniature inverted repeat transposable elements (MITEs) (blue rectangles indicating direction), and de novo mapped repetitive elements identified from BAC end sequences (BES REs) and BAC full insert sequences (BAC REs), within the gene protein coding sequence (CDS) and transcript sequence (RNA).