| Literature DB >> 21645357 |
Patricia Faivre Rampant1, Isabelle Lesur, Clément Boussardon, Frédérique Bitton, Marie-Laure Martin-Magniette, Catherine Bodénès, Grégoire Le Provost, Hélène Bergès, Sylvia Fluch, Antoine Kremer, Christophe Plomion.
Abstract
BACKGROUND: One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21645357 PMCID: PMC3132169 DOI: 10.1186/1471-2164-12-292
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Estimation of mean insert size in the oak BAC library. A - Pulsed-field gel electrophoresis of 38 BAC clones DNA digested with NotI (Lanes 1-38) The 7.5 kb band is from the cloning vector. Lane 39 molecular marker 1 kb+ Invitrogen, Lane 40 size standard PFGE marker (Lambda Ladder PFG marker New England Biolabs). B - Insert size distribution of clones from the Quercus robur BAC library. The x-axis shows the size of the insert in kb. The y-axis indicates the number of clones.
Characteristics of the oak BAC library
| Cloning vector | Pindigo BAC 536 |
| Partial digest enzyme | |
| Number of clones | 92160 |
| Number of 384-well plates | 240 |
| Missed wells | ~7% |
| Mean insert size | 135 kb |
| Minimum insert size | 50 kb |
| Maximum insert size | 205 kb |
| Chloroplast contamination | 2.20% |
| Mitochondrial contamination | - |
| Number of genome equivalents | 12× |
Summary of BAC end sequencing
| No. of good-quality BAC end sequences | 20,056 |
| Total base count | 12,018,238 |
| Minimum length | 100 bp |
| Maximum length | 967 bp |
| Mean length | 599 bp |
| GC content | 35.33% |
| Oak | 2.60% |
| Poplar | 1.20% |
| Grapevine | 1.20% |
| Grapevine | 0.60% |
Classification and distribution of known plant repeats in the BAC end sequences
| Class | Number of elements | % of nucleotides | Length (bp) |
|---|---|---|---|
| RTE/Bov-B | 18 | 0.04 | 4,649 |
| L1/CIN4 | 297 | 0.61 | 73,545 |
| Ty1/Copia | 1,155 | 3.03 | 364,184 |
| Gypsy/DIRS1 | 701 | 1.8 | 216,428 |
| - | |||
a - Numbers indicate the percentages of BESs displaying similarity to a repeat from the indicated category.
Figure 2Estimation of redundancy within oak BESs. Distribution of the number of masked BESs with at least one significant alignment with another BES in the dataset. BESs were masked for repetitive DNA and low-complexity sequences identified with RepeatMasker software, using the Viridiplantae section of the RepBase database. Values on the y-axis represent the number of BESs matching the number of BESs listed on the x-axis (between one and 352 BESs).
Figure 3Distribution of SSR motifs. A - Distribution of the most abundant SSR motifs in oak BESs. The values on the y-axis indicate the fraction of SSRs displaying the motifs listed on the x-axis. SSR motifs were identified with MREPS 2.5. All the dinucleotide motifs are represented. Only the five most abundant tri-, tetra-, penta- and hexanucleotide motifs are listed. B - Distribution of di-, tri-, tetra-, penta- and hexanucleotide motifs identified by MREPS, using the same parameters in 10 BAC end sequences libraries published in the NCBI dbGSS database, normalized for cross-species comparisons. C- Distribution of the most abundant SSR motifs found in oak BES in 10 other BES datasets: Carita papaya (50,661 BESs), Prunus persica (47,311 BESs), Citrus clementina (46,000 BESs), Populus trichocarpa (13,249 BESs), Brassica oleracea (85,416 BESs), Glycine max (325,952 BESs), Solanum lycopersicum (399,114 BESs), Brassica rapa (198,490 BESs), Solanum tuberosum (140,540 BESs), Cucumis sativus (63,935 BESs). The values on the y-axis indicate the fraction of SSRs displaying the motifs listed on the x-axis. SSR motifs were identified with MREPS 2.5, using the same parameters as for oak BESs. The values have been normalized for cross-species comparisons.
Figure 4Gene ontology classification of the 1,823 oak BESs significantly aligned with . A - Masked BESs were annotated as related to Molecular Function, Biological Process and Cellular Component categories. B - Molecular Function annotation of the BESs grouped into 9 higher level terms of the Gene Ontology. C - Biological Process annotation of the BESs grouped into 12 higher level terms of the Gene Ontology.
Figure 5Comparison of the distribution of functional annotations of oak BESs with the functional annotations in the Gene Ontology database. Distribution of Gene Ontology functional annotations in the oak BES dataset (blue) and in the Gene Ontology database (pink) belonging to A - the Molecular Function category (GO:0003674) and B - the Biological Process category (GO:0008150). We identified the functional categories (in bold) with frequencies in our BES dataset different from those in the Gene Ontology dataset, through chi2 tests (P < 0.05).
BlastN hits between oak BESs and the Vitis vinifera, Populus trichocarpa and Arabidopsis thaliana genomes
| Hit | No. hits | Single end (BESs) | Co-localized (pairs) | Gapped (pairs) | Non-gapped (pairs) | Collinear (pairs) | Rearranged (pairs) | ||
|---|---|---|---|---|---|---|---|---|---|
| 176 BESs | 19880 | 136 | 20 | 8 | 7 | 1 | 1 | 0 | |
| 81 BESs | 19975 | 75 | 3 | 2 | 2 | 0 | 0 | 0 | |
| 134 BESs | 19922 | 102 | 16 | 16 | 0 | 16 | 8 | 8 |
1 pair = BES pair
2 alig. = significant alignment