| Literature DB >> 19656416 |
Björn Hamberger1, Dawn Hall, Mack Yuen, Claire Oddy, Britta Hamberger, Christopher I Keeling, Carol Ritland, Kermit Ritland, Jörg Bohlmann.
Abstract
BACKGROUND: Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19656416 PMCID: PMC2729077 DOI: 10.1186/1471-2229-9-106
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Structure of white spruce genomic DNA of BAC clones PGB02 and PGB04. The position of the target genes 3CAR and CYP720B4 is indicated. Red and yellow bars represent repeated segments and segments with similarity to DNA transposons, respectively. Transposable elements were identified with the RepeatMasker using the viridiplantae section of the RepBase Update database. EcIS10, E. coli individual insertion sequence (IS) of the bacterial transposon Tn10; CSRE, conifer specific repeat element; LB/RB left and right border of pINDIGO; arrows in PGB04 indicate local putative segment duplications. The scale bar represents 10 kbp. (p) pseudogene, based on the accumulation of deleterious mutations and the absence of transcript with >90% identity.
General features of the gDNA sequences of the white spruce BAC clones PGB02 and PGB04 as compared to the genome sequence features of Arabidopsis, rice, poplar and grapevine.
| Genome Size (Mbp) | Predicted genes | Avg Gene length (bp) | Gene density (kbp per gene) | % TE | GC content (%) | |
|---|---|---|---|---|---|---|
| 115 | 25,498 | 1,992 | 4.5 | 14.0 | 36.0 | |
| 389 | 37,544 | 2,699 | 9.9 | 34.8 | 43.6 | |
| 485 | 45,555 | 2,392 | 10.6 | 42.0 | 33.7 | |
| 487 | 30,434 | 3,399 | 16.0 | 41.4 | 34.6 | |
| PGB025 | 0.172 | 1 | 3,138 | 172 | 36.0 | 38.0 |
| PGB045 | 0.094 | 1 | 3,131 | 93.6 | 41.6 | 37.0 |
1-4 [30-33]
5BAC insert size
Figure 2Gene structure of white spruce 3CAR (A) and CYP720B4 (B) and comparison of 3CAR with the grand fir (. Exons of the 3CAR and CYP720B4 genes matching the cDNA sequences are shown with grey arrows separated by introns. The UTRs are shown with grey lines. ATG, start codon. Putative cis-acting elements were identified using the PlantCare database and positions are highlighted in blue (not to scale): wun-box, wound-responsive element (Brassica oleracea); W-box, fungal elicitor responsive element (Petroselinum crispum); TCRR, TC-rich repeats, cis-acting element involved in defence and stress responsivenes (Nicotiana tabacum); TCA, cis-acting element involved in salicylic acid responsiveness (Brassica oleracea); TGACG, cis-acting regulatory element involved in the MeJA-responsiveness (Hordeum vulgare). LIM, AF326518; PIN, AF326517; roman numbers in part C indicate conserved exons in 3CAR, LIM and PIN; the scale bar represents 1 kbp.
High complexity repeats in the white spruce gDNA of PGB02 and PGB04.
| BAC | Repetitive sequences with similarities in angiosperms1 | TEs detected with RepeatMasker2 | Total repeat content3 | Similarity to EST4 |
|---|---|---|---|---|
| PGB02 | 3.7% | 21.9% | 36.0% | 14.7% |
| PGB04 | 3.0% | 17.6% | 41.6% | 17.1% |
1Portion of the white spruce gDNA sequences of PGB02 and PGB04 with similarity to repeat regions identified in the genomes of Arabidopsis, rice, poplar and grapevine (cut-off E-value < 10-5); this excludes the coding regions of 3CAR and CYP720B4.
2Percentage of PGB02 and PGB04 sequences consisting of TEs as detected by the RepeatMasker using the viridiplantae section of the RepBase Update.
3Percentage of PGB02 and PGB04 sequences consisting of high complexity repeats as detected by pairwise comparisons of the two gDNA sequences.
4Fraction of the PGB02 and PGB04 sequences with similarity (at least 80 - 90% nucleotide sequence identity) to white spruce ESTs; this excludes the coding regions of 3CAR and CYP720B4; no EST hits were detected outside repeat regions.