| Literature DB >> 21208460 |
Chia-Chi Hsu1, Yu-Lin Chung, Tien-Chih Chen, Yu-Ling Lee, Yi-Tzu Kuo, Wen-Chieh Tsai, Yu-Yun Hsiao, Yun-Wen Chen, Wen-Luan Wu, Hong-Hwa Chen.
Abstract
BACKGROUND: Phalaenopsis orchids are popular floral crops, and development of new cultivars is economically important to floricultural industries worldwide. Analysis of orchid genes could facilitate orchid improvement. Bacterial artificial chromosome (BAC) end sequences (BESs) can provide the first glimpses into the sequence composition of a novel genome and can yield molecular markers for use in genetic mapping and breeding.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21208460 PMCID: PMC3027094 DOI: 10.1186/1471-2229-11-3
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Statistical analysis of Phalaenopsis equestris bacterial artificial chromosome (BAC) end sequences (BESs)
| Total number of BESs | 5,535 |
| No. of paired BESs | 5,360 |
| No. of non-paired BESs | 175 |
| Total length (bp) | 4,544,250 |
| Minimum length (bp) | 123 |
| Maximum length (bp) | 1,397 |
| Average length (bp) | 821 |
| GC content | 35.95% |
| Sequence composition | |
| Potential transposable elements (%) | 1,272 (23.0) |
| Simple sequence repeats (%) | 950 (17.2) |
| Protein coding regions (%) | 641 (11.6) |
| Chloroplast sequences (%) | 29 (0.5) |
| Unknown genomic sequences (%) | 2,643 (47.7) |
Figure 1Size distribution of . The trimmed sequences ranged from 123 to 1,397 bp in length and had an average edited read length of 821 bp.
Number of bacterial artificial chromosome (BAC) end sequences (BESs) containing repetitive DNA
| Class, subclass, group | No. of BESs | % of BESs with repetitive DNA |
|---|---|---|
| Class I retrotransposons | 963 | 75.7 |
| Ty1-copia | 163 | 12.8 |
| Ty3-gypsy | 533 | 41.9 |
| LINE | 95 | 7.5 |
| SINE | 0 | 0.0 |
| Unclassified retrotransposons | 172 | 13.5 |
| Class II DNA transposons | 137 | 10.8 |
| Ac/Ds | 12 | 0.9 |
| CACTA, En/Spm | 67 | 5.3 |
| Mutator (MULE) | 15 | 1.2 |
| Tourist/Harbinger/Helitron/Mariner | 10 | 0.8 |
| Unclassified Transposons | 33 | 2.6 |
| Miniature inverted-repeat transposable elements | 2 | 0.2 |
| Centromere | 19 | 1.5 |
| rRNA | 33 | 2.6 |
| Unclassified | 118 | 9.3 |
| Total | 1272 | 100 |
Figure 2Number of . Among the protein matches, the top BLASTX match was to a protein of V. vinifera.
Figure 3Gene Ontology (GO) analysis of the . Among the 641 BESs containing protein-encoding regions, 321 were annotated to the cellular components category, 182 to the molecular functions category, and 329 to the biological processes category.
Distribution of simple sequence repeats in P. equestris bacterial artificial chromosome (BAC) end sequences
| Type | Type | ||
|---|---|---|---|
| A/T | 264 | AACTC/AGTTG | 1 |
| C/G | 20 | AACTT/AATTG | 1 |
| AC/GT | 24 | AAGAG/CTCTT | 3 |
| AG/CT | 47 | AAGGG/CCCTT | 1 |
| AT/AT | 253 | AAGTT/AATTC | 1 |
| CG/CG | 2 | AAGCC/CGGTT | 1 |
| AAC/GTT | 22 | AATAT/ATATT | 4 |
| AAG/CTT | 12 | AATCC/AGGTT | 1 |
| AAT/ATT | 51 | AATGC/ACGTT | 1 |
| ACC/GGT | 3 | ACACG/CTGTG | 1 |
| ACT/ATG | 2 | ACCTC/AGTGG | 1 |
| AGC/CGT | 1 | ACGAG/CTCTG | 1 |
| AGG/CCT | 6 | ACTAT/ATATG | 1 |
| AGT/ATC | 5 | AGAGG/CCTCT | 4 |
| CCG/CGG | 2 | AGGAT/ATCCT | 1 |
| AAAG/CTTT | 2 | AGGGC/CCCGT | 1 |
| AAAT/ATTT | 20 | AGGGG/CCCCT | 1 |
| AACT/ATTG | 1 | AAAAAG/CTTTTT | 11 |
| AATT/AATT | 4 | AAAAAT/ATTTTT | 10 |
| ACAG/CTGT | 1 | AAAACC/GGTTTT | 15 |
| ACAT/ATGT | 12 | AAAATT/AATTTT | 1 |
| ACCT/ATGG | 4 | AAAATC/AGTTTT | 2 |
| ACTC/AGTG | 1 | AAACCT/ATTTGG | 1 |
| AGAT/ATCT | 2 | AAACCC/GGGTTT | 1 |
| AGGG/CCCT | 2 | AAAGTC/AGTTTC | 1 |
| ACGT/ATGC | 1 | AAGGGC/CCCGTT | 1 |
| AAAAC/GTTTT | 10 | AACGTG/ACTTGC | 4 |
| AAAAG/CTTTT | 20 | AAGATC/AGTTCT | 1 |
| AAAAT/ATTTT | 34 | AATATT/AATTAT | 1 |
| AAACT/ATTTG | 2 | ACATAG/ATCTGT | 1 |
| AAAGT/ATTTC | 2 | ACATAT/ATATGT | 2 |
| AAAGG/CCTTT | 1 | ACCTGC/ACGTGG | 2 |
| AAATC/AGTTT | 2 | AACCCT/ATTGGG | 1 |
| AAATG/ACTTT | 1 | ACACCC/GGGTGT | 1 |
| AAATT/AATTT | 30 | ACCCCT/ATGGGG | 1 |
| AACAC/GTGTT | 1 | AGAGGC/CCGTCT | 1 |
Distribution and frequency of simple sequence repeats (SSRs) detected in different plant species
| No. of BESs | 26,068 | 88,825 | 6,376 | 78,427 | 5,506 | 31,907 | 54,960 |
| Total sequence length (bp) | 13,987,589 | 39,551,595 | 4,517,901 | 69,423,321 | 4,520,220 | 18,117,956 | 37,410,959 |
| Mononucleotides | 40.3 (878) | 7.5 (320) | 0.8 (6) | 9.1 (696) | 29.9a (284)b | 6.0 (188) | 7.2 (167) |
| Dinucleotides | 13.1 (285) | 36.2 (1,551) | 47.7 (350) | 19.9 (1,531) | 34.3 (326) | 28.0 (881) | 15.4 (358) |
| Trinucleotides | 21.2 (462) | 20.4 (876) | 20.6 (151) | 28.9 (2,219) | 11.0 (104) | 18.7 (586) | 35.7 (831) |
| Tetranucleotides | 3.9 (84) | 6.9 (294) | 9.0 (66) | 10.2 (783) | 5.3 (50) | 14.9 (467) | 8.3 (193) |
| Pentanucleotides | 15.3 (333) | 21.2 (911) | 13.1 (96) | 21.4 (1,642) | 13.5 (128) | 21.4 (673) | 21.6 (504) |
| Hexanucleotides | 6.3 (137) | 7.9 (338) | 8.9 (65) | 10.5 (804) | 6.1 (58) | 11.0 (347) | 11.9 (276) |
| Total SSRs | 2,179 | 4,290 | 734 | 7,675 | 950 | 3,142 | 2,329 |
| SSR frequencyc | 6.4 | 9.2 | 6.2 | 9 | 4.8 | 5.8 | 16.1 |
| Most frequent SSR motif | A/T | AT/TA | AT/TA | CCG/CGG | A/T | AT/TA | AGC/GCT |
a Percentage of SSRs in each category
b Number of SSRs in each category
c Average estimated distance between SSRs (kb)
Figure 4Frequency of AT-rich repeat motifs in the nuclear genomes of .
Microsynteny between Phalaenopsis and A. thaliana, O. sativa, P. trichocarpa and V. vinifera
| No. of hits | 94 | 83 | 142 | 123 |
| Pair ends | 12 | 6 | 14 | 12 |
| Same chromosome | 11 | 6 | 12 | 10 |
| 50- to 300-kb sequence | 0 | 0 | 1 | 1a |
a The whole-genome sequence data of V. vinifera shows that each individual chromosome contains several contigs. Thus, we were unable to determine the exact distances between the BAC pair ends that mapped together on the various chromosomes of the grape genome.
Number of bacterial artificial chromosome (BAC) end sequences (BESs) containing Phalaenopsis chloroplast DNA with hits to the nuclear and chloroplast DNA of A. thaliana, O. sativa, P. trichocarpa and V. vinifera
| Nuclear DNA | 2 | 24 | 21 | 25 |
| Chloroplast DNA | 25 | 24 | 27 | 28 |