| Literature DB >> 16928278 |
Charlotte Lindqvist1, Anne-Cathrine Scheen, Mi-Jeong Yoo, Paris Grey, David G Oppenheimer, James H Leebens-Mack, Douglas E Soltis, Pamela S Soltis, Victor A Albert.
Abstract
BACKGROUND: The endemic Hawaiian mints represent a major island radiation that likely originated from hybridization between two North American polyploid lineages. In contrast with the extensive morphological and ecological diversity among taxa, ribosomal DNA sequence variation has been found to be remarkably low. In the past few years, expressed sequence tag (EST) projects on plant species have generated a vast amount of publicly available sequence data that can be mined for simple sequence repeats (SSRs). However, these EST projects have largely focused on crop or otherwise economically important plants, and so far only few studies have been published on the use of intragenic SSRs in natural plant populations. We constructed an EST library from developing fleshy nutlets of Stenogyne rugosa principally to identify genetic markers for the Hawaiian endemic mints.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16928278 PMCID: PMC1560379 DOI: 10.1186/1471-2229-6-16
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Simplified phylogenetic tree of the Hawaiian mint relationships. Phylogenetic analyses based on (a) 5S-NTS nuclear ribosomal DNA (nrDNA) and (b) chloroplast DNA (cpDNA) show the position of the endemic Hawaiian mints within the Stachys lineage [3]. The cross, likely indicative of an interlineage reticulation event [3], shows the different positions of Mesoamerican and temperate North American Stachys species with respect to one another and the Hawaiian mints. Taxa marked with an asterix (*) are those with fleshy nutlets (excluding the Hawaiian genus Haplostachys). These, including the monotypic genus Bostrychanthera, are the only taxa with fleshy fruits in the entire subfamily Lamioideae.
Figure 2Flowers and fruits of Hawaiian mints. (a) Stenogyne rugosa flowers, and (b) fruits of Phyllostegia ambigua comprising four fleshy nutlets.
Stenogyne rugosa EST library and sequencing statistics.
| Primary titre (pfu) | 1,5 × 106 |
| Amplified titre (pfu/ml) | 2,4 × 106 |
| Avg cDNA insert size | 1500 bp |
| Avg good sequence length | 494 bp |
| Total sequences | 1273 |
| Sequences passed quality check | 942 (74%) |
| unique gene sequences (unigenes) | 628 |
| Avg unigene length | 653 bp |
| Observed redundancya | 50% |
aObserved redundancy: (EST# after quality check – Unigene #)/Unigene#.
Figure 3Gene Ontology (GO) classification of the . The relative frequencies of GO hits for Stenogyne rugosa unigenes assigned to the GO functional categories Biological Process, Molecular Function, and Cellular Component, as defined for the Arabidopsis proteome.
Figure 4Comparison of GO classification frequences for . Relative frequences assigned to the GO categories Biological Processes and Molecular Function are shown.
Stenogyne (Ste) unigenes and best BLAST hits to Arabidopsis (Ath) floral developmental and flowering time genes.
| AFO, ABNORMAL FLORAL ORGANS/YABBY1 | At2g45190 | 1 | 5 |
| ANL2, ANTHOCYANINLESS2 (Homeobox-leucine zipper) | At4g00730 | 1 | 17 |
| ATCa, ARABIDOPSIS THALIANA CENTRORADIALIS | At2g27550 | 1 | 6 |
| ARF4, AUXIN RESPONSE FACTOR 4 | At5g60450 | 1 | 23 |
| ATEXPA10, ARABIDOPSIS THALIANA EXPANSIN A10 | At1g26770 | 1 | 30 |
| DCL3, DICER-LIKE 3 (CAF-like DEAD Box) | At3g43920 | 1 | 6 |
| FCAa, FCA-like RNA binding | At2g47310 | 1 | 3 |
| GIa, GIGANTEA | At1g22770 | 1 | 1 |
| LDa, LUMINIDEPENDENS-like Homeodomain | At4g02560 | 1 | 1 |
| NAM, NO APICAL MERISTEM (similar to NAC2) | At3g10500 | 1 | 83 |
| PAT1, PHYTOCHROME A SIGNAL TRANSDUCTION 1 (member of GRAS family) | At5g48150 | 1 | 27 |
a Flowering time genes.
b Gene family size is represented by the number of Arabidopsis (Ath) genes in each medium stringency tribe [see 11].
A list of 44 microsatellite markers identified in a total of 42 unigenes showing the motif of the repeat unit and the annotation of the unigenes as defined by best-matched Arabidopsis protein (BLASTX E-value < 1e-10).
| 260553 | GGCGGCGGCGGCGG | At3g19760.1 – eukaryotic translation initiation factor 4A/DEAD box RNA helicase, putative |
| 260567 | ACAACACAACACAACACAA | no hit |
| 260570 | AGCAGCAGCAGCAGCAGCAG | At4g16280.1 – flowering time control protein/FCA gamma (FCA) |
| 260579 | TGCTGCTGCTGCT | At2g41900.1 – zinc finger (CCCH-type) family protein |
| 260619 | ATATATATATA | At3g48860.2 – expressed protein |
| 260625 | GATGATGATGATG | At4g16630.1 – DEAD/DEAH box helicase, putative (RH28) |
| 260641 | TCATCATCATCA | no hit |
| 260657 | CTGCTGCTGCTGCTGCT | no hit |
| 260658 | TCGTCGTCGTCG | At5g27540.1 – GTP-binding protein-related |
| 260658 | GCCGCCGCCGCCG | At5g27540.1 – GTP-binding protein-related |
| 260664 | TTATTATTATTAT | At3g04770.1 – 40S ribosomal protein SA (RPSaB) |
| 260670 | ACCACCACCACC | At2g26510.1 – xanthine/uracil permease family protein |
| 260691 | TATATATATATAT | no hit |
| 260708 | AGAAGAAGAAGAAGAAGAA | no hit |
| 260711 | TTCTTCTTCTTCTTC | At2g38410.1 – VHS domain-containing protein/GAT domain-containing protein |
| 260714 | AAATAAATAAATAAA | no hit |
| 260715 | TGCTTGCTTGCTT | At1g73660.1 – protein kinase family protein |
| 260720 | ATACATACATACA | At5g49720.1 – endo-1,4-beta-glucanase KORRIGAN (KOR)/cellulase (OR16pep) |
| 260754 | ATTATTATTATT | At1g48600.1 – phosphoethanolamine N-methyltransferase 2, putative (NMT2) |
| 260776 | ATATATATATATATATATATATATATATATATATAT | At5g08380.1 – alpha-galactosidase, putative |
| 260779 | CGACCCGACCCGAC | no hit |
| 260781 | TTATTATTATTA | no hit |
| 260802 | ACAACACAACACAACACAACACAACACAA | no hit |
| 260810 | ATCATCATCATCA | At5g56350.1 – pyruvate kinase, putative |
| 260811 | CTCTCTCTCTC | At5g48150.2 – phytochrome A signal transduction 1 (PAT1) |
| 260853 | CCGCCGCCGCCG | At5g13440.1 – ubiquinol-cytochrome C reductase iron-sulfur subunit |
| 260883 | CCTCCCCTCCCCTC | At5g46250.1 – RNA recognition motif (RRM)-containing protein |
| 260903 | CACCACCACCAC | At3g11400.1 – eukaryotic translation initiation factor 3G/eIF3g |
| 260907 | GCTGCTGCTGCT | At5g17920.1 – 5-methyltetrahydropteroyltriglutamate – homocysteine methyltransferase |
| 260915 | CCAAACCAAACCAAACCAAACCAAA | At3g12020.1 – kinesin motor protein-related |
| 260925 | GCAGCAGCAGCA | At1g03880.1 – 12S seed storage protein (CRB) |
| 260939 | GCTGCTGCTGCTGC | At1g80490.2 – WD-40 repeat family protein |
| 260940 | ACGAACGAACGAA | no hit |
| 260971 | GGCGGCGGCGGC | no hit |
| 260986 | AGAAGAAGAAGAAGAA | At1g10040.1 – expressed protein |
| 260996 | GCTGCTGCTGCT | At5g17920.1 – 5-ethyltetrahydropteroyltriglutamate – homocysteine methyltransferase |
| 261018 | ATTTTATTTTATTTT | At3g29160.3 – Snf1-related protein kinase (KIN11) |
| 261039 | TTAATTAATTAATT | At2g45190.1 – axial regulator YABBY1/AFO |
| 261062 | GCTGCTGCTGCTGCTGCTG | At2g28000.1 – RuBisCO subunit binding-protein alpha subunit/CHAPERONIN-60ALPHA |
| 261064 | ATACATACATACA | At4g23400.1 – major intrinsic family protein/MIP family protein |
| 261094 | GAGAGAGAGAG | At1g48410.2 – argonaute protein (AGO1) |
| 261109 | ATTCATTCATTCAT | At3g15880.1 – WD-40 repeat family protein |
| 261109 | ACCACCACCACC | At3g15880.1 – WD-40 repeat family protein |
| 261131 | TAATATAATATAATA | no hit |
A list of primer sets for 44 EST microsatellite markers in the Stenogyne EST library. 35 primer sets were tested initially with Stenogyne rugosa (unigene IDs with bold text), 30 of which gave clear PCR products. Of these 30 primer pairs, 24 (marked with an asterix) gave positive amplification for four Hawaiian mint accessions.
| CCCTCCGACGACACTAAA | CGACAATCTGGCAAACAG | 322 | |
| GAAAGTGAAAGCAGCAGATAC | ATGTGGAGGAATGATGACAC | 275 | |
| GTGGGAGAAACCTGAAGAG | TGTATTGACTGCCCTGATG | 109 | |
| TTTCTCTTCTCTCACTCACTTTC | TGATTACACCACTTCTATTGTT | 226 | |
| ATCTTCCCTTCCTCAAATC | TCATCATCGTCGTCTTCTTC | 325 | |
| GGAAGTAGAAGCAGAAGAAGG | GAGTTAGCATTGAAGGAAACA | 345 | |
| CAAGAATCAAACAGACCAGAC | CAGGGAAGGTAAAGCAATC | 375 | |
| TAAGTCTCATAGCAAGCGAAC | TATTCTACATCCCTCACACCC | 128 | |
| 260658 | CACACCAAATCGTCAAATC | CTTCCCTCTATTCTCCAAACT | 239 |
| 260658 | CACACCAAATCGTCAAATC | CTTCCCTCTATTCTCCAAACT | 239 |
| AACATTCACCAACCACCTT | GTCCCAGAGCACCTTCTT | 186 | |
| AAGAAAGAGAAGAAGGATGGT | AGAAGGGTTTGAGTGGATG | 169 | |
| TCCGAAATCATACTCATAAA | GGGCGATGTGCTTAGTTC | 258 | |
| ACCAGGAGGAAGATTGGA | TATGATTGGAGAGGTGAGATG | 217 | |
| GCACGAGGCAGTAATCTTT | GCAAATAATCACCACAGTTCTT | 395 | |
| AATAGAGATTGGAGAGAAGCA | CGACACAAACGAAACATC | 321 | |
| AAATCTGGGCTACTCAAACA | GGTCAGTCTTCTGTGAACCA | 190 | |
| ACGAGGTTTAGAAGCAAAGT | GTTGGAGTATCTGGGAAGAAG | 159 | |
| CACGAGGATTTACTGCTTCA | ACAAGGACATAGTTGGAGGTT | 314 | |
| ACTGTTGTTGATGTTCCC | TCATTCTTCCACCATTTGA | 257 | |
| 260779 | CCTTCACCTGCTCTTCATAC | TCTTTACTCTTCTCTTTCTGACC | 219 |
| GCACGAGGACAAGAAATAGA | CGCTGCCTGAAGATGAGA | 226 | |
| GAAAGTGAAAGCAGCAGATAC | GATGTGGAGGATGATGACAC | 270 | |
| GCTTCCTTCTCTTCCCAA | ACTGGTTCCGATTCTGTATG | 220 | |
| CAGTTGGGAAAGATAAACAGA | GGAACACAAGGAGAGGATAAA | 380 | |
| CGGATTGCTGATTTACTGA | GAGATTCAAACGAACACGA | 174 | |
| AGAAGGAGCGGGAGGTTT | TAATAATCTTGCGTCGGAG | 278 | |
| ACGAGAATGGGATAAAGAAAC | ATCGGTCGGTTGAGTAAGA | 362 | |
| 260907 | CATTGGCTGGTCATAAGG | TAGTAGTGTTTGGGAGGATTG | 210 |
| CATCGGTTAGAAACACAAGAA | CAAACAAAGAGAAGCCTACC | 305 | |
| ATAAATCTCTCCCTCCTCTCC | TTCTTCGTCGTTCCTCCT | 400 | |
| CGAATGTAGAAAGATAAAGGAG | CTGGACAAGTAAACGGTAGG | 221 | |
| GTTACATACCGTTCCCACC | TACCCTCTCTCATCCTCAAC | 394 | |
| TGTTGGAGTGTAAAGTTGGTT | GAGTTGTTGTTGCTGACGA | 380 | |
| AGAGACTGTGAGTGAGCAGG | CCATAACATCAATACCATCCA | 374 | |
| 260996 | CCAGGGTGAACAATGAAG | TTCTACGGTTTGAGGGAAG | 165 |
| ACTGGAGGAAGAAAGCAATAC | TTTACCAAATGAACCAATCC | 236 | |
| CATAGTAACAGATGAACCCTAA | ATCAACCCGTGAAGAAACC | 220 | |
| 261062 | TTCTCTCACTCAATCCTGAAA | TACAACGCAATGACCGATAC | 299 |
| CAACGCACACACTATGCTTC | TATTCTGGGTAGGACCATTC | 228 | |
| 261094 | GCAGTGATGGCACAGTTAG | CGAGAAGTCGGCAGTTCA | 317 |
| 261109 | ATTCATTCATTCCGCTACAC | CAGAGGGCTGCTAAGATG | 157 |
| 261109 | ATTCATTCCGCTACACAAA | GGTTTATCCGCTGGTGGT | 366 |
| ACGGATGGACTTAGGACTTT | TTTGAGATTTGAGTAGTTTGAGA | 241 |
Results from five EST SSR loci tested for variation among a total of 88 individual accessions, including 84 Hawaiian mint accessions and four North American relatives in the genus Stachys.
| 260553 | coding region (5' of gene) | 315–331 | 6 | 1–3 | 1.3 | 0.334 |
| 260708 | no hit | 173–259 | 25 | 1–8 | 2.5 | 0.844 |
| 260715 | 3'-UTR | 176–188 | 4 | 1–4 | 2.0 | 0.458 |
| 261039 | 3'-UTR | 312–331 | 5 | 1–4 | 2.9 | 0.773 |
| 261064 | 3'-UTR | 221–263 | 12 | 1–6 | 2.9 | 0.872 |
Figure 5Allele frequency distributions of five EST SSR markers. SSR markers (a) 260708, (b) 261064, (c) 261039, (d) 260715, and (e) 260553. See also Tables 3–5 for more information on these loci.