| Literature DB >> 15799777 |
Victor A Albert1, Douglas E Soltis, John E Carlson, William G Farmerie, P Kerr Wall, Daniel C Ilut, Teri M Solow, Lukas A Mueller, Lena L Landherr, Yi Hu, Matyas Buzgo, Sangtae Kim, Mi-Jeong Yoo, Michael W Frohlich, Rafael Perl-Treves, Scott E Schlarbaum, Barbara J Bliss, Xiaohong Zhang, Steven D Tanksley, David G Oppenheimer, Pamela S Soltis, Hong Ma, Claude W DePamphilis, James H Leebens-Mack.
Abstract
BACKGROUND: The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15799777 PMCID: PMC1083416 DOI: 10.1186/1471-2229-5-5
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Well- supported evolutionary relationships among FGP species and other genomic models are shown in this phylogenic tree of seed plants. Red taxon names indicate those species for which we aim to sequence 10,000 ESTs, green taxon names indicate species for which we are sequencing 2000 ESTs, and blue taxon names indicate species for which large EST sets are already available in public databases [24, 25, 28, 29, 61] or will soon become available [70].
Current (12/01/04) statistics for Floral Genome Project cDNA libraries, EST sequences and unigene builds. We will perform 10,000 EST sequencing reactions for each taxon listed in the top portion of the table and 2000 ESTs for each taxon in the bottom portion of the table.
| 3.25E+05 | 6.00E+08 | 1382 | 3732 | 2771 | 34.7% | |
| 2.24E+06 | 1.37E+10 | 1611 | 4047 | 6099 | 41.5% | |
| 4.98E+06 | 1.40E+10 | 1031 | 4442 | |||
| Nuphar advena ( | 2.00E+06 | 3.20E+10 | 1134 | 8442 | 6205 | 36.1% |
| 2.80E+06 | 6.00E+09 | 1083 | 5883 | 3067 | 28.8% | |
| 1.50E+06 | 1.20E+10 | 1468 | 5188 | 4560 | 61.5% | |
| 1.30E+06 | 1.40E+10 | 1200 | 2174 | |||
| 2.74E+06 | 2.57E+10 | 1349 | 8735 | 5314 | 41.3% | |
| 3.00E+06 | 2.00E+10 | 1346 | 9531 | 6520 | 46.2% | |
| 1.97E+06 | 1.67E+10 | 1587 | 3230 | 2631 | 21.8% | |
| 7.00E+06 | 1.68E+11 | 1702 | 9079 | 6015 | 46.18% | |
| ------ | ------ | ------ | 1107 | 1648 | 23.5% | |
| ------ | ------ | ------ | 928 | |||
| Ribes americanum | 2.58E+06 | 2.25E+10 | 1200 | 2238 | 1791 | 25.0% |
| ------ | ------ | ------ | 1758 | 1549 | 13.5% | |
| 70,514 | 48,170 |
m indicates library constructed from male tissues; f for female tissues.
Unigene numbers are shown in the first line for taxa with multiple libraries.
bObserved redundancy was measured for each taxon as (EST# – Unigene #)/Unigene#.
c5188 ESTs from the male Asparagus library were sequenced in collaboration with Mike Havey (University of Wisconsin) and Chris Town (TIGR).
dMale and female Cucumis flower bud libraries described by Perl-Treves et al [71]
eVaccinium young inflorescence library was provided by Jeannine Rowland (USDA) [72].
Number of FGP unigenes that are best BLAST hits to Arabidopsis floral developmental regulation genes, with corresponding tribe ID number and number of Arabidopsis and rice genes in these tribes. Species IDs: Aam, Acorus americanus; Aof, Asparagus officinalis; Ath, Arabidopsis thaliana; Atr, Amborella trichopoda; Eca, Eschscholzia californica; Ltu, Liriodendron tulipifera; Nad, Nuphar advena; Osa, Oryza savita; Pam, Persea americanus; She, Saruma henryi; Wmi, Welwitschia mirabilis.
| At2g45190 | AFO, YABBY1 | 1010 | 4 | 7 | 3 | 1 | 1 | 3 | 2 | 12 | ||||
| At4g18960 | AG, AGAMOUS | 65 | 46 | 51 | 2 | 1 | 1 | 4 | ||||||
| At4g09960 | AGL11, MADS-box protein | 65 | 46 | 51 | 1 | 1 | ||||||||
| At2g45660 | AGL20, SOC | 65 | 46 | 51 | 1 | 1 | 2 | |||||||
| At4g24540 | AGL24, MADS-box protein | 65 | 46 | 51 | ||||||||||
| At2g03710 | AGL3, MADS-box protein | 65 | 46 | 51 | 1 | 1 | 2 | |||||||
| At2g45650 | AGL6, MADS-box protein | 65 | 46 | 51 | 1 | 1 | 1 | 2 | 1 | 2 | 9 | |||
| At4g37750 | ANT, AINTEGUMENTA | 123 | 18 | 38 | 2 | 1 | 1 | 4 | ||||||
| At1g69120 | AP1, APETALA 1 | 65 | 46 | 51 | 1 | 1 | 3 | |||||||
| At4g36920 | AP2, APELATA 2 (FL1, FLOWER1) | 123 | 18 | 38 | 1 | 1 | 1 | 3 | 8 | |||||
| At3g54340 | AP3, APETALA 3 | 65 | 46 | 51 | 1 | 2 | 1 | 2 | 2 | 1 | 9 | |||
| At1g75950 | ASK1 | 122 | 19 | 38 | 1 | 1 | 1 | 3 | ||||||
| At5g42190 | ASK2 | 122 | 19 | 38 | 3 | 2 | 7 | 1 | 16 | |||||
| At4g02570 | AXR6, AUXIN RESISTANT 6 | 324 | 10 | 16 | 2 | 2 | 1 | 7 | 1 | 2 | 1 | 17 | ||
| At1g01040 | CAF, CARPEL FACTORY (SUS1) | 446 | 8 | 13 | 2 | 1 | 3 | |||||||
| At1g26310 | CAL, CAULIFLOWER | 65 | 46 | 51 | 1 | 1 | ||||||||
| At1g75820 | CLV1, CLAVATA 1 (FASCIATA 3) | 8 | 194 | 478 | 2 | 2 | ||||||||
| At1g65380 | CLV2, CLAVATA 2 | 8 | 194 | 478 | 1 | 1 | ||||||||
| At2g27250 | CLV3, CLAVATA 3 | 10933 | 1 | 0 | ||||||||||
| At1g69180 | CRC, CRABS CLAW | 1010 | 4 | 7 | ||||||||||
| At3g61850 | DAG1, DOF AFFECTING GERMINATION 1 | 93 | 36 | 36 | 1 | 1 | ||||||||
| At2g33860 | ETT, ETTIN | 117 | 26 | 34 | 1 | 3 | ||||||||
| At3g59380 | FTA, FARNESYLTRANSFERASE A | 2266 | 1 | 5 | 1 | 1 | ||||||||
| At3g30260 | FUL, FRUITFULL (AGL8) | 65 | 46 | 51 | ||||||||||
| At4g20910 | HEN1, HUA ENHANCER 1 | 3162 | 2 | 2 | 1 | 2 | 3 | |||||||
| At2g06990 | HEN2, HUA ENHANCER 2 | 1435 | 4 | 4 | 1 | 1 | 1 | 1 | 4 | |||||
| At5g64390 | HEN4, HUA ENHANCER 4 | 601 | 10 | 7 | 2 | 1 | 1 | 1 | 6 | |||||
| At3g12680 | HUA1, ENHANCER OF AG-4 1 | 469 | 10 | 9 | 1 | 2 | 1 | 4 | ||||||
| At5g23150 | HUA2, ENHANCER OF AG-4 2 | 1582 | 4 | 4 | ||||||||||
| At1g23420 | INO, INNER NO OUTER | 1010 | 4 | 7 | ||||||||||
| At5g16560 | KAN, KANADI | 100 | 25 | 43 | 1 | 1 | ||||||||
| At5g61850 | LFY, LEAFY | 7107 | 1 | 1 | ||||||||||
| At4g32551 | LUG, LEUNIG | 1572 | 2 | 6 | 1 | 1 | 1 | 2 | 1 | 1 | 7 | |||
| At4g10350 | NAM, NO APICAL MERISTEM | 30 | 82 | 105 | ||||||||||
| At1g69490 | NAP, NAC-LIKE, ACTIVATED BY AP3/PI | 30 | 82 | 105 | 1 | 1 | ||||||||
| At1g68640 | PAN, PERIANTHIA | 253 | 10 | 21 | 1 | 1 | ||||||||
| At5g20240 | PI, PISTILLATA | 65 | 46 | 51 | 1 | 1 | 2 | 1 | 5 | |||||
| At2g34650 | PID, PINOID | 87 | 36 | 42 | 1 | 1 | 3 | |||||||
| At2g28610 | PRS, PRESSED FLOWER | 626 | 8 | 8 | ||||||||||
| At5g35770 | SAP, STERILE APETALA | 9932 | 1 | 0 | ||||||||||
| At5g15800 | SEP1, SEPALLATA1 (AGL2) | 65 | 46 | 51 | 3 | 4 | 1 | 1 | 9 | |||||
| At3g02310 | SEP2, SEPALLATA2 (AGL4) | 65 | 46 | 51 | 1 | 1 | 1 | 3 | ||||||
| At1g24260 | SEP3, SEPALLATA3 (AGL9) | 65 | 46 | 51 | 3 | 3 | 1 | 7 | ||||||
| At1g43850 | SEU, SEUSS | 1762 | 4 | 3 | 2 | 2 | ||||||||
| At3g58780 | SHP1, SHATTERPROOF 1 (AGL1) | 65 | 46 | 51 | 1 | 1 | ||||||||
| At2g42830 | SHP2, SHATTERPROOF 2 (AGL5) | 65 | 46 | 51 | 1 | 2 | 3 | |||||||
| At1g02065 | SPL8, SQUAMOSA PROMOTER BINDING PROTEIN-LIKE 8 | 284 | 15 | 13 | 1 | 2 | 1 | 5 | ||||||
| At3g23130 | SUP, SUPERMAN | 1084 | 5 | 5 | ||||||||||
| At5g03840 | TFL1 TERMINAL FLOWER 1 | 397 | 6 | 17 | ||||||||||
| At3g22780 | TSO1, | 1120 | 4 | 6 | 1 | 1 | 2 | |||||||
| At1g30950 | UFO, UNUSUAL FLORAL ORGANS | 4059 | 1 | 3 | ||||||||||
| At2g17950 | WUS, WUSCHEL | 626 | 8 | 8 | ||||||||||
| At4g00180 | YABBY3 | 1010 | 4 | 7 | 1 | 1 | 2 | 1 | 1 | 6 |
aTribe ID's are reference numbers for the PlantTribes database [33].
bGene family size is represented by the number of rice (osa) and Arabidopsis (ath) genes in each medium stringency tribe.
Number of FGP unigenes that are best BLAST hits to Arabidopsis flowering time genes, with corresponding tribe ID number and number of Arabidopsis and rice genes in these tribes (species abbreviations as in Table 2)
| At2g45660 | AGL20, SOC | 65 | 46 | 51 | 1 | 1 | 2 | |||||||
| At4g24540 | AGL24, MADS-box protein | 65 | 46 | 51 | ||||||||||
| At2g46830 | CCA1, CIRCADIAN CLOCK ASSOCIATED 1 | 3546 | 2 | 1 | 1 | 2 | ||||||||
| At2g25920 | ELF3 | 8701 | 1 | 1 | ||||||||||
| At5g11530 | EMF1, EMBRYONIC FLOWER 1 | 10070 | 1 | 0 | ||||||||||
| At5g51230 | EMF2, EMBRYONIC FLOWER 2 | 1026 | 5 | 6 | 1 | 3 | 4 | |||||||
| At4g15880 | ESD4, EARLY IN SHORT DAYS 4 | 10325 | 1 | 0 | ||||||||||
| At4g16280 | FCA, FCA | 2386 | 2 | 0 | ||||||||||
| At4g35900 | FD, FD | 6153 | 2 | 0 | ||||||||||
| At1g04400 | FHA (CYR2, CRYPTOCHROME 2) | 2549 | 2 | 3 | 1 | 1 | ||||||||
| At1g68050 | FKF1, FLAVIN-BINDING KELCH DOMAIN F BOX PROTEIN | 1047 | 6 | 5 | 1 | 1 | ||||||||
| At5g10140 | FLC, FLOWERING LOCUS F | 65 | 46 | 51 | ||||||||||
| At2g43410 | FPA, FPA | 2343 | 3 | 3 | 2 | 2 | ||||||||
| At5g24860 | FPF1, FLOWERING PROMOTING FACTOR 1 | 311 | 11 | 16 | 1 | 1 | ||||||||
| At4g00650 | FRI, FRIGIDA | 6545 | 1 | 1 | 1 | 1 | ||||||||
| At1g65480 | FT FLOWERING LOCUS T | 397 | 6 | 17 | ||||||||||
| At3g59380 | FTA, FARNESYLTRANSFERASE A | 2266 | 1 | 5 | 1 | 1 | ||||||||
| At3g30260 | FUL, FRUITFULL, AGL8 | 65 | 46 | 51 | ||||||||||
| At4g25530 | FWA, FWA | 230 | 18 | 15 | ||||||||||
| At5g13480 | FY, FY | 7822 | 1 | 1 | 2 | 2 | ||||||||
| At1g14920 | GAI, GA INSENSITIVE | 74 | 27 | 62 | ||||||||||
| At1g22770 | GI, GIGANTEA | 8967 | 1 | 1 | 1 | 3 | 2 | 2 | 1 | 10 | ||||
| At4g08920 | HY4, ELONGATED HYPOCOTYL 4 (CRY1) | 2549 | 2 | 3 | 1 | 3 | 5 | |||||||
| At2g23380 | ICU1, INCURVATA 1 | 2735 | 3 | 2 | ||||||||||
| At4g02560 | LD, LUMINIDEPENDENS | 8840 | 1 | 1 | 2 | 2 | ||||||||
| At5g61850 | LFY, LEAFY | 7107 | 1 | 1 | ||||||||||
| At1g01060 | LHY, LATE ELONGATED HYPOCOTYL | 3546 | 2 | 1 | 1 | 1 | 2 | |||||||
| At1g77080 | MAF1, MADS AFFECTING FLOWERING 1 | 65 | 46 | 51 | ||||||||||
| At5g65050 | MAF2.4, MADS AFFECTING FLOWERING2 | 65 | 46 | 51 | ||||||||||
| At5g65060 | MAF3, MADS AFFECTING FLOWERING 3 | 65 | 46 | 51 | ||||||||||
| At5g65070 | MAF4.5, MADS AFFECTING FLOWERING 4 VARIANT V | 65 | 46 | 51 | ||||||||||
| At5g65080 | MAF5.2, MADS AFFECTING FLOWERING 5 VARIANT II | 65 | 46 | 51 | ||||||||||
| At2g19520 | NFC4, FVE | 1299 | 6 | 3 | 2 | 1 | 1 | 1 | 3 | 8 | ||||
| At1g09570 | PHYA, FAR RED ELONGATED 1 | 1254 | 5 | 4 | 2 | 1 | 3 | 1 | 7 | |||||
| At2g18790 | PHYB, PHYTOCHROME B | 1254 | 5 | 4 | 1 | 1 | 2 | 1 | 5 | |||||
| At5g35840 | PHYC, PHYTOCHROME DEFECTIVE C | 1254 | 5 | 4 | 1 | 1 | 2 | |||||||
| At4g16250 | PHYD, PHYTOCHROME DEFECTIVE D | 1254 | 5 | 4 | ||||||||||
| At4g18130 | PHYE, PHYTOCHROME DEFECTIVE E | 1254 | 5 | 4 | ||||||||||
| At1g73590 | PIN1, PIN-FORMED 1 | 405 | 8 | 14 | 2 | 2 | ||||||||
| At2g01570 | RGA1, REPRESSOR OF GA1-3 1 | 74 | 27 | 62 | 1 | 1 | ||||||||
| At1g02065 | SPL8, SQUAMOSA PROMOTER BINDING PROTEIN-LIKE 8 | 284 | 15 | 13 | 1 | 2 | 1 | 5 | ||||||
| At3g11540 | SPY, SPINDLY | 6916 | 1 | 1 | 1 | 1 | 2 | |||||||
| At2g22540 | SVP, SHORT VEGETATIVE PHASE | 65 | 46 | 51 | 1 | 1 | 1 | 3 | ||||||
| At2g28290 | SYD, SPLAYED | 135 | 20 | 30 | 1 | 1 | ||||||||
| At2g26670 | TED4, ELONGATED HYPOCOTYL 1 | 2358 | 4 | 2 | 1 | 1 | ||||||||
| At5g03840 | TFL1, TERMINAL FLOWER 1 | 397 | 6 | 17 | ||||||||||
| At5g17690 | TFL2, | 5181 | 1 | 2 | 1 | 1 | 1 | 4 | ||||||
| At5g61380 | TOC1, PSEUDO-RESPONSE REGULATOR 1 | 769 | 6 | 8 | 1 | 2 | 2 | 5 | ||||||
| At5g61150 | VIP4, VERNALIZATION INDEPENDENCE 4 | 2513 | 1 | 4 | 2 | 2 | ||||||||
| At3g18990 | VRN1, REDUCED VERNALIZATION RESPONSE 1 | 4792 | 2 | 1 | 1 | 1 | ||||||||
| At4g16845 | VRN2, REDUCED VERNALIZATION RESPONSE 2 | 1026 | 5 | 6 | ||||||||||
| At1g80730 | ZFP1, ZFP1 | 511 | 6 | 13 | ||||||||||
| At5g57360 | ZTL, ZEITLUPE | 1047 | 2 | 1 | 3 |
Figure 2The relative frequencies of ESTs assigned to GO Biological process classes are quite similar across our study taxa. Class frequencies are shown for ten EST sets, the inferred Arabidopsis proteome, and Arabidopsis genes with moderate-to-high expression in young inflorescences (stage 3).
Figure 3TribeMCL gene clusters (Tribes) with floral development genes vary in size and tend to include similar numbers of rice and Arabidopsis genes (left). These gene families are well represented in our EST sets (right and Tables 2 and 3). *The unigene counts (right) for the CLAVATA gene family have been halved.
Distribution of best matches to floral organ-specific Arabidopsis genes [40] among seven unigene sets (species abbreviations as in Table 2). Shown in bold are two DUF642-domain genes with differential expression in petals.
| carpel | At5g44635 | similar to putative CDC21 protein | 549 | 8 | 10 | 1 | 1 | 1 | 3 | ||||||
| carpel | At1g71691 | hypothetical protein with GDSL-like motif | 40 | 84 | 79 | 1 | 1 | 1 | 2 | 1 | 6 | ||||
| carpel | At5g53120 | spermidine synthase | 2331 | 3 | 3 | 1 | 1 | 3 | 1 | 6 | |||||
| carpel | At5g07280 | receptor-like protein kinase-like protein | 8 | 194 | 478 | 2 | 1 | 2 | 1 | 7 | |||||
| carpel | At3g51860 | Ca2+/H+-exchanging protein-like | 823 | 6 | 7 | 1 | 1 | 1 | 2 | 2 | 1 | 8 | |||
| carpel | At5g06860 | polygalacturonase inhibiting protein (PGIP1) | 8 | 194 | 478 | 1 | 1 | 1 | 1 | 1 | 8 | ||||
| carpel | At5g02540 | putative protein | 348 | 11 | 14 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 9 | |
| carpel | At5g59320 | nonspecific lipid-transfer protein precursor – like | 277 | 11 | 18 | 1 | 1 | 1 | 2 | 2 | 2 | 1 | 10 | ||
| petal | At3g62700 | glutathione-conjugate transporter, putative | 192 | 16 | 22 | 1 | 1 | 2 | 1 | 5 | |||||
| sepal | At1g69120 | floral homeotic gene APETALA1 | 65 | 46 | 51 | 1 | 1 | 3 | |||||||
| stamen | At5g14780 | formate dehydrogenase (FDH) | 4097 | 1 | 3 | 1 | 1 | 1 | 1 | 3 | 1 | 8 | |||
| stamen | At1g52570 | phospholipase D, putative | 300 | 10 | 17 | 4 | 1 | 1 | 2 | 1 | 9 | ||||
| stamen | At3g09390 | metallothionein-like protein | 6057 | 2 | 0 | 1 | 2 | 1 | 2 | 2 | 10 | ||||
| stamen | At3g03080 | putative NADP-dependent oxidoreductase | 315 | 14 | 13 | 1 | 2 | 1 | 1 | 1 | 3 | 11 | |||
| stamen | At3g62290 | ADP-ribosylation factor-like protein | 197 | 19 | 17 | 4 | 3 | 2 | 1 | 1 | 2 | 1 | 14 | ||
| stamen | At5g43330 | cytosolic malate dehydrogenase | 1443 | 4 | 4 | 3 | 2 | 1 | 1 | 3 | 1 | 3 | 14 | ||
| stamen | At1g13950 | initiation factor 5A-4 | 1208 | 3 | 7 | 1 | 3 | 1 | 3 | 2 | 5 | 18 | |||
| stamen | At5g45775 | Expressed protein | 923 | 4 | 7 | 2 | 1 | 7 | 2 | 1 | 3 | 1 | 1 | 18 | |
| stamen | At5g14670 | ADP-ribosylation factor – like protein | 197 | 19 | 17 | 3 | 2 | 6 | 3 | 3 | 4 | 2 | 24 |
Figure 4A phylogeny for the DUF642-domain gene family indicates that two Arabidopsis genes with differential expression in petals [40] are not the products of a recent duplication event. Genes with the plant-specific but functionally uncharacterized domain family DUF642 form three clades (A, B, and C). In addition to the Arabidopsis genes with petal-specific expression patterns, Clade A includes asterid, Vitis, monocot and magnoliid genes, indicating that the clade predates the divergence of these lineages (Fig. 1). The genomes of basal-most angiosperm lineages (Amborella and Nymphaeales) and gymnosperms may also contain unsampled Clade A genes. Clade B genes were sampled from Nuphar (Nymphaeales) and Clade C genes were identified from Amborella and Pinus. Bootstrap support values (>50%) are shown above each branch. Abbreviated taxon names associated with some gene sequences: Os, Oryza sativa (rice); At, Arabidopsis thaliana; Gossyp, Gossypium spp. (cotton); G.max, Glycine max (soybean); L.japo, Lotus japonicus; M.trun, Medicago truncatula; V.vini, Vitis vinifera (grape); L.sati Lactuca sativa (lettuce); S.tube, Solanum tuberosum (potato); S.escu, Solanum esculentum (tomato); M.crys, Mesembryanthemum crystallinum (ice plant); A.cepa, Allium cepa (onion); T.aest, Triticum aestivum (wheat); H.vulg, Hordeum vulgare (barley); S.bico, Sorghum bicolor; S.offi Saccharum officinarum (sugarcane); Z.mays, Zea mays (maize); L.tuli, Liriodendron tulipifera (tulip poplar); P.amer, Persea americana (avocado); N.adve Nuphar advena; A.tric, Amborella trichopoda.