| Literature DB >> 23180799 |
Tae-Ho Lee1, Haibao Tang, Xiyin Wang, Andrew H Paterson.
Abstract
Genome duplication (GD) has permanently shaped the architecture and function of many higher eukaryotic genomes. The angiosperms (flowering plants) are outstanding models in which to elucidate consequences of GD for higher eukaryotes, owing to their propensity for chromosomal duplication or even triplication in a few cases. Duplicated genome structures often require both intra- and inter-genome alignments to unravel their evolutionary history, also providing the means to deduce both obvious and otherwise-cryptic orthology, paralogy and other relationships among genes. The burgeoning sets of angiosperm genome sequences provide the foundation for a host of investigations into the functional and evolutionary consequences of gene and GD. To provide genome alignments from a single resource based on uniform standards that have been validated by empirical studies, we built the Plant Genome Duplication Database (PGDD; freely available at http://chibba.agtec.uga.edu/duplication/), a web service providing synteny information in terms of colinearity between chromosomes. At present, PGDD contains data for 26 plants including bryophytes and chlorophyta, as well as angiosperms with draft genome sequences. In addition to the inclusion of new genomes as they become available, we are preparing new functions to enhance PGDD.Entities:
Mesh:
Year: 2012 PMID: 23180799 PMCID: PMC3531184 DOI: 10.1093/nar/gks1104
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of 26 plants currently served by PGDD
| Species name | Common name | Release version | Gene number |
|---|---|---|---|
| Lyrate rockcress | Version 1.0 (April 2011) | 32 670 | |
| Arabidopsis | TAIR 9.0 (June 2009) | 27 379 | |
| Purple false brome | Phytozome v6.0 | 32 255 | |
| Chinese cabbage | Version 1.1 | 22 285 | |
| Pigeonpea | November 2011 | 48 680 | |
| Papaya | December 2007 | 25 536 | |
| Green algae | Version 4.2 | 16 036 | |
| Cucumber | Phytozome v6.0 | 21 491 | |
| Strawberry | December 2010 | 34 809 | |
| Soybean | Release 1 (December 2008) | 66 153 | |
| Lotus | Release 2.5 | 42 399 | |
| Apple | August 2010 | 57 386 | |
| Barrel medic | Mt 3.5.1 (December 2010) | 45 108 | |
| Banana | July 2012 | 36 542 | |
| Rice | RAP 2.0 (November 2007) | 30 192 | |
| Moss | Version 1.6 (January 2008) | 32 272 | |
| Peach | Version 1.0 | 27 864 | |
| Western poplar | Phytozome 2.0 (February 2010) | 45 778 | |
| Castor bean | Release 0.1 (May 2008) | 38 613 | |
| Sorghum | Sbi 1.4 (December 2007) | 34 496 | |
| Tomato | Release 2.3 | 34 727 | |
| Potato | Version 3.4 | 39 031 | |
| Spikemoss | Version 1.0 (December 2007) | 22 273 | |
| Cacao | Release 0.9 (September 2010) | 28 798 | |
| Grape vine | Genoscope (August 2007) | 26 346 | |
| Maize | Release 5a (November 2010) | 32 540 |
aUnpublished genome data temporarily restricted for downloading (in accordance with the understandings in the Fort Lauderdale meeting and NHGRI policy statement).
Figure 1.Diagram of current PGDD server. (A) Diagram of pipeline to update PGDD with new genome data (in blue box). The boxes in the diagram represent four major steps of the pipeline, consisting of 18 in-house scripts. Insets in some boxes contain the name of a major program in the process. (B) Layers diagram of PGDD structure (in green polygon).
Figure 2.Homepage and examples of three major functions of PGDD. (A) The homepage of PGDD and functions supported by the database. (B) Web page of Dot plot function and a plot applying a Ks filter of 0.4–0.7 between rice and sorghum as an example, representing colinear blocks between the plants. (C) Example of Locus-search result for AT1G25460 loci in Arabidopsis. A blue line in alignment image represents same orientations of paired genes, whereas the red line represents opposite orientations of the genes. (D) Example of Map-view function. The grey vertical bars represent chromosomes, and green arrows on the bars represent the position of locus, which are similar to input sequences. The detailed loci information page, the inset, shows protein and nucleotide sequences of gene in the loci and a description of the gene.