| Literature DB >> 18284697 |
Haining Lin1, Shu Ouyang, Amy Egan, Kan Nobuta, Brian J Haas, Wei Zhu, Xun Gu, Joana C Silva, Blake C Meyers, C Robin Buell.
Abstract
BACKGROUND: High gene numbers in plant genomes reflect polyploidy and major gene duplication events. Oryza sativa, cultivated rice, is a diploid monocotyledonous species with a ~390 Mb genome that has undergone segmental duplication of a substantial portion of its genome. This, coupled with other genetic events such as tandem duplications, has resulted in a substantial number of its genes, and resulting proteins, occurring in paralogous families.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18284697 PMCID: PMC2275729 DOI: 10.1186/1471-2229-8-18
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Size distribution of paralogous protein families in rice and Arabidopsis. The exact number of families is listed above the bars.
Figure 2Functional classification of paralogous family and singleton proteins in rice and Arabidopsis.
Figure 3GOSlim assignment of A) rice paralogous families and singletons, B) Arabidopsis paralogous families and singletons. The paralogous protein families are further classified by family size.
Two-sample binomial tests for GOSlim assignments of paralogous family and singleton proteins in rice
| Binding, otherb | 3.3 | 6.5 | <1e-5 |
| Carbohydrate bindingc | 2.7 | 0.6 | <1e-5 |
| DNA bindingb | 4.8 | 8.0 | <1e-5 |
| Hydrolase activityb | 7.8 | 12.7 | <1e-5 |
| Kinase activityc | 16.0 | 6.2 | <1e-5 |
| Nucleotide bindingc | 13.4 | 4.2 | <1e-5 |
| Protein binding, otherc | 14.2 | 9.5 | <1e-5 |
| Receptor activityc | 2.3 | 0.4 | <1e-5 |
| Transcription factor activityb | 4.3 | 9.3 | <1e-5 |
| Catalytic activity, otherb | 8.7 | 12.2 | <1e-5 |
| Structural molecule activityb | 0.8 | 2.2 | <1e-5 |
| Oxygen bindingb | 0.7 | 1.9 | <1e-5 |
| Transcription regulator activityb | 1.1 | 2.3 | <1e-5 |
| Transporter activityb | 5.0 | 7.0 | <1e-5 |
| Lipid bindingb | 0.4 | 1.1 | <1e-5 |
| Molecular function, otherb | 0.1 | 0.4 | 0.001 |
| Enzyme regulator activityb | 0.5 | 0.9 | 0.008 |
| Motor activity | 0.5 | 0.3 | 0.051 |
| Transferase activity | 7.0 | 7.7 | 0.095 |
| Receptor binding | 0.0 | 0.1 | 0.137 |
| RNA binding | 1.8 | 2.1 | 0.369 |
| Translation factor activity, nucleic acid binding | 0.5 | 0.7 | 0.353 |
| Signal transducer activity | 1.0 | 0.9 | 0.43 |
| Chromatin binding | 0.3 | 0.2 | 0.465 |
| Nucleic acid binding, other | 1.8 | 1.9 | 0.882 |
| Nuclease activity | 0.8 | 0.8 | 0.888 |
a GoSlim assignment classifications were performed as described in the Materials and Methods.
b Enrichment of GOSlim annotations in paralogous protein families compared to singletons.
c Reduction of GOSlim annotations in paralogous protein families compared to singletons.
d Benjamini and Hochberg correction for multiple testing.
Two-sample binomial tests for GOSlim assignments of paralogous family and singleton proteins in Arabidopsis
| Hydrolase activityb | 7.5 | 12.6 | <1e-5 |
| Kinase activityc | 10.4 | 5.5 | <1e-5 |
| Nucleotide bindingc | 10.2 | 4.6 | <1e-5 |
| Protein binding, otherc | 12.9 | 8.2 | <1e-5 |
| Transcription factor activityb | 4.2 | 9.0 | <1e-5 |
| Receptor activityc | 1.9 | 0.7 | <1e-5 |
| DNA bindingb | 4.1 | 7.2 | <1e-5 |
| Oxygen bindingb | 0.1 | 1.4 | <1e-5 |
| Receptor bindingc | 0.5 | 0.1 | <1e-5 |
| Carbohydrate bindingc | 0.7 | 0.3 | <1e-3 |
| Lipid bindingb | 0.3 | 0.8 | 0.001 |
| Structural molecule activityb | 1.6 | 2.5 | 0.002 |
| Enzyme regulator activityb | 0.7 | 1.4 | 0.005 |
| Molecular function, otherb | 1.8 | 2.5 | 0.011 |
| Transporter activityb | 5.0 | 6.0 | 0.019 |
| Nucleic acid binding, otherc | 2.6 | 2.0 | 0.027 |
| Motor activityb | 0.2 | 0.5 | 0.03 |
| Transferase activity | 5.3 | 6.1 | 0.053 |
| RNA binding | 1.5 | 1.9 | 0.099 |
| Binding, other | 12.3 | 11.3 | 0.102 |
| Signal transducer activity | 1.0 | 0.8 | 0.132 |
| Catalytic activity, other | 12.4 | 11.7 | 0.244 |
| Transcription regulator activity | 1.3 | 1.5 | 0.743 |
| Chromatin binding | 0.2 | 0.1 | 0.803 |
| Translation factor activity, nucleic acid binding | 0.6 | 0.6 | 1 |
| Nuclease activity | 0.7 | 0.8 | 1 |
a GoSlim assignment classifications were performed as described in the Materials and Methods.
b Enrichment of GOSlim annotations in paralogous protein families compared to singletons.
c Reduction of GOSlim annotations in paralogous protein families compared to singletons.
d Benjamini and Hochberg correction for multiple testing.
Figure 4Histogram of Pearson's Correlation Coefficients of expression (r) of rice paralogous protein families with exactly two MPSS-qualifying genes.