| Literature DB >> 26089837 |
Bryan Musungu1, Deepak Bhatnagar2, Robert L Brown2, Ahmad M Fakhoury3, Matt Geisler1.
Abstract
Interactomes are genome-wide roadmaps of protein-protein interactions. They have been produced for humans, yeast, the fruit fly, and Arabidopsis thaliana and have become invaluable tools for generating and testing hypotheses. A predicted interactome for Zea mays (PiZeaM) is presented here as an aid to the research community for this valuable crop species. PiZeaM was built using a proven method of interologs (interacting orthologs) that were identified using both one-to-one and many-to-many orthology between genomes of maize and reference species. Where both maize orthologs occurred for an experimentally determined interaction in the reference species, we predicted a likely interaction in maize. A total of 49,026 unique interactions for 6004 maize proteins were predicted. These interactions are enriched for processes that are evolutionarily conserved, but include many otherwise poorly annotated proteins in maize. The predicted maize interactions were further analyzed by comparing annotation of interacting proteins, including different layers of ontology. A map of pairwise gene co-expression was also generated and compared to predicted interactions. Two global subnetworks were constructed for highly conserved interactions. These subnetworks showed clear clustering of proteins by function. Another subnetwork was created for disease response using a bait and prey strategy to capture interacting partners for proteins that respond to other organisms. Closer examination of this subnetwork revealed the connectivity between biotic and abiotic hormone stress pathways. We believe PiZeaM will provide a useful tool for the prediction of protein function and analysis of pathways for Z. mays researchers and is presented in this paper as a reference tool for the exploration of protein interactions in maize.Entities:
Keywords: Zea mays; disease resistance; interactome; network; protein-protein interactions; tool
Year: 2015 PMID: 26089837 PMCID: PMC4454876 DOI: 10.3389/fgene.2015.00201
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Reference organism data used for maize interolog prediction.
| 33602 | 4289 | 7331 | 1.709256237 | 11620 | |
| 7126 | 42250 | 51873 | 1.227763314 | 94123 | |
| 15246 | 1508 | 2232 | 1.480106101 | 3740 | |
| 29516 | 22 | 78 | 3.545454545 | 100 | |
| 37991 | 105 | 269 | 2.561904762 | 374 | |
| 7031 | 2766 | 3121 | 1.128344179 | 5887 | |
| 57945 | 2735 | 5005 | 1.829981718 | 7740 | |
| 4371 | 32 | 32 | 1 | 64 | |
| 20447 | 1 | 1 | 1 | 2 | |
| 3673 | 60 | 923 | 15.38333333 | 983 | |
| 3264 | 161 | 501 | 3.111801242 | 662 | |
| 58058 | 220 | 248 | 1.127272727 | 468 | |
| 4618 | 2496 | 2444 | 0.979166667 | 4940 | |
| Total | 130703 |
Only orthologs used for interologs are listed.
Predicted maize interactions.
| Combined Total (One to One/Many to Many) | 6004 | 130703 |
| One to One Total | 4843 | 56645 |
| Many to Many Total | 5673 | 74058 |
| Combined Unique (One to One/Many to Many) | 6004 | 49026 |
| One to One Unique | 331 | 4737 |
| Many to Many Unique | 1161 | 14919 |
| Overlap between Unique (One to One/Many to Many) | 4512 | 29370 |
The most confident hetero-interactions in the predicted maize interactome.
| 008327_P01 | 061287_P01 | Cell division control protein 2 homolog | Cyclin superfamily protein, putative | 12960 |
| 056075_P01 | 110212_P01 | DNA mismatch repair protein MSH2 | MUS2 protein | 10752 |
| AC149818.2_FGP004 | 008327_P01 | Cyclin-dependent kinases regulatory subunit | Cell division control protein 2 homolog | 9450 |
| 045314_P01 | 064732_P01 | Guanine nucleotide-binding protein subunit beta | G protein alpha subunit 1 | 6720 |
| 008327_P01 | 017081_P01 | Cell division control protein 2 homolog | Uncharacterized protein | 6120 |
| 033626_P02 | 061745_P01 | 26S proteasome non-ATPase regulatory subunit 14 | Uncharacterized protein | 5880 |
| 093050_P01 | 174757_P01 | Eukaryotic translation initiation factor 3 subunit A | Uncharacterized protein | 5772 |
| 040152_P02 | 088162_P01 | Unknown | Putative uncharacterized protein | 5760 |
| 047774_P01 | 105409_P02 | Uncharacterized protein | Unknown | 5616 |
| 028313_P03 | 452026_P01 | Putative translation elongation/initiation factor | Eukaryotic peptide chain release factor subunit 1-1 | 5124 |
| 071518_P01 | 085970_P01 | Spc97/Spc98 family of spindle pole body (SBP) component | Tubulin gamma-2 chain | 5040 |
| 046021_P01 | 177974_P02 | Histone acetyltransferase of the GNAT family 1 | Uncharacterized protein | 4320 |
| 043484_P01 | 148924_P02 | Histone-lysine N-methyltransferase EZ3 | Polycomb group protein FIE2 | 3696 |
| 161418_P02 | 168096_P01 | TATA-box-binding protein 2 | DNA binding | 3640 |
| 021069_P01 | 066101_P02 | Minichromosome maintenance protein | Unknown | 3584 |
| 133952_P01 | 356935_P01 | MUTL-homolog 1 | Uncharacterized protein | 3432 |
| AC234199.1_FGP003 | 048067_P01 | Unknown | Uncharacterized protein | 3360 |
| 152328_P01 | 556845_P01 | Actin-1 | Unknown | 3315 |
| 054225_P01 | 402295_P01 | DNA-directed RNA polymerase | Unknown | 3312 |
| 042371_P01 | 100872_P01 | Elongator protein 2 | Uncharacterized protein | 3024 |
Unique one-to-one interactome interactions with the highest confidence values.
GRMZM2G prefix for identifiers removed for space.
The most connected maize proteins.
| GRMZM2G118637_P01 | Putative ubiquitin family protein | 797 |
| GRMZM2G012280_P01 | Uncharacterized protein | 523 |
| GRMZM2G033626_P02 | 26S proteasome non-ATPase regulatory subunit 14 | 496 |
| GRMZM2G012631_P01 | HSP protein | 432 |
| GRMZM2G327635_P01 | Ubiquitin carboxyl-terminal hydrolase | 399 |
| GRMZM2G147671_P01 | 26S proteasome non-ATPase regulatory subunit 4 | 348 |
| GRMZM2G416120_P01 | Chaperonin CPN60-2l | 334 |
| GRMZM2G155384_P02 | Uncharacterized protein | 324 |
| GRMZM2G075719_P02 | Uncharacterized protein | 308 |
| GRMZM2G046021_P01 | histone acetyltransferase of the GNAT family 1 | 281 |
| GRMZM2G152328_P01 | Actin-1 | 274 |
| GRMZM2G008410_P01 | Transcribed sequence 1087 protein | 250 |
| GRMZM2G038964_P02 | Uncharacterized protein | 248 |
| GRMZM2G153815_P01 | Heat shock 70 kDa protein | 248 |
| GRMZM2G171080_P01 | Uncharacterized protein | 232 |
| GRMZM2G008327_P01 | Cell division control protein 2 homolog | 230 |
| GRMZM2G027282_P01 | Uncharacterized protein | 230 |
| GRMZM2G303752_P01 | ATP-dependent rRNA helicase spb4 | 215 |
| GRMZM2G014676_P01 | Prefoldin 5 | 199 |
| GRMZM2G107540_P01 | Core histone H2A/H2B/H3/H4 domain | 195 |
Number of predicted interacting protein partners for the unique interactome.
Figure 1PiZeaM toplogy and co-expression. All unique interactions in PiZeaM (merged one-to-one and many-to-many orthology) were analyzed. (A) Highly conserved interactions with interologs from four or more different reference species interactomes were compared to interactions with greater than or equal to a confidence value (CV) 100. The minimum CV of an interaction with 4 reference species is 16. Larger CVs indicate additional support from different experimental methodologies or replication. (B) Frequency distribution of proteins with increasing numbers of unique interacting partners (including self-interaction) is plotted on a log-linear graph. (C) Frequency distribution of gene transcript expression (Pearson correlation of M-values from 68 microarrays) for interacting protein pairs (red) and random pairs of proteins from those found in the interactome (blue). Random interactions are statistically significantly different from nonrandom interactions (P-value of 9.53644 × 10−39 in a student t-test), although the average correlation coefficient of both distributions is similar, observed correlations were more widely spread at both high positive and negative ends.
Figure 2Conserved subnetworks. (A) A sub-network consisting of interactions identified by matching interologs in at least 4 (out of 13) reference species. The majority of proteins were found in a single connected network, with several smaller unconnected subnetworks. (B) A more stringent subnetwork of interactions with at least 5 interolog reference species. Clear clusters were concentrated in an organic layout by having more inter-group connections than between-group connections. These clusters consisted primarily of proteins with the same biological role, indicated by dotted circles. One such cluster (C) was an unconnected subnetwork consisting entirely of members of the COP9 signalosome. EIF3, elongation initation factor complex 3; U2, U2 splicosome; MCM, mini-chromosome maintenance proteins complex.
Figure 3Disease response and hormone subnetworks. (A) Interaction between all proteins in PiZeaM with the gene ontology annotation “response to other organism,” which includes many disease response proteins, and their first neighbors from ROS and stress hormone signaling and metabolic pathways. (B) A nested subnetwork taken from (A) of MAPK signaling proteins and first neighbors. (C) A nested subnetwork taken from (A) of S-adenosyl methionine synthase (ethylene biosynthesis) and first neighbors.
| N: | the number of times that an interaction appeared in PiZeaM. |
| E: | the number of times an experiment appeared with an interaction in the mined interactomes. |
| S: | the number of times an interaction appeared in the reference species. |
| N: | the number of expression samples. |
| X: | the expression level of gene X in ith sample. |
| Y: | the expression level of gene Y in the ith sample. |