| Literature DB >> 17064408 |
Etsuko N Moriyama1, Pooja K Strope, Stephen O Opiyo, Zhongying Chen, Alan M Jones.
Abstract
To identify divergent seven-transmembrane receptor (7TMR) candidates from the Arabidopsis thaliana genome, multiple protein classification methods were combined, including both alignment-based and alignment-free classifiers. This resolved problems in optimally training individual classifiers using limited and divergent samples, and increased stringency for candidate proteins. We identified 394 proteins as 7TMR candidates and highlighted 54 with corresponding expression patterns for further investigation.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17064408 PMCID: PMC1794564 DOI: 10.1186/gb-2006-7-10-r96
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Numbers of 7TMpR candidates identified by various methods from the A. thaliana genome
| Methods | Number of 7TMpR candidates* |
| HMMTOP | |
| 7TMs† | 236 (201) |
| 6-8 TM† | 633 (545) |
| 5-9 TMs† | 1,091 (957) |
| 5-10 TMs† | 1,343 (1,179) |
| SAM | 16 (15) |
| LDA | 3,211 (2,935) |
| QDA | 2,006 (1,820) |
| LOG | 2,626 (2,394) |
| KNN ( | 3,125 (2,839) |
| KNN ( | 3,202 (2,906) |
| KNN ( | 3,298 (3,004) |
| KNN ( | 3,347 (3,043) |
| SVM-AA | 2,263 (2,043) |
| SVM-di | 2,004 (1,807) |
| PLS-ACC | 2,671 (2,466) |
*The numbers in parentheses show 7TMpR candidates after removing proteins derived from alternative splicing. †The numbers of TM regions predicted by HMMTOP.
Figure 1Distribution of transmembrane numbers predicted by HMMTOP (black bars) and TMHMM (gray bars) from the 500 7TMR sample sequences. Proportions (%) of the proteins predicted to have six to eight and five to nine TM regions by HMMTOP are shown at the top. The percentages shown in parentheses were obtained from the entire 7,674 7TMR dataset in GPCRDB. The numbers shown on the top of black bars are the number of previously predicted 22 Arabidopsis 7TMpR proteins.
Summary of the 54 7TMpR candidates identified in this study1
| Groups* | TAIR locus IDs |
| Multiple members from gene families | |
| Nodulin MtN3 family proteins (8/17) | At1g21460, At3g16690, At3g28007, At3g48740, At4g25010, At5g13170, At5g23660, At5g50800 |
| MLO proteins (7/15) | At1g11000 (MLO4), At1g26700 (MLO14), At1g42560 (MLO9), At2g33670 (MLO5), At2g44110 (MLO15), At4g24250 (MLO13), At5g53760 (MLO11) |
| Expressed protein family 1 (2/6) | At1g77220, At4g21570 |
| GNS1/SUR4 membrane family proteins (3/4) | At1g75000, At3g06470, At4g36830 |
| Perl1-like family protein (2/2) | At1g16560, At5g62130 |
| TOM3 family proteins (3/3) | At1g14530, At2g02180, At4g21790 |
| Expressed protein family 2 (3/5) | At1g10660, At2g47115, At5g62960 |
| Expressed protein family 3 (2/4) | At3g09570, At5g42090 |
| Expressed protein family 4 (2/5) | At1g49470, At5g19870 |
| Expressed protein family 5 (2/5) | At3g63310, At4g02690 |
| Single copy genes (8) | At1g48270 (GCR1), At1g57680, At2g41610, At2g31440, At3g04970, At3g26090 (RGS1), At3g59090, At4g20310 |
| Single member from small gene families (8) | At2g01070, At3g19260, At2g35710, At2g16970, At1g15620, At1g63110, At4g36850, At5g27210 |
| Single member from big gene families (4) | At1g71960, At3g01550, At5g23990, At5g37310 |
*The number of candidates identified in this study belonging to each group is shown in parentheses (the number of all proteins in each group is given after '/'). More detailed information is given in Additional data file 2.
Figure 2Expression patterns of Arabidopsis genes encoding 7TMpR candidates and G-protein subunits among tissues. The figure was modified from an output of the Meta-Analyzer of Genevestigator (last updated in November 2005), which illustrates expression levels of each gene in different organs. Relative expression levels of a gene in different organs/tissues are given as heat maps in blue-scale coding that reflects absolute signal values, where darker colors represent stronger expression. All gene-level profiles are normalized for coloring such that, for each gene, the highest signal intensity obtains a value 100% (shown in the darkest blue and marked with an asterisk) and absence of signal obtains a value 0% (shown in white). All GeneChip data was processed using Affymetrix MAS5.0. Special precaution is required for gene expression in certain cell types (for example, pollen), since difference in normalization may achieve different results. Probe-sets of five 7TMpR candidates (At1g15620. At1g75000, At4g21570, At4g36850, and At5g23990) were not present in the 22K chip, and, therefore, their tissue-specific expression could not be assessed. For At2g35710, two probe-sets (265797_ata and 265841_atb) were designed on the chip. Gene names for those belonging to the MtN3 family are shown in boldface and marked with an asterisk. Genes encoding G-protein subunits (AGB1, GPA1, AGG1, and AGG2) as well as two reported 7TMpRs (RGS1 and GCR1) are labeled accordingly in boldface.