Literature DB >> 27327960

Genome-Wide Characterization of Major Intrinsic Proteins in Four Grass Plants and Their Non-Aqua Transport Selectivity Profiles with Comparative Perspective.

Abul Kalam Azad1, Jahed Ahmed1, Md Asraful Alum2, Md Mahbub Hasan3, Takahiro Ishikawa4, Yoshihiro Sawa4, Maki Katsuhara5.   

Abstract

Major intrinsic proteins (MIPs), commonly known as aquaporins, transport not only water in plants but also other substrates of physiological significance and heavy metals. In most of the higher plants, MIPs are divided into five subfamilies (PIPs, TIPs, NIPs, SIPs and XIPs). Herein, we identified 68, 42, 38 and 28 full-length MIPs, respectively in the genomes of four monocot grass plants, specifically Panicum virgatum, Setaria italica, Sorghum bicolor and Brachypodium distachyon. Phylogenetic analysis showed that the grass plants had only four MIP subfamilies including PIPs, TIPs, NIPs and SIPs without XIPs. Based on structural analysis of the homology models and comparing the primary selectivity-related motifs [two NPA regions, aromatic/arginine (ar/R) selectivity filter and Froger's positions (FPs)] of all plant MIPs that have been experimentally proven to transport non-aqua substrates, we predicted the transport profiles of all MIPs in the four grass plants and also in eight other plants. Groups of MIP subfamilies based on ar/R selectivity filter and FPs were linked to the non-aqua transport profiles. We further deciphered the substrate selectivity profiles of the MIPs in the four grass plants and compared them with their counterparts in rice, maize, soybean, poplar, cotton, Arabidopsis thaliana, Physcomitrella patens and Selaginella moellendorffii. In addition to two NPA regions, ar/R filter and FPs, certain residues, especially in loops B and C, contribute to the functional distinctiveness of MIP groups. Expression analysis of transcripts in different organs indicated that non-aqua transport was related to expression of MIPs since most of the unexpressed MIPs were not predicted to facilitate the transport of non-aqua molecules. Among all MIPs in every plant, TIP (BdTIP1;1, SiTIP1;2, SbTIP2;1 and PvTIP1;2) had the overall highest mean expression. Our study generates significant information for understanding the diversity, evolution, non-aqua transport profiles and insight into comparative transport selectivity of plant MIPs, and provides tools for the development of transgenic plants.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27327960      PMCID: PMC4915720          DOI: 10.1371/journal.pone.0157735

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Aquaporins (AQPs), water channel proteins, are channel-forming integral membrane proteins that are found in all living organisms [1,2]. Plant AQPs are involved in many physiological processes such as motor cell movement, root and leaf hydraulic conductance, diurnal regulation of leaf movements, rapid internode elongation, responses to numerous abiotic stresses, temperature-dependent petal movement and petal development [1,3,4,5,6,7,8,9]. AQPs belong to the ancient major intrinsic proteins (MIPs) super family. Although 13 different AQPs have been identified in mammals [10], the genomes of plants encode 2–5 folds more AQP homologues [11,12,13,14,15,16,17,18,19]. On the basis of sequence homology and cellular localization, plant AQPs are classified into four subfamilies: (1) plasma membrane intrinsic proteins (PIPs), which are usually localized in the plasma membrane (PM); (2) tonoplast intrinsic proteins (TIPs), which are generally localized in the vacuolar membranes; (3) nodulin-26-like intrinsic proteins (NIPs); and (4) small basic intrinsic proteins (SIPs) [2]. Recently, a fifth subfamily of uncharacterised X intrinsic proteins (XIPs) [20] has been reported in the PM [21]. Plant AQPs have been reported recently to transport not only water but also a wide range of substrates such as ammonia, antimony, arsenite, boron, carbon dioxide, glycerol, hydrogen peroxide, silicon, urea etc [2,22,23,24,25]. Almost all of these molecules are important for plant growth and development, plant nutrition, photosynthesis, structures of biological membranes and cell walls, tolerance to biotic and abiotic stresses, stomatal movement and senescence [26,27,28,29]. These physiological roles as well as the chance of heavy metalloids such as arsenic and antimony to enter into the food chain through plant AQPs suggest that it is important to understand their transport selectivity profiles. Despite the discovery of more than 400 AQPs in plants, very few studies have been done to compare their transport profiles and the molecular determinants for the substrate selectivity. AQPs consist of six transmembrane (TM) α-helices (helix H1–H6) and five loops (loops A–E). The N- and C-termini are located on the cytoplasmic side of the membrane. In the pore of the channel, two regions of constriction have been proposed to specify the transport selectivity profile. The first constriction is formed at the centre of the pore by oppositely juxtaposing two Asn-Pro-Ala (NPA) motifs in loops B and E [30]. This constriction is supposed to be involved in proton exclusion [31]. Consensus sequences are suggested for the first (SGXHXNPAVT) [32] and second (GXXXNPAR(S/D)XG) [33] NPA motifs. The second constriction known as the aromatic/arginine (ar/R) selectivity filter is formed at the extracellular mouth of the pore by four residues from H2, H5, and loop E (LE1 and LE2), respectively [34,35]. Variability at the ar/R selectivity filter is thought to form the basis of the broad spectrum of substrate conductance in plant AQPs [11,14,36,37]. Up to five relatively conserved amino acid residues known as the Froger’s positions (FPs) and those designated P1-P5 play roles in substrate selectivity [32,38]. Recently, some specificity-determining positions have been suggested by analyzing the protein sequences of MIPs transporting non-aqua substrates in wet-lab experiments [23]. Identification and characterization of the MIP gene family is the first step in investigating the role of MIPs in plant water relationships or transporting physiologically important small molecules. Grasses, plants of the Poaceae family, the largest plant family in the world, afford the bulk of human nutrition, and highly productive grasses are potential sources of sustainable biofuels [39,40]. Phytozome (www.phytozome.net), which facilitates comparative genomic studies among green plants, provides access to six grass plants. The MIPs in rice and maize, among these six grass plants, have been reported [12,14,17]. There has been no study for MIPs in the remaining four grass plants namely switchgrass (Panicum virgatum), foxtail millet (Setaria italica), sorghum (Sorghum bicolor) and Brachypodium distachyon. P. virgatum, which exists at multiple ploidies, is a drought tolerant plant and has been intensively studied as a source of lignocellulosic biomass to produce renewable energy [41,42]. S. italica is closely related to P. virgatum. It is a small diploid C4 panicoid crop species and a more tractable experimental model because of its small genome [43]. S. bicolor, related to sugar cane and maize, is grown for food, feed, fibre and biofuels [44]. B. distachyon, related to rice, maize, wheat, barley, sorghum and millet, has several advantages as an experimental model organism for understanding genetic, cellular and molecular biology of temperate grasses [40]. In the study reported herein, we identified MIP genes in the genomes of P. virgatum, S. italica, S. bicolor and B. distachyon. We investigated the phylogeny, structural properties, in silico subcellular localization and expression profiles of MIPs in these plants. Based on structural analysis of the homology models and comparing the primary selectivity-related motifs, we further deciphered the non-aqua transport profiles (ammonia, antimony, arsenic, boron, CO2, H2O2, silicon and urea) and molecular determinants for substrate selectivity of the MIPs in the four grass plants and compared them with their counterparts in two grass plants such as rice (OsMIP) and maize (ZmMIP) and six non-grass plants such as soybean (GmMIP), poplar (PtMIP), cotton (GhMIP), Arabidopsis thaliana (AtMIP), Selaginella moellendorffii (SmMIP) and Physcomitrella patens (PpMIP).

Materials and Methods

Identification of PvMIP, SiMIP, SbMIP and BdMIP genes

The genomes of P. virgatum (JGI v1.1), S. italica (JGI v2.1), S. bicolor (v2.1) and B. distachyon (v1.2), available at Phytozome, were searched for MIPs using TBLASTN and BLASTp tools with the protein sequences of the complete set of 55 MIPs from P. trichocarpa and 22 MIPs from P. patens as queries. PvMIPs, SiMIPs, SbMIPs and BdMIPs were included until no more MIPs could be found from P. virgatum, S. italica, S. bicolor and B. distachyon, respectively. Every sequence from each plant was individually compared with functional annotations by browsing the Phytozome databases of P. virgatum, S. italica, S. bicolor and B. distachyon to indentify the maximum number of MIPs for further analyses. The genomic regions containing MIP genes were further used to determine the gene structure using the program GeneMark.hmm ES-3.0 [45] (http://exon.gatech.edu/GeneMark), a self-training based algorithm for prediction of genes from novel eukaryotic genomes, and Arabidopsis was chosen as a model organism in GeneMark for gene prediction in P. virgatum, S. italica, S. bicolor and B. distachyon. When short genes were found, their sequences with 1000 base flanking regions were subjected to Genetyx_SV_RC_version 7 to investigate their protein sequences.

Phylogenetic and domain analysis of PvMIPs, SiMIPs, SbMIPs and BdMIPs

PvMIPs, SiMIPs, SbMIPs or BdMIPs were separately aligned with PtMIPs using the Clustal Omega program (http://www.ebi.ac.uk/Tools/msa/clustalo/) and a phylogenetic tree was constructed using Molecular Evolution Genetic Analysis (MEGA), version 5.0 [46]. The evolutionary history was inferred using the Neighbor-Joining method and the genetic distance was estimated by the p-distance method. To identify the total number of subfamilies present in PvMIPs, SiMIPs, SbMIPs and BdMIPs, phylogenetic analysis was also conducted with PpMIPs that have seven subfamilies [20], whereas PtMIPs have five subfamilies. The identified PvMIPs, SiMIPs, SbMIPs and BdMIPs were classified into different subfamilies and groups by their phylogenetic relationship with PtMIPs. To investigate the different subfamilies and groups, we further analyzed phylogeny separately with AtMIPs, ZmMIPs, OsMIPs and GmMIPs. PvMIPs, SiMIPs SbMIPs and BdMIPs were named according to the best similarities from the trees generated by phylogeny analysis. To construct the phylogenetic tree with the MIPs in the four grass plants, all of their MIPs were aligned as above. The TM α-helices were predicted by SOSUI (http://bp.nuap.nagoya-u.ac.jp/sosui/), TMpred (http://www.ch.embnet.org/software/TMPRED_form.html) and the tools of ExPASy (http://kr.expasy.org/tools/).

Homology modeling

Homology models were constructed using the Molecular Operating Environment software (MOE 2009.10; Chemical Computing Group, Quebec, Canada). The sequence of each MIP homologue was aligned with the open conformation of spinach PIP, SoPIP2;1 (PDB, Protein Data Bank ID: 2B5F) [47] using the MOE software as described previously [36]. The alignment of the MIP homologue was based on both sequence and structural homology with the structure of SoPIP2;1. The 3D structure models were formed using the MOE homology program and the stereochemical quality of the templates and the models was assessed, as we described previously [36].

Prediction of subcellular localization and computation of Ka/Ks value

The subcellular localizations of PvMIPs, SiMIPs, SbMIPs and BdMIPs were predicted in silico by using tools of WoLF PSORT (http://www.genscript.com/wolf-psort.html), TargetP (www.cbs.dtu.dk/Services/TargetP), Cello prediction system (http://cello.life.nctu.edu.tw/) and MultiLoc2 (www.abi.inf.uni-tuebingen.de/Services/MultiLoc2). Ka and Ks are the numbers of non-synonymous and synonymous substitutions per site, respectively on a protein-coding gene. The Ka/Ks values of the PvMIPs, SiMIPs, SbMIPs and BdMIPs were calculated using an online Ka/Ks calculation tool at http://services.cbu.uib.no/tools/kaks. A Ka/Ks value greater than one implies gene evolution under positive or Darwinian selection; less than one indicates purifying (stabilizing) selection and a Ka/Ks value of one suggests a lack of selection or possibly a combination of positive and purifying selections at different points within the gene that cancel each other out [18].

Expression analysis

For expression analysis, a compendium of RNA-seq data for the plants in the Phytozome was used. In the Phytozome, P. virgutam, S. bicolor and B. distachyon were selected separately and the phytozome accession number of a specific MIP was entered to search the gene. Transcript level as FPKM (Fragments per Kilobase of Transcript per Million Mapped Reads) values of a MIP gene was achieved from the gene view link. The FPKM values for each MIP gene of S. italica was retrieved from the InterMine interface of Phytozome (https://phytozome.jgi.doe.gov/phytomine/template.do?name=One_Gene_Expression&scope=global) using phytozome accession number or identifier. The FPKM values of individual MIP gene in leaf, root and shoot under diverse conditions were retrieved and put into the Microsoft Excel. The heatmap was generated using conditional formatting based on the FPKM values. The FPKM values <1 were treated as no expression of the respective gene.

Determination of pore diameter and pore lining residues

To analyze the MIP channels, the poreWalker server [48] (http://www.ebi.ac.uk/thornton-srv/software/PoreWalker/) was used. This is a fully automated method designed to detect and characterize transmembrane protein channels from their 3D structures. The 3D structure of a MIP in PDB format was uploaded to the server, which generated the specific pore characteristics, particularly the conformation and the regularity of the channel cavity, the corresponding pore lining residues and atoms, and the location of pore centers along the channel. From the PoreWalker outputs, the pore diameter profiles at different regions of a MIP channel were compiled. From the given pore diameter profile of a channel, continuous numerical data were constructed from the non-continuous numerical data through a customized statistical language R-script so that the precise pore diameter at a specific region particularly at the ar/R selectivity filter could be determined. The existing values of pore diameters generated by the PoreWalker were used as an input in the R-script to calculate the missing values of pore diameters to make a continuous pore diameter profile. Through the PoreWalker server, the pore lining residues, which are very important for the formation of a channel, were identified.

Results

Genome-wide identification of PvMIP, SiMIP, SbMIP and BdMIP genes

The whole genome shotgun sequence (WGS) of P. virgatum, S. italica, S. bicolor and B. distachyon available at Phytozome was searched for PvMIP, SiMIP, SbMIP and BdMIP genes using TBLASTN. The query PtMIP and PpMIP sequences from P. trichocarpa and P. patens resulted in 116, 51, 44 and 37 hits for PvMIPs, SiMIPs, SbMIPs and BdMIPs, respectively. We further analyzed the PvMIP, SiMIP, SbMIP and BdMIP sequences for domain identification. Out of 116 unique hits for PvMIPs, 48 were deemed to be pseudo MIP genes after manual inspection of their amino acid sequences, TM domains and homology models, and were discarded (S1 Table). Out of the 51 unique hits for SiMIPs, 9 were deemed to be pseudo MIP genes and were discarded (S1 Table). On the other hand, 6 and 9 unique hits for SbMIPs and BdMIPs, respectively were deemed to be pseudo MIP genes and discarded (S1 Table). We ultimately obtained 68, 42, 38 and 28 full-length PvMIP, SiMIP, SbMIP and BdMIP protein sequences from the WGS of P. virgatum, S. italica, S. bicolor and B. distachyon, respectively (Tables 1–4).
Table 1

MIP genes in P. Virgatum.

Gene NamePhytozome accessionsGenomic LocationPPL(aa)Maximum Identity with other MIP (%)xPSCLyKa/Ks value
PvPIP1;1Pavir.Gb01084.2Chr07b: 13931659–13933701288XP_002454508(98)aPLAS, CHLO0.095
PvPIP1;2Pavir.J11645.1contig141014: 535–2653288AAO86706(97)bPLAS, CHLO0.380
PvPIP1;3Pavir.Gb01084.3Chr07b: 13931659–13933701277AAO86706(97)bPLAS0.669
PvPIP1;4Pavir.Aa00868.1Chr01a: 10299092–10302435289XP_004953388(99)cPLAS, CHLO0
PvPIP1;5Pavir.J37677.1contig69730: 133–3636289XP_004953388(99)cPLAS, CHLO0.116
PvPIP1;6Pavir.Aa00075.1Chr01a: 810607–812141288NP_001105131(98)bPLAS, CHLO0.052
PvPIP1;7Pavir.Ab03380.1Chr01b: 55703303–55704564288NP_001105131(99)bPLAS, CHLO0.023
PvPIP2;1Pavir.Ab02356.1Chr01b: 44317427–44320839288NP_001105026(98)bPLAS, CHLO0.307
PvPIP2;2Pavir.Ab02356.2Chr01b: 44317427–44320300264ACG33001(98)bPLAS0.569
PvPIP2;3Pavir.Bb01320.1 Chr02b: 27409376–27413184363NP_001105024(98)bPLAS0.103
PvPIP2;4Pavir.Ga01149.1Chr07a: 14124713–14126981266XP_004976254(96)cPLAS0.463
PvPIP2;5Pavir.Gb00671.1Chr07b: 7857237–7859623277XP_004976254(99)cPLAS0.505
PvPIP2;6Pavir.Ba02483.2Chr02a: 37691323–37694961290XP_002461930(99)aPLAS, CHLO0.186
PvPIP2;7Pavir.Bb01320.2Chr02b: 27409376–27413188290XP_002461930(99)cPLAS, CHLO0.688
PvPIP2;8Pavir.Bb01841.1Chr02b: 46595867–46597386286XP_004956116(97)cPLAS, CHLO0.331
PvPIP2;9Pavir.Ba02478.1Chr02a: 37576388–37578287286XP_004956116(98)cPLAS, CHLO0.201
PvPIP2;10Pavir.Ib04237.1Chr09b: 67322496–67323750276XP_002489214(90)aPLAS0.111
PvPIP2;11Pavir.Ia02751.1Chr09a: 54199846–54200694282XP_002489214(89)aPLAS0.238
PvPIP2;12Pavir.Ib03181.1Chr09b: 51605207–51606895294XP_004986496(84)cPLAS0.476
PvPIP2;13Pavir.Ba01199.1Chr02a: 15220158–15221702284XP_004957505(85)cPLAS1.323
PvPIP2;14Pavir.J11644.1contig140997: 365–1545287XP_004957505(83)cPLAS0.327
PvTIP1;1Pavir.Ia04869.1Chr09a: 86386535–86388928250P50156(96)dVACU0
PvTIP1;2Pavir.Ib00275.1Chr09b: 2982020–2984273250P50156(96)dPLAS0.267
PvTIP1;3Pavir.Ea04152.1Chr05a: 63625246–63626300252XP_004971442(92)cVACU0
PvTIP1;4Pavir.Ea04152.2Chr05a: 63625246–63626533252XP_004971442(91)cPLAS0
PvTIP2;1Pavir.Gb01125.1Chr07b: 14244344–14245618248XP_004976439(98)cPLAS0.035
PvTIP2;2Pavir.Ga01087.1Chr07a: 12722976–12724266248XP_004976439(98)cPLAS0.076
PvTIP2;3Pavir.J30578.1contig357494: 1–1165249XP_004953349(98)cPLAS0.203
PvTIP2;4Pavir.Da01714.1Chr04a: 37554270–37555738248XP_002438430(97)aPLAS0.212
PvTIP2;5Pavir.Db01217.1Chr04b: 23834192–23835703248XP_002438430(97)aPLAS0.063
PvTIP3;1Pavir.Ia01749.1Chr09a: 21354884–21356273263NP_001105032(95)bMITO0.271
PvTIP3;2Pavir.Ib03520.1Chr09b: 57398634–57399928264NP_001105032(95)bMITO0.033
PvTIP3;3Pavir.Ga00845.1Chr07a: 10052517–10054500273XP_002446824(88)aCHLO0.456
PvTIP4;1Pavir.Ea00003.1Chr05a: 159662–160973250XP_004967395(94)cVACU0.682
PvTIP4;2Pavir.J30482.1contig355910: 206–1208256XP_004967395(91)cCYTO0.102
PvTIP4;3Pavir.J20433.1contig222165: 1187–2025239XP_004967395(88)cVACU0.819
PvTIP4;4Pavir.Eb00023.1Chr05b: 514689–516054259XP_004967394(92)cCYTO0.521
PvTIP4;5Pavir.Cb01832.1Chr03b: 43764796–43767586347XP_004960662(93)cCHLO0.154
PvTIP4;6Pavir.Ca00461.1Chr03a: 5397927–5400491318XP_004960662(91)cCYTO0.172
PvTIP5;1Pavir.Gb01126.1 Chr07b: 14245888–14247100270XP_004978166(82)cCHLO0.187
PvTIP5;2Pavir.Ga01088.1 Chr07a: 12724501–12725804266XP_004978166(78)cCHLO0.522
PvNIP1;1Pavir.Cb01700.1Chr03b: 42769884–42772044280XP_004960601(95)cPLAS0.289
PvNIP1;2Pavir.J36379.1contig59709: 2228–4657277XP_004960601(93)cPLAS0.288
PvNIP1;3Pavir.Eb00236.3Chr05b: 3774233–3776780290XP_002454982 (89)aPLAS0.137
PvNIP1;4Pavir.Ea00222.1Chr05a: 2686340–2692046287XP_002454982(89)aPLAS0.113
PvNIP1;5Pavir.Ab01231.1Chr01b: 18627382–18630325280XP_004951368(93)cPLAS0.436
PvNIP1;6Pavir.Db00851.1Chr04b: 11796860–11798059322XP_004967095(73)cPLAS0.328
PvNIP1;7Pavir.Da00802.1Chr04a: 12785576–12787025287XP_004967095(83)cPLAS0.327
PvNIP2;1Pavir.Ab02995.1Chr01b: 52353467–52357364296XP_004953867(97)cE.R0.116
PvNIP2;2Pavir.Aa00406.1Chr01a: 4613561–4619572313XP_004953867(76)cCHLO0.082
PvNIP2;3Pavir.Db01588.1Chr04b: 35916639–35920941295XP_004965042(97)cPLAS0.423
PvNIP2;4Pavir.Da01156.1Chr04a: 22506078–22510755296XP_004965042(97)cPLAS0.049
PvNIP3;1Pavir.J11993.1contig143579: 44–1260 286XP_004974441(80)cVACU0.460
PvNIP3;2Pavir.J04994.1contig07346: 8718–11701292XP_004974441(81)cPLAS0.255
PvNIP3;3Pavir.Fb00252.1Chr06b: 4491634–4492686288XP_004974441(84)cCHLO0.672
PvNIP3;4Pavir.Fa01950.2Chr06a: 45025793–45028094330XP_004974441(82)cCYTO0.153
PvNIP3;5Pavir.Fa01948.1Chr06a: 45006164–45007342298XP_004974438(87)cCYTO0.598
PvNIP3;6Pavir.Fa01949.1Chr06a: 45022689–45023983278XP_004974438(93)cPLAS0.560
PvNIP3;7Pavir.J17719.1contig194795: 1290–2237291XP_004974439(81)cPLAS0.315
PvNIP3;8Pavir.J35034.1contig50657: 4142–5117295XP_004974439(80)cCYTO0.398
PvNIP3;9Pavir.Ib03684.1Chr09b: 59774269–59780622301XP_004982621(98)cPLAS0.594
PvNIP3;10Pavir.Ia01421.1Chr09a: 15383290–15385609281XP_002464380(87)aCHLO0.561
PvNIP4;1Pavir.Ea00764.2Chr05a: 10523338–10525337 310XP_004971599(86)cPLAS1.429
PvNIP4;2Pavir.Ea00764.3Chr05a: 10523338–10525337308XP_004971599(85)cPLAS1.309
PvSIP1;1Pavir.J16825.1 contig18611: 427–3959 243XP_004962139(95)cPLAS0.107
PvSIP1;2Pavir.J10110.1 contig12910: 934–4304241XP_004962139(95)cPLAS0.060
PvSIP2;1Pavir.Ia03463.1Chr09a: 68891317–68893364242XP_004984561(97)cNUCL0.135
PvSIP2;2Pavir.J37350.1contig67361: 886–3091 242XP_004984561(97)cPLAS0.164

Where, Ka and Ks are numbers of non-synonymous and synonymous substitutions per site, respectively. PPL: polypeptide length, aa: amino acid, PSCL: predicted subcellular localization, PLAS: plasma membrane. VACU: vacuolar membrane, CYTO: cytosol, ER: endoplasmic reticulum, MITO: mitochondrion, NUCL: Nucleous and CHLO: chloroplast.

xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level.

aSorghum bicolor

bZea mays

cSetaria italica and

dOryza sativa Japonica Group

yThe same abbreviations have been used in Tables 1–4.

Table 4

MIP genes in B. distachyon.

Gene NameAccession No.CLGenomic LocationPPL(aa)Maximum identity with other MIP (%)xPSCLyKa/Ks value
PhytozomeNCBI
BdPIP1;1Bradi5g18170.1XP_003580312521376355..21380359288AFV92901(97)gPLAS, CHLO0.203
BdPIP1;2Bradi3g56020.1XP_003570439355807156..55808872289ABJ98535(96)hPLAS, CHLO0.086
BdPIP2;1Bradi3g49360.1XP_003575410350482001..50485770288BAE02729(94)ePLAS, CHLO0.101
BdPIP2;2Bradi5g15970.1XP_003580150519545026..19547614287BAF33069(93)ePLAS, CHLO0.099
BdPIP2;3Bradi1g28760.1XP_003563177124115585..24118617290BAG06231(95)ePLAS, CHLO0.207
BdPIP2;4Bradi1g28780.1XP_003563179124143877..24145345289NP_001105027(92)aPLAS, CHLO0.351
BdPIP2;5Bradi4g36601.1XP_003578538441709704..41711325290ADW85675(89)ePLAS0.376
BdPIP2;6Bradi4g36610.1XP_003576780441713192..41714682297ADW85675(79)ePLAS1.118
BdPIP2;7Bradi1g00552.1-1440975..442208290EMT26209(73)fPLAS0.369
BdPIP2;8Bradi3g18460.1XP_003571557316901458..16902737295BAJ92749(76)ePLAS0.340
BdTIP1;1Bradi1g75290.1XP_003558815172464538..72466271250CAA56553(92)eVACU0.264
BdTIP1;2Bradi2g62520.1XP_003565186258924353..58925772252EMT32480(94)fCYTO0.473
BdTIP2;1Bradi3g50690.1XP_003570028351583379..51584778249BAI66435(96)ePLAS0.116
BdTIP2;2Bradi5g17690.1XP_003580281521052804..21053722248AAF90121(94)eCHLO0.157
BdTIP3;1Bradi3g29780.1XP_003574110331567966..31569631265BAI66441(93)eMITO0.375
BdTIP3;2Bradi5g16370.1XP_003580181519889474..19890848262BAK04817(82)eCHLO0.534
BdTIP4;1Bradi2g07830.1XP_00356552926185272..6187133252EMT15368(91)fCYTO0.455
BdTIP4;2Bradi2g31800.2XP_003568717231480793..31482685252BAI66438(91)eVACU0.448
BdTIP4;3Bradi2g07810.1XP_00356601026166394..6167158254ACG39579(78)aCHLO0.512
BdTIP5;1Bradi5g17680.1XP_003581502521051238..21052610263AAF90122(89)eCHLO1.016
BdNIP1;1Bradi3g08930.1XP_00357185737053864..7055984280BAI66443(93)ePLAS0.229
BdNIP1;2Bradi2g32890.1XP_003568755232572036..32574264282BAI66444(86)ePLAS0.315
BdNIP1;3Bradi1g38160.1XP_003560673134458353..34459897282EMT31551(78)fPLAS0.521
BdNIP2;1Bradi3g59390.1-358343770..58347486296BAH24163(88)eE.R0.421
BdNIP2;2Bradi1g45200.1XP_003564051143568241..43572469302BAH84977(97)eE.R0.327
BdNIP3;1Bradi3g30540.1XP_003574178332426723..32431422301EAY79189(89)iPLAS0.760
BdNIP4;1Bradi2g01095.1XP_0035652462672850..675385285BAK04446(69)ePLAS0.803
BdSIP1;1Bradi4g26870.1XP_003577906431780346..31782559246BAJ86223(88)eCHLO1.629

xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level.

aSorghum bicolor

eHordeum vulgare

fAegilops tauschii

Lolium perenne

Stipa baicalensis and

Triticum urartu.

yThe same abbreviations have been used in Tables 1–4.

Where, Ka and Ks are numbers of non-synonymous and synonymous substitutions per site, respectively. PPL: polypeptide length, aa: amino acid, PSCL: predicted subcellular localization, PLAS: plasma membrane. VACU: vacuolar membrane, CYTO: cytosol, ER: endoplasmic reticulum, MITO: mitochondrion, NUCL: Nucleous and CHLO: chloroplast. xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level. aSorghum bicolor bZea mays cSetaria italica and dOryza sativa Japonica Group yThe same abbreviations have been used in Tables 1–4. xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level. aSorghum bicolor bZea mays dOryza sativa Japonica Group eHordeum vulgare and fAegilops tauschii yThe same abbreviations have been used in Tables 1–4. Where, CL: chromosome location, U: Unknown chromosomal location xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level. bZea mays cSetaria italica and dOryza sativa Japonica Group yThe same abbreviations have been used in Tables 1–4. xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level. aSorghum bicolor eHordeum vulgare fAegilops tauschii Lolium perenne Stipa baicalensis and Triticum urartu. yThe same abbreviations have been used in Tables 1–4. The Ka/Ks value was >1 for PvPIP2;13, PvNIP3;10, PvNIP4;1, SiPIP1;3, SiTIP2;5, SiTIP5;2, SbNIP3;4, SbSIP2;1, BdPIP2;6, BdTIP5;1 and BdSIP1;1 (Tables 1–4), indicating their positive or Darwinian selection. The remaining MIPs showed Ka/Ks values <1, demonstrating their purifying selection.

Nomenclature and predicted subcellular localization of PvMIPs, SiMIPs, SbMIPs and BdMIPs

The phylogenetic analysis showed that PvMIPs, SiMIPs, SbMIPs and BdMIPs were divided into four subfamilies. PIPs, TIPs, NIPs and SIPs of PvMIPs, SiMIPs SbMIPs and BdMIPs clustered with those subfamilies in the respective plant (Fig 1). However, no XIP was found. Sequences belonging to hybrid intrinsic proteins (HIPs) and a novel plant MIP (GIP, GlpF-like intrinsic protein) homologous to bacterial glycerol channel reported in the nonvascular moss P. patens [20] were not found. Fig 1 shows that most of the PIPs clustered either to PIP1s or PIP2s. However, some of the PIPs formed distinct clades from PIP1s and PIP2s. In contrast to PIP1s or PIP2s, they had no N- and C-terminal characteristic lengths [12], and in comparison with reference PIPs, they had the characteristic FPs (discussed later). The phylogenetic analysis with all PIPs from the 12 plants showed that these PIPs clustered with OsPIP2;7 and OsPIP2;8 (S1 Fig). Moreover, their percentage of identity at amino acid level with OsPIP2;7 and OsPIP2;8 (~65% to 80%) was higher than that with PpPIP3;1 (~54%). We therefore named these PIPs as PIP2s. The PvTIPs, SiTIPs and BdTIPs had five subgroups (TIP1 to TIP5) similar to TIPs in Arabidopsis, maize, poplar, rice and soybean. However, SbTIPs had four subgroups (SbTIP1 to SbTIP4). Four subgroups of NIPs were found in P. virgatum, S. bicolor and B. distachyon. Nevertheless, NIPs in S. italica had three subgroups. Although Arabidopsis and soybean have seven NIP subgroups [13,18], poplar, rice and maize have three to four NIP subgroups [11,12,17]. Similar to Arabidopsis, rice, maize, poplar and soybean, P. virgatum, S. italica and S. bicolor had two SIPs subgroups. However, B. distachyon had only one SIP of SIP1 subgroup.
Fig 1

Evolutionary relationship of MIPs in the four grass plants.

Phylogenetic analysis of all MIPs from the four grass plants is shown along with MIPs from poplar. The deduced amino acid sequences of MIPs were aligned using the Clustal Omega computer program and a phylogenetic tree was constructed using MEGA. The evolutionary history was inferred using the Bootstrap Neighbor-Joining (1000 replicates) method and the genetic distance was estimated by the p-distance method. PIPs, TIPs, NIPs and SIPs from the four plants clustered with the corresponding PtMIP subfamilies. Each MIP subfamily is shown with a specific background color to distinguish them from others.

Evolutionary relationship of MIPs in the four grass plants.

Phylogenetic analysis of all MIPs from the four grass plants is shown along with MIPs from poplar. The deduced amino acid sequences of MIPs were aligned using the Clustal Omega computer program and a phylogenetic tree was constructed using MEGA. The evolutionary history was inferred using the Bootstrap Neighbor-Joining (1000 replicates) method and the genetic distance was estimated by the p-distance method. PIPs, TIPs, NIPs and SIPs from the four plants clustered with the corresponding PtMIP subfamilies. Each MIP subfamily is shown with a specific background color to distinguish them from others. PvPIPs, SiPIPs, SbPIPs and BdPIPs were predicted to be localized in the PM or both in the PM and chloroplast (Tables 1–4). However, the predicted subcellular loclization of TIPs was diversed including vacuole, PM, mitochondria, chloroplast and cytosol. Most of the NIPs were predicted to be localizd in the PM. However, some of the NIPs were predicted to be localized in any of the endoplasmic reticulum, choroplast, vacuole or cytosol. The predicted subcellular localization of SIPs was either in the PM or in the chloroplast. However, 1 PvSIP and 1 SiSIP were predicted to be localized in the nucleus. The amino acid lengths of PvMIP, SiMIP, SbMIP and BdMIP homologues with their maximum sequence identity with MIP in other plants are tabulated in Tables 1–4.

Gene structure of MIPs in the four grass plants

All of the full-length MIP sequences found in P. virgatum, S. italica, S. bicolor and B. distachyon were analyzed for introns and exons. The introns in the MIPs of these plants were compared to OsMIPs and ZmMIPs of the two other grass plants as well as AtMIPs and PtMIPs of two non-grass plants (Fig 2). The number of introns varied from zero to five. However, apart from some disparities, the number and positions of introns were conserved within the subfamilies of MIPs in the grass plants. Nevertheless, major differences were observed when subfamilies from monocots were compared to those from dicots [11].
Fig 2

Gene structure of MIPs from grass plants, P. trichocarpa and A. thaliana.

Exon-intron organizations of The exon-intron pattern observed in the majority of MIPs within a subfamily is shown in gray background. In the parenthesis, the number of MIPs having that pattern is indicated for each plant species. For example, Pv (6/21) indicates that 6 out of 21 PvPIPs have the same gene structure. The members of homologue(s) are mentioned after the parenthesis. The six TM regions are shown in black bars and the loops B and E are shown in diamond shapes. The intron positions are indicated by inverted triangles.

Gene structure of MIPs from grass plants, P. trichocarpa and A. thaliana.

Exon-intron organizations of The exon-intron pattern observed in the majority of MIPs within a subfamily is shown in gray background. In the parenthesis, the number of MIPs having that pattern is indicated for each plant species. For example, Pv (6/21) indicates that 6 out of 21 PvPIPs have the same gene structure. The members of homologue(s) are mentioned after the parenthesis. The six TM regions are shown in black bars and the loops B and E are shown in diamond shapes. The intron positions are indicated by inverted triangles. A comparison of members of the PIP subfamily revealed that among the grass plants only PvPIP2;14 had four introns. Although the majority of AtPIPs and PtPIPs had three introns, only ~30% of PIPs in the six grass plants had three introns (Fig 2). The majority of PIPs in the grass plants had two introns because they lost one intron between helices H2 and H3; only PvPIP1;3 and SiPIP1;3 lost one intron between helices H5 and H6. Two PIP genes from each of P. virgatum, S. italica, S. bicolor, O. sativa and Z. mays had a single intron in the distal end of Loop E. The P. virgatum further had four PIP genes that carried a single intron between helices H4 and H5. Nonetheless, this intron position is conserved in all PIPs having more than one intron. Conversely, B. distachyon had no single intron bearing PIP gene. At least one PIP gene from each of S. italica, S. bicolor, O. sativa and Z. mays and two PIP genes from each of P. virgatum and B. distachyon had no intron. Members of the TIP subfamily showed the most stable gene structure in comparison with members of other subfamilies. The majority of TIPs in the grass plants including Arabidopsis and poplar had either two or one introns. Despite PvTIP1;4, TIPs with two introns had intron position at the end of helices H1 and H3. The position of the intron in TIPs having a single intron was at the end of helix H1. Two TIPs, each from P. virgatum and S. italica, had three introns. Similar to AtTIP1;3, only BdTIP4;3 had a gene structure without any intron. The gene structures of members of NIP subfamily in grass plants diverged from their counterparts in Arabidopsis and poplar (Fig 2). The majority of NIPs had four or three introns with highly variable introns organization. Similar to OsNIP1;5, the SiNIP2;3 had five introns, which was the highest intron number among the MIPs. However, the intron positions in SiNIP2;3 and OsNIP1;5 were different. The SiNIP3;4 possesed a unique gene structure without any intron. Similar to AtSIPs, all SIPs in the grass plants had two introns having highly conserved positions in helix H3 and loop E.

Grouping of MIPs based on the ar/R selectivity filter and Froger's position

To group the MIPs based on the ar/R selectivity filter and FPs, we constructed 3D models of all MIPs in P. virgatum, S. italica, S. bicolor and B. distachyon. The structure-based alignments and multiple sequence alignments of MIPs helped us to identify the four amino acid residues at the ar/R selectivity filter and the five residues in the FPs. The residues at the ar/R selectivity filter and in the FPs were considered to group MIPs and to compare these groups with those of the eight plants (Fig 3, S2 and S3 Figs). These groups were correlated with their expression and non-aqua transport profiles (discussed later).
Fig 3

Grouping of MIPs based on the ar/R selectivity filter and FPs in the four grass plants and their expression profiles in different organs.

The phylogenetic tree was generated as described in Fig 1. The residues in the ar/R selectivity filter and the FPs were selected from the 3D models as well as from the alignment shown in S2 and S3 Figs. The ar/R and FP groupings of PIPs (A), TIPs (B), NIPs (C), and SIPs (D), are indicated in the right side. # and * indicate the members of Group IB PIP and Group II TIP based on FPs, respectively. The non-aqua substrates predicted to be transported are mentioned. A, B, C, H, N, Sb, Si and U stand for arsenic, boron, CO2, H2O2, ammonia, antimony, silicon and urea, respectively. Expression heatmap in different organs are shown in the right side. Expression levels are given as the FPKM values.

Grouping of MIPs based on the ar/R selectivity filter and FPs in the four grass plants and their expression profiles in different organs.

The phylogenetic tree was generated as described in Fig 1. The residues in the ar/R selectivity filter and the FPs were selected from the 3D models as well as from the alignment shown in S2 and S3 Figs. The ar/R and FP groupings of PIPs (A), TIPs (B), NIPs (C), and SIPs (D), are indicated in the right side. # and * indicate the members of Group IB PIP and Group II TIP based on FPs, respectively. The non-aqua substrates predicted to be transported are mentioned. A, B, C, H, N, Sb, Si and U stand for arsenic, boron, CO2, H2O2, ammonia, antimony, silicon and urea, respectively. Expression heatmap in different organs are shown in the right side. Expression levels are given as the FPKM values. The ar/R selectivity filters in all PIPs of the four grass plants contained residues F, H, T and R in H2, H5, LE1 and LE2, respectively (Fig 3A) identical to those found in Arabidopsis, maize, rice and G. max, and hence there was no group in PIPs based on this filter. Based on ar/R selectivity filters, all TIPs in P. virgatum, S. italica, S. bicolor and B. distachyon were grouped into two, Groups I and II, with different subgroups in Group II (Fig 3B). All members in TIP1 and TIP2 were in Group I and Group IIA, respectively, with the ar/R selectivity filter composed of HIAV and HIGR, correspondingly except PvTIP1;3 and PvTIP1;4 in which H in helix H2 was substituted by Y. All TIP3s and six members of TIP4 were in Group IIB with the tetrad composed of H, V/I/M, A and R. All members of TIP5 and most members of TIP4 in P. virgatum, S. italica, S. bicolor were sub-grouped to Group IIC having the residues Q/H/N, S/V/T, A and R. TIP Groups I, IIA and IIB in this study corresponded to those in Arabidopsis and G. max. However, the ar/R selectivity filter of TIP Group III, which was reported in Arabidopsis, G. max and poplar (S5 Fig; [11,18,30]), was not found in grass plants or cotton. Based on the ar/R selectivity filters, all NIPs were grouped into four (Fig 3C). All members of NIP1, NIP2, NIP3 and NIP4 were grouped to Groups I, III, II and IV, respectively. The tetrad of the ar/R selectivity filters in Group I (W, V/A, A and R) and Group II (A, A/I, A/P/G and R) were similar to those of Groups I and II, respectively in Arabidopsis, rice, maize, soybean, poplar and cotton. The tetrad of the ar/R filter in Group III (G, S, G and R) was conserved in the six glass plants as well as in soybean and poplar but was absent in Arabidopsis, cotton, P. patens and S. moellendorffii (Fig 3 and S6 Fig). The ar/R selectivity filter in NIPs of Group IV (C/V, G, G and R) was found only in grass plants but completely absent in other six plants. The SIPs were grouped into Group I and II based on the tetrad of the ar/R selectivity filter (Fig 3D). All SIP1s in the grass plants were clustered together with the ar/R filter composed of L/V, V/I, P and N which was fully conserved in some of the SIP1 members in other plants. All SIP2s were clustered into Group II with the conserved ar/R selectivity filter composed of S, H, G and S. Based on the FPs, all PIPs from the four grass plants were clustered into two groups (Fig 3A). The P2-P5 positions were conserved in PIPs of both groups (Fig 3 and S3 Fig). While Gln was conserved in the P1 position in all members of Group I, the corresponding position in the homologues of Group II was substituted by H/V/T/M/N/E. The P3-P5 positions in all TIPs conserved the residues A, Y and W, respectively (Fig 3B). Based on the disparities in P1 and P2 positions, all TIPs could be divided into two groups. Despite three members of TIP3, all members of TIP1, TIP2, TIP3 and TIP4 were in Group I in which the P1 and P2 positions conserved T and S/V/A, respectively. All TIP5 members and a few members of TIP3 were in Group II in which the P1 and P2 positions conserved S and S/A, correspondingly. Similar FPs of Groups I and II TIPs were observed in rice, maize and other plants (S5 Fig). Based on the FPs, NIPs were clustered into four groups (Fig 3C). All NIP1 and NIP2 members were in Groups I and II, respectively, whereas all members of NIP3 and NIP4 clustered to Groups III and VI, individually. In all NIPs, P3 and P4 positions were conserved with A and Y, correspondingly. NIPs of rice and maize as well as other plants also followed this grouping (S6 Fig). Based on the FPs, all SIP1s and SIP2s clustered to Groups I and II, respectively, with the residues in P1-P5 positions correspondingly M, A, A, Y, W and F/L, A, A, Y, W (Fig 3D). However, the P2 position in other than grass plants was substituted by V.

MIPs with unusual NPA motifs

Like their counterparts in other plants, all PIPs, TIPs, NIP1s and NIP2s in the four grass plants had dual conserved NPA motifs in loops B and E, respectively. In the NIPs with unusual NPA motifs, A of the NPA in Loop B was substituted by S and that in Loop E was substituted by V or I, as was found in poplar and other plants (Table 5). However, substitution of A with I in LE of PvNIP4;1–2 and SbNIP4;1 has not so far been reported although it is found in XIPs [11]. The NIPs with unusual NPA motifs in which A in loop B and that in loop E were substituted by S and V, respectively, had a characteristic Arg-rich C-termini (Table 5). In all SIPs in the grass plants, substitution of A by T (in SIP1s) or L (in SIP2s) in the NPA motif of Loop B was in agreement with other plants. The SIPs in all plants had the conserved NPA motif in Loop E with a unique characteristic Lys-rich C-termini (Table 5) which is a potential endoplasmic reticulum retention signal [1,49].
Table 5

NIPs and SIPs with unusual NPA motifs and the characteristic C-termini.

PlantsMIPsNPA in LB*NPA in LE*C-terminal region
NIPs
P. virgatumPvNIP3;9NPSNPV-GETPRTQRSFRR
PvNIP3;10NPSNPV-GETPRAQRSFRR
PvNIP4;1NPANPI-PHAIGAVASQQF
PvNIP4;2NPANPI-PHAIGAVASQQF
S. italicaSiNIP3;5NPSNPV-GETPRTQRSFRR
S. bicolorSbNIP3;4NPSNPV-GEAPRPQRSFRR
SbNIP4;1NPANPI-RAVGSLASSPHY
B. distachyonBdNIP3;1NPSNPV-GEAPRPQRSFRR
BdNIP4;1NPANPV-GRGGAAARSGSN
O. setivaOsNIP3;1NPSNPV-GETPRPQRSFRR
Z. maysZmNIP3;1NPSNPV-GETPRTQRSFRR
A. thalianaAtNIP1;2NPANPG-SFLKTVRNGSSR
AtNIP5;1NPSNPV-TDPPRPVRSFRR
AtNIP6;1NPANPV-DEAPKERRSFRR
AtNIP7;1NPLNPA-SPVSPSVSSLLR
P. trichocarpaPtNIP3;1NPSNPV-NEKTSAARSFRR
PtNIP3;2NPSNPV-NEKTSATRSFRR
PtNIP3;3NPSNPV-ADPPRQVRSFRR
PtNIP3;4NPSNPV-TDPPRPVRSFRR
G. maxGmNIP5;1NPSNPV-AEPPRQVRSFRR
GmNIP6;2NPANPV-AKAKTSISSFRR
G. hirsutumGhNIP6;1NPANPV-ILGSPCGCRTYT
P. patensPpNIP3;1NPANPV-DPPRLPVRVFHR
PpNIP6;1NPANPM-LAGTWTHTMLQI
S. moellendorffiiSmNIP3;2NPANPI-LGAGFYTLIRSS
SmNIP6;2NPSNPA-KPKKWGRNELLQ
SmNIP5;4NPANPC-FKELERPKSFRR
SmNIP7;2NPSNPA-VLEGKEDSQNSM
SIPs
P. virgatumPvSIP1;1NPTNPA-LAPPPKPKAKKA
PvSIP1;2NPTNPA-LAPPPKPKAKKA
PvSIP2;1NPLNPA-TFLTKPKKIKEQ
PvSIP2;2NPLNPA-TFLTKPKKIKEQ
S. italicaSiSIP1;1NPTNPA-LAPPPKPKAKKA
SiSIP2;1NPLNPA-EQEADENKTKKE
S. bicolorSbSIP1;1NPTNPA-LPPAPKPKTKKA
SbSIP1;2NPLNPA-LAPPPKPKAKKA
SbSIP2;1NPLNPA-EQEADENKTKKE
B. distachyonBdSIP1;1NPTNPA-PPPAPKPKAKKA
O. sativaOsSIP1;1NPTNPA-PPPAPKPKAKKA
OsSIP2;1NPLNPA-EEEADESKTKKE
Z. maysZmSIP1;1NPTNPA-LPPAPKPKTKKA
ZmSIP1;2NPTNPA-LTPPPKPKAKKA
ZmSIP2;1NPLNPA-EQKVDENKIKKE
A. thalianaAtSIP1;1NPTNPA-PPRPQKKKQKKA
AtSIP1;2NPCNPA-APPLVQKKQKKA
AtSIP2;1NPLNPA-TEEQEKPKAKSE
P. trichocarpaPtSIP1;1NPTNPA-VFPPPAPKQKKT
PtSIP1;2NPTNPA-VFPPPAPKQKKA
PtSIP2;1NPLNPA-QDEKEKLKGKTE
PtSIP2;2NPLNPA-QDEKEKLKGKTD
G. maxGmSIP1;1NPTNPA-PPAPRVVKQKKA
GmSIP1;2NPTNPA-VFPPRVVKQKKA
GmSIP1;3NPTNPA-PPPPPEVKQKKA
GmSIP1;4NPTNPA-PPSPPEVKQKKA
GmSIP1;5NPSNPA-SMFMPPIKQKKA
GmSIP1;6NPSNPA-SMFMPPIKQKKA
G. hirsutumGhSIP1;2NPTNPA-KKAKKTRKPKRA
GhSIP1;3NPTNPA-FSPSSSIKEKKA
P. patensPpSIP1;1NPTNPA-STGNAGDKMKAS
PpSIP1;2NPTNPA-LSENAAGKVKAS
S. moellendorffiiSmSIP1;2NPTNPA-MFALGQNKEKTA

*LB and LE indicates loops B and E, respectively.

*LB and LE indicates loops B and E, respectively.

Substrate-specific signature sequences or specificity-determining positions and non-aqua transport profiles of plant MIPs

The 3D models and the multiple sequence alignments of plant MIPs that have been shown experimentally to facilitate the transport of physiologically important non-aqua molecules such as ammonia, boron, CO2, H2O2, silicon and urea as well as toxic heavy metals arsenic and antimony [22,23,24,25,26,27,28,50,51,52,53,54,55] were analyzed for predicting substrate-specific signature sequences (SSSS) or specificity-determining positions (SDPs) in NPA regions, ar/R filter and FPs. The predicted SSSS or SDPs in these three constrictions in the experimentally proven MIPs are summarized in Table 6. All of the MIPs in each of the 12 plant genomes were subjected to ScanProsite tool (http://prosite.expasy.org/scanprosite/) to identify the SSSS or SDPs, and thereby the non-aqua transporters MIPs were predicted. Only the common homologues supported by all the characteristic SSSS or SDPs in the three constrictions (two NPA regions, ar/R selectivity filter and FPs) were listed as the transporter of the specific non-aqua molecule (Fig 3 and S4–S6 Figs).
Table 6

Substrate-specific signature sequences (SSSS) or specificity determining positions (SDPs) in MIPs transporting non-aqua substrates.

SubstrateaSub-familySignature sequencesReferencesc
Ar/RNPA in Loop BNPA in Loop EFPs
Ammonia (3.26 Å)TIPHI(G/A)RSGGH(V/L)NPAVTG(G/A)SMNPARSFGTSAYW[24]
NIPWVARSGGH(L/F)NPAVTG(G/A)SMNPARSLGFSAYL
Antimonite (3.70 Å)NIP(G/A/T)(S/I/V/A)(G/A)RSG(A/C)H(L/M)NP(S/A)(V/I/T)(T/S)(G/S)(G/A)SMNP(V/A)R(T/S)L(G/A)(L/F/Y/I)(T/S)AY(L/M/F)[57]
Arsenic (4.00 Å)NIP(G/W/A)(V/S/I)(G/A)(R/V)SGAH(L/M/I/V/)NP(A/S)(V/I)T(G/S)(A/G)SMNP(A/V)R(T/S)(L/I)G(L/F/Y)(T/S)AY(F/L/M)[24]
SIP*SHGSGGASYNPLT(I/V)GG(I/V)MNPASAFA(F/L)AAYW
Boron (2.57 Å)NIP(A/G)(I/S)GRSGAH(M/L/I)NP(A/S)(V/L)T(G/S)(G/A)SMNP(A/V)R(S/T)LG(F/I)TAY(F/L)[24]
bCO2 (3.00 Å)PIPFHTRSGGHINPAVTGTGINPARSLG(Q/M)SAFW[24]
H2O2 (3.20 Å)PIPFHTRSGGH(I/L/V/)NPAVTGT(G/S)INPARS(L/F)G(Q/F)SAFW[24]
TIPHI(A/G)(R/V)SGGH(V/L/I/)NPAVTG(A/G)SMNPA(R/V)SFGTSAYW
NIPWVARSGAH(F/L/I/V)NPAVTG(A/G)SMNPARSLGFSAY(I/L)
SIP*SHGSGGASYNPLT(I/V)GG(I/V)MNPASAFA(F/L)AAYW
bSilicon (4.38 Å)NIPGSGRSGAHMNPA(V/L)TGGSMNPARTL(G/A)(L/I)TAYF[69]
Urea (2.62 Å)TIP(H/G/N)(I/V)(A/G)(R/V/C)SGGH(V/I/L/M)NPAVTG(A/G)SMNPA(R/V/C)SFGT(S/A)AYW[24]
NIP(G/A)(S/I)ARSGAH (M/ V/I/L/)NPAVT(G/S)(A/G)SMNP(A/V)R(T/S)LG(L/F/M/V/I)TAY(F/L)
SIP*(L/V/I/A)(V/I/F/M/T)P(NF/I)G(G/S)(V/A)(S/T)(F/W)NP(S/C/T/A)(T/A/G/D)(S/T/N/L/V/I/F)(G/R)P(S/A)MNPA(N/F/I)A(F/Y)(M/I)AAYW

a The diameter of the molecule is shown in the parenthesis.

b SSSS or SDPs in two NPA regions, ar/R selectivity filter and FPs were determined by analyzing the MIPs that have been shown experimentally to transport CO2 and silicon [24] which synchronized with the report of Hove and Bhave [23].

c The SSSS or SDPs were determined in this study by analyzing the experimental MIP homologues mentioned in the references within the parenthesis.

*SSSS and SDPs were not based on the experimental SIPs. SIPs that were predicted as arsenic, H2O2 and urea transporter based on the FPs of experimental PIPs, TIPs and NIPs, were used to predict the SSSS or SDPs in NPA regions, ar/R selectivity filter and FPs.

a The diameter of the molecule is shown in the parenthesis. b SSSS or SDPs in two NPA regions, ar/R selectivity filter and FPs were determined by analyzing the MIPs that have been shown experimentally to transport CO2 and silicon [24] which synchronized with the report of Hove and Bhave [23]. c The SSSS or SDPs were determined in this study by analyzing the experimental MIP homologues mentioned in the references within the parenthesis. *SSSS and SDPs were not based on the experimental SIPs. SIPs that were predicted as arsenic, H2O2 and urea transporter based on the FPs of experimental PIPs, TIPs and NIPs, were used to predict the SSSS or SDPs in NPA regions, ar/R selectivity filter and FPs. Our analysis showed that the predicted ammonia transporter MIPs were distributed to TIPs (TIP2s and TIP4s) (Fig 3 and S5 Fig), which was in agreement with experimental evidence [24]. This result indicated that ammonia transport through TIPs might be a conserved and ancient feature in higher plants since early branched plants such as P. patens and S. moellendorffii have no ammonia transporter. At least 5 MIPs from the four grass plants and 12 MIPs of the other plants were predicted to transport boron and were distributed only to members of NIP3, NIP5 and NIP6 except OsNIP2;1 (Fig 3 and S6 Fig). Boron transport in plants could be an ancestral feature as each of the 12 plants except S. moellendorffii had at least one NIP homologue predicted to be boron transporter. Our data showed that 36 PIPs in the four grass plants and 55 PIPs in the other 8 plants were predicted to be CO2 transporters with the highest and lowest numbers in cotton and S. moellendorffii, respectively (Fig 3 and S4 Fig). Despite AtPIP1;2 in Arabidopsis, no homologue in these 12 plants has experimental evidence, hence it would be interesting to test the CO2 permeability of these predicted PIPs in higher and lower plants. However, the plant MIPs especially in Arabidopsis, barley and tobacco, which have been experimentally proven to transport CO2, are dispersed to PIPs [51,52,56]. Including a total of 72 MIPs in the four species, more than 139 MIP homologues in the 12 plants were predicted to facilitate the transport of H2O2 (Fig 3 and S4 and S5 Figs). These MIPs were mostly of PIPs and TIPs; the members of PIPs were of group I based on FPs (Fig 3 and S4 Fig, [24]). However, a few NIPs of group I from rice, poplar and Arabidopsis, and two HIPs each from P. patens and S. moellendorffii were predicted to be H2O2 transporters (S6 Fig). Data showed that all of the six grass plants had more than one silicon transporter and all were members of NIP2s (Fig 3 and S6 Fig). Furthermore, except PtNIP2;1, no silicon transporter was predicted in the other 5 plants. This result indicated that silicon transport might not be an ancestral characteristic and may be inherited based on the plant species. Each of the 12 plants had multiple urea transporters that were distributed to TIPs and NIPs (Fig 3 and S5 and S6 Figs). This result indicated that urea transport might be an ancestral characteristic of plants. Phytotoxic antimony and arsenic transported through MIPs in the form of antimonite and arsenite, respectively can enter the food chain [25,57]. Our analysis predicted that the antimony and arsenic transporters were distributed only among the NIPs (either Group II or III NIPs based on the ar/R filter) in all grass plants including other higher plants (Fig 3 and S6 Fig). The antimony and arsenic transporter MIPs so far reported based on wet lab experiments are NIPs [25,57]. Therefore, antimony and arsenic transport through NIPs is a conserved and prehistoric characteristic. It was predicted that 24 MIPs from the 12 plants were arsenic transporters; of them 9 homologous were from the four grass plants (Fig 3 and S6 Fig), and among the six grass plants, P. virgatum, O. sativa had the highest number of arsenic transporters. However, A few PIP homologues in rice have been reported to have arsenic permeability [58]. Therefore, SSSS or SDPs prediction based on only a few PIP homologues might not be significant, and hence, PIPs were not considered in the analysis to predict arsenic transport. Very few studies have examined the functions of SIPs. However, at least one of the two AtSIPs showed water channel activity when they were expressed in yeast [59]. Our analysis based on the SSSS or SDPs in the NPA regions, ar/R filter and FPs determined from the experimental PIPs, TIPs and NIPs did not detect their non-aqua transport. However, based on only the FPs, almost all SIP1s were predicted as urea transporters and SIP2s in the grass plants were predicted as transporters of arsenic and H2O2 in addition to urea (Fig 3D).

MIPs predicted with multi, dual and single molecule transport activity

We defined a multichannel MIP when one MIP homologue was predicted to facilitate the transport of three or more than three non-aqua substrates. The total number of such MIPs in the four grass and in the other 6 higher plants was 18 and 37, respectively (Fig 3 and S5 and S6 Figs). However, this types of multichannel MIPs were not predicted in the lower plants, P. patens and S. moellendorffii. This result indicated that the multichannel MIPs were members of TIP2s and NIP2s. The 12 plants had a total of 136 MIP homologues that were predicted to transport two non-aqua substrates; 54 homologous were predicted in the four grass species (Fig 3 and S4–S6 Figs). A total of 78 MIP homologues in the 12 plants were predicted to transport only one non-aqua substrate.

Expression of MIP genes in roots, shoots and leaves

The FPKM values obtained from the Phytozome could be assigned to 176 MIP genes of the four species. A heatmap showing their transcript levels in roots, shoots and leaves of the four plants was generated (Fig 3A–3D). The percentage of MIP genes in P. virgatum, S. italica, S. bicolor and B. distachyon expressed in at least one organ analyzed was 70, 76, 75 and 89, respectively, and that of MIPs in those plants expressed in all organs analyzed was 47, 59, 50 and 34, respectively. Among the MIPs, PvTIP1;2 (FPKM = 411.5), SiTIP1;1 (FPKM = 846.5), SbTIP2;1 (FPKM = 941.5) and BdTIP1;1 (FPKM = 1076) showed the highest expression in roots and these TIP homologues were ubiquitously expressed in all organs analyzed.

Discussion

We identified and characterized a total of 176 MIP homologues from the genomes of four grass plants, P. virgatum, S. italica, S. bicolor and B. distachyon to predict and compare their structural properties and non-aqua transport functions to those in other two grass plants, rice and maize, as well as at least six non-grass plants comprising higher and lower plants. The genomes of all twelve plants included a total of 487 full-length MIP homologues. Therefore, this study provides a comparative particulars in context of their genome-wide number of homologues, subclasses or groups, non-aqua transport profile and structure-function relationships or non-aqua transport selectivity.

The genome of P. virgatum has the largest number of MIP homologues

Although the number of MIP homologues varies from plant to plant, dicot plants comparatively have more homologues than monocot plants. Before our report, the highest known number of 66 full-length MIP homologues was shown in the genome of the dicot species G. max [18]. However, in the present study, we identified 68 full-length MIP homologues in a monocot species, P. virgatum (Table 1). This is the largest number of MIP homologues in a plant genome reported to date. It can be speculated that the polyploidy nature of P. virgatum resulted in duplication of these genes along the genome [33,42]. The large numbers of MIPs reflects wide diversity in substrate specificity, subcellular localization, transcriptional and post-translational regulation.

Grass plants have the least number of MIP subfamilies

Similar to Arabidopsis [13], MIPs of grass plants comprise only four subfamilies, namely PIPs, TIPs, NIPs and SIPs (Fig 1), whereas MIPs of other higher plants with dicotyledon such as poplar, soybean, tomato and cotton have one more subfamily, XIPs [11,16,18,19]. The early-branched land plants, P. patens or mosses, possesses additional MIP subfamilies adding up to seven including GIPs and HIPs [20]. The PtMIPs and PpMIPs were chosen as queries so that XIPs, GIPs or HIPs could be detected if they were encoded in the genomes of grass plants. Occurrence of gene duplication as well as horizontal gene transfer during evolution is an important consideration for diversification of MIPs [33]. HIPs and GIPs might have been lost between the ancestor of early-branched vascular and seed plants and XIPs might have been lost between the ancestor and grass plants including Arabidopsis. Interestingly, although all higher plants have both SIP1s and SIP2s, B. distachyon possesses only SIP1 homologue as was found in lower plants, P. patens and S. moellendorffii [20,60]. This indicated that either SIP2s were present in the early-branched land plants but were subsequently lost in B. distachyon. It might be because of rapid divergence of SIP2s from SIP1 in B. distachyon as was suggested for P. patens and S. moellendorffii.

Sub-cellular localizations and expression of plant MIPs are likely to be connected to their transport profiles

The sub-cellular localizations of plant MIPs are diversified, which might be connected to their functions. It was speculated that the same PIP localized in the PM and chloroplast might be responsible for transporting water and CO2, respectively [51,56]. Dual or multiple localizations might be coherent with the dual or multi channel activities of MIPs (Tables 1–4, Fig 3 and S4–S6 Figs). We guess that the PIPs predicted to transport CO2 are localized in the chloroplast in addition to PM. However, the score for localization in the chloroplast is lower than that in the PM. This is also applicable to the AtPIP1;2 in Arabidopsis, HvPIP2;1 in barley and NtAQP1 in Nicotina tabaccum (data not shown), which were shown experimentally to localize in PM and chloroplast and to transport CO2 [51,52,56]. Again PIP1 is localized in the PM when it is coexpressed with PIP2; if it is expressed alone, then it remains in the ER [61,62]. TIPs and NIPs exhibit multiple sub-cellular localizations and high functional diversity with transport of water, glycerol, H2O2, NH3, urea or metalloids such as arsenic, antimony, boron and silicon (Tables 1–4, Fig 3 and S5 and S6 Figs; [55,63]). The multiple sub-cellular localizations and diversified transport activities of MIPs are associated with osmoregulation and transcellular water transport, cell elongation, cell signaling, detoxification of excess urea, NH3 and H2O2 [3,27,36,55,64]. Data in the present study revealed that out of the 36 CO2 transporter PIPs, 32 were expressed in the leaves (Fig 3A). All predicted arsenic, silicon, boron, ammonia and H2O2 transporters were expressed in the roots. Nevertheless, most of these MIPs were also expressed in shoots and leaves. Similarly, almost all MIPs predicted to transport other non-aqua substrates such as antimony and urea were also ubiquitously expressed in the three organs roots, shoots and leaves. Interestingly, most of the unexpressed MIPs were not predicted to have non-aqua transport activity (Fig 3A–3D). These results indicate that the predicted non-aqua transport profiles of MIPs have a close relation with their expression. Again higher level of expression of some PIPs, TIPs and NIPs suggest that they have central physiological role in regulating water homeostasis, cell growth and cell expansion [4,36]. Therefore, the prediction of sub-cellular localization and expression profiles of MIPs in this study may be a nice direction for wet lab experiments to validate the relationship among the multiple sub-cellular localizations, expression and functional diversity.

Non-aqua transport selectivity profile might be MIP group-specific

The non-aqua transport activities are mostly related to the phylogenetic framework of MIPs (Fig 3 and S4–S6 Figs). Group IA PIPs (based on FPs) from every plant were predicted to transport dual substrate CO2 and H2O2 (Fig 3A and S4 Fig). Because all the PIPs conserve the NPA motifs and the ar/R selectivity filter, their non-aqua transport selectivity profiles might be rendered by FPs. Group I PIPs usually differ from Group II PIPs by P1 position among the pore lining and their neighboring residues (S7 Fig). The variety of hydogen bonding interaction of Gln and the substituted amino acid residue at P1 position (S8 Fig) might be a reason for the different conformation and thereby transport selectivity between group I and group II PIPs. The NH2 of polar Gln at the P1 position of group I PIPs may further influence the permeate molecules. Mutagenesis studies might be interesting to validate this hypothesis. However, the pore diameter and the transport profile might be regulated by post translational modification [5,6,47] and/or by heteromerization through physical interaction [62,65]. Since all the TIPs conserve the NPA motifs and also the FPs except some disparities (Fig 3B and S5 Fig), their non-aqua transport selectivity profiles might be rendered by the ar/R selectivity filter. The substitution of the Arg in the LE2 position by the smaller Val present in group I TIPs results in wider pore diameter (data not shown). We thus support other reports [36,66,67] that the wider pore apertures in group I TIPs might have facilitated the transport of larger non-aqua susbstrates such as urea and H2O2. However, ammonia transporters clustered only to group IIA in grass plants and also to group IIB in other plants (Fig 3B and S5 Fig), most of which had smaller pore diameter than the diameter of the ammonia molecule (data not shown). This indicates that pore diameter alone is not a determinant for selectivity of all non-aqua solutes. The regulatory events, biochemical properties of the filters and elsewhere or SDPs [23] might have effects on the transport selectivity profiles of TIPs. The TIP2s and TIP4s that have been predicted to be ammonia transporters have two motifs, G-L-x-y-G-G and P-x-H in loops B and C, respectively (S2 Fig and S9 Fig). The hydrophobic pore-linning conserved Leu, Pro and basic His with imidazole ring in these motifs might have imparted a more hydrophobic channel above the ar/R selectivity filter. The greater hydrophobicity of the channel might have aided the transport of ammonia [68]. Further studies such as mutagenesis are required to test the relevance of these motifs to ammonia transport. The divergent NPA motifs, ar/R filter and FPs individually and/or collectively may play important roles in the substrate transport selectivity profiles of NIPs which were particularly predicted for metalloids such as arsenic, antimony, silicon and boron transporters in addition to H2O2 and urea (Fig 3C and S8 Fig). Conserved NPA motifs in silicon transporter and silicon non-transporter NIPs might have a limited role in the selectivity for silicon and also urea [37,69]. The ar/R filter in silicon transporter NIP2s characterized by the conserved G-S-G-R made the constriction wider [70]. The wider pore diameter of NIP2s might be one of the reasons to facilitate the transport of the bulkiest silicon molecule as well as urea, antimony and arsenic (Fig 3C and S6 Fig). In addition to NPA motifs, ar/R filter and FPs, the pore-lining highly conserved His in F/L-x-H-F-P motif in loop B may also influence the transport selectivity of metalloids in NIPs (S10 Fig). Hydrophobic Leu and Phe in the first position of this motif would be one of the determinants for boron and other metals such as arsenic, silicon and antimony, respectively. Interestingly, all of the predicted boron transporters conserved two unusual NPA motifs (NPS and NPV in Loops B and E, respectively) and Arg-enrich (R-x-x-R-S-F-R-R) C-termini (Table 5) that may play roles respectively in transport selectivity profiles and structural stabilization of the tetramers [49,71]. Furthermore, the highly conserved pore-lining SGGVTVP motifs in loop C of boron transporter NIPs might have important roles in the transport selectivity profile (data not shown). The substitutions of corresponding positions of E14, H66, I187 and F200 of GlpF were focused to affect the width of the pore and the hydrophilic-hydrophobic pattern inside the channel in SIPs [72]. However, most of the substitutions were found to be SIP group-specific in the present study (Fig 3D, S11 Fig and S2 Table). Thus, the substrate selectivity profiles of SIP1s and SIP2s, notably both the width of the pore and the interior properties of the channels, are likely to differ. Comparison of the primary sequences of SIPs with GlpF and AQP1 suggests that SIPs are likely to transport solutes which are noble, hydrophobic and large in size [72]. It is usually supposed that MIPs with unusual NPA motifs may not transport water. However, water transport activity has been demonstrated in two AtSIP1s but not in AtSIP2;1 and the latter is supposed to have non-aqua transport activity [62]. Expression profiles of SIPs further indicated to have their transport activity. Nevertheless, wet-lab experiments are necessary to determine the intracellular localization, expression pattern and transport activities of SIPs.

Conclusions

Analysis of genome sequences in four monocot grass plants revealed a new highest number of MIP homologues in P. virgatum without the recently discovered XIP subfamily in the grass plants. Further sequence and homology models analysis indicated that the signatures for substrate selectivity are group-specific, and like the ar/R selectivity filter, FPs can be an important basis for phylogenetic and functional groupings of MIP subfamilies. While the amino acid residue at the P1 position of FPs is one of the critical molecular determinants of the transport selectivity profiles of PIPs, residues at the ar/R filter and FPs are critical for substrate selectivity in TIPs and NIPs. Besides, the ar/R filter and FPs appear to work in coordination with pore-lining residues, particularly in loops B and C. Comparison of the predicted transport profiles with the expression profiles of MIPs in the four grass plants elucidateed a close correlation. The signature sequences or residues identified in the present study are important for predicting the transport profiles of uncharacterized MIPs. Prediction of the transport profiles and substrate selectivity of MIPs in the present study will provide an inroad to develop genetically modified plants that are tolerant to toxicity of heavy metals such as arsenic and antimony or deficiency of microelements and nutritionally better or healthier. However, the computational analysis-aided prediction for transport profiles, substrate selectivity and subcellular localization based on the critical primary sequence motifs and tertiary structural models of MIPs need to be validated by wet lab experiments.

Phylogenetic relationships of all PIPs from the 12 plants.

The description of figure legend is as for Fig 1. (TIF) Click here for additional data file. Homology models (green) of PvPIP2;1, PvTIP2;1, SiNIP3;5 and SiSIP1;1 superimposed with the models (red) of OsPIP2;1 (A), OsTIP2;1 (B), OsNIP3;1 (C) and OsSIP1;1 (D), respectively. A and D, the top views into the pore of PvPIP2;1 and SiSIP1;1, respectively, and B and C, the side views of PvTIP2;1 and SiNIP3;5, correspondingly. The 3D models of MIPs of the four grass plants were first constructed separately on the basis of the experimental structure of spinach PIP, SoPIP2;1(PDB ID:2B5F). Each of the 3D models of MIPs of the four grass plants was then superimposed on the MIP of other plants (only the representatives are shown). The residues that form the NPA box, ar/R filter and the FPs are shown as sticks. The residues of NPA, ar/R and FPs in PvPIP2;1, PvTIP2;1, SiNIP3;5 and SiSIP1;1 are shown in blue, green and yellow, respectively and those in OsMIPs are shown in black, red and pink, correspondingly and labeled. The TM α-helices and the loops to which they belong are indicated. The center of the pore is indicated as a black ball (A and D) and the path of the channel is indicated as the chain of red balls (B and C). The conserved pore-lining Leu in loop B and P-x-H in loop C found in predicted ammonia transporters TIP2s and TIP4s (B) and L-x-H-F-P in loop B and SGGVTVP found in predicted boron transporters NIP3s (C) are magenta; the same residues in the corresponding positions in OsTIP2;1 (B) and OsNIP3;1 (C) are cyan. The regions of NPA and ar/R selectivity filter and the conserved pore-lining residues in loops B and C in ammonia and boron transporters are boxed (B and C) and indicated by open arrows. The hydrogen bonding interaction between Pro and Val in loop C is shown by black dots. (TIF) Click here for additional data file.

NPA motifs, tetrad of ar/R filter and FPs in the MIPs of four grass plants.

The amino acid sequences were aligned using the Clustal Omega sequence alignment program. From the multiple alignment, only structurally significant regions containing the NPA motifs, tetrad residues of ar/R filter and FPs are shown. The two conserved NPA motifs are bold, the residues at H2, H5, LE1, and LE2 of the ar/R filter are bold and underlined, FPs (P1-P5) are italic and underlined, conserved residues are shaded with grey. (PDF) Click here for additional data file.

Grouping of PIPs based on the FPs in Arabidopsis (At), rice (Os), maize (Zm), poplar (Pt), soybean (Gm), cotton (Gh) and moss (Pp).

The description of the figure legend is as for Fig 3. Here, # and * indicate the members of group I and group II, respectively. (TIF) Click here for additional data file.

Grouping of TIPs based on the ar/R selectivity filter and FPs in Arabidopsis, rice, maize, poplar, soybean, cotton and moss.

The description of the figure legend is as for Fig 3. Here, Ϯ and * indicate the members of group IIB of ar/R filter and group I of FPs, respectively. (TIF) Click here for additional data file.

Grouping of NIPs based on the ar/R selectivity filter and FPs in Arabidopsis, rice, maize, poplar, soybean, cotton and moss.

The description of the figure legend is as for Fig 3. Here, * and # indicates the members of group I of FPs and Group IV of ar/R filter, respectively. (TIF) Click here for additional data file. Multiple sequence alignment of groups I (A) and II (B) PIPs of the twelve plants. The amino acid sequences were aligned using the Clustal Omega program. The transmembrane helices and the dual NPA motifs are shown as gray and yellow, respectively. The residue (Q) at P1 position is shown as cyan. The pore-lining residues are indicated by arrows above the alignment and the conserved residues are indicated by stars (*) at the bottom of the alignment. (PDF) Click here for additional data file. Intramolecular hydrogen-bonding interaction of the amino acid residue at the P1 position (A and B) and its possible role in pore conformation (C and D) in PIPs of Groups I and II. The Gln (Q) in P1 position of a Group I PIP is shown in magenta and its hydrogen bonding interactions with at least five amino acid residues are shown as black dashes (A). The hydrogen-bonding interaction of a substituted amino acid residue (magenta) at the corresponding position in a Group II PIP is shown as black dashes (B). The pore conformation (indicated by an open arrow) in the ar/R selectivity filter region (space-filling residues) of the same 3D models in (A) and (B) are shown in (C) and (D), respectively. (TIF) Click here for additional data file. Multiple sequence alignment of ammonia transporter TIP2s and TIP4s (A) and ammonia non-transporters (B) of the twelve plants. The conserved pore lining hydrophobic Leu in loop B and P-x-H in loop C are shown in the blue boxes. The description of the figure legend is as for Fig S9. (PDF) Click here for additional data file. Multiple sequence alignment of silicon transporter (A) and silicon non-transporter (B) NIPs of the twelve plants. The conserved pore lining F/L-x-H-F-P motif in loop B is shown in the blue boxes. The description of the figure legend is as for S9 Fig. (PDF) Click here for additional data file.

Multiple sequence alignment of SIPs with GlpF and AQP1.

The amino acid sequences were aligned using the Clustal Omega sequence alignment program. Two NPA motifs, the residues at H2, H5, LE1, and LE2 of the ar/R filter and FPs (P1-P5) are yellow, green and cyan, respectively. The SIP group-specific residues corresponding to structurally important residues in GlpF shown by Fu et al. (2000) are in open boxes. The group-specific residues at TM5, LE and TM6, which may also have structural and/or functional roles, are shown in blue boxes. The star (*) at the bottom of the alignment indicates the conserved residues. (PDF) Click here for additional data file.

MIPs discarded from the four grass plants.

(PDF) Click here for additional data file.

Structurally important SIP group-specific amino acids and the role of residues in the corresponding positions in the structure of GlpF and AQP1 (or both).

(PDF) Click here for additional data file.
Table 2

MIP genes in S. italica.

Gene NameAccession No.Genomic locationPPL(aa)Maximum Identity with other MIP (%)xPSCLyKa/Ks value
PhytozomeNCBI
SiPIP1;1Seita.1G264900XP_004953388scaffold_1:33821741..33825147289XP_002454508(99) aPLAS, CHLO0
SiPIP1;2Seita.7G196700XP_004976483scaffold_7:26986155..26988583288XP_002446929 (96)aPLAS, CHLO0.003
SiPIP1;3Seita.1G372300AET81042scaffold_1:41718834..41720554288NP_001105131(97)bPLAS, CHLO CHLO0.077
SiPIP1;4Seita.4G089800XP_004964964scaffold_4:7477219..7478601299XP_002438067(90)aPLAS0.433
SiPIP2;1Seita.2G123300XP_004956116scaffold_2:13956110..13957549286XP_002461936(97)aPLAS, CHLO0.111
SiPIP2;2Seita.2G123200XP_004956115scaffold_2:13928862..13930357286XP_002461936 (96)aPLAS, CHLO0.061
SiPIP2;3Seita.7G170200XP_004976254scaffold_7:25196250..25198921290NP_001105616(96)bPLAS, CHLO0.396
SiPIP2;4Seita.1G241900XP_004953172scaffold_1:31952120..31955237288NP_001105026(96)bPLAS, CHLO0.155
SiPIP2;5Seita.2G123000XP_004956113scaffold_2:13905407..13908718289NP_001105024(97)bPLAS, CHLO0.161
SiPIP2;6Seita.9G219400XP_004986496scaffold_9:16160701..16162490294NP_001105024(74)bPLAS0.443
SiPIP2;7Seita.9G268100-scaffold_9:22654024..22655606284AFW68878(89)bPLAS0.647
SiPIP2;8Seita.2G291500XP_004957505scaffold_2:38725943..38727530285ADW85675(84)ePLAS0.628
SiTIP1;1Seita.5G469800-scaffold_5:47173769..47175771268NP_001045562(90)dCYTO0.444
SiTIP1;2Seita.9G541300XP_004985722scaffold_9:56280555..56282371249P50156(94)dVACU0.350
SiTIP2;1Seita.5G452400XP_004971257scaffold_5:46290359..46291626243NP_001047632(90)dPLAS0.906
SiTIP2;2Seita.1G259900XP_004953349scaffold_1:33381748..33382889249NP_001047632(94)aPLAS0.154
SiTIP2;3Seita.7G189600XP_004976439scaffold_7:26563113..26564278248DAA36542(98)bPLAS0.810
SiTIP2;4Seita.4G160700-scaffold_4:24091200..24092502252XP_002438430(96)aCYTO0.141
SiTIP2;5Seita.7G175600XP_004965462scaffold_4: 24091083–24092571248XP_002438430(98)aCYTO1.013
SiTIP3;1Seita.9G571600XP_004986028scaffold_9:58297296..58298733257NP_001146930(80)aCHLO0.557
SiTIP3;2Seita.9G208400XP_004982756scaffold_9:14971858..14973146262NP_001105032(95)bMITO0.284
SiTIP4;1Seita.5G007300XP_004967394scaffold_5:510443..513569246NP_001105035(88)bPLAS0.324
SiTIP4;2Seita.5G007400XP_004967392scaffold_5:517674..519546246DAA53302(84)bCYTO0.225
SiTIP4;3Seita.5G007500XP_004967395scaffold_5:526125..527419250XP_002457071(88)aVACU0.487
SiTIP4;4Seita.3G082100-scaffold_3:5238397..5241760377NP_001105034(86)bCHLO0.625
SiTIP5;1Seita.1G259800XP_004953348scaffold_1:33380410..33381506259AAF90122(71)eCHLO0.785
SiTIP5;2Seita.7G189500-scaffold_7:26561818..26562859259EMT13969(77)fCHLO1.152
SiNIP1;1Seita.1G025100XP_004951368scaffold_1:2256093..2258861278NP_001105721(93)aPLAS0.420
SiNIP1;2Seita.3G073300XP_004960601scaffold_3:4665483..4668824281XP_002440774(92)aPLAS0.393
SiNIP1;3Seita.4G180100XP_004967095scaffold_4:29058134..29059299286AFW86958(75)bCYTO0.728
SiNIP2;1Seita.6G063400-scaffold_4: 8475698–8479630282XP_002438105(90)aPLAS0.083
SiNIP2;2Seita.4G098700XP_004965042scaffold_4:8475700..8479629297XP_002438105(95)aPLAS0.060
SiNIP2;3Seita.1G318800XP_004953867scaffold_1:37930981..37934840341NP_001105637(93)bPLAS0.271
SiNIP3;1Seita.6G062200XP_004974438scaffold_6:5185326..5186805296XP_002443852(90)aVACU0.297
SiNIP3;2Seita.6G062300XP_004972846scaffold_6:5190608..5192100286XP_002445047(71)aVACU0.211
SiNIP3;3Seita.6G063300XP_004974441scaffold_6:5255676..5256858277NP_001150784(75)bCYTO0.419
SiNIP3;4Seita.6G062400XP_004974439scaffold_6:5192809..5193684291XP_002445042(69)aCYTO0.654
SiNIP3;5Seita.9G193500XP_004982621scaffold_9:13788219..13792953299XP_002464380(98)aPLAS0.949
SiNIP4;1Seita.5G076000XP_004971599scaffold_5:6518742..6520453298ACL53915.1 (79)bPLAS0.845
SiSIP1;1Seita.3G248900XP_004962139scaffold_3:21344768..21347980243XP_002441068(92)aPLAS0.719
SiSIP1;2Seita.8G085300XP_004979029scaffold_8:10243492..10260237251XP_002449310.1(89)aPLAS0.721
SiSIP2;1Seita.9G422800XP_004984561scaffold_9:47875350..47877444252NP_001105640(93)bNUCL0.541

xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level.

aSorghum bicolor

bZea mays

dOryza sativa Japonica Group

eHordeum vulgare and

fAegilops tauschii

yThe same abbreviations have been used in Tables 1–4.

Table 3

MIP genes in S. biocolor.

Gene NameAccession No.CLGenomic locationPPL (aa)Maximum identity with other MIP (%)xPSCLyKa/Ks value
PhytozomeNCBI
SbPIP1;1Sobic.006G176700.1XP_002446929653192023..53194107288AAO86706(98)bPLAS, CHLO0.102
SbPIP1;2Sobic.004G288700.1XP_002454508463023013..63027206289ACF84511(99)bPLAS, CHLO0.039
SbPIP1;3Sobic.004G351200.1XP_002453072467981023..67983212290NP_001105131(96)bPLAS, CHLO0.156
SbPIP1;4Sobic.010G087900.1XP_002438067107521029..7522397296NP_001105023(94)bPLAS0.415
SbPIP2;1Sobic.002G125700.2XP_002461936216980280..16982926286NP_001105027(98)bPLAS, CHLO0.050
SbPIP2;2Sobic.002G125300.1XP_002461933216906986..16908387286NP_001105027 (97)bPLAS, CHLO0.048
SbPIP2;3Sobic.002G125000.1XP_002461931216883369..16884816296NP_001105027(96)bPLAS, CHLO0.080
SbPIP2;4Sobic.002G125200.1XP_002461932216897836..16899264286NP_001105027(96)bPLAS, CHLO0.135
SbPIP2;5Sobic.004G222000.1XP_002452483457220820..57224296289NP_001105026(98)bPLAS, CHLO0.156
SbPIP2;6Sobic.006G150100.1XP_002446796651145123..51147729292NP_001105616(95)bPLAS, CHLO0.134
SbPIP2;7Sobic.002G124700.1XP_002461930216844700..16848362290NP_001105024(99)bPLAS, CHLO0.260
SbPIP2;8Sobic.K007000.1XP_002489214U2606152..2607000282AFW68878(94)bPLAS0.325
SbPIP2;9Sobic.002G281000.2-266275305..66276988289XP_004957505(84) cPLAS, CHLO0.456
SbTIP1;1Sobic.001G505100.1XP_002465859177324938..77327995250NP_001104896(94)bPLAS0.376
SbTIP1;2Sobic.003G445300.2XP_002459183374316138..74319335258ACF78734(91)bCYTO0.362
SbTIP2;1Sobic.004G295100.1XP_0024528084Sobic.004G295100.1249NP_001105030(90)bPLAS0.144
SbTIP2;2Sobic.006G170600.1XP_002448289652722392..52723580249XP_004976439(96)cPLAS0.130
SbTIP2;3Sobic.010G146100.1XP_0024384301041392271..41394011248EAZ00793 (96) dCYTO0.652
SbTIP3;1Sobic.001G208500.1XP_002467022119088973..19090440266NP_001105032(94)bMITO0.361
SbTIP3;2Sobic.006G155300.1XP_002446824651467369..51469051268DAA36836(89)bPLAS0.852
SbTIP3;3Sobic.001G535900.2XP_002468661179929261..79930715271NP_001146930(88)bPLAS0.510
SbTIP4;1Sobic.003G007200.1XP_0024570713622245..623367252ACG39579(95)bCYTO0.591
SbTIP4;2Sobic.009G085900.1XP_002439483914570383..14573259314ACG46456(92)bVACU0.567
SbTIP4;3Sobic.003G006600.1XP_0024570683572753..574763318DAA53302(88)bVACU0.550
SbNIP1;1Sobic.003G026400.1XP_00245498232231972..2234369271AFW77428(77)bCYTO0.393
SbNIP1;2Sobic.009G075900.1XP_00244077499905084..9909435283NP_001151947(92)bPLAS0.253
SbNIP1;3Sobic.004G102200.1XP_00245357349450179..9453286287NP_001105721(94)bCYSK0.453
SbNIP1;4Sobic.010G164100.1-1048401814..48403745291XP_004967095(80)cCYTO0.453
SbNIP2;1Sobic.004G238100.1XP_002454286458614722..58618581297NP_001105637(97)bPLAS0.343
SbNIP2;2Sobic.010G092600.1XP_002438105108195416..8200323295NP_001105020(98)bPLAS0.512
SbNIP3;1Sobic.007G039600.1XP_00244385273826797..3828613297Q7EYH7(77) dVACU0.777
SbNIP3;2Sobic.007G039500.1XP_00244504773812660..3815360289AFW61239 (77)bVACU0.343
SbNIP3;3Sobic.007G038500.1XP_00244504273735702..3736780297AFW57375(70) bCYTO0.512
SbNIP3;4Sobic.001G195800.1XP_002464380117588588..17593923301ACN36318(95) bPLAS1.053
SbNIP4;1Sobic.003G098100.1XP_00245531138668414..8671066289ACL53915(85)bPLAS0.844
SbSIP1;1Sobic.005G091600.1XP_002449310513565974..13569467246NP_001105514(92)bPLAS0.324
SbSIP1;2Sobic.009G131500.1XP_002441068948499291..48503602243NP_001105028(96)bPLAS0.319
SbSIP2;1Sobic.001G389900.1XP_002465351167642857..67645670249NP_001105640(94)bCHLO1.058

Where, CL: chromosome location, U: Unknown chromosomal location

xA gene that shows the highest identity with MIP in other plants by BLASTp. Parenthesis indicates the percentage of identity at the amino acid level.

bZea mays

cSetaria italica and

dOryza sativa Japonica Group

yThe same abbreviations have been used in Tables 1–4.

  70 in total

1.  Plant aquaporins with non-aqua functions: deciphering the signature sequences.

Authors:  Runyararo Memory Hove; Mrinal Bhave
Journal:  Plant Mol Biol       Date:  2011-02-10       Impact factor: 4.076

2.  The Arabidopsis thaliana aquaporin AtPIP1;2 is a physiologically relevant CO₂ transport facilitator.

Authors:  Marlies Heckwolf; Dianne Pater; David T Hanson; Ralf Kaldenhoff
Journal:  Plant J       Date:  2011-06-21       Impact factor: 6.417

3.  The Arabidopsis major intrinsic protein NIP5;1 is essential for efficient boron uptake and plant development under boron limitation.

Authors:  Junpei Takano; Motoko Wada; Uwe Ludewig; Gabriel Schaaf; Nicolaus von Wirén; Toru Fujiwara
Journal:  Plant Cell       Date:  2006-05-05       Impact factor: 11.277

4.  Solanaceae XIPs are plasma membrane aquaporins that facilitate the transport of many uncharged substrates.

Authors:  Gerd Patrick Bienert; Manuela Désirée Bienert; Thomas Paul Jahn; Marc Boutry; François Chaumont
Journal:  Plant J       Date:  2011-03-01       Impact factor: 6.417

5.  NIP6;1 is a boric acid channel for preferential transport of boron to growing shoot tissues in Arabidopsis.

Authors:  Mayuki Tanaka; Ian S Wallace; Junpei Takano; Daniel M Roberts; Toru Fujiwara
Journal:  Plant Cell       Date:  2008-10-24       Impact factor: 11.277

6.  Characterization of four plasma membrane aquaporins in tulip petals: a putative homolog is regulated by phosphorylation.

Authors:  Abul Kalam Azad; Maki Katsuhara; Yoshihiro Sawa; Takahiro Ishikawa; Hitoshi Shibata
Journal:  Plant Cell Physiol       Date:  2008-06-20       Impact factor: 4.927

7.  Plant plasma membrane water channels conduct the signalling molecule H2O2.

Authors:  Marek Dynowski; Gabriel Schaaf; Dominique Loque; Oscar Moran; Uwe Ludewig
Journal:  Biochem J       Date:  2008-08-15       Impact factor: 3.857

8.  Gene identification in novel eukaryotic genomes by self-training algorithm.

Authors:  Alexandre Lomsadze; Vardges Ter-Hovhannisyan; Yury O Chernoff; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-11-28       Impact factor: 16.971

9.  CO2 transport by PIP2 aquaporins of barley.

Authors:  Izumi C Mori; Jiye Rhee; Mineo Shibasaka; Shizuka Sasano; Toshiyuki Kaneko; Tomoaki Horie; Maki Katsuhara
Journal:  Plant Cell Physiol       Date:  2014-01-08       Impact factor: 4.927

10.  Genome-wide analysis of major intrinsic proteins in the tree plant Populus trichocarpa: characterization of XIP subfamily of aquaporins from evolutionary perspective.

Authors:  Anjali Bansal Gupta; Ramasubbu Sankararamakrishnan
Journal:  BMC Plant Biol       Date:  2009-11-20       Impact factor: 4.215

View more
  16 in total

1.  The Eucalyptus Tonoplast Intrinsic Protein (TIP) Gene Subfamily: Genomic Organization, Structural Features, and Expression Profiles.

Authors:  Marcela I Rodrigues; Agnes A S Takeda; Juliana P Bravo; Ivan G Maia
Journal:  Front Plant Sci       Date:  2016-11-30       Impact factor: 5.753

2.  Computational Analysis of Damaging Single-Nucleotide Polymorphisms and Their Structural and Functional Impact on the Insulin Receptor.

Authors:  Zabed Mahmud; Syeda Umme Fahmida Malik; Jahed Ahmed; Abul Kalam Azad
Journal:  Biomed Res Int       Date:  2016-10-20       Impact factor: 3.411

3.  Roles of Aquaporins in Setaria viridis Stem Development and Sugar Storage.

Authors:  Samantha A McGaughey; Hannah L Osborn; Lily Chen; Joseph L Pegler; Stephen D Tyerman; Robert T Furbank; Caitlin S Byrt; Christopher P L Grof
Journal:  Front Plant Sci       Date:  2016-12-01       Impact factor: 5.753

4.  Evolutionary and Predictive Functional Insights into the Aquaporin Gene Family in the Allotetraploid Plant Nicotiana tabacum.

Authors:  Jahed Ahmed; Sébastien Mercx; Marc Boutry; François Chaumont
Journal:  Int J Mol Sci       Date:  2020-07-03       Impact factor: 5.923

5.  Genome-Wide Identification and Transcriptional Regulation of Aquaporin Genes in Bread Wheat (Triticum aestivum L.) under Water Stress.

Authors:  José Madrid-Espinoza; Nidia Brunel-Saldias; Fernando P Guerra; Adelina Gutiérrez; Alejandro Del Pozo
Journal:  Genes (Basel)       Date:  2018-10-15       Impact factor: 4.096

6.  Genome-Wide Identification and Characterization of Aquaporins and Their Role in the Flower Opening Processes in Carnation (Dianthus caryophyllus).

Authors:  Weilong Kong; Mohammed Bendahmane; Xiaopeng Fu
Journal:  Molecules       Date:  2018-07-29       Impact factor: 4.411

Review 7.  Plant and Mammal Aquaporins: Same but Different.

Authors:  Timothée Laloux; Bruna Junqueira; Laurie C Maistriaux; Jahed Ahmed; Agnieszka Jurkiewicz; François Chaumont
Journal:  Int J Mol Sci       Date:  2018-02-08       Impact factor: 5.923

8.  Identification and substrate prediction of new Fragaria x ananassa aquaporins and expression in different tissues and during strawberry fruit development.

Authors:  Britt Merlaen; Ellen De Keyser; Marie-Christine Van Labeke
Journal:  Hortic Res       Date:  2018-04-01       Impact factor: 6.793

9.  Production and partial characterization of dehairing alkaline protease from Bacillus subtilis AKAL7 and Exiguobacterium indicum AKAL11 by using organic municipal solid wastes.

Authors:  Al Hakim; Farhana Rumzum Bhuiyan; Asif Iqbal; Tanvir Hossain Emon; Jahed Ahmed; Abul Kalam Azad
Journal:  Heliyon       Date:  2018-06-07

10.  Genome Wild Analysis and Molecular Understanding of the Aquaporin Diversity in Olive Trees (Olea Europaea L.).

Authors:  Mohamed Faize; Boris Fumanal; Francisco Luque; Jorge A Ramírez-Tejero; Zhi Zou; Xueying Qiao; Lydia Faize; Aurélie Gousset-Dupont; Patricia Roeckel-Drevet; Philippe Label; Jean-Stéphane Venisse
Journal:  Int J Mol Sci       Date:  2020-06-11       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.