| Literature DB >> 25018760 |
Ronald M Clouse1, Nicola Carraro2.
Abstract
The PIN and ACO gene families present interesting questions about the evolution of plant physiology, including testing hypotheses about the ecological drivers of their diversification and whether unrelated genes have been recruited for similar functions. The PIN-formed proteins contribute to the polar transport of auxin, a hormone which regulates plant growth and development. PIN loci are categorized into groups according to their protein length and structure, as well as subcellular localization. An interesting question with PIN genes is the nature of the ancestral form and location. ACOs are members of a superfamily of oxygenases and oxidases that catalyze the last step of ethylene synthesis, which regulates many aspects of the plant life cycle. We used publicly available PIN and ACO sequences to conduct phylogenetic analyses. Third codon positions of these genes in monocots have a high GC content, which could be historical but is more likely due to a mutational bias. Thus, we developed methods to extract phylogenetic information from nucleotide sequences while avoiding this convergent feature. One method consisted in using only A-T transformations, and another used only the first and second codon positions for serine, which can only take A or T and G or C, respectively. We also conducted tree-searches for both gene families using unaligned amino acid sequences and dynamic homology. PIN genes appear to have diversified earlier than ACOs, with monocot and dicot copies more mixed in the phylogeny. However, gymnosperm PINs appear to be derived and not closely related to those from primitive plants. We find strong support for a long PIN gene ancestor with short forms subsequently evolving one or more times. ACO genes appear to have diversified mostly since the dicot-monocot split, as most genes cluster into a small number of monocot and dicot clades when the tree is rooted by genes from mosses. Gymnosperm ACOs were recovered as closely related and derived.Entities:
Keywords: 1-aminocyclopropane-1-carboxylate oxidase; GC mutational bias; PIN-formed; ancestral reconstruction; auxin; ethylene synthesis
Year: 2014 PMID: 25018760 PMCID: PMC4071234 DOI: 10.3389/fpls.2014.00296
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
.
| AcoGoldSmith_v1.001931m.g | Aco001931 | 654 | 9 | Long | 1 | |
| AcoGoldSmith_v1.007499m.g | Aco007499 | 356 | 9 | Short | 2 | |
| AcoGoldSmith_v1.016169m.g | Aco016169 | 620 | 9 | Long | 1 | |
| AcoGoldSmith_v1.018139m.g | Aco018139 | 641 | 9 | Long | 1 | |
| AcoGoldSmith_v1.018694m.g | Aco018694 | 612 | 5 | Long | 1 | |
| AT1G73590 | AtPIN1 | 622 | 9 | Long | 1 | |
| AT5G57090 | AtPIN2 | 647 | 9 | Long | 1 | |
| AT1G70940 | AtPIN3 | 640 | 9 | Long | 1 | |
| AT2G01420 | AtPIN4 | 616 | 10 | Long | 1 | |
| AT5G16530 | AtPIN5 | 351 | 9 | Short | 2 | |
| AT1G77110 | AtPIN6 | 570 | 9 | Reduced | 3 | |
| AT1G23080 | AtPIN7 | 619 | 9 | Long | 1 | |
| AT5G15100 | AtPIN8 | 367 | 8 | Short | 2 | |
| Orange1.1g006199m.g | Csi_g006199 | 657 | 10 | Long | 1 | |
| Orange1.1g007420m.g | Csi_g007420 | 604 | 8 | Long | 1 | |
| Orange1.1g007826m.g | Csi_g007826 | 588 | 8 | Long | 1 | |
| Orange1.1g018360m.g | Csi_g018360 | 357 | 8 | Short | 2 | |
| Orange1.1g019021m.g | Csi_g019021 | 347 | 9 | Short | 2 | |
| Orange1.1g035534m.g | Csi_g035534 | 291 | 5 | N-terminal TMD only | 5 | |
| Orange1.1g036474m.g | Csi_g036474 | 646 | 9 | Long | 1 | |
| Orange1.1g041301m.g | Csi_g041301 | 291 | 7 | Short | 2 | |
| Orange1.1g048649m.g | Csi_g048649 | 256 | 5 | N-terminal TMD only | 5 | |
| Eucgr.A02229 | EgrA02229_1 | 599 | 8 | Long | 1 | |
| Eucgr.B00948 | EgrB00948_1 | 587 | 9 | Long | 1 | |
| Eucgr.B01403 | EgrB01403_1 | 365 | 9 | Short | 2 | |
| Eucgr.B01405 | EgrB01405_1 | 364 | 9 | Short | 2 | |
| Eucgr.B01406 | EgrB01406_1 | 285 | 7 | Short | 2 | |
| Eucgr.B02902 | EgrB02902_1 | 657 | 9 | Long | 1 | |
| Eucgr.C00078 | EgrC00078_1 | 626 | 9 | Long | 1 | |
| Eucgr.F04265 | EgrF04265_1 | 530 | 8 | Reduced | 3 | |
| Eucgr.G02187 | EgrG02187_1 | 652 | 9 | Long | 1 | |
| Eucgr.G02548 | EgrG02548_1 | 338 | 9 | Short | 2 | |
| Eucgr.G02549 | EgrG02549_1 | 360 | 9 | Short | 2 | |
| Eucgr.H01382 | EgrH01382_1 | 262 | 6 | Short | 2 | |
| Eucgr.H01390 | EgrH01390_1 | 519 | 7 | Long | 1 | |
| Eucgr.01919 | EgrI01919_1 | 356 | 8 | Short | 2 | |
| Eucgr.K02271 | EgrK02271_1 | 598 | 9 | Long | 1 | |
| Cassava4.1_003367m.g | Mes003367 | 646 | 9 | Long | 1 | |
| Cassava4.1_003794m.g | Mes003794 | 614 | 8 | Long | 1 | |
| Cassava4.1_003807m.g | Mes003807 | 614 | 8 | Long | 1 | |
| Cassava4.1_006998m.g | Mes006998 | 468 | 9 | Long | 1 | |
| Cassava4.1_010607m.g | Mes010607 | 357 | 9 | Short | 2 | |
| Cassava4.1_010688m.g | Mes010688 | 354 | 9 | Short | 2 | |
| Cassava4.1_026579m.g | Mes026579 | 598 | 7 | Long | 1 | |
| Cassava4.1_029063m.g | Mes029063 | 361 | 8 | Short | 2 | |
| Cassava4.1_029078m.g | Mes029078 | 626 | 9 | Long | 1 | |
| Cassava4.1_030090m.g | Mes030090 | 380 | 5 | N-terminal TMD only | 5 | |
| Cassava4.1_033391m.g | Mes033391 | 355 | 8 | Short | 2 | |
| Medtr2g043210 | Mtr2g043210 | 315 | 4 | N-terminal TMD only | 5 | |
| Medtr4g154810 | Mtr4g154810 | 524 | 8 | Long | 1 | |
| Medtr6g083450 | Mtr6g083450 | 659 | 10 | Long | 1 | |
| Medtr7g008720 | Mtr7g008720 | 357 | 8 | Short | 2 | |
| Medtr7g089430 | Mtr7g089430 | 363 | 9 | Short | 2 | |
| Medtr7g106430 | Mtr7g106430 | 591 | 8 | Long | 1 | |
| Medtr8g130020 | Mtr8g130020 | 625 | 9 | Long | 1 | |
| Medtr8g130040 | Mtr8g130040 | 568 | 10 | Long | 1 | |
| Medtr4g084870 | Mtr4g084870 | 659 | 10 | Long | 1 | |
| MtrAAT48627 | MtrAAT48627 | 527 | 9 | Long | 1 | |
| MtrAY115838 | MtrAY115838 | 621 | 10 | Long | 1 | |
| LOC_Os06g12610 | OsPIN1a | 595 | 10 | Long | 1 | |
| LOC_Os02g50960 | OsPIN1b | 554 | 9 | Reduced | 3 | |
| LOC_Os11g04190 | OsPIN1c | 592 | 10 | Long | 1 | |
| LOC_Os12g04000 | OsPIN1d | 390 | 4 | C-terminal TMD only | 4 | |
| LOC_Os06g44970 | OsPIN2 | 630 | 9 | Long | 1 | |
| LOC_Os01g45550 | OsPIN3a | 670 | 5 | Long | 1 | |
| LOC_Os05g50140 | OsPIN3b | 591 | 10 | Long | 1 | |
| LOC_Os01g69070 | OsPIN5a | 363 | 7 | Short | 2 | |
| LOC_Os08g41720 | OsPIN5b | 398 | 7 | Short | 2 | |
| LOC_Os09g32770 | OsPIN5c | 357 | 7 | Short | 2 | |
| LOC_Os01g51780 | OsPIN8 | 311 | 5 | Short | 2 | |
| LOC_Os01g58860 | OsPIN9 | 426 | 10 | Reuced | 3 | |
| Pp1s10_17V6.1 | PpPIN1A | 713 | 9 | Long | 1 | |
| Pp1s18_186V6.1 | PpPIN1B | 713 | 9 | Long | 1 | |
| Pp1s32_43V6.1 | PpPIN1C | 698 | 9 | Long | 1 | |
| FJ031883.2 | PaPIN1 | 699 | 10 | Long | 1 | |
| MA_61553g0010 | PaPIN2 | 426 | 3 | C-terminal TMD only | 4 | |
| MA_69724g0010 | PaPIN3 | 625 | 8 | Long | 1 | |
| AAP59843.1 | PtoPIN1 | 619 | 7 | Long | 1 | |
| AF190881.1 | PttPIN1 | 614 | 9 | Long | 1 | |
| AF515435.1 | PttPIN2 | 640 | 9 | Long | 1 | |
| AF515434.1 | PttPIN3 | 588 | 9 | Long | 1 | |
| POPTR_0015s04570 | PtrPIN1 | 614 | 9 | Long | 1 | |
| POPTR_0016s03450 | PtrPIN2 | 588 | 8 | Long | 1 | |
| POPTR_0010s12320 | PtrPIN3 | 645 | 9 | Long | 1 | |
| POPTR_0005s20990 | PtrPIN4 | 534 | 9 | Long | 1 | |
| POPTR_0002s07310 | PtrPIN5 | 532 | 8 | Long | 1 | |
| POPTR_0008s12830 | PtrPIN6 | 649 | 9 | Long | 1 | |
| POPTR_0012s04470 | PtrPIN7 | 609 | 9 | Long | 1 | |
| POPTR_0006s03540 | PtrPIN8 | 587 | 9 | Long | 1 | |
| POPTR_0018s13610 | PtrPIN9 | 633 | 9 | Long | 1 | |
| POPTR_0001s21230 | PtrPIN10 | 547 | 10 | Long | 1 | |
| POPTR_0013s08510 | PtrPIN11 | 346 | 9 | Short | 2 | |
| POPTR_0019s07990 | PtrPIN12 | 346 | 10 | Short | 2 | |
| POPTR_0004s12310 | PtrPIN13 | 355 | 8 | Short | 2 | |
| POPTR_0017s11440 | PtrPIN14 | 358 | 8 | Short | 2 | |
| POPTR_0014s14390 | PtrPIN15 | 370 | 8 | Short | 2 | |
| POPTR_0014s14390 | PtrPIN16 | 304 | 6 | Short | 2 | |
| ppa002528m.g | Ppe002528 | 662 | 10 | Long | 1 | |
| ppa002944m.g | Ppe002944 | 619 | 9 | Long | 1 | |
| ppa003159m.g | Ppe003159 | 597 | 8 | Long | 1 | |
| ppa007621m.g | Ppe007621 | 361 | 9 | Short | 2 | |
| ppa021573m.g | Ppe021573 | 357 | 7 | Short | 2 | |
| ppa022797m.g | Ppe022797 | 550 | 9 | Long | 1 | |
| ppa024134m.g | Ppe024134 | 649 | 9 | Long | 1 | |
| ppa025174m.g | Ppe025174 | 602 | 8 | Long | 1 | |
| Rco27985.t000045 | Rc27985 | 544 | 8 | Long | 1 | |
| Rco29662.t000026 | Rc29662 | 635 | 8 | Long | 1 | |
| Rco29816.t000014 | Rc29816 | 646 | 10 | Long | 1 | |
| Rco29822.t000149 | Rc29822 | 313 | 7 | Short | 2 | |
| Rco30128.t000486 | Rc30128 | 357 | 9 | Short | 2 | |
| Rco30180.t000054 | Rc30180 | 613 | 8 | Long | 1 | |
| 234325 | SmPIN1-1 | 625 | 9 | Long | 1 | |
| XM_002990455.1 | SmPIN1-2 | 617 | 9 | Long | 1 | |
| 102666 | SmPIN2-1 | 602 | 9 | Long | 1 | |
| XM_002977411.1 | SmPIN2-2 | 716 | 9 | Long | 1 | |
| 99301 | SmPIN3-1 | 669 | 9 | Long | 1 | |
| XM_002976656.1 | SmPIN3-2 | 672 | 9 | Long | 1 | |
| 119024 | SmPIN4-1 | 687 | 9 | Long | 1 | |
| 231064 | SmPIN5-1 | 636 | 9 | Long | 1 | |
| 268490 | SmPIN5-2 | 625 | 9 | Long | 1 | |
| Sb02g029210 | SbPIN1 | 371 | 9 | Short | 2 | |
| Sb03g029320 | SbPIN2 | 653 | 8 | Long | 1 | |
| Sb03g032850 | SbPIN3 | 362 | 7 | Short | 2 | |
| Sb03g037350 | SbPIN4 | 444 | 10 | Reduced | 3 | |
| Sb03g043960 | SbPIN5 | 336 | 7 | Short | 2 | |
| Sb04g028170 | SbPIN6 | 605 | 10 | Long | 1 | |
| Sb05g002150 | SbPIN7 | 583 | 9 | Long | 1 | |
| Sb07g026370 | SbPIN8 | 402 | 9 | Short | 2 | |
| Sb10g004430 | SbPIN9 | 600 | 10 | Long | 1 | |
| Sb10g008290 | SbPIN10 | 606 | 9 | Long | 1 | |
| Sb10g026300 | SbPIN11 | 626 | 9 | Long | 1 | |
| GSVIVG01025748001 | VvPIN1a | 604 | 8 | Long | 1 | |
| GSVIVG01025749001 | VvPIN1b | 591 | 8 | Long | 1 | |
| GSVIVG01029266001 | VvPIN2 | 630 | 10 | Long | 1 | |
| GSVIVG01019110001 | VvPIN5b | 361 | 9 | Short | 2 | |
| GSVIVG01019126001 | VvPIN5a | 356 | 9 | Short | 2 | |
| GSVIVG01010025001 | VvPIN6 | 532 | 9 | Reduced | 3 | |
| GSVIVG01033005001 | VvPIN8 | 357 | 8 | Short | 2 | |
| GSVIVG01031663001 | VvPIN9 | 463 | 8 | Reuced | 3 | |
| GRMZM2G098643 | ZmPIN1a | 601 | 9 | Long | 1 | |
| GRMZM2G074267 | ZmPIN1b | 595 | 8 | Long | 1 | |
| GRMZM2G149184 | ZmPIN1c | 597 | 8 | Long | 1 | |
| GRMZM2G171702_T01 | ZmPIN1d | 580 | 8 | Long | 1 | |
| JQ421085.1 | ZmPIN2 | 626 | 9 | Long | 1 | |
| GRMZM2G025742 | ZmPIN5a | 382 | 9 | Short | 2 | |
| GRMZM2G148648 | ZmPIN5b | 385 | 7 | Short | 2 | |
| GRMZM2G040911 | ZmPIN5c | 365 | 7 | Short | 2 | |
| GRMZM5G839411 | ZmPIN8 | 359 | 7 | Short | 2 | |
| GRMZM5G859099 | ZmPIN9 | 433 | 10 | Short | 2 | |
| GRMZM2G126260 | ZmPIN10a | 610 | 8 | Long | 1 | |
| GRMZM2G160496 | ZmPIN10b | 581 | 8 | Long | 1 |
These genes are distinct in GenBank but they retrieve the same entry in the phytozome database (www.phytozome.org).
.
| AT1G62380 | AtACO2 | |
| AT1G12010 | AtACO3 | |
| AT1G05010 | AtACO4 | |
| AT1G77330 | AtACO5 | |
| AT1G03400 | AtACO6 | |
| AT2G25450 | AtACO7 | |
| AT2G30830 | AtACO8 | |
| AT2G30840 | AtACO9 | |
| AT3G47190 | AtACO10 | |
| AT3G61400 | AtACO11 | |
| AT5G43440 | AtACO12 | |
| AT5G43450 | AtACO13 | |
| evm.model.supercontig_132.27 | CpACO1 | |
| evm.model.supercontig_64.148 | CpACO2 | |
| Eucgr.K00740 | EgK00740 | |
| Eucgr.K00750 | EgK00750 | |
| Eucgr.K00746 | EgK00746 | |
| Eucgr.K00749 | EgK00749 | |
| Eucgr.K00747 | EgK00747 | |
| Eucgr.C00906 | EgC00906 | |
| Eucgr.C03886 | EgC03886 | |
| Eucgr.F03839 | EgF03839 | |
| Glyma07g39420 | Gm07g39420 | |
| Glyma17g01330 | Gm17g01330 | |
| Glyma09g01110 | Gm09g01110 | |
| Glyma15g11930 | Gm15g11930 | |
| Glyma14g05390 | Gm14g05390 | |
| Glyma02g43560 | Gm02g43560 | |
| Glyma06g12340 | Gm06g12340 | |
| Glyma04g42460 | Gm04g42460 | |
| Glyma05g36310 | Gm05g36310 | |
| Glyma07g15480 | Gm07g15480 | |
| Glyma08g03310 | Gm08g03310 | |
| Gorai.010G184900 | GrACO1 | |
| Gorai.009G182300 | GrACO2 | |
| Gorai.004G062100 | GrACO3 | |
| Gorai.007G170100 | GrACO4 | |
| Gorai.001G096300 | GrACO5 | |
| Gorai.001G096400 | GrACO6 | |
| Gorai.001G011100 | GrACO7 | |
| Gorai.013G107500 | GrACO8 | |
| MDP0000195885 | MdACO1 | |
| MDP0000200737 | MdACO2 | |
| MDP0000725984 | MdACO3 | |
| MDP0000251295 | MdACO4 | |
| MDP0000453114 | MdACO5 | |
| MDP0000025650 | MdACO6 | |
| MDP0000200896 | MdACO7 | |
| LOC_Os02g53180 | Os02g53180 | |
| LOC_Os09g27750 | Os09g27750 | |
| LOC_Os05g05680 | Os05g05680 | |
| LOC_Os05g05670 | Os05g05670 | |
| LOC_Os01g39860 | Os01g39860 | |
| LOC_Os11g08380 | Os11g08380 | |
| LOC_Os06g37590 | Os06g37590 | |
| LOC_Os09g27820 | Os09g27820 | |
| Pp1s191_95V6 | PpACO1 | |
| Pp1s50_26V6 | PpACO2 | |
| Pp1s50_26V6 | PpACO3 | |
| Pp1s180_67V6 | PpACO4 | |
| Pp1s327_42V6 | PpACO5 | |
| MA_2297g0010 | PaACO1 | |
| MA_9554510g0010 | PaACO2 | |
| MA_10431299g0010 | PaACO3 | |
| MA_10437223g0010 | PaACO4 | |
| MA_54476g0010 | PaACO5 | |
| DQ480741 | PgACO1 | |
| DQ480740 | PsiACO1 | |
| ABR17770 | PsiACO2 | |
| CBL95267 | PpiACO1 | |
| GQ258776 | PtdaACO1 | |
| GQ258775 | PtdaACO2 | |
| GQ258774 | PtdaACO3 | |
| AB128037 | PsACO1 | |
| Potri002G224100 | PtACO1 | |
| Potri004G003000 | PtACO2 | |
| Potri011G020900 | PtACO3 | |
| Potri014G159000 | PtACO4 | |
| Potri.002G078600 | PtACO5 | |
| Potri.005G182700 | PtACO6 | |
| Potri.006G151600 | PtACO7 | |
| ABF20554 | PsmACO1 | |
| 116993 | SmACO1 | |
| 407386 | SmACO2 | |
| 169250 | SmACO3 | |
| 228878 | SmACO4 | |
| 117056 | SmACO5 | |
| 402706 | SmACO6 | |
| Solyc06g060070 | SlACO1 | |
| Solyc12g005940 | SlACO2 | |
| Solyc07g049550 | SlACO3 | |
| Solyc07g026650 | SlACO4 | |
| Solyc07g049530 | SlACO5 | |
| Solyc02g081190 | SlACO6 | |
| Solyc02g036350 | SlACO7 | |
| Sb02g026280 | Sb02g026280 | |
| Sb05g005710 | Sb05g005710 | |
| Sb05g005720 | Sb05g005720 | |
| Sb09g003790 | Sb09g003790 | |
| Sb09g003800 | Sb09g003800 | |
| Sb10g022640 | Sb10g022640 | |
| Sb04g034520 | Sb04g034520 | |
| GRMZM2G052422 | Zm2G052422 | |
| GRMZM2G072529 | Zm2G072529 | |
| GRMZM2G126732_T02 | Zm2G126732 | |
| GRMZM2G164883 | Zm2G164883 | |
| GRMZM2G166616 | Zm2G166616 | |
| GRMZM2G166639_T01 | Zm2G166639_T01 | |
| GRMZM2G166639_T02 | Zm2G166639_T02 | |
| GRMZM2G332423 | Zm2G332423 |
Figure 1Different alignments used in this study. Alignments began by translating coding sequences and aligning them with ClustalW2 (“gapopen” option set to 15), and then back-translating them into nucleotides (A,B). From these only the 1st and 2nd codon positions were taken (C), and then all G and C nucleotides converted to “N” (D), which is read as “any nucleotide” in the phylogenetic program TNT. This alignment then had gappy regions removed by the program Gblocks (E) or had all of its GC bases and gaps converted to “?” (F), which is read as “any nucleotide or gap” in TNT. The alignment in B also had all codon positions except those coding for Serine converted to “N”, then the 3rd codon positons removed (G). This also had all Ns and gaps converted to “?” (H).
Figure 2Frequencies of nucleotides (T, C, A, and G) at each codon position (1–3) for 151 . Bars represent 95% confidence intervals.
Figure 3The different morphologies postulated for PIN proteins. PIN proteins are classified into 5 groups according to their length and structure. PINs with two complete TMDs and a long (A), reduced (B), or short (C) central hydrophylic loop. PINs with reduced protein length and presence of a TMD at the C-terminal (D), or N-Terminal end (E) only.
Figure 4Phylogeny of the PIN genes, using only the 1st and 2nd codon positions and with al Gs and Cs converted to “N.” The strict consensus of 85 equally parsimonious trees (A), and the most likely tree under the GTR model (B). Branches are colored according to gene morphology, with parsimony-based historical reconstructions in both trees; equally parsimonious ancestral reconstructions are shown by multi-colored branches. Terminals are colored according to plant taxon, with icons used as guides. Clades recovered under the likelihood optimality criterion (B) which were also recovered under parsimony (A) are noted with the letter “p.”
Figure 5Strict consensuses of equally parsimonious trees using various alignments of the PIN gene data set with only 1st and 2nd codon positions. With Gs and Cs removed, the influence of gaps was minimized so as to mimic how gaps and Ns are treated in likelihood programs by replacing them with “?” (A). Alternatively, the alignment had gappy regions removed using the program Gblocks, and then all Gs, Cs, and remaining gaps replaced by “?” (B). A Serine-only alignment was also used, with only 1st and 2nd positions and all gaps and non-Serine postions replaced with “?” (C).
Figure 6Phylogeny of the The strict consensus of 42 equally parsimonious trees is shown in (A), and the most likely tree under the GTR model is shown in (B). Terminals are colored according to plant taxon. Clades recovered under the likelihood optimality criterion (B) which were also recovered under parsimony (A) are noted with the letter “p.” Scale bar equals per-site mutation probability.
Figure 7The most parsimonious reconstructions of the . Terminals are colored according to plant taxon.
The number of most parsimonious trees found and their lengths, or the log likelihoods, for each of the alignments searched.
| D (Figure | 85 | 11,745 | ||
| D (Figure | −12279.865284 | |||
| F (Figure | 1 | 2960 | ||
| E (Figure | 2 | 843 | ||
| H (Figure | 1 | 494 | ||
| A (Figure | 1 | 22,706 | ||
| D (Figure | 42 | 4215 | ||
| D (Figure | −6064.187817 | |||
| A (Figure | 1 | 8688 |
Alignment codes (D, F, H, etc.) follow those illustrated in Figure 1, and the figure showing the resulting tree is noted.
Alignments, illustrated in Figure 1:
(A) Amino acids.
(D) G and C replaced by N.
(E) Gappy regions removed by g-blocks, remaining gaps replaced by “?.”
(F) G, C, and gaps replaced by “?.”
(H) Non-Serine and gaps replaced by “?.”