| Literature DB >> 18253473 |
Yu Yang1, Song Qin, Fangqing Zhao, Xiaoyuan Chi, Xiaowen Zhang.
Abstract
To elucidate the evolution of cyanobacterial envelopes and the relation between gene content and environmental adaptation, cell envelope structures and components of unicellular and filamentous cyanobacteria were analyzed in comparative genomics. Hundreds of envelope biogenesis genes were divided into 5 major groups and annotated according to their conserved domains and phylogenetic profiles. Compared to unicellular species, the gene numbers of filamentous cyanobacteria expanded due to genome enlargement effect, but only few gene families amplified disproportionately, such as those encoding waaG and glycosyl transferase 2. Comparison of envelope genes among various species suggested that the significant variance of certain cyanobacterial envelope biogenesis genes should be the response to their environmental adaptation, which might be also related to the emergence of filamentous shapes with some new functions.Entities:
Year: 2007 PMID: 18253473 PMCID: PMC2211374 DOI: 10.1155/2007/25751
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Absolute and relative numbers of envelope related genes in four cyanobacteria. PBR, LBR, EBR, OMP, and OU represent peptidoglycan biosynthesis-related, lipopolysaccharide biosynthesis-related, exopolysaccharide biosynthesis-related, outer membrane proteins coding, and other unknown genes, respectively. The data in the brackets were the percentage of each group within the total envelope-related genes.
| Species | Total | PBR | LBR | EBR | OMP | OU |
|---|---|---|---|---|---|---|
|
| 100 | 29 (29.0%) | 40 (40.0%) | 14 (15.0%) | 16 (15.0%) | 2 (2.0%) |
|
| 186 | 37 (19.9%) | 73 (39.2%) | 28 (15.1%) | 40 (21.5%) | 8 (4.3%) |
|
| 266 | 47 (17.7%) | 90 (33.1%) | 48 (18. 0%) | 63 (23.7%) | 18 (6.8%) |
|
| 294 | 48 (16.3%) | 113 (38.4%) | 61 (20.7%) | 60 (20.4%) | 12 (4.1%) |
waaG homologous genes in Anabaena sp. PCC 7120. Information of the 43 genes was provided.
| NCBI accession | IMG accession | Locus Tag | Product | Position in Genome |
|---|---|---|---|---|
| NP_484203 | 4210510 | Alr0159 | Alr0159 protein | 163382–164575 |
| NP_484204 | 4210520 | All0160 | All0160 protein | 164558–165712 |
| NP_484626 | 4214800 | Alr0582 | Alr0582 protein | 676349–677545 |
| NP_484628 | 4214820 | Alr0584 | Alr0584 protein | 679928–681130 |
| NP_484962 | 4218190 | All0919 | All0919 protein | 1063224–1064513 |
| NP_485043 | 4219010 | Alr1000 | Alr1000 protein | 1171949–1173031 |
| NP_485160 | 4220180 | Alr1117 | Alr1117 protein | 1308038–1309267 |
| NP_485388 | 4222480 | All1345 | All1345 protein | 1596626–1597858 |
| NP_485708 | 4225730 | Alr1668 | Alr1668 protein | 1990621–1991904 |
| NP_486077 | 4229490 | All2037 | All2037 protein | 2435914–2437014 |
| NP_486305 | 4231820 | SqdX | Sulfolipid sulfoquinovosyldiacylglycerol biosynthesis protein | 2725143–2726279 |
| NP_486331 | 4232080 | All2291 | Glycosyltransferase | 2760187–2761173 |
| NP_486332 | 4232090 | All2292 | All2292 protein | 2761170–2762348 |
| NP_486547 | 4234260 | All2507 | All2507 protein | 3008236–3009423 |
| NP_486589 | 4234680 | All2549 | All2549 protein | 3051362–3052363 |
| NP_486760 | 4236410 | All2720 | All2720 protein | 3315625–3316713 |
| NP_486872 | 4237530 | Alr2832 | Alr2832 protein | 3448705–3449793 |
| NP_486879 | 4237600 | Alr2839 | Glycosyltransferase | 3459432–3460577 |
| NP_486904 | 4237850 | Alr2864 | Alr2864 protein | 3488401–3489579 |
| NP_486907 | 4237880 | Alr2867 | Alr2867 protein | 3491419–3492636 |
| NP_487097 | 4239800 | Alr3057 | Alr3057 protein | 3703378–3704592 |
| NP_487098 | 4239810 | Alr3058 | Alr3058 protein | 3704628–3705854 |
| NP_487104 | 4239870 | Alr3064 | Alr3064 protein | 3712759–3714171 |
| NP_487465 | 4243510 | Alr3425 | Alr3425 protein | 4133859–4135025 |
| NP_487738 | 4246270 | HepB | Heterocyst envelope polysaccharide synthesis protein | 4465828–4466997 |
| NP_487739 | 4246280 | Alr3699 | Alr3699 protein | 4467059–4468207 |
| NP_488208 | 4251030 | Alr4168 | Alr4168 protein | 5015231–5016502 |
| NP_488218 | 4251140 | Alr4178 | Alr4178 protein | 5025948–5027096 |
| NP_488463 | 4253590 | All4423 | All4423 protein | 5300887–5302026 |
| NP_488466 | 4253620 | All4426 | All4426 protein | 5304172–5305425 |
| NP_488476 | 4253720 | All4436 | All4436 protein | 5320348–5321526 |
| NP_488534 | 4254300 | Alr4494 | Mannosyltransferase | 5380744–5381811 |
| NP_489234 | 4261400 | All5194 | Glycosyltransferase | 6192395–6193555 |
| NP_489235 | 4261410 | All5195 | Glycosyltransferase | 6193736–6194992 |
| NP_489241 | 4261470 | Alr5201 | Glycosyltransferase | 6201983–6203275 |
| NP_489242 | 4261480 | Alr5202 | Glycosyltransferase | 6203285–6204574 |
| NP_489263 | 4261690 | Alr5223 | Glycosyltransferase | 6236642–6237991 |
| NP_489275 | 4261810 | Alr5235 | Alr5235 protein | 6247505–6248551 |
| NP_489277 | 4261830 | Alr5237 | Alr5237 protein | 6249905–6251158 |
| NP_489278 | 4261840 | Alr5238 | Glycosyltransferase | 6251167–6252315 |
| NP_489279 | 4261850 | Alr5239 | Alr5239 protein | 6252417–6253586 |
| NP_489347 | 4262550 | Alr5307 | Alr5307 protein | 6328387–6329490 |
| NP_489381 | 4262900 | All5341 | All5341 protein | 6373814–6375079 |
Figure 1Multiple sequence alignments of the 43 waaG homologous genes in Anabaena sp. PCC 7120. Only most conserved areas were shown. The number following the genus name was the gene accession in IMG database. NCBI accessions and other information of genes were provided in Table .
Genes encoding GT2 domain in Anabaena sp. PCC 7120. Information of the 36 genes was provided.
| NCBI Accession | IMG Accession | Locus Tag | Product | Position in Genome |
|---|---|---|---|---|
| NP_484086 | 4209330 | all0042 | All0042 protein | 44511–45458 |
| NP_484118 | 4209650 | alr0074 | Alr0074 protein | 78171–79187 |
| NP_484187 | 4210350 | all0143 | All0143 protein | 148503–149681 |
| NP_484819 | 4216740 | alr0776 | Alr0776 protein | 899704–900894 |
| NP_484957 | 4218140 | all0914 | All0914 protein | 1057871–1058884 |
| NP_484958 | 4218150 | all0915 | All0915 protein | 1058947–1059852 |
| NP_485777 | 4226430 | all1737 | All1737 protein | 2088106–2089074 |
| NP_485802 | 4226680 | all1762 | All1762 protein | 2117622–2118518 |
| NP_485806 | 4226720 | all1766 | All1766 protein | 2121006–2122007 |
| NP_485807 | 4226730 | all1767 | All1767 protein | 2122000–2123007 |
| NP_485926 | 4227930 | all1886 | All1886 protein | 2252568–2253362 |
| NP_486328 | 4232050 | all2288 | Glucosyltransferase | 2756810–2757841 |
| NP_486329 | 4232060 | all2289 | Glucosyltransferase | 2757927–2758916 |
| NP_486448 | 4233260 | alr2408 | Alr2408 protein | 2888194–2888949 |
| NP_486868 | 4237490 | alr2828 | Alr2828 protein | 3444428–3445441 |
| NP_486876 | 4237570 | alr2836 | Putative glycosyl transferase | 3456248–3457216 |
| NP_486877 | 4237580 | alr2837 | Glycosyltransferase | 3457336–3458310 |
| NP_486880 | 4237610 | alr2840 | Glycosyltransferase | 3460577–3461524 |
| NP_486906 | 4237870 | alr2866 | Glycosyltransferase | 3490561–3491400 |
| NP_487103 | 4239860 | alr3063 | Alr3063 protein | 3711770–3712762 |
| NP_487109 | 4239920 | alr3069 | Alr3069 protein | 3718782–3719963 |
| NP_487110 | 4239930 | alr3070 | Alr3070 protein | 3719986–3720942 |
| NP_487111 | 4239940 | alr3071 | Alr3071 protein | 3720982–3721938 |
| NP_487113 | 4239960 | alr3073 | Alr3073 protein | 3723391–3724365 |
| NP_487216 | 4241000 | alr3176 | Alr3176 protein | 3844812–3845753 |
| NP_487217 | 4241010 | alr3177 | Alr3177 protein | 3845774–3846715 |
| NP_487420 | 4243050 | alr3380 | Dolichol-phosphate mannosyltransferase | 4091498–4092511 |
| NP_488471 | 4253670 | all4431 | Glycosyl transferase | 5310064–5311017 |
| NP_488532 | 4254280 | alr4492 | Alr4492 protein | 5378788–5379816 |
| NP_488897 | 4257980 | all4857 | All4857 protein | 5785088–5786275 |
| NP_488973 | 4258750 | all4933 | All4933 protein | 5886142–5887548 |
| NP_489142 | 4260480 | all5102 | All5102 protein | 6079688–6080410 |
| NP_489158 | 4260640 | all5118 | All5118 protein | 6114366–6115355 |
| NP_489280 | 4261860 | alr5240 | Glycosyltransferase | 6253630–6254397 |
| NP_489382 | 4262910 | all5342 | All5342 protein | 6375223–6376452 |
| NP_489383 | 4262920 | all5343 | All5343 protein | 6376587–6377849 |
Genes encoding GT2 domain in Trichodesmium erythraeum IMS101. Information of the 27 genes was provided.
| NCBI accession | IMG accession | Locus Tag | Product | Position in Genome |
|---|---|---|---|---|
| YP_720085 | 636810880 | Tery_0115 | Glycosyl transferase, family 2 | 155085–157763 |
| YP_720116 | 636811045 | Tery_0148 | Glycosyl transferase, family 2 | 217777–218829 |
| YP_720694 | 636814360 | Tery_0804 | Glycosyl transferase, family 2 | 1279953–1280891 |
| YP_720758 | 636814755 | Tery_0883 | Glycosyl transferase, family 2 | 1403156–1404088 |
| YP_720935 | 636815825 | Tery_1097 | Glycosyl transferase, family 2 | 1725763–1726743 |
| YP_721031 | 636816345 | Tery_1201 | Glycosyl transferase, family 2 | 1875929–1876612 |
| YP_721128 | 636817045 | Tery_1340 | Glycosyl transferase, family 2 | 2040749–2041705 |
| YP_721156 | 636817205 | Tery_1372 | Glycosyl transferase, family 2 | 2104828–2106021 |
| YP_721969 | 636821740 | Tery_2268 | Glycosyl transferase, family 2 | 3529656–3534458 |
| YP_722405 | 636824155 | Tery_2749 | Glycosyl transferase, family 2 | 4257314–4258822 |
| YP_722496 | 636824655 | Tery_2849 | Glycosyl transferase, family 2 | 4430305–4432779 |
| YP_722503 | 636824690 | Tery_2856 | Glycosyl transferase, family 2 | 4447185–4448186 |
| YP_722586 | 636825160 | Tery_2950 | Glycosyl transferase, family 2 | 4584744–4585874 |
| YP_722664 | 636825630 | Tery_3040 | Glycosyl transferase, family 2 | 4692416–4693294 |
| YP_722816 | 636826565 | Tery_3225 | Glycosyl transferase, family 2 | 4937986–4938924 |
| YP_722946 | 636827300 | Tery_3371 | Glycosyl transferase, family 2 | 5168831–5170021 |
| YP_722999 | 636827610 | Tery_3433 | Glycosyl transferase, family 2 | 5251339–5252268 |
| YP_723000 | 636827615 | Tery_3434 | Glycosyl transferase, family 2 | 5252486–5253415 |
| YP_723155 | 636828495 | Tery_3609 | Glycosyl transferase, family 2 | 5550523–5551482 |
| YP_723304 | 636829395 | Tery_3784 | Dolichyl-phosphate beta-D-mannosyltransferase | 5816905–5817705 |
| YP_723576 | 636830965 | Tery_4095 | Glycosyl transferase, family 2 | 6315001–6316008 |
| YP_723603 | 636831105 | Tery_4122 | Glycosyl transferase, family 2 | 6360766–6361701 |
| YP_723606 | 636831120 | Tery_4125 | Glycosyl transferase, family 2 | 6363768–6364736 |
| YP_723897 | 636832695 | Tery_4437 | Glycosyl transferase, family 2 | 6839236–6842421 |
| YP_724037 | 636833455 | Tery_4588 | Glycosyl transferase, family 2 | 7057924–7058847 |
| YP_724197 | 636834370 | Tery_4771 | Glycosyl transferase, family 2 | 7329873–7332980 |
| YP_724341 | 636835285 | Tery_4954 | Glycosyl transferase, family 2 | 7547130–7548305 |
Figure 2Multiple sequence alignments of homologous genes encoding glycosyl transferase 2 (GT2) domains in Trichodesmium erythraeum IMS101 (27 genes) and Anabaena sp. PCC 7120 (36 genes). Only most conserved areas were shown. The number following the genus name was the gene accession in IMG database. NCBI accession and other information of genes were provided in Tables 3 and 4.
FAS1-containing genes from Trichodesmium erythraeum IMS101, Anabaena sp. PCC 7120, and other 15 species.
| NCBI Accession | IMG Accession | Gene | Species |
|---|---|---|---|
| NP_485363 | 4222220 | Alr1320 Alr1320 protein |
|
| NP_485859 | 4227250 | Alr1819 Alr1819 protein |
|
| NP_487837 | 4247260 | All3797 All3797 protein |
|
| NP_488687 | 4255850 | All4647 All4647 protein |
|
| NP_488934 | 4258350 | All4894 All4894 protein |
|
| NP_489304 | 4262100 | All5264 All5264 protein |
|
| YP_722947 | 636827305 | Tery_3372 beta-Ig-H3/fasciclin |
|
| YP_722948 | 636827310 | Tery_3373 beta-Ig-H3/fasciclin |
|
| AAF02137 | — | Unknown protein |
|
| CAF32145 | — | Fasciclin I family protein, putative |
|
| EAQ86204 | — | Hypothetical protein CHGG_07457 |
|
| EAM48409 | — | Beta-Ig-H3/fasciclin |
|
| AAW46332 | — | Hypothetical protein CNK01730 |
|
| CAI83309.1 | — | Fasciclin domain protein |
|
| EAS19928 | — | Putative cell adhesion protein, fasciclin domain |
|
| AAO92753 | — | Arabinogalactan protein |
|
| BAC65875 | — | Putative membrane-associated or secreted protein |
|
| AAM05399 | — | Hypothetical protein MA_1996 |
|
| ZP_00108174 | — | COG2335 |
|
| CAH58718 | — | Fasciclin-like protein precursor |
|
| CAA20163 | — | Putative secreted protein |
|
| AAB62187 | — | Putative secreted protein MPB70 |
|
| AAC49869 | — | Endosperm specific protein |
|
Figure 3The phylogenetic tree of genes containing FAS1 domain in 17 species. Besides Trichodesmium erythraeum IMS101 and Anabaena sp. PCC 7120, other 15 species were from cyanobacteria, archaebacteria, eubacteria, actinomycetes, yeast, filamentous fungi, and vascular plants. To keep the figure clear and direct, the species were written in their genus name for short. The detailed information was described in Section 3 and Table .