| Literature DB >> 32025382 |
Yongguo Zhao1,2, Hua Kong2, Yunling Guo2, Zhi Zou2.
Abstract
The Lhc (light-harvesting chlorophyll a/b-binding protein) superfamily represents a class of antennae proteins that play indispensable roles in capture of solar energy as well as photoprotection under stress conditions. Despite their importance, little information has been available beyond model plants. In this study, we presents a first genome-wide analysis of Lhc superfamily genes in jatropha (Jatropha curcas L., Euphorbiaceae), an oil-bearing plant for biodiesel purpose. A total of 27 members were identified from the jatropha genome, which were shown to distribute over nine out of the 11 chromosomes. The superfamily number is comparable to 28 present in castor (Ricinus communis, Euphorbiaceae), but relatively less than 35 in cassava (Manihot esculenta, Euphorbiaceae) and 34 in arabidopsis (Arabidopsis thaliana) that experienced one or two recent whole-genome duplications (WGDs), respectively. In contrast to a high number of paralogs present in cassava and arabidopsis, few duplicates were found in jatropha as observed in castor, corresponding to no recent WGD occurred in these two species. Nevertheless, 26 orthologous groups representing four defined families were found in jatropha, and nearly one-to-one orthologous relationship was observed between jatropha and castor. By contrast, a novel group named SEP6 was shown to have been lost in arabidopsis. Global transcriptome profiling revealed a predominant expression pattern of most JcLhc superfamily genes in green tissues, reflecting their key roles in photosynthesis. Moreover, their expression profiles upon hormones, drought, and salt stresses were also investigated. These findings not only improve our knowledge on species-specific evolution of the Lhc supergene family, but also provide valuable information for further studies in jatropha. ©2020 Zhao et al.Entities:
Keywords: Evolution; Harvesting chlorophyll a/b-binding protein; Lhc superfamily; Synteny analysis; Arabidopsis thaliana; Expression profile; Jatropha curcas; Manihot esculenta; Ricinus communis; Whole-genome duplication
Year: 2020 PMID: 32025382 PMCID: PMC6993755 DOI: 10.7717/peerj.8465
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
27 Lhc superfamily genes identified in jatropha.
| Nucleotide length (bp, from start to stop codons) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Lhca | JCGZ_23938 | scaffold794:74368-75803(−) | 738 | 1,076 | 3 | 0 | 245 | 45 | 3 | RcLhca1 | MeLhca1.1 | AtLhca1 | Lhca1 | |
| JCGZ_17961 | scaffold502:3176588-3178724(+) | 813 | 1,491 | 4 | 1 | 270 | 58 | 3 | RcLhca2 | MeLhca2.1 | AtLhca2 | Lhca2 | ||
| JCGZ_15032 | scaffold42:178242-179765(−) | 816 | 1,171 | 2 | 8 | 271 | 38 | 3 | RcLhca3 | MeLhca3 | AtLhca3 | Lhca3 | ||
| JCGZ_11643 | scaffold328:2197266-2195814(−) | 750 | 917 | 2 | 2 | 249 | 48 | 3 | RcLhca4 | MeLhca4.1 | AtLhca4 | Lhca4 | ||
| JCGZ_04265 | scaffold159:84206-85704(+) | 795 | 1,255 | 5 | 0 | 264 | 57 | 3 | RcLhca5 | MeLhca5 | AtLhca5 | Lhca5 | ||
| JCGZ_07509 | scaffold211:3089530-3091440(−) | 792 | 1,612 | 4 | 0 | 263 | 42 | 3 | RcLhca6 | MeLhca6 | AtLhca6 | Lhca6 | ||
| Lhcb | JCGZ_04588 | scaffold160:101030-101541(+) | 798 | 798 | 0 | 28 | 265 | 35 | 3 | RcLhcb1.1 | MeLhcb1.1 | AtLhcb1.1 | Lhcb1 | |
| JCGZ_04587 | scaffold160:97668-99902(−) | 798 | 798 | 0 | 17 | 265 | 35 | 3 | RcLhcb1.1 | MeLhcb1.1 | AtLhcb1.1 | Lhcb1 | ||
| JCGZ_18481 | scaffold529:239615-242657(−) | 798 | 2,225 | 1 | 10 | 265 | 37 | 3 | RcLhcb2 | MeLhcb2.1 | AtLhcb2.1 | Lhcb2 | ||
| JCGZ_00703 | scaffold108:360030-361580(−) | 804 | 1,202 | 2 | 5 | 267 | 44 | 3 | RcLhcb3 | MeLhcb3 | AtLhcb3 | Lhcb3 | ||
| JCGZ_25025 | scaffold843:295347-296708(+) | 858 | 957 | 1 | 8 | 285 | 31 | 3 | RcLhcb4 | MeLhcb4 | AtLhcb4.1 | Lhcb4 | ||
| JCGZ_01281 | scaffold11:738853-741122(+) | 840 | 1,786 | 1 | 0 | 279 | 31 | 3 | RcLhcb8 | MeLhcb8 | AtLhcb8 | Lhcb8 | ||
| JCGZ_06701 | scaffold200:160436-162429(−) | 876 | 1,573 | 5 | 1 | 291 | 41 | 3 | RcLhcb5 | MeLhcb5 | AtLhcb5 | Lhcb5 | ||
| JCGZ_20203 | scaffold645:375633-376811(+) | 765 | 844 | 1 | 1 | 254 | 49 | 3 | RcLhcb6 | MeLhcb6 | AtLhcb6 | Lhcb6 | ||
| JCGZ_08016 | scaffold221:188978-192485(−) | 1,014 | 3,050 | 5 | 0 | 337 | 52 | 3 | RcLhcb7 | MeLhcb7 | AtLhcb7 | Lhcb7 | ||
| PsbS | JCGZ_12094 | scaffold339:857142-859344(−) | 825 | 1,726 | 3 | 2 | 274 | 62 | 4 | RcPsbS | MePsbS | AtPsbS | PsbS | |
| ELIP | JCGZ_02231 | scaffold119:1376949-1378080(−) | 585 | 808 | 2 | 6 | 194 | 86 | 3 | RcELIP | MeELIP | AtELIP1 | ELIP | |
| OHP | JCGZ_09105 | scaffold250:2426033-2426931(−) | 357 | 552 | 2 | 2 | 118 | 48 | 1 | RcOHP1 | MeOHP1 | AtOHP1 | OHP1 | |
| JCGZ_08332 | scaffold224:432321-435041(+) | 552 | 2,339 | 1 | 0 | 183 | 47 | 1 | RcOHP2 | MeOHP2.1 | AtOHP2 | OHP2 | ||
| SEP | JCGZ_23193 | scaffold7:432788-436707(−) | 438 | 3,498 | 3 | 3 | 145 | 71 | 2 | RcSEP1 | MeSEP1 | AtSEP1 | SEP1 | |
| JCGZ_09398 | scaffold255:592063-593949(+) | 582 | 1,322 | 1 | 0 | 193 | 46 | 2 | RcSEP2 | MeSEP2 | AtSEP2 | SEP2 | ||
| JCGZ_03488 | scaffold137:779011-781250(+) | 780 | 1,919 | 2 | 4 | 259 | 100 | 2 | RcSEP3 | MeSEP3.1 | AtSEP3.1 | SEP3 | ||
| JCGZ_26324 | scaffold906:2486216-2487759(−) | 759 | 932 | 2 | 0 | 252 | 88 | 2 | RcSEP6 | MeSEP6 | – | SEP6 | ||
| JCGZ_06634 | scaffold20:991705-992601(+) | 570 | 570 | 0 | 0 | 189 | 56 | 2 | RcSEP4 | MeSEP4 | AtSEP4 | SEP4 | ||
| JCGZ_07816 | scaffold211:5206555-5210867(−) | 447 | 3,909 | 4 | 0 | 148 | 75 | 1 | RcSEP5 | MeSEP5 | AtSEP5 | SEP5 | ||
| Psb33 | JCGZ_17235 | scaffold5:830004-833169(+) | 867 | 2,803 | 2 | 1 | 288 | 62 | 1 | RcPsb33 | MePsb33.1 | AtPsb33 | Psb33 | |
| FCII | JCGZ_24260 | scaffold813:61000-71385(−) | 1,500 | 9,600 | 9 | 0 | 499 | 86 | 1 | RcFCII | MeFCII | AtFCII | FCII | |
Notes.
amino acid
Arabidopsis thaliana
base pair
coding sequence
expressed sequence tag
Manihot esculenta
orthologous group
Ricinus communis
transmembrane helix
transit peptide)
Figure 1Chromosomal locations of JcLhc superfamily genes.
Chromosome serial numbers are indicated at the top of each chromosome. Chr: chromosome.
Figure 2Exon-intron structures of JcLhc superfamily genes.
The graphic representation of the gene models is displayed using GSDS. GSDS: gene structure display server.
Figure 3Phylogenetic and conserved motif analyses of jatropha, castor, cassava, and arabidopsis Lhc superfamily proteins.
(A) Phylogenetic analysis of Lhca, Lhcb, PsbS, ELIP, OHP, SEP, Psb33, and FCII subfamilies; (B) Distribution of conserved motifs. Sequence alignment was performed using MUSCLE and unrooted phylogenetic trees were constructed using MEGA7 (maximum likelihood method; bootstrap, 1,000 replicates). Only bootstrap values at nodes supported by a posterior probability of ≥50% are given. The distance scale denotes the number of amino acid substitutions per site. The name of each OG is indicated next to the corresponding group. OG: orthologous group.
Detailed information of 25 motifs identified in this study.
Motifs were identified using MEME.
| Motif 1 | 2.5e−1,988 | 205 | 21 | EJINGRLAMLGFLGFLVQEIL |
| Motif 2 | 6.2e−1,170 | 24 | 60 | ARNGVKFGEAVWFKAGAQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRVAG |
| Motif 3 | 9.4e−992 | 69 | 23 | YLDGELPGDYGFDPAGLSADPET |
| Motif 4 | 8.6e−894 | 64 | 21 | TGKGPJENLADHLADPVHNNI |
| Motif 5 | 2.6e−582 | 58 | 21 | GSFDPLGLADDPEAFAELKVK |
| Motif 6 | 3.0e−546 | 40 | 29 | TLFVIELJLIGYVEFRRWADLDNPGSVYP |
| Motif 7 | 1.9e−374 | 32 | 21 | SPWYGPDRVKYLGPFSGETPS |
| Motif 8 | 4.2e−331 | 8 | 73 | KFVDPRWIGGTWDLKQFZKDGKTDWDAVIDAEAKRRKWLEENPESSSNDEPVVFDTSIIPWWAWIKRYHLPEA |
| Motif 9 | 1.4E−216 | 9 | 41 | QNLAKNVAGDIIGTRTEAADVKSTPFQPYSEVFGLQRFREC |
| Motif 10 | 3.5E−205 | 12 | 48 | KTLLFVAVAGVLLIRKNEDIETLKKLLDETTLYDKQWQATWKDZNPSS |
| Motif 11 | 9E−200 | 5 | 80 | IENRSPAEGAYSEGLJNAKLTQDGCIVCPTTDSTFDLRTGAIKDWYPKNPVLRVLTPALRTLYVYPVKTDEENIYISLRG |
| Motif 12 | 1E−170 | 4 | 80 | KVYVGMRYWHPFTEEAIEQIKRDGITKLVVLPLYPQFSISTSGSSLRLLESIFREDEYLVNMQHTVIPSWYQREGYIKAM |
| Motif 13 | 8.1E–166 | 13 | 29 | DVGYPGGLWFDPLGWGSGSPEKVKELRTK |
| Motif 14 | 3.6E–159 | 27 | 15 | SWYDAGKVEYFAGSS |
| Motif 15 | 1.8E–154 | 25 | 21 | TVCVKADPDRPLWFPGSTPPE |
| Motif 16 | 5.4E–153 | 24 | 11 | WAYATNFVPGK |
| Motif 17 | 7.6E–149 | 20 | 21 | PSAPEVMGNGRVTMRKTVKKA |
| Motif 18 | 8.6E–142 | 12 | 41 | TGKGLLAQLNJETGJPIYELEPLVLFNVLFALFAAINASKD |
| Motif 19 | 3.6E–140 | 80 | 11 | MATSTLAASSS |
| Motif 20 | 6.1E–140 | 4 | 80 | GAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQALLMVGPEYARYCGIEALEPLGIYLPGDINYPGG |
| Motif 21 | 3.4E–142 | 8 | 57 | GPEPLLGVZPFLINLLADPVIERLPYVGAFLVKPLEAFISLVPLPKVEEGLASYGGG |
| Motif 22 | 2E–98 | 5 | 53 | PPPKPAAQVALDDKNVITLEFQRQKAKELQEYFKQKKLEETDQGPFFGFLGKN |
| Motif 23 | 4.3E–98 | 23 | 11 | PLGEVTDPJYP |
| Motif 24 | 4.6E–92 | 4 | 57 | AYQSRVGPVEWLKPYTDETIIELGRKGVKNLLAVPISFVSEHIETLEEIDVEYKELA |
| Motif 25 | 1.4E–106 | 10 | 29 | NWVPAVPLAALPGGZATYJGQPVPTGLLP |
Figure 4Tissue-specific expression profiles of JcLhc superfamily genes.
Color scale represents FPKM normalized log10 transformed counts where navy indicates low expression and firebrick3 indicates high expression. FPKM: fragments per kilobase of exon per million fragments mapped; IND, undifferentiated inflorescence of 0.5 cm diameter; PID1, female flower with carpel primordia beginning to differentiate; PID2, female flower with three distinct carpels formed; STD1, male flower with stamen primordia beginning to differentiate; STD2, male flower with ten complete stamens formed.
Figure 5Expression profiles of JcLhc superfamily genes upon drought, salt, BA, or GA treatments.
Color scale represents RPKM normalized log10 transformed counts where navy indicates low expression and firebrick3 indicates high expression. RPKM, Reads per kilobase per million mapped reads.