| Literature DB >> 30692573 |
Erxi Liu1,2, Chaozhu Yang2, Jiangdong Liu1, Surong Jin3, Nunung Harijati4, Zhongli Hu1, Ying Diao1, Lingling Zhao5.
Abstract
Amorphophallus (Araceae) contains more than 170 species that are mainly distributed in Asia and Africa. Because the bulbs of Amorphophallus are rich in glucomannan, they have been widely used in food, medicine, the chemical industry and so on. To better understand the evolutionary relationships and mutation patterns in the chloroplast genome of Amorphophallus, the complete chloroplast genomes of four species were sequenced. The chloroplast genome sequences of A. albus, A. bulbifer, A. konjac and A. muelleri ranged from 162,853 bp to 167,424 bp. The A. albus chloroplast (cp) genome contains 113 genes, including 79 protein-coding genes, 30 tRNA genes and 4 rRNA genes. The A. bulbifer cp genome contains 111 genes, including 78 protein-coding genes, 29 tRNA genes and 4 rRNA genes. A. muelleri contains 111 and 113 genes, comprising 78 and 80 protein-coding genes, respectively, 29 tRNA genes and 4 rRNA genes. The IR (inverted repeat) region/LSC (long single copy) region and IR/SSC (short single copy) region borders of the four Amorphophallus cp genomes were compared. In addition to some genes being deleted, variations in the copy numbers and intron numbers existed in some genes in the four cp genomes. One hundred thirty-four to 164 SSRs (simple sequence repeats) were detected in the four cp genomes. In addition, the highest mononucleotide SSRs were composed of A and T repeat units, and the majority of dinucleotides were composed of AT and TA. SNPs (single nucleotide polymorphisms) and indels (insertion-deletions) were calculated from coding genes and noncoding genes, respectively. These divergences comprising SSRs, SNPs and indel markers will be useful in testing the maternal inheritance of the chloroplast genome, identifying species differentiation and even in breeding programs. Furthermore, the regression of ndhK was detected from four Amorphophallus cp genomes in our study. Complete cp genome sequences of four Amorphophallus species and other plants were used to perform phylogenetic analyses. The results showed that Amorphophallus was clustered in Araceae, and Amorphophallus was divided into two clades; A. albus and A. konjac were clustered in one clade, and A. bulbifer and A. muelleri were clustered in another clade. Phylogenetic analysis among the Amorphophallus genus was conducted based on matK and rbcL. The phylogenetic trees showed that the relationships among the Amorphophallus species were consistent with their geographical locations. The complete chloroplast genome sequence information for the four Amorphophallus species will be helpful for elucidating Amorphophallus phylogenetic relationships.Entities:
Mesh:
Year: 2019 PMID: 30692573 PMCID: PMC6349887 DOI: 10.1038/s41598-018-37456-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene maps of the four Amorphophallus cp genomes. (A) A. albus, (B) A. bulbifer, (C) A. konjac, (D) A. muelleri. The annotation of the genome was performed using DOGMA. The genes that are drawn on the outside of the circle are transcribed clockwise, while those inside are transcribed counter clockwise. Genes belonging to different functional groups are color coded. Small single copy (SSC), large single copy (LSC), and inverted repeats (IRa, IRb) are indicated.
Summary of the sequencing data for the four Amorphophallus species.
| Species | Raw reads no. | Clean reads no. | Gene no. | Protein coding genes no. | tRNA genes no. | rRNA genes no. | Cp genome length (bp) | LSC length (bp) | IRa length (bp) | SSC length (bp) | IRbLength (bp) | GC content (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Amorphophallus albus | 7383690 | 6572597 | 113 | 79 | 30 | 4 | 166867 | 92249 | 25926 | 22766 | 25926 | 35.47 |
| Amorphophallus bulbifer | 8590079 | 7645378 | 111 | 78 | 29 | 4 | 162853 | 90467 | 25379 | 21628 | 25379 | 35.90 |
| Amorphophallus konjac | 7282525 | 6532968 | 111 | 78 | 29 | 4 | 167424 | 92660 | 25973 | 22839 | 26012 | 35.39 |
| Amorphophallus muelleri | 10581954 | 8610905 | 113 | 80 | 29 | 4 | 164669 | 90789 | 26120 | 21640 | 26120 | 35.69 |
List of genes encoded by the four Amorphophallus chloroplast genomes.
| Category for genes | Group of genes | Name of genes |
|---|---|---|
| Self-replication | rRNA genes | rrn16a, rrn23a, rrn4.5a, rrn5a |
| tRNA genes | trnA-UGC*,a, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC (Aa, Ab, Am), trnG-UCC, trnH-GUG, trnI-CAU a, trnI-GAU *,a, trnK-UUU, trnL-CAA a (Aa, Ak), trnL-UAA*, trnL-UAG, trnM-CAU a, trnN-GUU a, trnP-GGG, trnP-UGG, trnQ-UUG, trnR-ACGa, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-UGU, trnT-GGU (Aa, Ab a, Ak, Am a), trnV-GAC,a, trnV-UAC*, trnW-CCA, trnY-GUA | |
| Small subunit of ribosome | rps2, rps3, rps4, rps7a, rps8, rps11, rps12 b, rps14, rps15, rps16, rps18, rps19 | |
| Large subunit of ribosome | rpl2 (Aa*, Ab*, a, Am*, a, Ak*, a), rpl20, rpl23 (Aa*, Aba, Am*, a, Ak*, a), rpl33, rpl36, rpl14, rpl16, rpl22, rpl32 | |
| RNA polymerase | rpoA, rpoB, rpoC1*, rpoC2 | |
| Phytosynthesis | NADH-dehydrogenase | ndhA a, ndhB*, a, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhK, ndhJ psbG, |
| Photosystem I | psaA, psaB, psaC, psaI, psaJ, ycf3** | |
| Photosystem II | psbA, psbB, psbC psbD, psbE (Aa, Ab, Am), psbF (Aa, Ab, Am, Ak*), psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |
| Cytochrome b/f complex | petA, petB, petD, petL, petG, petN, | |
| ATP synthase | atpA, atpB, atpE, atpF*, atpH, atpI | |
| Large subunit to rubisco | rbcL | |
| Other genes | Maturase | matK |
| Protease | clpP(Aa**, Ab, Am, Ak**) | |
| Envelope membrane protein | cemA | |
| Subunit of Acetyl-CoA-Carboxylase | ||
| c-type cytochrome synthesis gene | ccsA | |
| Translational initiation factor | infA** | |
| Genes of unknown function | Open Reading Frames | ycf1 (Aa, Ak, Am), ycf2 (Aaa, Aba, Am*, a, Aka), ycf4, |
| Putative pseudogenes | ycf15 (Aa*, a, Aba, Am*, a, Aka), ycf68 a |
aGene with two copies; bGene with three copies; *Gene with one intron; **Gene with two introns.
( )Gene existed in which species cp genome as well as copy number and intron number in each cp genome.
Aa, Amorphophallus albus cp genome; Ab, Amorphophallus bulbifer cp genome; Ak, Amorphophallus konjac cp genome; Am, Amorphophallus muelleri cp genome.
Summary of genes estimate deletion in the four Amorphophallus cp genomes.
| Gene | Species (cp genome) | Region | |||
|---|---|---|---|---|---|
|
| + | − | + | + | SSC |
|
| IRA+; IRB− | + | + | + | IR |
|
| IRA+; IRB− | + | + | + | IR |
|
| + | − | + | − | IR |
|
| + | + | − | + | LSC |
|
| − | − | − | + | LSC |
|
| + | + | − | + | LSC |
+ Gene existing. − Gene estimate deletion.
Simple sequence repeats (SSRs) in the four Amorphophallus species cp genomes.
| Species | SSR loci no. | P1 loci no. | P2 loci no. | P3 loci no. | P4 loci no. | P5 loci no. | P6 loci no. |
|---|---|---|---|---|---|---|---|
| Amorphophallus albus | 163 | 86 | 49 | 27 | 1 | / | / |
| Amorphophallus bulbifer | 134 | 94 | 25 | 14 | 1 | / | / |
| Amorphophallus konjac | 164 | 82 | 51 | 30 | 1 | / | / |
| Amorphophallus muelleri | 144 | 80 | 45 | 15 | 1 | / | 3 |
Comparisons of mutation changes, number of synonymous (S) and nonsynonymous (N) substitutions per gene of protein coding cp genes among A. bulbifer, A. konjac and A. muelleri.
| Gene |
|
|
| Region | ||||
|---|---|---|---|---|---|---|---|---|
| S | N | S | N | S | N | |||
| Photosynthetic apparatus |
| 9 | 0 | 0 | 0 | 9 | 0 | LSC |
|
| 7 | 2 | 0 | 0 | 8 | 2 | LSC | |
|
| 2 | 1 | 0 | 0 | 2 | 1 | LSC | |
|
| 3 | 1 | 0 | 0 | 3 | 1 | LSC | |
|
| 2 | 0 | 1 | 0 | 2 | 0 | LSC | |
|
| 2 | 5 | 1 | 0 | 2 | 5 | LSC | |
|
| 2 | 0 | 0 | 0 | 2 | 0 | LSC | |
|
| 3 | 1 | 0 | 0 | 3 | 1 | LSC | |
|
| 1 | 0 | 1 | 0 | 1 | 0 | LSC | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | LSC | |
|
| 10 | 4 | 0 | 0 | 10 | 4 | LSC | |
|
| 10 | 2 | 2 | 2 | 9 | 2 | LSC | |
|
| 1 | 0 | 0 | 0 | 2 | 0 | SSC | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | LSC | |
|
| 0 | 1 | 0 | 0 | 0 | 0 | LSC | |
|
| 3 | 3 | 1 | 0 | 3 | 3 | LSC | |
|
| 3 | 0 | 0 | 0 | 3 | 0 | LSC | |
|
| 2 | 1 | 0 | 0 | 3 | 1 | LSC | |
|
| 1 | 0 | 1 | 0 | 1 | 0 | LSC | |
|
| 0 | 0 | 0 | 0 | 0 | 1 | LSC | |
|
| 3 | 1 | 0 | 0 | 1 | 1 | LSC | |
|
|
|
|
|
|
|
| ||
| Photosynthetic metabolism |
| 9 | 5 | 0 | 1 | 8 | 6 | LSC |
|
| 7 | 5 | 0 | 0 | 8 | 5 | LSC | |
|
| 2 | 1 | 0 | 0 | 2 | 1 | LSC | |
|
| 0 | 4 | 0 | 0 | 0 | 4 | LSC | |
|
| 1 | 1 | 0 | 0 | 1 | 1 | LSC | |
|
| 3 | 3 | 2 | 1 | 3 | 3 | LSC | |
|
| 10 | 12 | 2 | 4 | 10 | 12 | SSC | |
|
| 1 | 2 | 1 | 1 | 1 | 2 | LSC | |
|
| 17 | 6 | 2 | 0 | 16 | 5 | SSC | |
|
| 4 | 0 | 0 | 0 | 4 | 0 | SSC | |
|
| 13 | 14 | 2 | 4 | 11 | 16 | SSC | |
|
| 2 | 4 | 1 | 2 | 2 | 5 | SSC | |
|
| 11 | 7 | 3 | 0 | 12 | 6 | SSC | |
|
| 3 | 1 | 0 | 0 | 3 | 1 | SSC | |
|
| 1 | 0 | 0 | 0 | 0 | 0 | LSC | |
|
| 12 | 2 | 2 | 0 | 12 | 2 | LSC | |
|
|
|
|
|
|
|
| ||
| Gene expression |
| 3 | 5 | 0 | 0 | 2 | 5 | LSC |
|
| 10 | 23 | 1 | 3 | 11 | 22 | LSC | |
|
| 18 | 41 | 3 | 8 | 17 | 41 | LSC | |
|
| 12 | 13 | 1 | 2 | 14 | 13 | LSC | |
|
| 4 | 30 | 1 | 1 | 4 | 31 | LSC | |
|
| 1 | 40 | 0 | 1 | 1 | 39 | LSC | |
|
| 3 | 22 | 0 | 1 | 3 | 24 | LSC | |
|
| 1 | 5 | 0 | 0 | 1 | 5 | LSC | |
|
| 7 | 30 | 0 | 0 | 7 | 33 | LSC | |
|
| 2 | 5 | 0 | 0 | 1 | 5 | LSC | |
|
| 2 | 11 | 0 | 1 | 2 | 12 | LSC | |
|
| 0 | 13 | 0 | 5 | 0 | 13 | SSC | |
|
| 0 | 6 | 0 | 0 | 0 | 8 | LSC | |
|
| 1 | 14 | 0 | 3 | 1 | 14 | LSC | |
|
| 2 | 27 | 1 | 1 | 3 | 26 | LSC | |
|
| 1 | 19 | 0 | 0 | 0 | 17 | LSC | |
|
| 1 | 10 | 0 | 0 | 1 | 10 | LSC | |
|
| 2 | 11 | 0 | 1 | 2 | 10 | LSC | |
|
| 4 | 8 | 0 | 1 | 4 | 8 | LSC | |
|
| 1 | 2 | 0 | 0 | 1 | 2 | SSC | |
|
| 1 | 2 | 0 | 1 | 1 | 3 | LSC | |
|
| 1 | 9 | 0 | 0 | 1 | 10 | LSC | |
|
|
|
|
|
|
|
| ||
| Other genes |
| 12 | 170 | 0 | 24 | 12 | 174 | SSC |
|
| 2 | 3 | 0 | 0 | 2 | 3 | LSC | |
|
| 2 | 6 | 0 | 1 | 2 | 6 | LSC | |
|
| 15 | 73 | 2 | 3 | 14 | 74 | LSC | |
|
| 1 | 2 | 0 | 0 | 1 | 2 | LSC | |
|
| 2 | 7 | 1 | 0 | 3 | 7 | LSC | |
|
|
|
|
|
|
|
| ||
*SNP markers were detected in their introns.
Figure 2SNPs statistics of A. bulbifer, A. konjac and A. muelleri cp genomes. The Amorphophallus albus cp genome was used as the reference sequence for SNPs analyses for the other three cp genomes. SNPs belonging to different type groups are color coded. (A) Number of SNPs in the A. bulbifer cp genome sequence. (B) The number of SNPs in the A. konjac cp genome sequence. (C) The number of SNPs in the A. muelleri cp genome sequence.
Comparisons of InDels of protein coding cp genes among A. bulbifer, A. konjac and A. muelleri.
| Gene | motif | size | Direction |
|---|---|---|---|
|
| Ab: accaaataccaa | 12 | Deletion |
| Am: accaaataccaa | 12 | Deletion | |
|
| Ab: aaaa | 4 | Insertion |
| Am: aataa | 5 | Insertion | |
|
| Ab: cttttt | 6 | Insertion |
| Am: tctttt | 6 | Insertion | |
|
| Ab: aat | 3 | Deletion |
|
| Ab: act | 3 | Insertion |
| Am: cat | 3 | Insertion | |
|
| Ab: a | 1 | Insertion |
| Am: tta | 3 | Insertion | |
|
| Ab: tttttt | 6 | Insertion |
| Am: tttttt | 6 | Insertion | |
|
| Ab: ttt | 3 | Deletion |
| Am: ttt | 3 | Deletion | |
|
| Ab: aaaaaaaa | 8 | Insertion |
| Am: ttggaattgggagaataaccca | 22 | Insertion | |
|
| Ab: a | 1 | Insertion |
| Am: a | 1 | Insertion | |
|
| Ab: tgat | 4 | Deletion |
| Am: tgat | 4 | Deletion | |
|
| Ab: a | 1 | Insertion |
| Am: t | 1 | Deletion | |
|
| Ab: aaaaaaaaaa | 10 | Insertion |
|
| Ab: t | 1 | Insertion |
| Am: a | 1 | Deletion | |
|
| Ab: t | 1 | Insertion |
| Am: t | 1 | Insertion | |
|
| Ab: c | 1 | Deletion |
| Am: c | 1 | Deletion | |
|
| Ab: c | 1 | Deletion |
| Am: c | 1 | Deletion | |
|
| Ab: g | 1 | Insertion |
| Am: g | 1 | Insertion | |
|
| Ab: g | 1 | Deletion |
| Am: g | 1 | Deletion | |
|
| Ab: a | 1 | Insertion |
| Am: a | 1 | Insertion | |
|
| Ab: t | 1 | Deletion |
| Am: t | 1 | Deletion | |
|
| Ab: c | 1 | Insertion |
| Am: c | 1 | Insertion | |
|
| Ab: tttccctag | 9 | Deletion |
| Am: tttccctag | 9 | Deletion | |
|
| Ab: cggg | 4 | Insertion |
| Am: ctggttg | 7 | Insertion | |
|
| Ab: aaaacg | 6 | Insertion |
| Am: aatatacg | 8 | Insertion | |
|
| Ab: aatttt | 6 | Insertion |
|
| Am: ta | 2 | Insertion |
|
| Ab: c | 1 | Insertion |
|
| Am: c | 1 | Insertion |
|
| Ab: aaa | 3 | Deletion |
| Am: aaa | 3 | Deletion | |
|
| Ab: tttgaca | 7 | Deletion |
|
| Ab: aaa | 3 | Insertion |
| Am: aatcca | 6 | Insertion | |
|
| Am: ttttttt | 7 | Insertion |
|
| Ab: g | 1 | Deletion |
|
| Ab: gttgaattaacg | 12 | Deletion |
|
| Am: cgaga | 5 | Deletion |
|
| Ab: gtttttg | 7 | Deletion |
|
| Am: aaa | 3 | Deletion |
|
| Am: g | 1 | Deletion |
|
| Ab: aaaa | 4 | Insertion |
|
| Ak: aaaa | 4 | Insertion |
|
| Ab: agg | 3 | Insertion |
| Am: tagg | 4 | Insertion | |
|
| Ab: ctttt | 5 | Insertion |
| Am: tcttt | 5 | Insertion | |
|
| Am: gttttttaa | 9 | Deletion |
|
| Am: ttatta | 6 | Insertion |
|
| Ab: aaaaaaaccccccccggggttttttttttt | 30 | Insertion |
| Am: tcgtccaggcatcaatatcgctatttattt | 30 | Insertion | |
|
| Ab: aacctt | 6 | Insertion |
| Am: ctcaat | 6 | Insertion | |
|
| Ab: aaaaccgg | 8 | Insertion |
| Am: atatcgatatcg | 12 | Insertion | |
|
| Ab: ccatataga | 9 | Deletion |
| Am: ccatataga | 9 | Deletion | |
|
| Ab: ttttctgtg | 9 | Deletion |
| Am: ttttctgtg | 9 | Deletion | |
|
| Ab: aagttt | 6 | Insertion |
| Am: gtatat | 6 | Insertion | |
|
| Ab: cat | 3 | Deletion |
| Am: cat | 3 | Deletion | |
|
| Ab: ccgtaataa | 9 | Deletion |
| Am: ccgtaataa | 9 | Deletion | |
|
| Ab: acc | 3 | Insertion |
| Am: tctatc | 6 | Insertion | |
|
| Ab: aaaacgttttttttt | 15 | Insertion |
|
| Am: tga | 3 | Insertion |
|
| Am: ttc | 3 | Insertion |
|
| Am: tt | 2 | Insertion |
|
| Am: t | 1 | Insertion |
|
| Ab: act | 3 | Insertion |
| Am: cat | 3 | Insertion | |
|
| Ab: c | 1 | Insertion |
| Am: ttc | 3 | Insertion | |
|
| Ab: a | 1 | Insertion |
| Am: tta | 3 | Insertion | |
|
| Am: g | 1 | Deletion |
|
| Ab: cctttt | 6 | Insertion |
| Am: tccttt | 6 | Insertion | |
|
| Ab: gattttcgccatcgtacttt | 20 | Deletion |
| Am: gattttcgccatcgtacttt | 20 | Deletion | |
|
| Ak: atcgatctttagattttcgcc | 21 | Deletion |
|
| Ab: g | 1 | Deletion |
| Am: g | 1 | Deletion | |
|
| Am: tctttctttctctttttctttct | 23 | Insertion |
|
| Am: tt | 2 | Insertion |
|
| Am: tc | 2 | Insertion |
|
| Ab: cccccgtttttttttttttttttt | 24 | Insertion |
|
| Ak: tcagaa | 6 | Insertion |
|
| Ab: ctt | 3 | Insertion |
|
| Am: gttcattattatcattatcattat | 24 | Insertion |
|
| Am: a | 1 | Insertion |
|
| Am: atc | 3 | Insertion |
|
| Am: attatcattatcatt | 15 | Insertion |
|
| Ab: g | 1 | Insertion |
| Am: g | 1 | Insertion | |
|
| Ab: a | 1 | Insertion |
| Am: a | 1 | Insertion | |
|
| Ab: c | 1 | Insertion |
| Am: c | 1 | Insertion | |
|
| Ab: c | 1 | Insertion |
| Am: ttc | 3 | Insertion | |
|
| Ab: gg | 2 | Insertion |
| Am: gg | 2 | Insertion | |
|
| Ab: c | 1 | Insertion |
| Am: c | 1 | Insertion | |
|
| Am: atcattcctgat | 12 | Insertion |
|
| Am: cg | 2 | Insertion |
|
| Am: ga | 2 | Insertion |
|
| Am: tc | 2 | Insertion |
|
| Ab: ttaaaaagg | 9 | Deletion |
| Am: ttaaaaagg | 9 | Deletion | |
|
| Ab: aaccccgggg | 10 | Insertion |
| Am: ttgttccagttgttccgg | 18 | Insertion | |
|
| Am: caa | 3 | Insertion |
Ab Amorphophallus bulbifer cp genome, Ak Amorphophallus konjac cp genome, Am Amorphophallus muelleri cp genome
Figure 3InDels statistics of A. bulbifer and A. muelleri cp genomes. The Amorphophallus albus cp genome was used as the reference sequence for InDels analyses for the other three cp genomes. InDels belonging to different coding genes are color coded. Only three InDels were detected in the A. konjac cp genome, so the statistics results are shown in the main text. (A) The number of InDels of each coding gene in the A. bulbifer cp genome sequence, (B) The number of InDels of each coding gene in the A. muelleri cp genome sequence.
Figure 4Phylogenetic tree based on 30 complete cp genome sequences.