| Literature DB >> 25874767 |
Lang Yan1, Xianjun Lai2, Xuedan Li1, Changhe Wei1, Xuemei Tan1, Yizheng Zhang1.
Abstract
Sweet potato [Ipomoea batatas (L.) Lam] ranks among the top seven most important food crops cultivated worldwide and is hexaploid plant (2n=6x=90) in the Convolvulaceae family with a genome size between 2,200 to 3,000 Mb. The genomic resources for this crop are deficient due to its complicated genetic structure. Here, we report the complete nucleotide sequence of the chloroplast (cp) genome of sweet potato, which is a circular molecule of 161,303 bp in the typical quadripartite structure with large (LSC) and small (SSC) single-copy regions separated by a pair of inverted repeats (IRs). The chloroplast DNA contains a total of 145 genes, including 94 protein-encoding genes of which there are 72 single-copy and 11 double-copy genes. The organization and structure of the chloroplast genome (gene content and order, IR expansion/contraction, random repeating sequences, structural rearrangement) of sweet potato were compared with those of Ipomoea (L.) species and some basal important angiosperms, respectively. Some boundary gene-flow and gene gain-and-loss events were identified at intra- and inter-species levels. In addition, by comparing with the transcriptome sequences of sweet potato, the RNA editing events and differential expressions of the chloroplast functional-genes were detected. Moreover, phylogenetic analysis was conducted based on 77 protein-coding genes from 33 taxa and the result may contribute to a better understanding of the evolution progress of the genus Ipomoea (L.), including phylogenetic relationships, intraspecific differentiation and interspecific introgression.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25874767 PMCID: PMC4398329 DOI: 10.1371/journal.pone.0124083
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Chloroplast genome of Ipomoea batats.
The outer circle shows positions of genes in the large single copy (LSC), small single copy (SSC), and two inverted repeat (IR A and IR B) regions. The inner circle is a graph depicting GC content across the genome. Plastome maps were generated in OGDraw v1.2.
Functional genes encoded by the sweet potato cp genome (72 single-copy genes and 11 two-copy genes).
| Groups | Functional System | Num | Gene names |
|---|---|---|---|
| Photosynthesis | Photosystem I | 7 |
|
| Photosystem II | 15 |
| |
| Cytochromeb6/f complex | 6 |
| |
| ATP synthase | 6 |
| |
| Calvin clycle | 1 |
| |
| C-type cytochrome synthesis | 1 |
| |
| Chloro-respiration | NADH oxidoreductase | 11 |
|
| Expression machinery | RNA polymerase | 4 |
|
| ribosomal large subunit | 9 |
| |
| ribosomal small subunit | 12 |
| |
| maturase K | 1 |
| |
| metabolic pathway | acetyl-CoA carboxylase carboxyltransferase | 1 |
|
| clp protease proteolytic subunit | 1 |
| |
| chloroplast envelope membrane protein | 1 |
| |
| pseudogenes | unknown functions | 7 |
|
Superscript
[a] means the gene in the IR region with two copies;
[b] means the gene contains a single intron;
[c] means the gene contains two introns;
[d] means the gene divided into two independent transcription units.
Codon usage of the sweet potato cp genome.
| Amino acid | Codon | Number | Amino acid | Codon | Number |
|---|---|---|---|---|---|
| Ala | GCA/GCU/GCC/GCG | 343/412/449/125 | Met | AUG | 526 |
| Cys | UGU/UGC | 192/88 | Asn | AAU/AAC | 749/190 |
| Asp | GAU/GAC | 598/276 | Pro | CCA/CCU/CCG/CCC | 329/358/252/56 |
| Glu | GAA/GAG | 842/283 | Gln | CAA/CAG | 537/242 |
| Phe | UUU/UUC | 939/342 | Arg | CGA/CGU/CGG/CGC/AGA/AGG | 312/334/87/68/408/140 |
| Gly | GGA/GGU/GGC/GGG | 538/589/312/126 | Leu | UUG/UUA/CUA/CUU/CUC/CUG | 405/694/476/382/270/103 |
| His | CAU/CAC | 407/131 | Ser | UCA/UCU/UCG/UCC/AGU/AGC | 681/477/309/187/302/70 |
| Ile | AUA/AUU/AUC | 546/1078/185 | Val | GUU/GUA/GUC | 543/412/286 |
| Lys | AAA/AAG | 798/243 | Trp | UGG | 380 |
| Thr | ACA/ACU/ACG/ACC | 551/379/163/56 | Tyr | UAA/UAC | 529/238 |
Fig 2Chloroplast genome alignment among species in Ipomoea genus.
The relative position of tRNA genes were determined and marked as short vertical line in black. The thickness of the vertical bar represented the relative length of the tRNA genes. The LSC, SSC and IRs regions of each chloroplast genome were shown in four different colors.
Fig 3Comparison of gene distribution in chloroplast genomes of sweet potato and other three inter- and intra-Ipomoea species.
The relative position of functional-protein encoding genes were determined and marked as short vertical line in black. The thickness of the vertical bar represented the relative length of the genes. The name and position of some differential genes between two genomes were marked. The LSC, SSC and IRs regions of each chloroplast genome were shown in four different colors (except for Medicago truncatula with incomplete genome).
Repeat sequences and their distribution in cp genome of sweet potato.
| Repeat type | Repeat size | Repeat1 position | Repeat2 position | Repeat1 location | Repeat2 location | Repeat sequence |
|---|---|---|---|---|---|---|
| Forward | 39 | 45230–45269 | 99641–99680 | Intron ( | IGS ( | CAGAACCGTACATGAGATTTTCACCTCATACGGCTCCTC |
| 38 | 59760–59798 | 59784–59822 |
|
| GAAAGTTCTAATGAGAATGAAAGCGAAAGTTCTAATGA | |
| 36 | 40125–40161 | 42350–42386 |
|
| AATAGCTAAATGATGGTGTGCAATATCGGTCAGCCA | |
| 22 | 9654–9676 | 37813–37835 | trnG-GCC | trnG-UCC | GATGCGGGTTCGATTCCCGCTA | |
| 22 | 8124–8146 | 36869–36891 | trnS-GCU | trnS-UGA | AGAGAGGGATTCGAACCCTCGG | |
| 20 | 54022–54042 | 144734–144754 | trnV-UAC | trnA-UGC | GCTCTACCAACTGAGCTATA | |
| 20 | 15073–15093 | 113223–113243 |
| IGS (trnN-GUU- | AAAGAAACAAGAACAACAA | |
| Reverse | 26 | 6039–6065 | 14796–14822 | IGS( |
| TCATGAATAGTCATAGGTTCTATTAT |
| 25 | 1614–1639 | 37917–37942 | IGS ( |
| AAATAAAAAAAAAAAAAAGAATTCT | |
| 17 | 76295–76312 | 127586–127603 |
|
| CGGCGAGAAATCTTTAT | |
| Palindromic | 39 | 45230–45269 | 149439–149478 | Intron ( | IGS ( | CCAGAACCGTACATGAGATTTTCACCTCATACGGCTCCT |
| 28 | 8120–8148 | 46916–46944 | trnS-GCU | trnS-GGA | TGGAAAGAGAGGGATTCGAACCCTCGGT | |
| 19 | 15073–15092 | 135861–135880 |
| IGS( | TAAAGAAACAAGAACAACA |
IGS: intergenic spacer
Fig 4Boundary gene-flow and IR region expansion/contraction events.
Comparison of the junction positions of IR boundaries among 11 basal angiosperms cp genome. ILa, ILb represented the positions between the two IRs and the LSC region, ISa and ISb represented the positions between the two IRs and the SSC region.
Fig 5Gene map and genome alignment of 13 basal important angiosperms species.
MAUVE multiple alignment implemented in Geneious. Colored outlined blocks surround regions of the genome sequence that aligned with part of another genome. The coloured bars inside the blocks are related to the level of sequence similarities. Lines link blocks with homology between two genomes.
Fig 6Phylogram based on sequence analysis of 77 chloroplast genes from 33 plant species.
Numbers above each node indicate the ML bootstrap support values. The current taxonomic classifications are indicated on the right.
Position of RNA editing sites in chloroplast transcripts of sweet potato.
| Gene | Cp genome positon | Gene position | AA position | Triplet position | Bases | Codon amino | acid change | Label |
|---|---|---|---|---|---|---|---|---|
|
| 5007 | 976 | 326 | 1 | C→U | CUA→UUA | L→L | rps16eU976LL |
|
| 5755 | 226 | 76 | - | C→U | UGC→UGU | - | rps16iU226CC |
|
| 7440 | 136 | 46 | 1 | C→U | CUC→UUC | S→L | psbKeU136SL |
|
| 11006 | 914 | 305 | 2 | C→U | UCA→UUA | S→L | atpAeU914SL |
|
| 13143 | 92 | 31 | 2 | C→U | CCA→CUA | P→L | atpFeU92PL |
|
| 16508 | 134 | 45 | 2 | C→U | ACA→AUA | T→I | rps2eU134TI |
|
| 26998 | 338 | 113 | 2 | C→U | UCU→UUU | S→F | rpoBeU338SF |
|
| 34240 | 53 | 18 | 2 | C→U | ACU→AUU | T→I | psbDeU53TI |
|
| 38487 | 149 | 50 | 2 | C→U | CCA→CUA | P→L | rps14eU149PL |
|
| 44366 | 1668 | 556 | - | C→U | GAC→GAU | - | ycf3iU1168DD |
|
| 50916 | 320 | 107 | 2 | C→U | GCG→GUG | A→V | ndhJeU320AV |
|
| 51549 | 683 | 228 | 2 | U→C | AUG→ACG | M→T | ndhKeC683MT |
|
| 51601 | 697 | 233 | 1 | C→U | CUU→UUU | L→F | ndhKeU697LF |
|
| 52001 | 231 | 77 | 3 | U→C | AAU→AAC | N→N | ndhKeC231NN |
|
| 52196 | 278 | 93 | 2 | C→U | GCA→GUA | A→V | ndhCeU278AV |
|
| 59371 | 232 | 78 | 1 | C→U | CAU→UAU | H→Y | accDeU232HY |
|
| 59681 | 551 | 184 | 2 | C→U | ACU→AUU | T→I | accDeU551TI |
|
| 63358 | 242 | 81 | 2 | C→U | CCA→CUA | P→L | cemAeU242PL |
|
| 64560 | 527 | 176 | 2 | C→U | GCU→GUU | A→V | petAeU527AV |
|
| 66431 | 214 | 72 | 1 | C→U | CCU→UCU | P→S | psbEeU214PS |
|
| 69762 | 20 | 7 | 2 | C→U | GCA→GUA | A→V | rpl33eU20AV |
|
| 70166 | 33 | 11 | 3 | C→U | UCC→UCU | S→S | rps18eU33SS |
|
| 70956 | 131 | 44 | 2 | C→U | GCU→GUU | A→V | rpl20eU131AV |
|
| 73106 | 324 | 108 | 3 | C→U | GCC→GCU | A→A | clpPeU324AA |
|
| 77516 | 296 | 98 | - | C→U | UCA→UUA | - | petBiU296SL |
|
| 78586 | 611 | 204 | 2 | C→U | CCA→CUA | P→L | petBeU611PL |
|
| 85153 | 609 | 203 | 3 | C→U | AUC→AUU | I→I | rps3eU609II |
|
| 85423 | 339 | 113 | 3 | U→C | GUU→GUC | V→V | rps3eC339VV |
|
| 85486 | 276 | 92 | 3 | C→U | ACC→ACU | T→T | rps3eU276TT |
|
| 85520 | 242 | 81 | 2 | C→U | CCA→CUA | P→L | rps3eU242PL |
|
| 85710 | 52 | 18 | 1 | C→U | CAU→UAU | H→Y | rps3eU52HY |
|
| 87527 | 194 | 65 | 2 | C→U | ACA→AUA | T→I | rpl23eU194TI |
|
| 88475 | 243 | 81 | 1 | U→C | UUU→CUU | F→L | ycf2eC243FL |
|
| 89627 | 1395 | 465 | 3 | G→A | UCG→UCA | S→S | ycf2eA1395SS |
|
| 89907 | 1676 | 559 | 2 | U→C | UUA→UCA | L→S | ycf2eC1676LS |
|
| 90243 | 2011 | 671 | 1 | G→A | GAA→AAA | E→K | ycf2eA2011EK |
|
| 91075 | 2843 | 948 | 2 | C→U | UCC→UUC | S→F | ycf2eU2843SF |
|
| 95864 | 1481 | 494 | 2 | C→U | CCA→CUA | P→L | ndhBeU1481PL |
|
| 96509 | 830 | 277 | 2 | C→U | UCA→UUA | S→L | ndhBeU830SL |
|
| 97273 | 737 | 246 | 2 | C→U | CCA→CUA | P→L | ndhBeU737PL |
|
| 97861 | 149 | 50 | 2 | C→U | UCA→UUA | S→L | ndhBeU149SL |
|
| 99558 | 144 | 48 | 3 | C→U | GCC→GCU | A→A | rps12eU144AA |
|
| 117572 | 726 | 242 | 3 | C→U | UAC→UAU | Y→Y | ndhHeU726YY |
Fig 7Differentially expressed transcripts (DETs) of cp genes in sweet potato.
(A) Numbers of DETs in leaves and stems of sweet potato. PS, CR, EM, MP, PG represented genes related to photosynthesis, chloro-respiration, expression machinery, metabolic pathway and pseudogenes. (B) Numbers of DETs among young leaves (YL), mature leaves (ML) and stems in sweet potato. The Up-(red) and down-regulated (black) statistics of DETs were calculated in edgeR.
Fig 8Expression levels of some DETs in young leaves (YL), mature leaves (ML) and stems.
(A) The expression of 9 DETs identified in digital gene expression profile. (B) Validation the expression of these DETs in relative real-time PCR-based quantification.