| Literature DB >> 28125648 |
Shin-Ichi Tsuruta1, Masumi Ebina2, Makoto Kobayashi2, Wataru Takahashi2.
Abstract
The genera Erianthus and Miscanthus, both members of the Saccharum complex, are of interest as potential resources for sugarcane improvement and as bioenergy crops. Recent studies have mainly focused on the conservation and use of wild accessions of these genera as breeding materials. However, the sequence data are limited, which hampers the studies of phylogenetic relationships, population structure, and evolution of these grasses. Here, we determined the complete chloroplast genome sequences of Erianthus arundinaceus and Miscanthus sinensis by using 454 GS FLX pyrosequencing and Sanger sequencing. Alignment of the E. arundinaceus and M. sinensis chloroplast genome sequences with the known sequence of Saccharum officinarum demonstrated a high degree of conservation in gene content and order. Using the data sets of 76 chloroplast protein-coding genes, we performed phylogenetic analysis in 40 taxa including E. arundinaceus and M. sinensis. Our results show that S. officinarum is more closely related to M. sinensis than to E. arundinaceus. We estimated that E. arundinaceus diverged from the subtribe Sorghinae before the divergence of Sorghum bicolor and the common ancestor of S. officinarum and M. sinensis. This is the first report of the phylogenetic and evolutionary relationships inferred from maternally inherited variation in the Saccharum complex. Our study provides an important framework for understanding the phylogenetic relatedness of the economically important genera Erianthus, Miscanthus, and Saccharum.Entities:
Mesh:
Year: 2017 PMID: 28125648 PMCID: PMC5268433 DOI: 10.1371/journal.pone.0169992
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Primer pairs used for amplification of Miscanthus sinensis cp genome.
| Primer pair | Primer sequence (5′ to 3′) | Tm | PCR product | Length (bp) | |||
|---|---|---|---|---|---|---|---|
| Forward | Reverse | (°C) | Location | Start | End | ||
| ES01 | 60 | LSC | 140 | 12113 | 11974 | ||
| ES02 | 58 | LSC | 11835 | 19267 | 7433 | ||
| ES03 | 58 | LSC | 18929 | 26001 | 7073 | ||
| ES04 | 60 | LSC | 25073 | 37410 | 12338 | ||
| ES05 | 56 | LSC | 36910 | 38656 | 1747 | ||
| ES06 | 60 | LSC | 37314 | 50226 | 12913 | ||
| ES07 | 58 | LSC | 49983 | 57728 | 7746 | ||
| ES08 | 58 | LSC | 56347 | 63939 | 7593 | ||
| ES09 | 60 | LSC | 63690 | 76120 | 12431 | ||
| ES10 | 58 | LSC | 75648 | 82881 | 7234 | ||
| ES11 | 58 | LSC-IRA | 82318 | 94864 | 12547 | ||
| ES12 | 56 | IRA-SSC | 94130 | 106419 | 12290 | ||
| ES13 | 54 | SSC | 105400 | 112986 | 7587 | ||
| ES14 | 56 | SSC | 111573 | 119368 | 7796 | ||
| ES15 | 56 | SSC-IRB | 118477 | 130428 | 11952 | ||
| ES16 | 60 | IRB-LSC | 129694 | 290 | 12013 | ||
1 LSC: Large single-copy, SSC: Small single-copy, IRA: Inverted repeat A, IRB: Inverted repeat B.
2 Position (base pairs) in the M. sinensis chloroplast genome sequence.
Fig 1Chloroplast genome maps of Erianthus arundinaceus and Miscanthus sinensis.
The genes of different functional groups are indicated in different colors. Genes on the inside and outside of the maps are transcribed clockwise and counter-clockwise, respectively. The thick lines on the inner circles indicate inverted repeats (IRa and IRb), which separate the genomes into the small single-copy (SSC) and large single-copy (LSC) regions.
Characteristics of the chloroplast genomes in three genera of the Saccharum complex and Sorghum bicolor.
| Species | Genome organization | Number of genes | GenBank accession No. | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Length of total genome | Length of LSC | Length of SSC | Length of IR | GC/AT contents (%) | Total | CDS | rRNA | tRNA | ||
| 141,182 | 83,048 | 12,544 | 22,795 | 38.4/61.6 | 136 | 87 (8) | 8 (4) | 41 (8) | NC006084 [ | |
| 141,210 | 83,170 | 12,516 | 22,762 | 38.5/61.5 | 136 | 87 (8) | 8 (4) | 41 (8) | LC160130 [This study] | |
| 141,416 | 83,141 | 12,681 | 22,797 | 38.4/61.6 | 134 | 85 (7) | 8 (4) | 41 (8) | LC160131 [This study] | |
| 141,372 | 83,163 | 12,659 | 22,775 | 38.4/61.6 | 134 | 85 (7) | 8 (4) | 41 (8) | NC028721 [ | |
| 141,332 | 83,207 | 12,575 | 22,775 | 38.4/61.6 | 134 | 85 (7) | 8 (4) | 41 (8) | NC028720 [ | |
| 140,754 | 82,688 | 12,502 | 22,782 | 38.5/61.5 | 135 | 86 (7) | 8 (4) | 41 (8) | NC008602 [ | |
1 Length is indicated in base pairs.
2 Including genes detected in this study (not annotated in GenBank).
3 Including ycf15 and ycf68.
4 The numbers of duplicated genes are shown in parentheses.
Fig 2Alignment of whole chloroplast genome sequences from four Panicoideae species.
Chloroplast genomes were aligned by using the mVISTA program with the Zea mays sequence as a reference. The X- and Y-scales indicate the coordinates within cp genomes and the percentage of identity (50%–100%), respectively. Genome regions (exons, introns, and conserved non-coding sequences) are color-coded. Gray arrows indicate the direction of transcription of each gene. The genes encoding transfer RNAs (trn) are indicated under gray arrows using the single-letter amino acid code (e.g., K: trnK).
SNPs and indels between Erianthus arundinaceus and Miscanthus sinensis chloroplast genomes.
| Category | SNP | Indel | Total |
|---|---|---|---|
| Photosystem I | 3 | ||
| Photosystem II | 17 | ||
| ATP synthase | 7 | ||
| Cytochrome | 8 | ||
| NADPH | 33 | ||
| Transcription | 36 | ||
| Ribosomal proteins (Small subunit) | 12 | ||
| Ribosomal proteins (Large subunit) | 7 | ||
| Other | 24 | ||
| Rubisco | 5 | ||
| Non-coding | Intron (38), IGS | 124 | 522 |
| Total | 546 (148) | 128 | 674 |
1 Intergenic spacer region.
2 Parenthesis shows SNPs in protein-coding genes.
Substitution rates on 76 protein-coding chloroplast genes in three genera of the Saccharum complex and Sorghum bicolor.
| Species | Substitution rate | d | |
|---|---|---|---|
| d | d | ||
| 0.0030±0.0007 | 0.0144±0.0030 | 0.2460 | |
| 0.0039±0.0010 | 0.0170±0.0030 | 0.2433 | |
| 0.0036±0.0010 | 0.0152±0.0030 | 0.2548 | |
| 0.0038±0.0009 | 0.0183±0.0028 | 0.2140 | |
1 dN: the rates of nonsynonymous, dS: the rates of synonymous substitutions, Zea mays was used as a reference.
Microsatellites in Erianthus arundinaceus and Miscanthus sinensis chloroplast genomes.
| Location | Motif | ||||||
|---|---|---|---|---|---|---|---|
| Sequence | Start | End | Sequence | Start | End | ||
| Mono | (A/T)8 | 3548 | 3555 | (A/T)10 | 3550 | 3560 | |
| Mono | (A/T)15 | 3756 | 3771 | (A/T)13 | 3760 | 3773 | |
| Mono | (A/T)11 | 4118 | 4129 | (A/T)10 | 4120 | 4130 | |
| Tri | (ATT)4 | 5844 | 5856 | - | - | - | |
| Mono | (A/T)11 | 6417 | 6428 | (A/T)13 | 6214 | 6227 | |
| Mono | (A/T)10 | 7757 | 7767 | (A/T)14 | 6471 | 6485 | |
| Mono | (A/T)12 | 9067 | 9079 | (A/T)10 | 8746 | 8756 | |
| Mono | (G/C)10 | 11033 | 11043 | (G/C)10 | 11032 | 11042 | |
| Mono | (A/T)10 | 13447 | 13457 | (A/T)12 | 13440 | 13452 | |
| Mono | (A/T)15 | 16614 | 16629 | (A/T)11 | 16638 | 16649 | |
| Mono | (A/T)11 | 16708 | 16719 | (A/T)9 | 16728 | 16736 | |
| Mono | (A/T)14 | 18717 | 18731 | (A/T)11 | 18736 | 18747 | |
| Mono | (A/T)14 | 19267 | 19281 | (A/T)11 | 19279 | 19290 | |
| Mono | (A/T)11 | 21124 | 21135 | (A/T)13 | 21111 | 21124 | |
| Mono | (A/T)10 | 31970 | 31980 | (A/T)10 | 31961 | 31971 | |
| Mono | (A/T)10 | 34148 | 34158 | (A/T)14 | 34140 | 34154 | |
| Mono | (A/T)9 | 34684 | 34692 | (A/T)12 | 34696 | 34708 | |
| Mono | (A/T)9 | 35871 | 35879 | (A/T)10 | 35892 | 35902 | |
| Mono | (A/T)9 | 38743 | 38751 | (A/T)10 | 38764 | 38774 | |
| Tri | (ATT)7 | 38901 | 38922 | - | - | - | |
| Mono | (A/T)13 | 44325 | 44338 | (A/T)13 | 45675 | 45688 | |
| Mono | (A/T)11 | 48910 | 48921 | (A/T)8 | 48896 | 48903 | |
| Mono | (A/T)10 | 50274 | 50284 | (A/T)8 | 50282 | 50289 | |
| Mono | (A/T)14 | 52432 | 52446 | (A/T)14 | 52438 | 52452 | |
| Tetra | (AGGT)4 | 54731 | 54747 | (AGGT)3 | 54736 | 54747 | |
| Mono | (A/T)12 | 56799 | 56811 | (A/T)11 | 56982 | 56993 | |
| Mono | (A/T)12 | 57457 | 57469 | (A/T)11 | 57454 | 57465 | |
| Mono | (A/T)10 | 59774 | 59784 | (A/T)8 | 59777 | 59784 | |
| Mono | (A/T)10 | 60225 | 60235 | (A/T)9 | 60226 | 60234 | |
| Mono | (A/T)11 | 63645 | 63656 | (A/T)9 | 63644 | 63652 | |
| Mono | (A/T)14 | 65805 | 65819 | (A/T)10 | 65604 | 65614 | |
| Mono | (A/T)12 | 68260 | 68272 | (A/T)12 | 68219 | 68231 | |
| Mono | (A/T)12 | 74056 | 74068 | (A/T)11 | 78555 | 78566 | |
| Mono | (A/T)10 | 79145 | 79155 | (A/T)10 | 79104 | 79114 | |
| Mono | (A/T)10 | 79163 | 79173 | (A/T)10 | 79122 | 79132 | |
| Mono | (A/T)13 | 81363 | 81376 | (A/T)15 | 81328 | 81343 | |
| Mono | (A/T)10 | 82624 | 82634 | (A/T)10 | 81698 | 81708 | |
| Tri | (CTT)4 | 83063 | 83075 | (CTT)4 | 83028 | 83040 | |
| Mono | (A/T)9 | 109365 | 109373 | (A/T)11 | 109392 | 109403 | |
| Mono | (A/T)7 | 116146 | 116152 | (A/T)12 | 116318 | 116330 | |
Fig 3Phylogenetic analysis of 40 species including three genera of the Saccharum complex.
A phylogenetic tree was generated using the maximum-likelihood method based on the concatenated nucleotide sequences of 76 protein-coding chloroplast genes. Numbers beside the nodes indicate the bootstrap values (%) from 1,000 replicates.
Fig 4Divergence times of the PACMAD clade.
A Bayesian relaxed-clock approach based on 76 concatenated protein-coding chloroplast genes was used to estimate divergence times.