| Literature DB >> 25398081 |
Huei-Jiun Su1, Saskia A Hogenhout2, Abdullah M Al-Sadi3, Chih-Horng Kuo4.
Abstract
The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C. aurantiifolia). The complete C. aurantiifolia chloroplast genome is 159,893 bp in length; the organization and gene content are similar to most of the rosids lineages characterized to date. Through comparison with the sweet orange (C. sinensis) chloroplast genome, we identified three intergenic regions and 94 simple sequence repeats (SSRs) that are potentially informative markers with resolution for interspecific relationships. These markers can be utilized to better understand the origin of cultivated Citrus. A comparison among 72 species belonging to 10 families of representative rosids lineages also provides new insights into their chloroplast genome evolution.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25398081 PMCID: PMC4232571 DOI: 10.1371/journal.pone.0113049
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Chloroplast genome map of Citrus aurantiifolia.
Gene drawn inside the circle are transcribed clockwise, whereas those outside are counterclockwise. The within-genome GC content variation is indicated in the middle circles.
Summary of the Citrus chloroplast genome characteristics.
| Attribute |
|
|
| Size (bp) | 159,893 | 160,129 |
| overall GC content (%) | 38.4 | 38.5 |
| LSC size in bp (% total) | 87,148 (54.5%) | 87,744 (54.8%) |
| SSC size in bp (% total) | 18,763 (11.7%) | 18,393 (11.5%) |
| IR size in bp (% total) | 26,991 (16.9%) | 26,996 (16.9%) |
| Protein-coding regions size in bp (% total) | 81,468 (51.0%) | 79,773 (49.8%) |
| rRNA and tRNA size in bp (% total) | 11,850 (7.5%) | 11,850 (7.4%) |
| Introns size in bp (% total) | 17,129 (10.7%) | 18,252 (11.4%) |
| Intergenic spacer size in bp (% total) | 49,446 (30.9%) | 50,254 (31.4%) |
| Number of different genes | 115 | 113 |
| Number of different protein-coding genes | 81 | 79 |
| Number of different rRNA genes | 4 | 4 |
| Number of different tRNA genes | 30 | 30 |
| Number of different genes duplicated by IR | 22 | 20 |
| Number of different genes with introns | 17 | 17 |
Each cp genome contains two copies of inverted repeats (IRs).
According to the original annotation, not including orf56.
Differences between the C. aurantiifolia and C. sinensis cp genomes.
|
| ||||
| Length (bp) | Count | |||
| 1 | 43 | |||
| 2–10 | 20 | |||
| 11–100 | 18 | |||
| 101–1,000 | 3 | |||
| Sum | 1,780 | 116 | Percentage | |
|
| ||||
| Type | Count | |||
| A <-> T | 34 | |||
| C <-> G | 15 | |||
| A <-> C | 81 | |||
| T <-> C | 64 | |||
| A <-> G | 51 | |||
| T <-> G | 85 | |||
| Sum | 330 | Percentage | ||
|
| ||||
| Region | Length | Pairwise distance | ||
|
| 234 | 0.027 | ||
|
| 276 | 0.018 | ||
|
| 231 | 0.017 | ||
|
| 118 | 0.017 | ||
| t | 59 | 0.017 | ||
|
| 449 | 0.016 | ||
|
| 1,141 | 0.015 | ||
|
| 66 | 0.015 | ||
|
| 204 | 0.015 | ||
|
| 469 | 0.013 |
Relative to the length of C. aurantiifolia.
Length in C. aurantiifolia.
List of simple sequence repeats.
| Repeat unit | Length (bp) | Number of SSRs | Start position |
| A | 10 | 6 |
|
| 11 | 6 |
| |
| 12 | 9 |
| |
| 13 | 2 |
| |
| 14 | 1 |
| |
| 15 | 1 |
| |
| 16 | 2 |
| |
| 17 | 3 |
| |
| 19 | 1 |
| |
| 22 | 1 | 70289 | |
| T | 10 | 10 |
|
| 11 | 11 | 9401; | |
| 12 | 6 |
| |
| 13 | 2 |
| |
| 14 | 2 | 1776; | |
| 15 | 2 |
| |
| 17 | 1 | 45965 | |
| 18 | 3 |
| |
| 20 | 1 |
| |
| 23 | 2 |
| |
| C | 10 | 2 |
|
| G | 10 | 1 |
|
| AT | 10 | 4 |
|
| AAG | 12 | 1 |
|
| AAT | 12 | 2 |
|
| ATA | 12 | 1 | 70220 |
| ATT | 12 | 3 |
|
| 18 | 1 |
| |
| CTT | 12 | 2 |
|
| TAA | 12 | 2 |
|
| TAT | 12 | 1 |
|
| TTC | 12 | 1 |
|
| TAAA | 12 | 2 | 4866; |
| AAAT | 12 | 3 |
|
| ATAC | 12 | 1 |
|
| ATTT | 12 | 1 | 49193 |
| 20 | 1 |
| |
| TTAA | 12 | 2 | 39175; 39188 |
| TTAG | 12 | 1 |
|
| TTTC | 12 | 1 |
|
| TCTT | 12 | 1 |
|
| AATAA | 20 | 1 | 144226 |
| TTTTA | 20 | 1 | 102781 |
| TTCAAA | 18 | 1 | 63817 |
The SSR-containing coding regions are indicated in parentheses. SSRs that are identical in the C. sinensis chloroplast genome are highlighted in bold; SSRs that are conserved but with different lengths are highlighted by underline.
List of long repeat sequences.
| Repeatsize | Type | Start position of1st repeat | Start position the repeatfound in other region | Location | Region |
| 30 | D | 1759 | 1762 | IGS ( | LSC |
| 30 | P | 1771 | 12015 | IGS ( | LSC |
| 30 | P | 8231 | 37726, 47606 |
| LSC |
| 30 | D | 23226 | 85067 |
| LSC |
| 30 | D | 23686 | 52733 | intron ( | LSC |
| 30 | P | 23687 | 70291 | intron ( | LSC |
| 30 | D | 23692 | 33280 | intron ( | LSC |
| 30 | D | 49192 | 117171 | IGS ( | LSC, IR |
| 30 | D, P | 49197 | 102764, 144233 | IGS ( | LSC, IR |
| 30 | D, P | 51215 | 102768, 144229 | IGS ( | LSC, IR |
| 30 | P | 71344 | 71344 |
| LSC |
| 30 | D, P | 102768 | 102773, 144224 | IGS ( | IR |
| 30 | D | 144225 | 144230 | IGS ( | IR |
| 31 | P | 4492 | 117868 | IGS ( | LSC, IR |
| 31 | P | 10106 | 49188 | IGS ( | LSC |
| 31 | P | 29811 | 29811 |
| LSC |
| 31 | P | 33281 | 70282 |
| LSC |
| 31 | P | 119977 | 119977 | intron ( | IR |
| 32 | D | 7615 | 74171 | IGS ( | LSC |
| 32 | P | 39166 | 39166 | IGS ( | LSC |
| 34 | P | 38774 | 38782 |
| LSC |
| 34 | P | 49186 | 70288 | rps4, IGS ( | LSC |
| 34 | D, P | 111432 | 111464, 135529, 135561 |
| IR |
| 35 | P | 10097 | 49193 | IGS | LSC |
| 36 | P | 27648 | 27648 | IGS ( | LSC |
| 40 | P | 77776 | 77776 | IGS ( | LSC |
| 41 | D | 41294 | 43518 |
| LSC |
| 41 | D, P | 102353 | 124945, 144633 |
| IR |
| 48 | P | 30626 | 30626 |
| LSC |
| 50 | D | 39020 | 39044 |
| LSC |
| 51 | P | 9984 | 9984 | IGS ( | LSC |
| 53 | P | 8869 | 31095 |
| LSC |
| 54 | P | 441 | 441 | IGS ( | LSC |
D: direct repeat; P: palindrome inverted repeat.
IGS: intergenic spacer region. Sequences conserved in the C. sinensis chloroplast genome are highlighted in bold.
Figure 2Maximum likelihood phylogeny of the representative rosids lineages.
The common grape vine (Vitis vinifera) is included as the outgroup to root the tree. The concatenated alignment includes 62 conserved chloroplast genome genes and 54,689 aligned nucleotide sites. Nodes received <70% bootstrap support are indicated by gray arrows. The putative events of gene losses are inferred based on the most parsimonious scenario.
Figure 3The phylogenetic distribution patterns of orf56 and ycf68.