| Literature DB >> 28286710 |
Wei Qing Kong1, Jin Hong Yang1.
Abstract
Trees in the Morus genera belong to the Moraceae family. To better understand the species status of genus Morus and to provide information for studies on evolutionary biology within the genus, the complete chloroplast (cp) genomes of M. cathayana and M. multicaulis were sequenced. The plastomes of the two species are 159,265 bp and 159,103 bp, respectively, with corresponding 83 and 82 simple sequence repeats (SSRs). Similar to the SSRs of M. mongolica and M. indica cp genomes, more than 70% are mononucleotides, ten are in coding regions, and one exhibits nucleotide content polymorphism. Results for codon usage and relative synonymous codon usage show a strong bias towards NNA and NNT codons in the two cp genomes. Analysis of a plot of the effective number of codons (ENc) for five Morus spp. cp genomes showed that most genes follow the standard curve, but several genes have ENc values below the expected curve. The results indicate that both natural selection and mutational bias have contributed to the codon bias. Ten highly variable regions were identified among the five Morus spp. cp genomes, and 154 single-nucleotide polymorphism mutation events were accurately located in the gene coding region.Entities:
Keywords: Chloroplast genome; Codon usage; Morus cathayana; Morus multicaulis; Mutation
Year: 2017 PMID: 28286710 PMCID: PMC5345388 DOI: 10.7717/peerj.3037
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Summary of the features of Morus chloroplast genomes.
| Species | GenBank No. | Genome size/GC content | LSC size/GC content | SSC size /GC content | IR size/GC content |
|---|---|---|---|---|---|
|
| 159,265/36.16 | 88,143/33.77 | 19,844/29.20 | 25,639/42.95 | |
|
| 159,103/36.19 | 87,940/33.82 | 19,809/29.26 | 25,677/42.91 | |
|
| 158,484/36.37 | 87,386/34.12 | 19,742/29.35 | 25,678/42.92 | |
|
| 158,459/36.29 | 87,367/33.97 | 19,736/29.33 | 25,678/42.92 | |
|
| 158,680/36.36 | 87,470/34.11 | 19,776/29.34 | 25,717/42.89 |
Figure 1Gene map for M. cathayana and M. multicaulis plastomes.
Genes lying outside the circle are transcribed in a clockwise direction, whereas genes inside are transcribed in a counterclockwise direction. Different colors denote known functional groups. The GC and AT contents of the genome are denoted by dashed darker and lighter gray in the inner circle. LSC, SSC, and IR indicate large single-copy, small single-copy, and inverted repeat regions, respectively.
SSRs in M. cathayana and M. multicaulis chloroplast genomes.
| SSR | Size | NUMBER | LOCI |
|---|---|---|---|
| a | 10 | 8 | |
| 11 | 4 |
| |
| 12 | 2 |
| |
| 13 | 1 |
| |
| 14 | 1 | ||
| 15 | 1 |
| |
| 16 | 2 |
| |
| t | 10 | 17 | |
| 11 | 8 | ||
| 12 | 8 | ||
| 13 | 3 |
| |
| 14 | 1 |
| |
| 16 | 1 |
| |
| 17 | 1 | ||
| 18(20) | 1 |
| |
| at | 5 | 2 | 69153, |
| 6 | 1 | ||
| ta | 6 | 3 | |
| tc | 6 | 1 | |
| aat | 4 | 1 | |
| ttc | 4 | 1 | |
| tat | 4 | 1 | 50114 |
| aaag | 3 | 1 | |
| aaat | 3 | 3 | |
| atta | 3 | 2 | |
| attt | 3 | 2 | |
| tatt | 3 | 1 | |
| tctt | 3 | 1 | |
| ttat | 3 | 1 | 118221 |
| aagga | 3 | 1 | |
| atata | 3 | 1 | 117818 |
| atttc | 3 | 1 |
|
Notes.
Parentheses, containing coding regions; boxed type, absent in M. cathayana but present in M. multicaulis; bolded type, absent in M. multicaulis but present in M. cathayana; underline and superscript, nucleotide length polymorphism; bracket, nucleotide content polymorphism, others are identical.
Long repeat sequences in the chloroplast genome of five Morus species.
| Type | Repeat size(bp) | Location | Region | ||||
|---|---|---|---|---|---|---|---|
| P | 51 | – | – | – | – | IGS( | LSC |
| F | 39 | 39 | 39 | 39 | 39 | Intron( | LSC; IRB |
| P | 39 | 39 | 39 | 39 | 39 | Intron( | LSC; IRA |
| F | – | – | – | 32 | 32 | IGS( | LSC |
| F | – | 31 | 31 | 31 | 31 | IGS( | LSC |
| F | – | – | – | – | 31 | Intron( | LSC |
| P | 30 | 30 | 30 | 30 | 30 | LSC | |
| P | 30 | – | 30 | – | Intron( | SSC | |
| R | 28 | – | 26 | 26 | 27 | IGS( | LSC |
| P | 28 | 28 | 28 | – | 28 | IGS( | SSC |
| F | 27 | – | – | – | – | IGS( | IRA(IRB) |
| P | 27 | – | – | – | – | IGS( | IRA; IRB |
| P | 26 | 26 | 26 | 26 | 26 | IGS( | LSC |
| F | – | 25 | 26 | – | – | IGS( | LSC |
GC content of coding regions in the chloroplast genome of five Morus species.
| Organism | CDS | Codons | GCc% | GC1% | GC2% | GC3% | GC3s% | ENc |
|---|---|---|---|---|---|---|---|---|
| 85 | 26599 | 37.29 | 45.13 | 37.42 | 29.3 | 26.28 | 49.28 | |
| 85 | 26599 | 37.3 | 45.13 | 37.44 | 29.3 | 26.28 | 49.27 | |
| 84 | 26301 | 37.32 | 45.17 | 37.54 | 29.25 | 26.23 | 49.24 | |
| 84 | 26544 | 37.33 | 45.19 | 37.56 | 29.24 | 26.23 | 49.22 | |
| 84 | 26301 | 37.32 | 45.11 | 37.55 | 29.31 | 26.29 | 49.25 |
Notes.
Coding sequence
GC content in coding regions
GC content at first, second, third, and synonymous third codon positions, respectively
effective number of codons
Figure 2ENc plots for the chloroplast genome of five Morus species.
Solid lines are expected ENc from GC3. (A) M. cathayana; (B) M. multicaulis; (C) M. indica; (D) M. mongolica; (E) M. notabilis.
Figure 3Sliding window analysis of the whole plastome for five Morus species (window length 600 bp, step size 200 bp).
X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity of each window.
Transitions (Ts) and transversions (Tv) in the chloroplast genome of five Morus species.
| Ts | Tv | |||||
|---|---|---|---|---|---|---|
| a < − >g | t < − >c | a < − >t | a < − >c | t < − >g | g < − >c | |
| 13 | 22 | 4 | 9 | 18 | 4 | |
| 16 | 28 | 2 | 8 | 6 | 1 | |
| – | – | – | 1 | – | 1 | |
| 4 | 2 | – | 2 | 1 | 1 | |
| 7 | 3 | 1 | – | – | – | |
| sum | 40 | 55 | 7 | 20 | 25 | 7 |
Synonymous (S) and non-synonymous (N) substitutions in chloroplast coding genes for five Morus species.
| Gene | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| S | N | S | N | S | N | S | N | S | N | ||
| Photosynthetic apparatus | 1 | 1 | |||||||||
| 1 | 1 | 21 − | 2 | 1 | |||||||
| 2 | |||||||||||
| 3 | 1 | ||||||||||
| 1 | |||||||||||
| 2 | |||||||||||
| 1 | |||||||||||
| 1 | |||||||||||
| 1 | 2 | ||||||||||
| 1 | |||||||||||
| 1 | |||||||||||
| 2 | |||||||||||
| 1 | |||||||||||
| Photosynthetic metabolism | 2 | ||||||||||
| 2 | 2 | 1 | |||||||||
| 1 | 1 | ||||||||||
| 1 | 1 | ||||||||||
| 1 | |||||||||||
| 1 | 2 | 1 | |||||||||
| 1 | |||||||||||
| 1 | 3 | 1 | 1 | 1 | 1 | ||||||
| 1 | |||||||||||
| 1 | |||||||||||
| 6 + | 1 | ||||||||||
| Gene expression | 1 | 1 | |||||||||
| 1 | |||||||||||
| 1 | 1 | ||||||||||
| 1 | |||||||||||
| 1 | 1 | ||||||||||
| 1 | 1 | ||||||||||
| 2 | |||||||||||
| 2 | 3 | ||||||||||
| 1 | |||||||||||
| 2 | |||||||||||
| 2 | 2 | ||||||||||
| 1 | 1 | ||||||||||
| 3 | 5 | ||||||||||
| 1 | |||||||||||
| 1 | |||||||||||
| 1 | 2 | ||||||||||
| Other genes | 1 | 1 | 1 | ||||||||
| 1 | |||||||||||
| 1 | |||||||||||
| 2 | 3 | 1 | 1 | ||||||||
| 5 | 11 + | 2 | 2 | 1 | |||||||
| 1 | |||||||||||
| 8 | 17 | 1 | 0 | 1 | 2 | 0 | 3 | 1 | 1 | ||
Notes.
positive selection
negative selection