| Literature DB >> 36092433 |
Jiao Zhang1, Xiulian Chi2, Juying Zhong1, Alisdair Fernie3, Saleh Alseekh3, Luqi Huang2, Dan Qian1.
Abstract
The internal transcribed spacer (ITS) is one of the most extensively sequenced molecular markers in plant systematics due to its generally concerted evolution. While non-concerted evolution has been found in some plant taxa, such information is missing in Lycium. Molecular studies of six species and two variants of the genus Lycium revealed high levels of intra- and inter-individual polymorphism in the ITS, indicating non-concerted evolution. All genomic DNA ITS paralogues were identified as putative pseudogenes or functional paralogues through a series of comparisons of sequence features, including length and substitution variation, GC content, secondary structure stability, and the presence of conserved motifs in the 5.8S gene, and the rate of evolution. Approximately, 60% of ITS pseudogenes could be easily detected. Based on phylogenetic analysis, all pseudogenes were highly distinct from their corresponding functional copies, tended to evolve neutrally, and clustered randomly together in the evolutionary tree. The results probably suggest that this ITS non-concerted evolution is related to the recent divergence between tandem repeats within the Lycium genome and hybridization between species. Our study complements those of pseudogenes in plant taxa and provides a theoretical basis for the phylogeny and genetic origin of the genus Lycium while having important implications for the use of ITS molecular markers for phylogenetic reconstruction.Entities:
Keywords: Lycium; internal transcribed spacer (ITS); non-concerted evolution; phylogeny; polymorphism; pseudogenes
Year: 2022 PMID: 36092433 PMCID: PMC9453804 DOI: 10.3389/fpls.2022.984579
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Tandem repeats found among ITS sequences in Lycium.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| ITS1/ITS4 | GAATCATT | 8 | 2.3 | 90 | ITS1 | |
| ATTCACTGAATTCTGCA | 17 | 3 | 57 | 5.8S | ||
| ATTCACTAAATTATCGCA |
| 18 | 2.4 | 70 | 5.8S | |
| ATTCACTGAATTCTGCA |
| 17 | 3 | 57 | ITS2 | |
| TCTGGGGCGCCAAAATACGCTGA | 23 | 2 | 72 | ITS1 | ||
| AGTGAATTGCGAGAAGTA |
| 18 | 2.4 | 70 | 5.8S | |
| C |
| 1 | 15 | 100 | ITS1 | |
| TGAATCAT | 8 | 4.5 | 66 | 5.8S | ||
| TGGTGTCGCGG | 11 | 2.5 | 87 | ITS2 | ||
| TCCCCCCATTTTTC | 14 | 1.9 | 92 | ITS1 | ||
| GTGAATCGCAGAATCA | 16 | 2.6 | 62 | 5.8S | ||
| GAATCATT | 8 | 3.4 | 75 | 5.8S | ||
| TGCAGAATACAGTGAAT | 17 | 3 | 57 | 5.8S | ||
| P1/P2 | TCAGCACGCGCG |
| 12 | 2.2 | 85 | ITS1 |
| TGCAGAATACAGTGAAT |
| 17 | 3 | 57 | 5.8S | |
| GAATCATT |
| 8 | 3.4 | 75 | 5.8S | |
| TGGTGTCGCGG | 11 | 2.5 | 87 | ITS2 | ||
| CG | 2 | 10.5 | 78 | ITS2 | ||
| GC | 2 | 10.5 | 78 | ITS1 | ||
| TATATTAAGAACT | 12 | 2 | 76 | ITS1 | ||
| GACGCGCATGCG | 12 | 2.2 | 85 | ITS2 | ||
| ATTCACTGAATTCTGCA | 17 | 3 | 57 | 5.8S |
Figure 1Plot of the G + C content of all paralogues obtained from five Lycium species. (A,B) ITS1 vs. ITS2. (C,D) 5.8S vs. the minimum free energy (ΔG at 37°C) of the secondary structure of 5.8S. (E,F) ITS2 vs. the minimum free energy (ΔG at 37°C) of the secondary structure of ITS2. The left panel shows the results for primer pair ITS1/ITS4, and the right panel shows those for primer pair P1/P2. Shaded symbols indicate putative functional sequences and blank symbols indicate putative pseudogenes.
Average lengths and GC contents of ITS regions and free energy of 5.8S secondary structures of five species of Lycium.
|
|
|
|
|
| Δ | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||||
| ITS1/ITS4 | LD | 7 | P | 572 (35.4) | 197.6 (18.5) | 157.1 (3.0) | 217.1 (31.7) | 45.0 (0.0) | 45.3 (0.0) | 47.8 (0.0) | −45.0 (3.7) | −66.3 (9.4) |
| LTΔ | 12 | P | 571 (1.7) | 199.3 (3.8) | 153.5 (3.0) | 218.2 (0.8) | 44.3 (0.0) | 45.1 (0.0) | 49.2 (0.0) | −27.1 (0.9) | −66.8 (1.1) | |
| LRΔ | 1 | F | 687 (0.0) | 251 (0.0) | 165 (0.0) | 271 (0.0) | 62.2 (0.0) | 50.91 (0.0) | 63.5 (0.0) | −41.4 (0.0) | −110.7 (0.0) | |
| 5 | P | 588 (4.5) | 219.8 (2.1) | 155.6 (0.8) | 212.6 (3.2) | 59.5 (0.0) | 47.9 (0.0) | 58.9 (0.0) | −44.72 (5.0) | −80.7 (0.2) | ||
| LBVA | 1 | F | 691 (0.0) | 251 (0.0) | 159 (0.0) | 281 (0.0) | 62.2 (0.0) | 50.9 (0.0) | 64.4 (0.0) | −41.4 (0.0) | −122.2 (0.0) | |
| 9 | P | 583.7 (24.1) | 203 (23.7) | 157.0 (0.0) | 213.7 (0.5) | 51.7 (0.0) | 45.1 (0.0) | 50.9 (0.0) | −46.5 (1.0) | −72.6 (7.1) | ||
| LCΔ | 5 | F | 657 (25.5) | 231.2 (24.7) | 154.8 (2.64) | 268.2 (6.8) | 58.9 (0.0) | 54.0 (0.0) | 59.6 (0.0) | −46.06 (4.4) | −104.56 (9.2) | |
| 118 | P | 565.9 (16.4) | 199.4 (13.1) | 154.1 (3.7) | 212.6 (11.5) | 47.8 (0.1) | 46.3 (0.0) | 50.9 (0.0) | −47.8 (2.8) | −67.3 (8.2) | ||
| ALL | 7 | F | 666.1 (26.0) | 236.9 (22.7) | 156.9 (4.3) | 270.4 (7.2) | 59.8 (0.0) | 53.1 (0.0) | 60.8 (0.0) | −44.7 (4.3) | −108 (9.9) | |
| 151 | P | 567.3 (17.0) | 200.0 (14.0) | 154.4 (3.6) | 213.0 (12.0) | 47.9 (0.1) | 46.1 (0.0) | 50.8 (0.0) | −47.4 (2.9) | −67.7 (7.7) | ||
| P1/P2 | LC | 19 | F | 657.5 (36.7) | 229.6 (29.9) | 161.3 (6.9) | 266.6 (7.4) | 59.0 (0.0) | 50.9 (0.0) | 60.4 (0.0) | −45.0 (4.1) | −108.2 (4.5) |
| 3 | P | 684.7 (0.5) | 265.0 (2.2) | 158.0 (0.0) | 261.7 (1.9) | 43.4 (0.0) | 47.1 (0.0) | 52.6 (0.0) | −46.5 (0.8) | −89.5 (8.9) | ||
F, putative functional ITS sequences; P, presumed ITS pseudogene sequences; SD, standard deviation; Δ, Direct sequencing succeeded.
Nucleotide diversity of individual parts and the entire ITS region in Lycium.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| ITS1/ITS4 | ITS1 | C | 31 | 119 | 331 | 0.60701 | 72.234 | 0.96829 |
| F | 7 | 169 | 266 | 0.48451 | 99.80952 | 0.57444 | ||
| P | 151 | 168 | 504 | 0.64888 | 109.01168 | 3.57845 | ||
| 5.8S | C | 31 | 149 | 411 | 0.65802 | 98.045 | 1.35911 | |
| F | 7 | 140 | 209 | 0.51003 | 77.52381 | 0.07743 | ||
| P | 151 | 150 | 406 | 0.50048 | 75.07223 | 3.42726 | ||
| ITS2 | C | 31 | 183 | 542 | 0.68977 | 126.228 | 1.73270** | |
| F | 7 | 257 | 562 | 0.69472 | 179.23810 | −0.13655 | ||
| P | 151 | 185 | 552 | 0.62197 | 115.06414 | 3.49745 | ||
| ITS | C | 31 | 540 | 1,584 | 0.67503 | 364.514 | 1.51015* | |
| F | 7 | 591 | 1,000 | 0.58723 | 371.71429 | 0.72990 | ||
| P | 151 | 532 | 1,596 | 0.67572 | 359.48342 | 3.63702 | ||
| P1/P2 | ITS1 | C | 52 | 159 | 450 | 0.59544 | 94.67572 | 1.77397** |
| F | 19 | 154 | 454 | 0.70165 | 108.05381 | 2.10623** | ||
| P | 7 | 126 | 133 | 0.25196 | 39.80952 | −1.57319 | ||
| 5.8S | C | 52 | 148 | 386 | 0.64316 | 95.18824 | 1.69113** | |
| F | 19 | 148 | 418 | 0.66241 | 98.03704 | 2.01404** | ||
| P | 7 | 157 | 231 | 0.53426 | 87.61905 | 0.17043 | ||
| ITS2 | C | 52 | 172 | 491 | 0.54802 | 94.259 | 1.34687 | |
| F | 19 | 169 | 496 | 0.65802 | 111.20539 | 1.94645** | ||
| P | 7 | 124 | 227 | 0.60789 | 77.80952 | 0.03303 | ||
| ITS | C | 52 | 515 | 1,509 | 0.61647 | 317.484 | 1.99136** | |
| F | 19 | 485 | 1,439 | 0.69542 | 337.27743 | 2.14384** | ||
| P | 7 | 446 | 959 | 0.68381 | 307.71429 | −0.10613 |
aC, cDNA functional sequence; F and P as in Table 2. But primer pairs P1/P2 pseudogenes include four sequences with deletions greater than 100 bp.
bNucleotide diversity (π).
cAverage number of nucleotide differences (K).
d*P < 0.05, **P < 0.02.
Figure 2Schematic illustration of the distribution of substitution sites across the entire ITS region. (A) Assumed pseudogenes (excluding sequences obtained via direct sequencing) were yielded by primers ITS1-ITS4 using the putative functional sequence from LC17-1-1 as a reference. (B) Putative functional copies and cDNA paralogues from primers P1-P2 using CHB1-1 as a reference (red = GG > AG, cyan = GA > AA, green = GC > AC, magenta = GT > AT, black = not a G > A transition, yellow = gap) (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article).
Figure 3Neighbor-joining trees constructed by using the P-distance matrix from the entire ITS region of all paralogues (1,000 bootstraps). (A) The phylogenetic tree constructed using all sequences from the primer pair ITS1/ITS4. (B) The phylogenetic tree constructed using all sequences from the primer pair P1/P2. F represents presumed functional ITS sequences, P represents putative ITS pseudogene sequences, and cDNA represents ITS cDNA sequence bootstrap values >50%.