| Literature DB >> 35406909 |
Ruixin Cao1, Shuyan Tong1, Tianjing Luan1, Hanyun Zheng1, Wei Zhang1.
Abstract
A compensatory base change (CBC) that coevolves in the secondary structure of ribosomal internal transcribed spacer 2 (ITS2) influences the estimation of genetic distance and thus challenges the phylogenetic use of this most popular genetic marker. To date, however, the CBC effect on ITS2 genetic distance is still unclear. Here, ITS2 sequences of 46 more recent angiosperm lineages were screened from 5677 genera and phylogenetically analyzed in sequence-structure format, including secondary structure prediction, structure-based alignment and sequence partition of paired and unpaired regions. ITS2 genetic distances were estimated comparatively by using both conventional DNA substitution models and RNA-specific models, which were performed in the PHASE package. Our results showed that the existence of the CBC substitution inflated the ITS2 genetic distances to different extents, and the deviation could be 180% higher if the relative ratio of substitution rate in ITS2 secondary structure stems was threefold higher than that in the loops. However, the CBC effect was minor if that ratio was below two, indicating that the DNA model is still applicable in recent lineages in which few CBCs occur. We thus provide a general empirical threshold to take account of CBC before ITS2 phylogenetic analyses.Entities:
Keywords: ITS2; RNA substitution model; compensatory base change; genetic distance; secondary structure; sister species pairs
Year: 2022 PMID: 35406909 PMCID: PMC9003356 DOI: 10.3390/plants11070929
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1ITS2 secondary structures and the compensatory base change (CBC) on their base pairs. (A) A snapshot of ITS2 stem II showing how CBC occurs on RNA secondary structure; (B) an example of ITS2 consensus secondary structure derived from the sister species pair of Sinosenecio bodinieri and S. confervifer. The four stems are labelled I–IV. The pyrimidine–pyrimidine bulge in stem II, the UGGU in stem III and the high adenine content between stems that are typical of nearly all angiosperm ITS2 secondary structures are indicated in red. The degree of conservation over the entire alignment is displayed in color grades from green (conservative) to red (variable), and the variable bases are labeled with site numbers.
Figure 2Comparisons of distinct likelihoods obtained from the best-fitting DNA and RNA models. (A) -ln L likelihood; (B) AICc likelihood. The same ITS2 sequence-structure alignments analyzed separately with DNA and RNA-models are connected with lines.
Figure 3A scatter plot showing the effect of compensatory base change on genetic distance. As the rate of substitution between stem and loop regions increases, the rate of genetic distance between RNA and DNA models becomes less and less, making RNA models more and more effective and play a leading role when the substitution rate ratio is greater than three.