Literature DB >> 35498650

5S Ribosomal DNA of Genus Solanum: Molecular Organization, Evolution, and Taxonomy.

Yurij O Tynkevich1, Antonina Y Shelyfist1, Liudmyla V Kozub1, Vera Hemleben2, Irina I Panchuk1,2, Roman A Volkov1.   

Abstract

The Solanum genus, being one of the largest among high plants, is distributed worldwide and comprises about 1,200 species. The genus includes numerous agronomically important species such as Solanum tuberosum (potato), Solanum lycopersicum (tomato), and Solanum melongena (eggplant) as well as medical and ornamental plants. The huge Solanum genus is a convenient model for research in the field of molecular evolution and structural and functional genomics. Clear knowledge of evolutionary relationships in the Solanum genus is required to increase the effectiveness of breeding programs, but the phylogeny of the genus is still not fully understood. The rapidly evolving intergenic spacer region (IGS) of 5S rDNA has been successfully used for inferring interspecific relationships in several groups of angiosperms. Here, combining cloning and sequencing with bioinformatic analysis of genomic data available in the SRA database, we evaluate the molecular organization and diversity of IGS for 184 accessions, representing 137 species of the Solanum genus. It was found that the main mechanisms of IGS molecular evolution was step-wise accumulation of single base substitution or short indels, and that long indels and multiple base substitutions, which arose repeatedly during evolution, were mostly not conserved and eliminated. The reason for this negative selection seems to be association between indels/multiple base substitutions and pseudogenization of 5S rDNA. Comparison of IGS sequences allowed us to reconstruct the phylogeny of the Solanum genus. The obtained dendrograms are mainly congruent with published data: same major and minor clades were found. However, relationships between these clades and position of some species (S. cochoae, S. clivorum, S. macrocarpon, and S. spirale) were different from those of previous results and require further clarification. Our results show that 5S IGS represents a convenient molecular marker for phylogenetic studies on the Solanum genus. In particular, the simultaneous presence of several structural variants of rDNA in the genome enables the detection of reticular evolution, especially in the largest and economically most important sect. Petota. The origin of several polyploid species should be reconsidered.
Copyright © 2022 Tynkevich, Shelyfist, Kozub, Hemleben, Panchuk and Volkov.

Entities:  

Keywords:  5S rDNA; Solanum; genomics; hybridization; molecular evolution; polyploidy; taxonomy

Year:  2022        PMID: 35498650      PMCID: PMC9043955          DOI: 10.3389/fpls.2022.852406

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   6.627


Introduction

Regions coding for 5S rRNA (rDNA) are present in genomes of all cellular organisms. In eukaryotes, 5S rDNA belongs to the class of moderately repeated sequences and is represented by hundreds or thousands of copies of tandemly arranged repeated units (repeats). 5S rDNA clusters are mostly located in one or two chromosomes, although multiple loci are also found (Volkov et al., 2004; Hasterok et al., 2006; Garcia et al., 2012; Bustos et al., 2020; Vozárová et al., 2021). In contrast to majority of repeated sequences whose functions largely remain uncertain, the activity of 5S rDNA is vital for cells, providing rRNA indispensable for assembly of functional ribosomes. The copy number of rDNA repeats is higher than what is required for rRNA synthesis, and redundant copies of rDNA are transcriptionally silenced (Volkov et al., 2004; Tucker et al., 2010; Layat et al., 2012; Matyasek et al., 2016). Each 5S rDNA repeat consists of an evolutionarily conserved region encoding 5S rRNA (coding sequence, CDS) and a rapidly evolving intergenic spacer (IGS) (Volkov et al., 2001; Ishchenko et al., 2018; Simon et al., 2018; Qin et al., 2019). The high evolutionary stability of the CDS is the result of purifying selection to maintain the function of 5S rRNA as a component of a large ribosome subunit (Vizoso et al., 2011; Mahelka et al., 2013). Transcription of 5S rDNA is provided by RNA polymerase III (Pol III) and corresponding transcription factors (TFs). The Pol III promoter consists of internal and external elements. The internal elements of the promoter, A-Box, IE, and C-Box, are located in the CDS and represent targets for TFs, which are necessary for the recruitment of Pol III to the transcription initiation complex (Douet and Tourmente, 2007; Layat et al., 2012). Respectively, mutations in internal elements of the promoter should not only disturb the structure of 5S rRNA and ribosome but also affect binding of TFs to the promoter and, thus, the expression of 5S rDNA. In contrast to the CDS, the main part of the IGS is not transcribed and probably does not have any function. Accordingly, it is believed that any mutation in the IGS is selectively neutral; therefore, this region evolves with a high rate. However, it was found that the IGS of Arabidopsis thaliana contains short-sequence motifs involved in initiation (external elements of the promoter) and termination (terminator) of 5S rDNA transcription (Douet and Tourmente, 2007; de Souza et al., 2020). Thus, one would expect that these regions are to be relatively conservative, as has been demonstrated in several taxonomic groups (Falistocco et al., 2007; Tynkevich and Volkov, 2014; Tynkevich et al., 2015; Mlinarec et al., 2016; Ishchenko et al., 2018; Alexandrov et al., 2021). However, the existing knowledge of the organization of external promoter elements and their molecular evolution is still incomplete. In many diploid species, numerous copies of rDNA repeats in the same genome tend to be nearly identical because of sequence homogenization (Volkov et al., 1999b,2001, 2003), i.e., individual copies of repeated elements do not evolve independently but in a concerted manner (Arnheim et al., 1980; Coen et al., 1982). To explain the high intragenomic similarity of 5S rDNA repeats, the “concerted evolution” and “birth and death” hypotheses were proposed. The mechanisms and intensity of homogenization may differ in different taxa and for different groups of repeated sequences, e.g., for 5S and 35S rDNA (Pinhal et al., 2011; Vizoso et al., 2011; Song et al., 2012; Mahelka et al., 2013; Galián et al., 2014; Barman et al., 2016; Volkov et al., 2007, 2017; Chen et al., 2021). Additive inheritance of both parent variants of 5S and 35S rDNA is usually observed in the first generation of interspecific hybrids/allopolyploids. However, in ancient allopolyploids, 35S rDNA can be the subject of interlocus sequence conversion (Volkov et al., 1999a,b, 2007; Vizoso et al., 2011), while different variants of 5S rDNA can coexist in a plant genome for a long time without being homogenized (Fulnecek et al., 2002; Song et al., 2012; Mahelka et al., 2013; Mlinarec et al., 2016; Volkov et al., 2017; Ishchenko et al., 2018). Especially, a significant sequence divergence was found for spatially distant 5S rDNA variants that are located in different loci of the same chromosome set (Cronn et al., 1996; Vozárová et al., 2021) and for different parent loci in genomes of hybrid origin (Fulnecek et al., 2002; Matyasek et al., 2002), while repeats from the same locus appeared to be highly homogenized. Accordingly, 5S rDNA became an attractive focus for investigation of molecular evolution of repeated sequences, identification of hybrids, and phylogenetic studies on angiosperms (Blöch et al., 2009; Baum et al., 2012; Simeone et al., 2018; Tynkevich and Volkov, 2019; Ishchenko et al., 2020, 2021; Cardoni et al., 2021; Vozárová et al., 2021). However, 5S rDNA is still poorly characterized in many important plant groups such as the Solanum L. genus. The Solanum (nightshade) genus is an attractive model for comparative genomics and investigation of molecular evolution of repeated sequences. With around 1,200 species, it belongs to the so-called “giant genera” and is the fifth largest genus of flowering plants (Frodin, 2004; Echeverría-Londoño et al., 2020). Solanum species are distributed worldwide from tropical to temperate areas and grow under diverse ecological conditions. Most Solanum species inhabit the New World, although secondary centers of diversity have been found in Africa, Asia, and Australia (D’Arcy, 1991). Overall, the Solanum genus is an example of unusual hyperdiversity in life forms, morphological features, and ecological preferences, representing a unique system for studying the diversification of plants (Knapp et al., 2004; Echeverría-Londoño et al., 2020). The genus includes important crops such as S. tuberosum L. (potato), S. lycopersicum L. (tomato), and S. melongena L. (brinjal eggplant, aubergine), about 20 cultivated species of local significance like S. aethiopicum L. (Ethiopian eggplant), S. betaceum Cav. (tamarillo), S. muricatum Aiton (pepino), and S. quitoense Lam. (lulo), as well as several medicinal and ornamental plants (S. marginatum L.f., S. aviculare G. Forst., S. mammosum L., and S. pseudocapsicum L.). In the Solanaceae family, the Solanum genus belongs to the strongly supported large “x = 12” clade (Olmstead et al., 2008). The most common chromosome number in Solanum is x = 12, which occurs in 97% of species examined, such as diploids (77%), tetraploids (14%), hexaploids (4%), triploids (2%), and octoploids (0.2%). Application of in situ hybridization showed that 55 out of 64 (85.9%) diploid species possess only one 5S locus per chromosome set (Chiarini et al., 2018). Up to now, the molecular organization and evolution of the 5S rDNA in the genus Solanum have only been analyzed in about 35 species and breeding lines (Volkov et al., 2001; Davidjuk et al., 2010; Sun et al., 2014). In this study, combining cloning and sequencing with analysis of available genomic data in the Sequence Read Archive (SRA) public database, we evaluate the molecular organization, diversity, and evolution of the IGS for 184 plant accessions, representing 137 species across the Solanum genus. Especially, our results shed a new light on the phylogeny of the genus and reticulate evolution of the largest and economically most important sect. Petota.

Materials and Methods

Plant Material and DNA Extraction

A plant material of the Solanum species was obtained from several collections (see Tables 1, 2). A plant material of out-group species, Lycianthes lycioides (L.) Hassl. and Physalis peruviana L. (acc. no. NK-03), was obtained from Orto Botanico di Padova (Italy) and National Botanical Garden of National Academy of Sciences of Ukraine (Kyiv, Ukraine), respectively.
TABLE 1

List of Solanum species analyzed (excluding sect. Petota).

Species nameTaxonomy
Chromosome number, 2 nAbbreviationPlant material
Nee, 1999 (Subgenus-Section)Särkinen et al., 2013 (Clades)Accession NoSource
S. abutiloides (Griseb.) Bitter and LilloSolanum-BrevantherumLeptostemonum-Brevantherum24abu 19682363 MBG
S. aculeatissimum Jacq.Leptostemonum-AcanthophoraLeptostemonum-Acanthophora24acu 79p515 WABG
S. aethiopicum L.Leptostemonum-MelongenaLeptostemonumOld World24aet SAMN 10986202 PRJNA 523664
S. albostellatum R.W. Davis and P.J.H. HurterNot indicatedNot indicatedndals SAMN 10969051 PRJNA 522689
S. americanum Mill.Solanum-SolanumMorelloid24ame1 SAMEA 3486921 PRJEB 9916
ame2 SAMEA 3486922 PRJEB 9916
ame3 SAMEA 7573861 PRJEB 38240
S. anguivi Lam.Not indicatedLeptostemonumOld World24ang SAMN 16746499 PRJNA 676007
S. anomalostemon S. Knapp and M. NeeNot indicatedNot indicated24ano SAMEA 7820352 PRJEB 42506
S. appendiculatum DunalSolanum-AnarrhichomenumPotato-Anarrhichomenum24ape1 SAMN 12623209 PRJNA 561636
ape2 SAMN 12623212 PRJNA 561636
S. aviculare G. Forst.Solanum-ArchaesolanumArchaesolanum46avi 19771009 MBG
S. betaceum Cav.Bassovia-PachyphyllaLeptostemonum-Cyphomandra24betBGUT
S. chrysotrichum Schltdl.Not indicatedNot indicated24chr SAMN 08770449 PRJNA 438407
S. clarkiae SymonNot indicatedLeptostemonumOld World24cla SAMN 12161630 PRJNA 551615
S. cleistogamum SymonLeptostemonum-MelongenaLeptostemonumOld World24cle SAMN 10969163 PRJNA 522689
S. clivorum S. Knapp Solanum-HolophyllaNot indicatedndcli SAMEA 7820346 PRJEB 42506
S. cochoae G.J. Anderson and BernardelloSolanum-BasarthrumNot indicated24coc SAMEA 7820347 PRJEB 42506
S. crinitum Lam.Leptostemonum-CrinitumLeptostemonum-Androceras/Crinitum24cri 74s1231 WABG
S. dimorphandrum S. KnappNot indicatedNot indicated24dim SAMEA 7820348 PRJEB 42506
S. diversiflorum F. Muell.Not indicatedLeptostemonumOld World24div SAMN 10969025 PRJNA 522689
S. dulcamara L.Solanum-DulcamaraDulcamaroid24dul 96065 BGUT
S. elatius A.R. BeanNot indicatedNot indicated24ela SAMN 10969339 PRJNA 522689
S. erianthum D. DonSolanum-BrevantherumNot indicated24eri SAMN 08770591 PRJNA 438407
S. esuriale Lindl.Not indicatedLeptostemonumOld World24, 48esu SAMN 10969026 PRJNA 522689
S. ferocissimum Lindl.Leptostemonum-MelongenaLeptostemonumOld World24, 48fer SAMN 10969027 PRJNA 522689
S. guamense Merr.Not indicatedNot indicatedndgua 81s39 WABG
S. hindsianum Benth.Leptostemonum-MelongenaLeptostemonum-Elaeagnifolium24hinLDZG
S. horridum Dunal ex Poir.Not indicatedLeptostemonumOld World24hor SAMN 10969028 PRJNA 522689
S. incanum L.Not indicatedLeptostemonumOld World24inc SAMN 07303451 PRJNA 392603
S. laciniatum AitonSolanum-ArchaesolanumArchaesolanum92lac SAMEA 7820351 PRJEB 42506
S. lasiophyllum Humb. and Bonpl. ex DunalNot indicatedNot indicated24, 48las SAMN 10969030 PRJNA 522689
S. linnaeanum Hepper and P.-M. L. JaegerNot indicatedLeptostemonumOld World24lin SAMN 13023229 PRJNA 577305
S. macrocarpon L.Leptostemonum-MelongenaLeptostemonumOld World24mac SAMN 16746492 PRJNA 676007
S. mammosum L.Leptostemonum-AcanthophoraLeptostemonum-Acanthophora22mamBGUT
S. medicagineum A.R. BeanNot indicatedNot indicatedndmdg SAMN 12096241 PRJNA 533457
S. melongena L.Leptostemonum-MelongenaLeptostemonumOld World24mel1cultivar Black BeautyVASSMA Ltd.
mel2 SAMN 13023228 PRJNA 577305
mel3 SAMN 07303456 PRJNA 392603
S. muricatum AitonSolanum-BasarthrumPotato-Basarthrum24murBGUT
S. nigrum L.Solanum-SolanumMorelloid24, 48, 72nig SAMN 17035829 PRJNA 683719
S. ossicruentum Martine and J. CantleyNot indicatedNot indicated48oss SAMN 12161629 PRJNA 533451
S. pachyandrum BitterLeptostemonum-HerposolanumNot indicated24pac SAMEA 7820344 PRJEB 42506
S. paposanumPhil.Not indicatedPotato-Regmandra24pap SAMEA 7820349 PRJEB 42506
S. phlomoides A. Cunn. ex Benth.Not indicatedLeptostemonumOld World24, 48phl SAMN 10969029 PRJNA 522689
S. pseudocapsicum L.Solanum-HolophyllaLeptostemonum-Geminata24pseBGChNU
S. pseudolulo HeiserNot indicatedLeptostemonum-Lasiocarpa24psl XX-GZU-88100737 BGUG
S. quitoense Lam.Not indicatedLeptostemonum-Lasiocarpa24qui XX-GZU-00120822 BGUG
S. scabrum Mill.Solanum-SolanumMorelloid72sca SAMN 08456262 PRJNA 432637
S. seaforthianumAndrewsSolanum-DulcamaraNot indicated24sea 74p1254 WABG
S. sejunctum Brennan, Martine and SymonLeptostemonumOld World24sej SAMN 12161632 PRJNA 551616
S. sisymbriifolium Lam.Leptostemonum-MelongenaLeptostemonum-Sisymbriifolium24sis SAMN 16746501 PRJNA 676007
S. spirale Roxb.Not indicatedNot indicated48spi SAMN 08770592 PRJNA 438407
S. torvum Sw.Leptostemonum-TorvaLeptostemonum-Torva24, 48trv SAMN 16746498 PRJNA 676007
S. valdiviense DunalNot indicatedUnclear24val SAMEA 7820350 PRJEB 42506
S. vespertilio ssp. vespertilio AitonLeptostemonum-MelongenaLeptostemonumOld World24vesBGUT
S. villosum Mill.Solanum-SolanumMorelloid48vilBGChNU
S. wendlandii Hook.f.Leptostemonum-HerposolanumLeptostemonum-Allophyllum and Wendlandii24wen 77c37 WABG
S. wrightii Benth.Leptostemonum-CrinitumLeptostemonum-Androceras/Crinitum24wri SAMN 16746495 PRJNA 676007

Taxonomy is shown according to

TABLE 2

List of analyzed Solanum species of sect. Petota.

Species nameTaxonomy
Chromosome number, 2 nPlant material
Hawkes, 1990, Nee, 1999 (Subsection-Series)Huang et al., 2019 (Clade)Abbre-viationAccession NoSource
S. abancayense OchoaPotatoe-Tuberosa (ii)Clade 4 north24abn SAMN 07540430 PRJNA 394943
S. acaule BitterPotatoe-AcauliaNot indicated48aclCIP
S. achacachense CardenasPotatoe-Tuberosa (iii)Clade 4 north24ach SAMN 07540512 PRJNA 394943
S. acroglossum Juz.Potatoe-PiuranaClade 324acg SAMN 07540377 PRJNA 394943
S. acroscopicum OchoaPotatoe-Tuberosa (ii)Clade 324acs SAMN 07540369 PRJNA 394943
S. ahanhuiri Juz. and BukasovPotatoe-Tuberosa (cult.)Not indicated24ajh SAMN 12684889 PRJNA 556263
S. albornozii CorrellPotatoe-PiuranaClade 324abz SAMN 07540378 PRJNA 394943
S. ambosinum OchoaPotatoe-Tuberosa (ii)Clade 4 north24amb1 SAMN 07540480 PRJNA 394943
amb2 SAMN 07540482 PRJNA 394943
S. andreanum BakerPotatoe-Tuberosa (i)Clade 324, 48adr SAMN 07540382 PRJNA 394943
S. arcanum PeraltaEstolonifera-NeolycopersiconNot indicated24arc SAMEA 2335233 PRJEB 5226
S. avilesii Hawkes and Hjert.Potatoe-Tuberosa (iii)Clade 4 south24avl SAMN 07540476 PRJNA 394943
S. berthaultii HawkesPotatoe-Tuberosa (iii)Clade 4 south24ber1 BGRC 18548 GDC
ber2 SAMN 07540477 PRJNA 394943
S. blanco-galdosii OchoaPotatoe-PiuranaClade 324blg SAMN 07540379 PRJNA 394943
S. boliviense DunalPotatoe-MegistacrolobaNot indicated24blv SAMN 06564709 PRJNA 378971
S. brevicaule BitterPotatoe-Tuberosa (iii)Clade 4 south24, 48, 72brc SAMN 07540508 PRJNA 394943
S. bukasovii Juz. ex RybinPotatoe-Tuberosa (ii)Clade 4 north24buk1 BGRC N 15424 GDC
buk2 SAMN 07540400 PRJNA 394943
buk3 SAMN 07540415 PRJNA 394943
buk4 SAMN 07540419 PRJNA 394943
buk5 SAMN 07540466 PRJNA 394943
buk6 SAMN 07540519 PRJNA 394943
buk7 SAMN 07540520 PRJNA 394943
S. bukasovii f. multidissectum (Hawkes) OchoaPotatoe-Tuberosa (ii)Clade 4 north24bukm1 SAMN 07540390 PRJNA 394943
bukm2 SAMN 07540456 PRJNA 394943
bukm3 SAMN 07540457 PRJNA 394943
S. bulbocastanum DunalPotatoe-BulbocastanaClade 1+224blb1 BGRC N 08006 GDC
blb2 SAMN 07540359 PRJNA 394943
S. cajamarquense OchoaPotatoe-Tuberosa (ii)Clade 4 south24cjm SAMN 07540364 PRJNA 394943
S. canasense HawkesPotatoe-Tuberosa (ii)Clade 4 north24can SAMN 07540552 PRJNA 394943
S. candolleanum P. BerthaultPotatoe-Tuberosa (iii)Not indicated24cnd SAMN 06564692 PRJNA 378971
S. cardiophyllum Lindl.Potatoe-PinnatisectaClade 1+224, 36cph SAMN 07540547 PRJNA 394943
S. chacoense BitterPotatoe-YungasensaClade 4 south24chc1B2MPI
24, 36chc2 SAMN 07540432 PRJNA 394943
S. chaucha Juz. and BukasovPotatoe-Tuberosa (cult.)Not indicated72cha SAMN 12684891 PRJNA 556263
S. cheesmaniae (L. Riley) FosbergEstolonifera-NeolycopersiconNot indicated24che SAMEA 2340812 PRJEB 5235
S. chilense (Dunal) ReicheEstolonifera-NeolycopersiconNot indicated24chi SAMEA 2340822 PRJEB 5235
S. chmielewskii (C.M. Rick et al.) D.M. Spooner et al.Estolonifera-NeolycopersiconNot indicated24cml SAMEA 2340810 PRJEB 5235
S. chomatophilum BitterPotatoe-ConicibaccataClade 324chm SAMN 07540374 PRJNA 394943
S. circaeifolium subsp. quimense Hawkes and Hjert.Potatoe-CircaeifoliaNot indicated24crc BGRC N 27036 GDC
S. commersonii DunalPotatoe-CommersonianaNot indicated24cmm1 BGRC N 17654 GDC
24, 36cmm2 SAMN 06564712 PRJNA 378971
S. corneliomuelleri J.F. Macbr.Estolonifera-NeolycopersiconNot indicated24crm SAMEA 2340786 PRJEB 5235
S. curtilobum Juz. and BukasovPotatoe-Tuberosa (cult.)Not indicated60cur SAMN 12684896 PRJNA 556263
S. demissum Lindl.Potatoe-DemissaNot indicated72dmsCIP
S. ehrenbergii (Bitter) Rydb.Potatoe-PinnatisectaNot indicated24ehr SAMN 06564745 PRJNA 378971
S. etuberosum Lindl.Estolonifera-EtuberosaOutgroup24etb SAMN 07540542 PRJNA 394943
S. galapagense S.C. Darwin and PeraltaEstolonifera-NeolycopersiconNot indicated24gal SAMEA 2340846 PRJEB 5235
S. gourlayi HawkesPotatoe-Tuberosa (iii)Clade 4 south24grl15.6GFP
grl2 SAMN 07540506 PRJNA 394943
S. habrochaites S. Knapp and D.M. SpoonerEstolonifera-NeolycopersiconNot indicated24hab SAMEA 2340830 PRJEB 5235
S. hondelmannii Hawkes and Hjert.Potatoe-Tuberosa (iii)Clade 4 south24hdm SAMN 07540500 PRJNA 394943
S. huaylasense PeraltaEstolonifera-NeolycopersiconNot indicated24hua SAMEA 2340821 PRJEB 5235
S. hypacrarthrum BitterPotatoe-PiuranaClade 324hcr SAMN 07540375 PRJNA 394943
S. incamayoense K.A. Okada and A.M. ClausenPotatoe-Tuberosa (iii)Clade 4 south24inm SAMN 07540492 PRJNA 394943
S. infundibuliforme Phil.Potatoe-CuneoalataNot indicated24ifd SAMN 06564699 PRJNA 378971
S. iopetalum (Bitter) HawkesPotatoe-DemissaNot indicated72iop GLSK 161 IPK
S. jamesii Torr.Potatoe-PinnatisectaClade 1+224jam1 BGRC N 10054 GDC
jam2 SAMN 07540363 PRJNA 394943
S. juzepczukii BukasovPotatoe-Tuberosa (cult.)Not indicated36juz SAMN 12684892 PRJNA 556263
S. kurtzianum Bitter and Wittm.Potatoe-Tuberosa (iii)Clade 4 south24ktz SAMN 07540435 PRJNA 394943
S. laxissimum BitterPotatoe-ConicibaccataClade 4 north24lxs1 GLKS 154.3 IPK
lxs2 SAMN 07540550 PRJNA 394943
S. leptophyes BitterPotatoe-Tuberosa (ii)Clade 4 north24lph8.27GFP
S. limbaniense OchoaPotatoe-ConicibaccataClade 4 north24lmb SAMN 07540465 PRJNA 394943
S. lycopersicoides DunalEstolonifera-JuglandifoliaNot indicated24lpd SAMN 10809628 PRJNA 516877
S. lycopersicum L.Estolonifera-NeolycopersiconOutgroup24lyc1--
lyc2 SAMN 15097861 PRJNA 637170
lyc3 SAMN 11163599 PRJNA 527863
S. lycopersicum var. cerasiforme (Dunal) D.M. Spooner et al.Estolonifera-NeolycopersiconNot indicated24lycc1lycc2 SAMN 09229594 SAMN 09229698 PRJNA 454805PRJNA 454805
S. maglia Schltdl.Potatoe-MagliaClade 4 south36mag BGRC N032571 GDC
S. marinasense VargasPotatoe-Tuberosa (ii)Clade 4 north24mrn SAMN 07540408 PRJNA 394943
S. medians BitterPotatoe-Tuberosa (ii)Clade 4 north24, 36med SAMN 07540469 PRJNA 394943
S. megistacrolobum BitterPotatoe-MegistacrolobaClade 4 south24mga SAMN 07540385 PRJNA 394943
S. microdontum BitterPotatoe-Tuberosa (iii)Clade 4 south24mcd1 BGRC 27351 GDC
24, 36mcd2 SAMN 07540501 PRJNA 394943
S. multiinterruptum BitterPotatoe-Tuberosa (ii)Clade 324mtp SAMN 07540388 PRJNA 394943
S. neorickii D.M. Spooner, G.J. Anderson and R.K. JansenEstolonifera-NeolycopersiconNot indicated24neo SAMEA 2340816 PRJEB 5235
S. neorossii Hawkes and Hjert.Potatoe-Tuberosa (iii)Not indicated24nrs11.42GFP
S. okadae Hawkes and Hjert.Potatoe-Tuberosa (iii)Not indicated24oka1 BGRC 17550 GDC
oka2 BGRC 24719 GDC
oka3 SAMN 06564702 PRJNA 378971
S. palustre Poepp. ex Schltdl.Estolonifera-EtuberosaOutgroup24pal1 BGRC N 17441 GDC
pal2 SAMN 07540543 PRJNA 394943
S. pampasense HawkesPotatoe-Tuberosa (ii)Clade 4 north24pam SAMN 07540427 PRJNA 394943
S. paucissectum OchoaPotatoe-PiuranaClade 324pcs SAMN 07540376 PRJNA 394943
S. pennellii CorrellEstolonifera-NeolycopersiconNot indicated24pen SAMN 14984469 PRJNA 557253
S. peruvianum L.Estolonifera-NeolycopersiconNot indicated24per SAMEA 2340809 PRJEB5235
S. phureja Juz. and BukasovPotatoe-Tuberosa (cult.)Cultivated24phu1 IVP 101 CPBR
phu2 SAMN 07540523 PRJNA 394943
S. pimpinellifolium L.Estolonifera-NeolycopersiconNot indicated24pim SAMN 09229654 PRJNA 454805
S. pinnatisectum DunalPotatoe-PinnatisectaClade 1+224pnt1 BGRC N 08168 GDC
pnt2 SAMN 07540354 PRJNA 394943
S. polyadenium Greenm.Potatoe-PolyadeniaClade 1+224pld1 BGRC N 08176 GDC
pld2 SAMN 07540357 PRJNA 394943
S. raphanifolium Cardenas and HawkesPotatoe-MegistacrolobaNot indicated24rap1 BGRC N 07207 GDC
rap2 BGRC N 08189 GDC
rap3 SAMN 06564696 PRJNA 378971
S. sitiens I.M. Johnst.Estolonifera-JuglandifoliaNot indicated24sit SAMN 14932980 PRJNA 633104
S. sogarandinum OchoaPotatoe-MegistacrolobaClade 324sgr1 SAMN 07540395 PRJNA 394943
sgr2 SAMN 07540416 PRJNA 394943
S. sparsipilum (Bitter) Juz. and BukasovPotatoe-Tuberosa (ii)Clade 4 south24, 48spl114.9GFP
spl2 SAMN 07540479 PRJNA 394943
S. spegazzinii BitterPotatoe-Tuberosa (iii)Clade 4 south24spg117.45GFP
spg2 SAMN 07540411 PRJNA 394943
S. stenophyllidium BitterPotatoe-PinnatisectaClade 1+224ste SAMN 07540355 PRJNA 394943
S. stenotomum Juz. and BukasovPotatoe-Tuberosa (cult.)Cultivated24stn1CIP
stn2 SAMN 07540540 PRJNA 394943
S. stenotomum subsp. goniocalyx (Juz. and Bukasov) HawkesPotatoe-Tuberosa (cult.)Cultivated24gon SAMN 07540541 PRJNA 394943
S. stoloniferum Schltdl. and C.D.BouchéPotatoe-LongipedicellataNot indicated48sto SAMEA 4949197 PRJEB 28862
S. tarijense HawkesPotatoe-YungasensaClade 4 south24trj SAMN 07540392 PRJNA 394943
S. tuberosum L.Potatoe-Tuberosa (iii)Breading lines24tbr1B15BLBP
24tbr2R1RAGIS
24tbr3 BP1076 Bio
24tbr4B1BLBP
S. tuberosum subsp. andigena (Juz. and Bukasov) HawkesPotatoe-Tuberosa (iii)Not indicated24, 36, 48tbrA1 SAMN 06564721 PRJNA 378971
tbrA2 SAMN 06564717 PRJNA 378971
S. venturii Hawkes and Hjert.Potatoe-Tuberosa (iii)Clade 4 south24vnt SAMN 07540366 PRJNA 394943
vrn1GDC
vrn2 SAMN 07540493 PRJNA 394943
vrn3 SAMN 07540514 PRJNA 394943
S. verrucosum Schltdl.Potatoe-Tuberosa (i)Clade 4 south24, 36, 48ver SAMN 07540496 PRJNA 394943
S. violaceimarmoratum BitterPotatoe-ConicibaccataClade 4 north24vio SAMN 07540551 PRJNA 394943

Taxonomy and species name abbreviations are shown according to

List of Solanum species analyzed (excluding sect. Petota). Taxonomy is shown according to List of analyzed Solanum species of sect. Petota. Taxonomy and species name abbreviations are shown according to Total genomic DNA was isolated from herbarium specimens according to the CTAB method of DNA extraction (Porebski et al., 1997). In addition, DNA was treated with Proteinase K (Sigma-Aldrich, United States).

Amplification, Cloning, and Sequencing of 5S rDNA Repeats

Repeated units of 5S rDNA were amplified using the primers Pr5S-L and Pr5S-R, complementary to the 5S rRNA CDS. These primers provide amplification of complete 5S IGS and flanking regions of the CDS (Volkov et al., 2001). PCR amplification was performed as described previously (Tynkevich and Volkov, 2019). PCR products were ligated into plasmid vector pJET 1.2/blunt using CloneJET PCR Cloning Kit (Thermo Fisher Scientific, United States). Screening of recombinant clones and size selection of inserts were performed by colony PCR with pJET 1.2 forward and reverse primers. Two to eight clones per plant accession were Sanger-sequenced by LGC Genomics (Germany). Primary processing of nucleotide sequences and calculation of sequence similarity levels were performed using the Chromas software and the DNASTAR software package. The obtained sequences were deposited in the GenBank database under the accession numbers listed in Table 3.
TABLE 3

Characteristics of the 5S intergenic spacer region (IGS) of Solanum species analyzed (excluding sect. Petota).

Species nameAbbreviationClade—Figure 7SequencingTNSSRA/clone NoGC content, %IGS length, bpSIM, %
S. abutiloides abu2.1CS2OM100771-254.721496.7
S. aculeatissimum acu2.3.1CS5OM100773-748.2418996.8–100
S. aethiopicum aet2.3.3DGA4SRX543853448.2516590.3–98.9
S. albostellatum als2.3.3BGA3SRX546280755.3719392.4–94.4
S. americanum ame11.1GA7ERX104311149.4122593.4–98.2
ame21.1GA6ERX104312348.822392.5–99.6
ame31.1GA2ERX470676049.822699.6
S. anguivi ang2.3.3DGA8SRX947354349.318989.6–99
S. anomalostemon ano2.1GA5ERX490718257.0221596.7–99.5
S. appendiculatum ape11.4.1GA3SRX676353047.5721985.5–96.4
ape21.4.1GA6SRX676355247.4222084.1–99.1
S. aviculare avi1.3CS3OM100778-8042.1320898.6–99.5
S. betaceum bet2.2CS2OM100795-653.218798.4
S. chrysotrichum chr2.3.2GA3SRX40430854918594.1–97.8
S. clarkiae cla2.3.3AGA5SRX637630856.1419997.5–99
S. cleistogamum cle2.3.3CGA4SRX546272552.4817478.8–95
S. clivorum cli1.4.1GA4ERX490717643.7520998.6–99.5
S. cochoae coc2.2GA5ERX490717761.2815696.2–99.4
S. crinitum cri2.3CS2OM100781-25318497.3
S. dimorphandrum dim1.2GA4ERX490717850.2516790.8–99.4
S. diversiflorum div2.3.3AGA4SRX546295556.119897–98.5
S. dulcamara dul1.2CS5AJ226026-3057.6822196–99.6
S. elatius ela2.3.3BGA4SRX546244253.619993.5–99
S. erianthum eri2.1GA6SRX404322753.1821394.8–98.1
S. esuriale esu2.3.3BGA4SRX546295255.918496.2–98.4
S. ferocissimum.fer2.3.3CGA3SRX546295349.7320398.5–99
S. guamense gua2.3.2CS4OM100797-80048.3518593–96.8
S. hindsianum.hin2.3.3CS2OM100783, OM74471055.4525899
S. horridumhor2.3.3CGA6SRX546295051.3817582.5–97.8
S. incanum inc2.3.3DGA5SRX297743050.7820696.6–99
S. laciniatum lac1.3GA7ERX490718143.3320681.8–99.5
S. lasiophyllum las2.3.3CGA6SRX54629484816959.4–98.9
S. linnaeanum lin2.3.3DGA3SRX699503049.0320796.2–99.5
S. macrocarpon mac2.3.3BGA6SRX947355446.4217791.6–97.8
S. mammosum mam2.3.1CS2OM100801-254.4520399.5
S. medicagineum mdg2.3.3CGA7SRX609522750.8417197.1–99.4
S. melongena mel12.3.3DCS3HM042870-1, OM10080349.3719856–99.6
mel22.3.3DGA8SRX699502949.5833851.4–99.7
mel32.3.3DGA3SRX29774275034995.2–99.4
S. muricatum mur1.4.1CS2OM100804-545.520999
S. nigrum nig1.1GA4SRX965446049.6822697.8–99.6
S. ossicruentum oss2.3.3BGA6SRX637630752.8219385.4–98
S. pachyandrum pac2.2GA6ERX490717448.5821083.4–98.6
S. paposanum pap1GA4ERX490717953.2322697.8–99.1
S. phlomoides phl2.3.3AGA4SRX546295153.717987.4–98.4
S. pseudocapsicum pse2.2CS3OM100784-563.917395.3–97.1
S. pseudolulo psl2.3.1CS3OM100806-849.8721981.2–98.3
S. quitoense qui2.3.1CS4OM100809-1249.619871.4–99.6
S. scabrum sca1.1GA4SRX364160246.8522792.5–96
S. seaforthianum sea1CS3OM100813-540.4723089.6–93
S. sejunctum sej2.3.3AGA4SRX637630953.6320297–99
S. sisymbriifolium sis2.3GA2SRX947354554.7521199.5
S. spiralespi2.3.3DGA6SRX404322849.0319188.7–99
S. torvum trv2.3.2GA5SRX947354249.1618595.1–99.5
S. valdiviense val1.3GA3ERX490718051.521198.6–99.5
S. vespertilio ves2.3.3DCS3OM100816-749.1720492.2–93.7
S. villosum vil1.1CS2OM100818-948.6522696.9
S. wendlandiiwen2.2CS8OM100786-948.9922058.4–96.9
S. wrightii wri2.3GA1SRX947355753183100

Methods used for generation of sequences: CS, cloning and sequencing; DS, direct sequencing of PCR product; GA, genomic assembly; TNS, total number of 5S IGS sequences (clones or ribotypes) analyzed in this study; SIM, intragenomic similarity between clones/ribotypes. For GC content and length of IGS, average values are shown.

Characteristics of the 5S intergenic spacer region (IGS) of Solanum species analyzed (excluding sect. Petota). Methods used for generation of sequences: CS, cloning and sequencing; DS, direct sequencing of PCR product; GA, genomic assembly; TNS, total number of 5S IGS sequences (clones or ribotypes) analyzed in this study; SIM, intragenomic similarity between clones/ribotypes. For GC content and length of IGS, average values are shown.

Assembly of 5S rDNA Repeats From Illumina Short Reads

De novo assembly of 5S rDNA repeats was performed using libraries of pre-filtered paired or single Illumina reads from raw data of Solanum species genomes available in SRA (Tables 3, 4). Read filtering was carried out using the built-in tool on the sequence download page: https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=search_seq_name. To filter reads containing 5S rDNA fragments, 20-bp long fragments of CDS were used for matching. De novo assembly was conducted using SeqMan NGen 14 (DNASTAR Lasergene suite). Libraries of filtered reads were automatically trimmed for quality, and the following assembly parameters were used: mer size 31, minimum match percentage 100%, and coverage threshold 100 reads. In the obtained contigs with highest coverage from 2 to 12, 5S rDNA repeats that contain one full IGS flanked by two fragments of CDS were identified and collected.
TABLE 4

Characteristics of the 5S IGS of Solanum species of sect. Petota.

Species nameAbbreviationCluster—Figure 8SequencingTNSSRA/clone NoGC content, %Length, bpSIM, %
S. abancayense abnA3GA7SRX464506049.0922296–99.6
S. acaule aclD10CS4AJ226031-3449.4321997.3–99.5
S. achacachense achD1, D7GA5SRX464506151.0620887.1–99.1
S. acroglossum acgA5GA7SRX464506451.0422396.4–99.6
S. acroscopicum acsA4GA11SRX464506350.7622495.6–99.6
S. ahanhuiri ajhD10GA4SRX696307749.221995.9–99.5
S. albornozii abzA1GA5SRX464506546.9219594.1–99
S. ambosinum amb1D1GA4SRX464506849.1820789.2–98.6
amb2D1, D10GA12SRX464507049.2521894.1–99.5
S. andreanum adrA4GA3SRX464507348.8321497.7–98.6
S. arcanum arcA6GA3ERX37659545.3723197.4–98.3
S. avilesii avlD1GA7SRX464507752.0720372.5–99.1
S. berthaultii ber1D5CS5AJ226037-4150.921398.1–100
ber2D5GA7SRX464507949.2420673.3–98.6
S. blanco-galdosii blgA5GA5SRX464508250.2822096.4–99.5
S. boliviense blvD6GA4SRX264603050.421496.3–98.1
S. brevicaule brcD6GA4SRX464509150.1321193.4–99.1
S. bukasovii buk1A3DS1AF33213048.2222nd
buk2A3GA4SRX464509248.322298.6–99.5
buk3D8, D9GA5SRX464509349.0221496.7–99.5
buk4D8, D9GA8SRX464509448.7520890.2–99.5
buk5D4GA5SRX464509550.1421398.6–99.5
buk6A3, D10GA5SRX464509848.5822084.7–99.6
buk7D6, D8, D10GA12SRX464509950.1621493.2–99.1
S. bukasovii f. multidissectumbukm1A3, D6GA5SRX464518449.9421286.9–99.5
bukm2D1, D10GA5SRX464519049.8621694.5–99.5
bukm3D7GA4SRX464519150.1321497.7–99.5
S. bulbocastanum blb1ndCS3AJ226012-1450.7318998.4–99.5
blb2ndGA3SRX464510051.9318897.9–98.4
S. cajamarquense cjmD2GA4SRX464510250.922396–99.6
S. canasense canD1, D7GA4SRX464511351.8320590.6–99.1
S. candolleanum cndD3, D6GA4SRX264604750.121394.4–98.6
S. cardiophyllum cphA1GA5SRX464511651.8622496.4–99.1
S. chacoense chc1D3DS1AF33105550.7213nd
chc2D3GA9SRX464512053.1621390.1–99.1
S. chaucha chaA3, D8, D10GA10SRX696656749.0221783.4–99.6
S. cheesmaniae cheA6GA4ERX38438746.0823297.4–99.1
S. chilense chiA6GA4ERX38439746.723097–99.6
S. chmielewskii cmlA6GA5ERX38438544.9523295.3–97
S. chomatophilum chmA5GA5SRX464512351.3222396.4–99.6
S. circaeifolium subsp. quimensecrcA4CS8AJ226015-2249.7322794.3–100
S. commersonii cmm1A5DS1AF33105651.3224nd
cmm2D3GA4SRX264602750.0522097.3–99.1
S. corneliomuelleri crmA6GA3ERX38436146.8722289.5–96.1
S. curtilobum curA2, D1, D2, D9, D10GA8SRX696656850.0921883.3–99.6
S. demissum dmsD10CS3AJ226023-2549.0321998.6–99.5
S. ehrenbergii ehrA2GA5SRX264599150.0820883.1–99.1
S. etuberosum etbA1GA7SRX464512448.2122394.6–99.1
S. galapagense galA6GA1ERX38442146.4233100
S. gourlayi grl1D6DS1AF33105751.8213nd
grl2D6GA3SRX464513850.721497.2–99.5
S. habrochaites habA6GA3ERX38440545.6323396.6–97.9
S. hondelmannii hdmD6GA5SRX464514550.5621790.1–98.3
S. huaylasense huaA6GA1ERX38439646.7229100
S. hypacrarthrum hcrA1GA5SRX464514845.9620289.3–99.5
S. incamayoense inmD3, D6GA4SRX464515349.4321590.4–99.1
S. infundibuliforme ifdD6GA4SRX264604049.921397.2–99.5
S. iopetalum iopD6CS4AJ226042-4549.021296.2–99.1
S. jamesii jam1BDS1AF33105850.2213nd
jam2BGA2SRX464515550.021299.1
S. juzepczukii juzA2, D2GA8SRX696656651.6822182.9–99.6
S. kurtzianum ktzD3GA6SRX464515751.2821478.2–99.1
S. laxissimum lxs1CCS5AJ226046-5049.120297–100
lxs2CGA4SRX464516349.3320294.6–99.5
S. leptophyes lphD6DS1AF33105949.8213nd
S. limbaniense lmbD1, D10GA8SRX464517150.0121991.5–99.6
S. lycopersicoides lpdA6GA2SRX530195745.2522992.4
S. lycopersicum lyc1A6CS1X5569746235nd
lyc2A6GA5SRX846771045.323393.6–98.7
lyc3A6GA5SRX553872545.6623398.3–99.6
S. lycopersicum var. cerasiformelycc1A6GA4SRX418331046.0523497.4–99.6
lycc2A6GA1SRX418317145.7234100
S. maglia magD3CS8AF331047-5447.3518972.1–100
S. marinasense mrnD9, D10GA4SRX464517346.3520166.7–98.2
S. medians medA2, D1GA11SRX464517849.621673.8–99.6
S. megistacrolobum mgaD2GA9SRX464517951.321388.2–99.5
S. microdontum mcd1D3CS9AJ226051-5950.2721091.2–100
mcd2D3, D6GA7SRX464525048.4319968.8–98.6
S. multiinterruptum mtpA3GA4SRX464518348.6322297.3–99.1
S. neorickii neoA6GA5ERX38439145.523095.7–98.7
S. neorossii nrsD3DS1AF33106050.2213nd
S. okadae oka1D4CS3AJ226060-6250.3322096.4–99.5
oka2D1, D4CS4AJ226063-6651.2820672.3–98.6
oka3D1GA3SRX264603752.5720497.1–98
S. palustre pal1A1CS6AJ226035-6353.5717466.4–100
pal2A1GA4SRX464519347.9522497.3–99.1
S. pampasense pamD1, D8GA5SRX464519750.3621085.5–96.7
S. paucissectumpcsA5GA5SRX464519849.2621069.1–99.1
S. pennellii penA6GA4SRX837112247.0522996.9–99.1
S. peruvianum perA6GA4ERX38438446.8823098.3–99.6
S. phureja phu1D1DS1AF33106150.0212nd
phu2D1, D8GA6SRX464519949.5521396.2–99.5
S. pimpinellifolium pimA6GA4SRX418309146.4522998.3–99.1
S. pinnatisectum pnt1BCS5X82779, AJ226008-1149.2421092.4–96.2
pnt2BGA18SRX311579650.1821183.5–99.5
S. polyadenium pld1A1CS3AF331044-649.9719786.9–96.6
pld2A1GA6SRX464521050.1720482.2–99.1
S. raphanifolium rap1CDS1AF33213150.0172nd
rap2A3DS2AF332132-350.4520173.5
rap3CGA8SRX264604350.7617688.8–98.9
S. sitiens sitA6GA5SRX853791945.422997.4–99.6
S. sogarandinum sgr1A1GA10SRX464521151.1521254.9–99.1
sgr2A1GA6SRX464521245.4720483.5–99.6
S. sparsipilum spl1D6DS1AF33106249.1216nd
spl2D6GA5SRX464521650.820687.3–98.6
S. spegazzinii spg1D3DS1AF33106352.2205nd
spg2D3GA6SRX464521951.2821995.4–99.1
S. stenophyllidium steA2GA3SRX311579750.4721998.6–99.5
S. stenotomum stn1D6DS1AF33106447.5200nd
stn2D1, D9GA8SRX464523150.6421594–99.5
S. stenotomum subsp. goniocalyxgonD8GA4SRX464512849.5321396.7–98.6
S. stoloniferum stoD1, D3GA6ERX282524050.4820386–99.5
S. tarijense trjD5GA7SRX464523251.1621192.5–99.1
S. tuberosum tbr1D3CS3X82780, Y16650-5152.3720599.5–100
tbr2D10CS1X8278149.1216nd
tbr3D3CS4Y16652-5552.3320599.5–100
tbr4D1CS4Y16656-5949.4521392–98.1
tbrA1D6, D8, D9GA5SRX264601850.1221493.5–99.1
tbrA2D3, D10GA4SRX264602251.121290.4–99.5
S. venturii vntD3, D4GA8SRX464514649.821893.6–99.5
S. vernei vrn1D7DS1AF33212950.0202nd
vrn2D7GA4SRX464524751.4520183.1–98.6
vrn3D7GA9SRX464525151.6320470.1–99.1
S. verrucosum verD6GA7SRX464524849.8321393.9–99.1
S. violaceimarmoratum vioCGA4SRX464525648.420391.1–98.5

Methods used for generation of sequences: CS, cloning and sequencing; DS, direct sequencing of PCR product; GA, genomic assembly; TNS, total number of 5S IGS sequences (clones or ribotypes) analyzed in this study; SIM, intragenomic similarity between clones/ribotypes. For GC content and length of IGS, average values are shown. Sequences generated by cloning for members of sect. Petota were obtained from our previous publication (

Characteristics of the 5S IGS of Solanum species of sect. Petota. Methods used for generation of sequences: CS, cloning and sequencing; DS, direct sequencing of PCR product; GA, genomic assembly; TNS, total number of 5S IGS sequences (clones or ribotypes) analyzed in this study; SIM, intragenomic similarity between clones/ribotypes. For GC content and length of IGS, average values are shown. Sequences generated by cloning for members of sect. Petota were obtained from our previous publication (

Prediction of 5S rRNA Secondary Structure

Hypothetical secondary structures of potential 5S rRNA transcripts were predicted using the Fold online tool in the RNAstructure server (Reuter and Mathews, 2010).[1] Lowest free energy structures were calculated using the following default parameters: temperature (in K) 310.15; maximum loop size 30; minimum helix length 3.

Median-Joining Network and Phylogenetic Analysis

Relationships among IGS sequences of the Solanum species were analyzed applying the median-joining network approach implemented in SplitsTree 5 (Huson and Bryant, 2006). Alignments of the IGS sequences were performed in the MAFFT server using the G-INS-I method, which is most suitable for sequences with global homology (Katoh et al., 2019). For alignment of the IGS sequences of Solanum species belonging to different taxonomic groups in the genus, we applied the E-INS-I method implemented in the MAFFT server (Katoh et al., 2019). The generated alignment was checked and adjusted manually with the UGENE software. The best-fit nucleotide substitution model was estimated with the lowest value of Bayesian Information Criterion (BIC) using the Find Best-Fit Substitution Model tool in Mega X (Kumar et al., 2018). A maximum likelihood (ML) phylogenetic tree was generated with the PhyML plugin for Geneious Prime 2021.0.3.[2] The IGS sequences of L. lycioides and Ph. peruviana produced in this study (acc. nos. OM100793-4 and OM744711-3) as well as those of four Capsicum species available in GenBank (C. baccatum L.: AF217951, C. frutescens L.: AF217952, C. chinense Jacq.: AF217953, and C. pubescens Ruiz and Pav.: AF217954) were used as outgroups. Branch support was calculated by approximate likelihood ratio tests, aLRT-Chi2 (Anisimova and Gascuel, 2006), and bootstrap analysis with 1,000 resampling replicates. Phylogenetic analysis was also performed by Bayesian inference using the MrBayes 2.2.4 plugin for Geneious Prime 2021.0.3. Four independent Monte Carlo Markov Chains (MCMCs) of 1,000,000 iterations each were run to generate phylogenetic trees with Bayesian posterior probabilities. Trees were sampled every 500 generations. The resulting trees were exported in Newick format and annotated using “Interactive tree of life” (iTOL v6).

Results

Cloning of 5S rDNA Repeats

5S rDNA repeats of 17 Solanum species representing different taxonomic groups were amplified by PCR using primers complementary to the coding region, cloned, and sequenced (Table 3 and Supplementary Material). Analysis of the obtained sequences showed that majority of the clones contained IGS flanked on both sides by fragments of the coding region including the primers used for PCR. Besides, we obtained 5S rDNA clones of S. vespertilio and S. pseudocapsicum that contain rDNA dimers, i.e., two adjacent copies of IGS, and the whole sequence of the CDS between them. Also, two clones containing 5S rDNA dimers and one clone containing a trimer were sequenced for S. wendlandii.

Intragenomic Diversity of Intergenic Spacer: In-Depth Analysis of Sequence Read Archive Data

In order to assess the intragenomic variability of 5S rDNA, we evaluated how many different types/variants of repeated units (ribotypes) are present in genomes of the Solanum species. Genomes of three diploid species, S. lycopersicum-3 (SRX5538725), S. stenotomum-2 (SRX4645231) of sect. Petota, and S. melongena-2 (SRX6995029) of sect. Melongena, were selected for detailed analysis. For these genomes, we assembled de novo 5S rDNA repeats composed of complete IGS and two flanking fragments of CDS. If the CDS contained indels or several SNP, the repeat was considered a pseudogene and excluded from further analysis. Variants of IGS that differed in at least one SNP were considered as distinct ribotypes. The total number of IGS ribotypes was 45, 177, and 31 in S. lycopersicum-3, S. stenotomum-2, and S. melongena-2, respectively. In order to visualize the intragenomic diversity of the ribotypes found in the three species, median-joining networks were constructed (Figure 1).
FIGURE 1

Analysis of 5S rDNA intragenomic diversity in diploid Solanum species. (A) Median joining networks for IGS types/variants (ribotypes) of S. lycopersicum-3, S. stenotomum-2, and S. melongena-2. Ribotypes are designated by the first letter of corresponding species name with index numbers. The size of the circles is proportional to the relative content (in %) of each ribotype in the genome. (B) Relative content (in %) of ribotypes.

Analysis of 5S rDNA intragenomic diversity in diploid Solanum species. (A) Median joining networks for IGS types/variants (ribotypes) of S. lycopersicum-3, S. stenotomum-2, and S. melongena-2. Ribotypes are designated by the first letter of corresponding species name with index numbers. The size of the circles is proportional to the relative content (in %) of each ribotype in the genome. (B) Relative content (in %) of ribotypes. After that, we mapped the reads of complete genomic libraries to the reference sequences of all collected ribotypes in order to estimate their relative content in the genomes. The obtained results showed that the IGS ribotypes differ significantly in this parameter. Accordingly, we classified the ribotypes as major (≥10% of all IGS copies present in the genome), minor (<10 but ≥5%), or rare (<5%). The number of major, minor, and rare ribotypes is 5, 0, and 40 in S. lycopersicum-3: 3, 5 and 169 in S. stenotomum-2; 2, 5, and 24 in S. melongena-2 (Figure 1). Altogether, the major and minor ribotypes represent 93, 68, and 78% of all rDNA repeats present in the genomes of these three species. Based on the results obtained, in the further analysis of 5S rDNA in other species, we considered only major and minor ribotypes. The variability of IGS sequences in each examined sample is given in Supplementary Material.

Length and GC Content of the 5S rDNA Repeated Units

Using sequences of clones and major + minor ribotypes, we determined GC content in the IGS of the Solanum species (Tables 3, 4) and found that this value ranges from 40.5% in S. seaforthianum to 63.9% in S. pseudocapsicum. In 90% of the species, intragenomic difference in GC content between individual ribotypes and clones was less than 4%. A greater difference was observed in repeats that were subjected to deletions, particularly in the AT-rich region of the IGS. No significant changes in GC content were found for taxonomic groups in the Solanum genus, suggesting that this parameter remained relatively constant during evolution. The typical length of IGS in members of the Solanum genus is about 190–220 bp (Tables 3, 4). The shortest IGS were found in S. cochoae, 155–158 bp, and in S. aethiopicum, 162–175 bp. In S. lasiophyllum, however, one ribotype (las-C2R1) is even shorter, 115 bp, although five other ribotypes in this species are 180 bp in length. The longest IGSs were found in S. melongena, 344–360 bp, and in S. lycopersicum, 234–235 bp. The extremely long IGS length in S. melongena is associated with large duplication of the spacer sequence. There is no significant difference in IGS length among the taxonomic groups in the Solanum genus. In general, our data show that the length remained largely unchanged during the evolution of the Solanum genus.

Long Duplication in the Intergenic Spacer of S. melongena

Two structural variants of IGS, long (∼350 bp) and short (∼200 bp) were identified in S. melongena-2 (mel2). The long variant was found in three accessions, mel1 (analyzed by cloning and sequencing) and in mel2 and mel3 (extracted from SRA), while the short variant was only detected in mel1 and mel2. In mel2, all major and minor as well as majority of rare ribotypes belong to the long variant, while the short variant is only represented by two rare ribotypes, M17 and M29 (Figure 1B), whose relative content in the genome is below 1%. Numerous single nucleotide polymorphisms (SNPs) and two oligonucleotide indels are present in the M29 sequence, so this ribotype appears to be a pseudogene. The ribotype M17 (mel2-C17R1, see Supplementary Material) also contains several SNPs compared to other ribotypes of S. melongena. Interestingly, this ribotype is identical to the most common ribotype of a closely related species, S. linnaeanum. A detailed sequence analysis showed that the long variant contains a 146-bp-long tandem duplication in the central part of the IGS (Figure 2). The duplicated region consists of a 32-bp-long 3′-fragment of the coding region and an adjacent 114-bp fragment of the IGS. Two copies of the duplicated segment differ by 6 SNPs and one 8-bp-long indel. All mutations are localized in the fragment of the IGS, not in the coding region.
FIGURE 2

(A) Schematic representation of the molecular organization and (B) sequence alignment of the 3′ fragment of the coding sequence (CDS) and the complete intergenic spacer region (IGS) the of 5S rDNA of S. melongena (mel) and closely related species S. aethiopicum (aet), S. incanum (inc), S. linnaeanum (lin). Two copies of duplication are shown separately and highlighted in different colors. Ribotype and clone names are shown after abbreviations of species names; cons, consensus sequence.

(A) Schematic representation of the molecular organization and (B) sequence alignment of the 3′ fragment of the coding sequence (CDS) and the complete intergenic spacer region (IGS) the of 5S rDNA of S. melongena (mel) and closely related species S. aethiopicum (aet), S. incanum (inc), S. linnaeanum (lin). Two copies of duplication are shown separately and highlighted in different colors. Ribotype and clone names are shown after abbreviations of species names; cons, consensus sequence.

High Diversity of 5S rDNA in S. wendlandii

For the 5S rDNA of S. wendlandii, we sequenced four clones, pSowen-3,-13,-14, and-18, which bore inserts of different lengths, 732, 912, 319, and 657 bp, respectively. Sequence analysis showed that the shortest insert contains one copy of IGS flanked by CDS fragments. The longer inserts represent two dimers and a trimer composed of adjacent copies of 5S rDNA repeats (Figure 3A).
FIGURE 3

Molecular organization of 5S rDNA repeats in S. wendlandii (wen). (A) General organization of 5S rDNA clones. Pr1 and Pr2, position of primers Pr5S-L and Pr5S-R used for PCR/cloning. (B) Sequences alignment of IGS and flanking fragments of CDS. The consensus sequence of CDS of the genus Solanum (Solanum CDS) is shown for comparison. The arrows indicate the location of repeated motifs.

Molecular organization of 5S rDNA repeats in S. wendlandii (wen). (A) General organization of 5S rDNA clones. Pr1 and Pr2, position of primers Pr5S-L and Pr5S-R used for PCR/cloning. (B) Sequences alignment of IGS and flanking fragments of CDS. The consensus sequence of CDS of the genus Solanum (Solanum CDS) is shown for comparison. The arrows indicate the location of repeated motifs. The sequence alignment revealed an obvious difference among IGS sequences of the adjacent 5S rDNA copies (Figure 3B), which is due to numerous nucleotide substitutions and insertions of different lengths of 1–82 bp. The 82-bp-long insertion harbors three tandem copies of the adjacent sequence, which is normally present once in the IGS. The level of sequence similarity among the compared IGS copies ranges from 58.4 to 96.9%, which indicates high intragenomic heterogeneity of the IGS in S. wendlandii. Comparison of the 5S rRNA CDS of S. wendlandii and several Solanum species representing different intrageneric clades revealed that the CDS is, as expected, highly conserved in the genus. Analysis of the 5S rDNA clones/ribotypes of several Solanum species showed that a single CDS usually contains no more than two mutations compared to the respective consensus sequence (data not shown), which agrees well with the observation on other plant taxa (Park et al., 2000; Mahelka et al., 2013). In contrast, the complete CDS sequences of S. wendlandii each contain 5–16 base substitutions (Figure 4A).
FIGURE 4

Comparison of Solanum 5S rRNA CDS. (A) Alignment of the CDS of distantly related Solanum species and 5S rDNA clones of S. wendlandii. (B) Predicted secondary structures of 5S rRNA transcripts. Abbreviations of species names are given in Table 1.

Comparison of Solanum 5S rRNA CDS. (A) Alignment of the CDS of distantly related Solanum species and 5S rDNA clones of S. wendlandii. (B) Predicted secondary structures of 5S rRNA transcripts. Abbreviations of species names are given in Table 1. The presence of numerous mutations in the CDS suggests its transformation into a pseudogene. To test this possibility, we calculated the secondary structure for transcripts of the complete CDS from the clones pSowen-3,-13, and-18. For comparison, the secondary structure was also calculated for (i) the total consensus CDS of the Solanum genus, (ii) consensus CDS of S. melongena, which differs from the total consensus by one base substitution, and (iii) CDS of S. pseudocapsicum (dimer clone pSpse-5S7), which contains two base substitutions (Figure 4B). The sequences examined formed a secondary structure typical for 5S rRNA (Sun and Caetano-Anollés, 2009), with the exception of the CDS of S. wendlandii, which appeared to be significantly changed, suggesting that the transcripts cannot fulfill their function in the ribosome. Hence, the 5S rDNA of S. wendlandii appears to be very heterogeneous in both the IGS and CDS regions and likely contains numerous pseudogenes. Unfortunately, the complete genome sequence of S. wendlandii is currently not presented in the GeneBank and cannot, therefore, be used to further elucidate the unusual organization of 5S rDNA in this species.

Intergenic Spacer Organization in Distantly Related Solanum Species

To reveal the molecular organization and evolution of IGS in Solanum, we compared the IGS sequences of 37 species representing distantly related groups (D’Arcy, 1991; Nee, 1999; Bohs, 2005; Särkinen et al., 2013) of the genus (Figure 5). The total length of the alignment obtained is 287 bp. Only 9 identical nucleotides were found in the compared sequences, and average pairwise identity value was 55.1%, indicating significant divergence of the IGS in the genus. Multiple base substitutions and indels of various lengths are scattered along the entire IGS in the species studied compared to the consensus sequence. The largest 31-bp-long indel is located in the central part of the IGS between the positions 144 and 174 bp. Despite numerous species-specific mutations, the sequence of the central indel shows an obvious sequence similarity in the species compared. Analysis of the phylogenetic dendrogram obtained by comparing IGS sequences (Figure 6, see also below) revealed that the central indel is present in the species belonging to major clade 1 (with the exception of S. muricatum) but is partially or completely absent in members of clade 2 (with the exception of S. anomalostemon). Hence, the central indel was present in the common ancestor of the Solanum genus and was later lost in some species during the course of evolution.
FIGURE 5

Schematic representation of the sequence alignment of the 5S rDNA IGS of distantly related Solanum species. The level of sequence similarity is shown in different colors. Abbreviations of species names are given in Table 1. Taxonomic assignment of species to Clades 1 and 2 in the maximum likelihood (ML) dendrogram (see Figure 7) is shown.

FIGURE 6

Phylogenetic ML and Bayesian dendrograms constructed by comparison of 5S IGS sequences of Solanum species. The aLRT support and posterior probabilities are represented by different branch colors. Numbers in the ML dendrogram represent bootstrap support values. Terminal clades are collapsed to species level. Numbers in brackets near the collapsed nodes indicate the number of ribotypes used. IGS structural variants A-D are given for members of sect. Petota (Clade 1.4.2). ML clades not confirmed by Bayesian analysis are highlighted in gray color.

Schematic representation of the sequence alignment of the 5S rDNA IGS of distantly related Solanum species. The level of sequence similarity is shown in different colors. Abbreviations of species names are given in Table 1. Taxonomic assignment of species to Clades 1 and 2 in the maximum likelihood (ML) dendrogram (see Figure 7) is shown.
FIGURE 7

Schematic representation of sequence alignment of the 5S IGS of sect. Petota members. The level of sequence similarity is shown in different colors. Positions of group-specific indels I and II and GC-duplication (GC-DUP) are indicated. Abbreviations of species names are given in Table 2. Structural features of presented IGS sequences are indicated as structural variants (SVs) A-D.

Phylogenetic ML and Bayesian dendrograms constructed by comparison of 5S IGS sequences of Solanum species. The aLRT support and posterior probabilities are represented by different branch colors. Numbers in the ML dendrogram represent bootstrap support values. Terminal clades are collapsed to species level. Numbers in brackets near the collapsed nodes indicate the number of ribotypes used. IGS structural variants A-D are given for members of sect. Petota (Clade 1.4.2). ML clades not confirmed by Bayesian analysis are highlighted in gray color.

Intergenic Spacer Organization in Sect. Petota

Analysis of IGS molecular organization in the species-rich sect. Petota was performed separately. By sequence comparison, numerous base substitutions and indels were detected, which appear to be randomly distributed along the IGS (Figure 7), except for the presumptive external promoter region just upstream of the CDS (see section “Discussion”). Schematic representation of sequence alignment of the 5S IGS of sect. Petota members. The level of sequence similarity is shown in different colors. Positions of group-specific indels I and II and GC-duplication (GC-DUP) are indicated. Abbreviations of species names are given in Table 2. Structural features of presented IGS sequences are indicated as structural variants (SVs) A-D. The alignment of the sequences revealed that in the central part of the IGS there are two group-specific indels, I and II. Also, a lot of species contain a GC-duplication (GC-DUP) in the IGS (Figure 7). It is likely that these structural rearrangements occurred in different stages during the evolution of sect. Petota. With regard to the presence/absence of these molecular features, four structural variants (SVs) of the IGS can be distinguished. The evolutionary ancestral SV-A contains both specific indels, while independent deletions of indels II and I resulted in the formation of the derived SV-B and –C/D, respectively. SV-D additionally contains a GC-DUP.

Phylogenetic Analysis

The IGS sequences obtained by cloning as well as major and minor ribotypes extracted from SRA were used to reconstruct the phylogenetic relationships among Solanum species representing different taxonomic groups of the genus. For sect. Petota, seven species were selected whose IGS sequences belong to different structural classes (see section “Discussion”). Multiple sequence alignment for the whole genus Solanum phylogeny was generated with the Mafft E-INS-I method and then manually corrected. The final 609-bp length alignment presented only one identical site, with an average pairwise identity of 54.7%. The best-fit phylogenetic model was estimated using Mega X to be general time-reversible (GTR) + gamma (G) (Kumar et al., 2018). The obtained maximum likelihood (ML) phylogenetic tree has 302 leaves, which correspond to the IGS sequences of 65 Solanum species (Figure 6). Calculating the statistical support applying the aLRT-Chi2 method and boot-strap analysis showed that majority of the tree’s nodes have a high or moderate support. The ML tree mostly matched the dendrogram generated by Bayesian inference. On the dendrograms, all investigated species of the Solanum genus form a well-supported monophyletic group with L. lycioides as sister taxon. In the ML-tree, the Solanum species are divided into two major clades, 1 and 2, with high statistical support. In the clades, several well-supported minor clades were found. The monophyly of the Solanum genus and clade 1 is also confirmed by Bayesian inference. In contrast, clade 2 is represented by polytomy in the Bayesian dendrogram.

Discussion

Phylogeny of the Solanum Genus

Since the nineteenth century, the Solanum genus has been traditionally divided into two main groups, the so-called spiny and non-spiny solanums (Dunal, 1852; Seithe, 1962), which were further subdivided into sections, subsections, and series using morphological characters (D’Arcy, 1991; Nee, 1999). However, application of molecular methods shed a new light on the phylogeny of Solanum, demonstrating that these groups are mainly not monophyletic, and that the genus can be divided into 13 clades (Bohs, 2005; Weese and Bohs, 2007; Särkinen et al., 2013). Some of these clades have high statistical support, while the taxonomic placement and composition of the others are uncertain. Analysis of several chloroplast genes and nuclear regions (e.g., ITS1/2 and waxy) is often performed in molecular phylogenetics. However, incongruence of results obtained by application of different markers is a well-known problem. Respectively, other genomic regions, particularly the 5S rDNA IGS, can additionally be used to clarify the phylogeny of lower-ranking taxa (Blöch et al., 2009; Tynkevich and Volkov, 2019; Cardoni et al., 2021; Ishchenko et al., 2021), including sect. Petota of the Solanum genus (Volkov et al., 2001). To evaluate the possibility of using this region to reconstruct phylogenetic relationships in the Solanum genus, we constructed an ML dendrogram that embraces 68 accessions from 63 species. The ML dendrogram includes two major clades (Figure 6). Similar to our data, two clades in the Solanum genus were found by comparing sequences of plastid, nuclear ribosomal ITS and low-copy nuclear (waxy) genes (Särkinen et al., 2013, 2015). Particularly, four species, S. abutiloides, S. erianthum, S. cochoa, and S. pseudocapsicum, are included in Clade 2 of our dendrogram, which is in agreement with recent molecular data (Särkinen et al., 2013, 2015) but in contrast to the previous taxonomy of Nee (1999), who placed the species in the sections Brevantherum, Basarthrum, and Holophylla (see Table 1). Clade 1 is composed of four smaller clades. Clade 1.1 contains four species of the Morelloid clade, S. americanum, S. nigrum, S. scabrum, and S. villosum (Särkinen et al., 2013, 2015). Two other species, S. anomalostemon and S. valdiviense previously associated with Morelloids are placed outside Clade 1.1, further supporting the phylogeny of the group proposed by Särkinen et al. (2015). Clades 1.2–1.4 are combined in a polytomy. S. dimorphandrum of the Thelopodium clade (Bohs, 2005) and S. dulcamara of the Dulcamaroid clade (Bohs, 2005; Särkinen et al., 2013) belong to Clade 1.2, while another member of the Dulcamaroid clade, S. seaforthianum, occupies a basal position in Clade 1. S. valdiviense is included in Clade 1.3, which also comprises two species of sect. Archaesolanum, S. aviculare and S. laciniatum. The taxonomic position of S. valdiviense found in our analysis is fully consistent with previous data (Särkinen et al., 2015). S. aviculare and S. laciniatum are closely related (Figure 6): There are several ribotypes in the genome of S. aviculare that are very similar and even identical to those of S. laciniatum. These data indicate incomplete lineage sorting during speciation or subsequent hybridization among these species. The close relationship between S. aviculare and S. laciniatum confirms the taxonomy derived from sequencing of three chloroplast and two nuclear regions in which these two species represent sister taxa (Poczai et al., 2011; Särkinen et al., 2015). Clade 1.4.1 comprises Central American S. appendiculatum and South American S. clivorum, which were previously assigned, respectively, to sect. Anarrhichomenum and Holophylla (Nee, 1999) as well as S. muricatum of sect. Basarthrum (Nee, 1999; Särkinen et al., 2013), while Clade 1.4.2 embraces numerous species of sect. Petota (including tomato) (Hawkes, 1990; Nee, 1999; Komarova et al., 2008). According to a recent analysis (Särkinen et al., 2015; Gagnon et al., 2021), S. appendiculatum and S. muricatum, similar to our results, belong to the potato clade, in contrast to S. clivorum, which was placed outside clade I. Clade 1 also includes the South American species S. paposanum, which represents the Regmandra clade (Bohs, 2005; Särkinen et al., 2013). It was found that this clade was resolved in different positions in three data sets used for comparison (Särkinen et al., 2015; Gagnon et al., 2021). Clade 2 consists of three smaller clades, 2.1–2.3. Clade 2.1 comprises two closely related species, S. abutiloides and S. erianthum, which were assigned by Nee (1999) to sect. Brevantherum of the Solanum subgenus. Later, sect. Brevantherum was transferred to clade II consisting of predominantly spiny and shrubby species (Särkinen et al., 2013, 2015; Gagnon et al., 2021). Similarly, the third member of Clade 2.1, S. anomalostemon, was assigned to the Morelloid clade (Bohs, 2005) but later transferred to clade II (Särkinen et al., 2015; Gagnon et al., 2021). Accordingly, the inclusion of S. abutiloides, S. erianthum, and S. anomalostemon in clade II is further supported by our results. Clade 2.2 contains five species, which were previously assigned to different taxonomic groups. According to Nee (1999), two Central/South American species, S. wendlandii and S. pachyandrum, are members of sect. Herposolanum. Later, it was shown that S. wendlandii belongs to clade Wendlandii/Allophyllum, while the position of S. pachyandrum appeared unclear (Bohs, 2005; Särkinen et al., 2013, 2015). Thereafter, both species were assigned to sect. Aculeigerum (Clark et al., 2015). Our data also confirm the phylogenetic affinity of S. wendlandii and S. pachyandrum. The next two species, South American S. cochoae and S. pseudocapsicum, have been previously assigned to different sections, Basarthrum and Holophylla (Anderson and Bernardello, 1991; Nee, 1999). In contrast, S. cochoae and S. pseudocapsicum are combined in a well-supported clade in our ML dendrogram. Originally, S. cochoae was included in sect. Basarthrum on the basis of morphological analyses and crossing experiments, although all crosses with related wild species were unsuccessful. Surprisingly, the only species crossed with S. cochoae was cultivated S. muricatum, despite large differences in karyotypes of these two species (Anderson and Bernardello, 1991). However, the possibility of obtaining hybrids cannot be seen as a decisive argument for the close relationship between these two species, as it is sometimes possible to successfully cross distant Solanum species (Daunay et al., 2019). The close relationship between S. cochoae and S. muricatum is also supported by recent molecular data (Gagnon et al., 2021). In our dendrogram, however, S. cochoae does not appear to be related to S. muricatum but to S. pseudocapsicum, a member of the Geminata clade (Gagnon et al., 2021). It should be noted that the common feature of the 5S rDNA repeats of S. cochoae and S. pseudocapsicum is short length due to deletion in the central part of the IGS. In addition, each species possesses specific deletions in other IGS regions (Figure 5). Altogether, these structural features can affect the position of the species in the dendrogram. Accordingly, we believe that the taxonomic position of S. cochoae close to S. pseudocapsicum should be interpreted with appropriate reservation in this stage, and that further studies should be carried out in order to finally clarify the question. The last member of Clade 2.2 is S. betaceum, which has been previously treated as a member of separate genus Cyphomandra (D’Arcy, 1991) and then later placed to Solanum (Bohs, 1995) and assigned to sect. Pachyphylla of the Bassovia subgenus (Nee, 1999) or clade Cyphomandra in clade II (Särkinen et al., 2013, 2015; Gagnon et al., 2021). In our dendrogram, S. betaceum is a sister taxon for the other members of Clade 2.2. Clade 2.3 includes three clades of lower ranks, 2.3.1–2.3.3. Clade 2.3.1 comprises two pairs of species, the South American S. aculeatissimum and S. mammosum of the section/clade Acanthophora as well as Andean cultivated species S. quitoense (naranjilla or lulo) and its wild relative S. pseudolulo of clade Lasiocarpa (Nee, 1999; Bohs, 2005; Levin et al., 2006; Särkinen et al., 2013). According to a molecular analysis, the clades Acanthophora and Lasiocarpa represent sister taxa (Särkinen et al., 2013; Gagnon et al., 2021). In the genome of S. quitoense, a ribotype similar to that of S. pseudolulo was detected, which could be due to hybridization between these species (Fory Sánchez et al., 2010). Clade 2.3.2 comprises two Central/South American species, S. chrysotrichum and S. torvum of the section/clade Torva, as well as S. guamense, an endangered endemic species in Northern Mariana Islands (Stone, 1970) whose taxonomic status remains unclear (Nee, 1999; Bohs, 2005; Särkinen et al., 2013; Aubriot et al., 2016). Our analysis revealed that the three species share common ribotypes and are, therefore, unresolved in the dendrogram. The high genetic affinity of S. chrysotrichum and S. torvum agrees well with their morphological similarity. S. guamense also appeared to be closely related to these species. Clade 2.3.3 includes S. hindsianum (clade Elaeagnifolium, Bohs, 2005; Särkinen et al., 2013), an endemic to the Sonoran Desert region of southern Arizona and northern Mexico (Knapp et al., 2017), and a well-supported monophyletic clade of 21 species, most of which have been assigned to sect. Melongena (Nee, 1999) or the Old World clade (Levin et al., 2006; Särkinen et al., 2013; Aubriot et al., 2016) of the Leptostemonum subgenus. However, the taxonomic position of five species (S. albostellatum, S. elatius, S. lasiophyllum, S. medicagineum, and S. spirale) has not yet been clarified, especially with molecular methods. In the Old World clade, there are four groups, A–D, of closely related species. Clade 2.3.3A comprises members of “Dioicum Complex,” a set of several dioecious species (Whalen, 1984; Bean, 2004) from tropical Australia. Our data show that S. diversiflorum and S. phlomoides are closely related, and that S. clarkiae is a more distant species. S. sejunctum is placed outside clade 2.3.3A. This result agrees with the phylogeny based on the analysis of trnK–matK and ITS data sets (Martine et al., 2006, 2009). Four other Australian species, S. albostellatum, S. esuriale, S. elatius, S. ossicruentum, as well as S. macrocarpon (African eggplant), belong to the next clade, 2.3.3B, although with a moderate statistical support. The West African species S. macrocarpon was previously assigned to Anguivi Grade (Aubriot et al., 2016, 2018; Gagnon et al., 2021), a group of Old World Leptostemonum species closely related to S. melongena (see our clade 2.3.3D below). Respectively, the phylogenetic affinity of S. macrocarpon to the Australian species seems somewhat unexpected and can be explained by the presumptive hybrid origin of S. macrocarpon (Daunay et al., 2019). According to our data, S. albostellatum and S. esuriale from Western Australia show the closest relationship in Clade 2.3.3B, which is in good agreement with the high morphological similarity of these species (Davis and Hurter, 2012). S. elatius is also a member of the S. esuriale group (Bean, 2013). S. ossicruentum represents a functionally dioecious bush tomato from northwestern Australia. Earlier, it was recognized as a variant of S. dioicum, a member of “Dioicum Complex.” However, later molecular analysis shows that S. ossicruentum is either a sister taxon to the rest of this group or represents an independent dioecious lineage (Martine et al., 2016). Our data further supported the second opinion and indicate a phylogenetic affinity between S. ossicruentum and members of the S. esuriale group. Clade 2.3.3C comprises five Australian species. Two species, S. cleistogamum and S. horridum, contain very similar sets of ribotypes in their genomes and appear unresolved in the dendrogram. A sister taxon to them is S. medicagineum, while S. lasiophyllum and S. ferocissimum are more distantly related species. A close relationship among S. cleistogamum, S. horridum, and S. medicagineum has been shown earlier (Bean, 2004, 2012; Levin et al., 2006). Hence, the Australian Solanum species studied here belong to three clades, 2.3.3A, B, and C. Similarly, monophyly of the Australian species was not supported by the analysis of seven nuclear genes (Martine et al., 2019). Clade 2.3.3D comprises seven species naturally distributed in Africa and Asia. In particular, this clade includes two very morphologically and genetically similar domesticated plants, S. aethiopicum (bitter tomato, Ethiopian eggplant) and S. melongena as well as their presumptive wild ancestors, S. anguivi and S. incanum. The second species is very similar and can even be confused with S. linnaeanum (Daunay et al., 2001; Doganlar et al., 2002; Prohens et al., 2012). S. vespertilio, a species endemic to the Canary Islands, appears to be closely related to the other members of clade 2.3.3D. Previously, the phylogeny of Old World “spiny solanums” was clarified using plastid and nuclear markers (Aubriot et al., 2016, 2018; Vorontsova and Knapp, 2016; Knapp et al., 2019; Gagnon et al., 2021). It was demonstrated that S. incanum, S. linnaeanum, and S. melongena are closely related and belong to the Eggplant clade, and that S. aethiopicum, S. anguivi, S. vespertilio (and S. macrocarpon, which is placed to clade 2.3.3B in our dendrogram) are included in Anguivi Grade outside the Eggplant clade. Hence, our novel data mainly confirm these results. Surprisingly, S. anguivi and morphologically different S. spirale, a tetraploid (Randell and Symon, 1976) species from East Asia, are not resolved in the dendrogram (see Figure 6). A possible explanation for this result could be the allopolyploid origin of S. spirale. In this case, the 5S rDNA inherited from the parent related to S. anguivi could be retained in the genome, while the DNA of the other parent was lost. The uniparental inheritance of 5S rDNA in allopolyploids, both young and old, has been reported for several taxonomic groups including Solanaceae (Pontes et al., 2004; Volkov et al., 2017). Clade 2.3 also comprises three species that do not belong to clades 2.3.1–2.3.3 presented above. Two species, S. crinitum and S. wrightii, represent the clade Androceras/Crinitum, while S. sisymbriifolium belongs to the clade Sisymbriifolium (Levin et al., 2006; Särkinen et al., 2013; Gagnon et al., 2021). The taxonomic position of these species in our dendrogram agrees well with previous results of molecular phylogenetics studies. Majority of the clades identified in the ML tree was also recognized in the Bayesian dendrogram (Figure 6). However, the monophyly of Clade 2 was not confirmed by Bayesian inference: Clades 2.1, 2.2, and 2.3 are not combined with each other but belong to a basal polytomy in the Solanum genus. Thus, in this study, we present the phylogeny of the Solanum genus derived from the analysis of 5S IGS sequences. Six species (S. albostellatum, S. elatius, S. guamense, S. lasiophyllum, S. medicagineum, and S. spirale) were characterized here for the first time using molecular taxonomy methods. The obtained dendrograms are mainly congruent with the published data for other regions of nuclear and plastid genomes: same major and minor clades were found for the species examined. However, taxonomic relationships between these clades and position of some species (e.g., S. cochoae, S. clivorum, S. macrocarpon, S. spirale) differ from previous results and require further clarification. Taken together, our results show that the 5S IGS represents a convenient molecular marker for phylogenetic studies on the Solanum genus. In particular, the simultaneous presence of several variants of rDNA in the genome enables the detection of cases of reticular evolution such as incomplete lineage sorting and interspecific hybridization.

Molecular Evolution and IGS Diversity in Sect. Petota

One of the species-rich groups in genus Solanum is sect. Petota, which has about 250 members (Hawkes, 1990; Nee, 1999). In our ML dendrogram (Figure 6), sect. Petota belongs to Clade 1.4.2. To analyze the molecular evolution of 5S rDNA in this section and in more details, we assembled IGS ribotypes for 125 accessions representing 83 species (Table 4) and compared the results with our previous data, obtained by cloning and sequencing of 5S rDNA of 32 wild species and breeding lines of sect. Petota (Volkov et al., 2001). Analysis of the IGS sequences revealed that they differ in base substitutions and indels (Figure 7). Same indels mostly occur in a single or some closely related species and, therefore, represent convenient molecular markers for their identification. For example, non-tuber-bearing species S. etuberosum and S. palustre (series Etuberosa; Hawkes, 1990) possess a common specific deletion at the beginning of the IGS, or S. laxissimum and S. violaceimarmoratum (series Conicibaccata) have a deletion in the central part of it (Figure 7). Several species-specific indels in the IGS of the Solanum species have already been described (Volkov et al., 2001), and our actual analysis additionally identifies new ones for the novel species. This finding further confirms our earlier assumption that indels are a characteristic feature of IGS evolution in sect. Petota. We have also argued that because of the high frequency of indels compared to base substitutions, IGS cannot be used for phylogenetic reconstruction of this section applying standard algorithms. However, the indels represent unique evolutionary events that should be considered in taxonomic studies. Considering the location of group-specific indels I and II as well as GC-duplication (Figure 7), four major structural variants of the IGS were identified. Accordingly, members of sect. Petota can be divided into four groups, A–D. Group A comprises species that belong to the subsection Estolonifera including the tomato group, and to the series Pinnatisecta, Polyadenia, Commersoniana, Circaeifolia, Megistacroloba, Conicibaccata, and Piurana of the subsection Potatoe (Hawkes, 1990; Nee, 1999), or to clades 1+2 and 3 (Spooner et al., 2014; Huang et al., 2019). Also, SV-A was found in four species (S. acroscopicum, S. andreanum, S. abancayense, S. multiinterruptum) that were assigned to ser. Tuberosa (Hawkes, 1990; Nee, 1999) or clade 4 (Spooner et al., 2014; Huang et al., 2019). Group B includes only two species, S. jamesii and S. pinnatisectum of ser. Pinnatisecta (Hawkes, 1990; Nee, 1999) or clade 1+2 (Huang et al., 2019). This means that the SR-B arose relatively recently during speciation in clade 1+2, just before the divergence of S. jamesii and S. pinnatisectum but after their separation from the sister taxon, which was similar to S. stenophyllidium. Group C includes accessions of three species, S. raphanifolium, S. laxissimum, and S. violaceimarmoratum, which belong to the series Megistacroloba and Conicibaccata (Hawkes, 1990) or clade 4 (Spooner et al., 2014; Huang et al., 2019). Group D embraces numerous species that belong to the series Yungasensa, Megistacroloba, Cuneoalata, Maglia, and Tuberosa (Hawkes, 1990) or clade 4 (Spooner et al., 2014; Huang et al., 2019). SV-C and D were not found outside of clade 4, which, however, also includes four species possessing SV-A. Therefore, SV-C and D arose from SV-A after separation of clades 3 and 4. Interestingly, in some cases, rDNA repeats representing different structural variants were found in the same plant accession (see below). We have also found that the central part of the IGS is completely deleted in S. bulbocastanum of the series Bulbocastana (Hawkes, 1990) or clade 1+2 (Huang et al., 2019). Accordingly, the structural organization of ITS characteristic of this species cannot be assessed and used for phylogenetic reconstruction. Analysis of our data showed that the most common IGS variants are SV-A and-D, and SV-B and-C were found only in four and three species, respectively. In order to assess the molecular diversity of SV-A and-D, we constructed median-joining networks for these two IGS variants using 204 and 353 sequences (Figure 8). In the median-joining networks, the sequences of SV-A and-D are distributed between the six and ten main clusters according to their similarity. In the vast majority of cases, each node corresponds to only one sequence, with the exception of one node in median-joining network A and seven nodes in median-joining network D. These nodes include two to nine ribotypes that mainly represent genomes of different species. Therefore, identical IGS sequence variants can be present in genomes of different species or plant accessions, suggesting their common origin.
FIGURE 8

Median-joining networks depicting relationships of 5S IGS structural variants A and D of the species of sect. Petota. Names of main clusters are given. The affiliation of the species to the series proposed by Hawkes (1990) is indicated by different colors. Numbers in brackets indicate the number of ribotypes used. Abbreviations of species names are given in Table 2.

Median-joining networks depicting relationships of 5S IGS structural variants A and D of the species of sect. Petota. Names of main clusters are given. The affiliation of the species to the series proposed by Hawkes (1990) is indicated by different colors. Numbers in brackets indicate the number of ribotypes used. Abbreviations of species names are given in Table 2. SV-D sequences are distributed among ten clusters, D1-D10 (Figure 8). The largest clusters, D1, D3, D6, and D10, contain 50, 56, 68, and 51 sequences, respectively. The sequences included in cluster D1 are nearly identical to SV-D consensus sequence and, therefore, represent evolutionary ancestral ribotypes, while the other clusters comprise derived sequences containing specific base substitution and indels. Starting from cluster D1, five evolutionary lineages can be distinguished. Taken together, our data agree well with modern taxonomy, which is based on the application of molecular methods (Spooner et al., 2014; Huang et al., 2019) but are less consistent with the traditional classification of Hawkes (1990). In particular, the sections proposed by Hawkes (1990) are not confirmed, because species from different sections are mixed up and belong to different clusters in the median-joining network. In contrast, our results agree well with the molecular data, since clades 4 North and 4 South (Spooner et al., 2014; Huang et al., 2019) can also be recognized in our median-joining network: members of the North and South clades belong to clusters D1, D7-D10, and D2-D6.

Conserved Sequence Motifs in the Intergenic Spacer of Solanum Species

Comparative sequence analysis revealed that the most conservative regions of the IGS in Solanum species are the 7- and 40-bp-long fragments at the 5′ and 3′ ends (Figure 5). The evolutionary conservation of these regions has already been observed in other plants (Hemleben and Werts, 1988; Crisp et al., 1999; Tynkevich and Volkov, 2019; Ishchenko et al., 2020, 2021), and a possible reason for this seems to be their involvement in the transcription of 5S rDNA by RNA polymerase III (Pol III). External elements of the Pol III promoter have been previously characterized in Arabidopsis thaliana (Douet and Tourmente, 2007; Vaillant et al., 2007; Layat et al., 2012; Simon et al., 2018). These signals include the TATATA motif (so-called TATA-box), GC-dinucleotide, and C nucleotide in positions-28,-13, and-1 bp, respectively. Similar sequences were also found in other plants (Tynkevich and Volkov, 2014; Tynkevich et al., 2015; Ishchenko et al., 2018, 2021). In representatives of the Solanum genus, as well as in Quercus (Tynkevich and Volkov, 2019) and Rosa (Tynkevich and Volkov, 2014), the TATA-box has a length of 7 bp and begins in position-30. Its sequence (TTTAATA) in Solanum is slightly different from that in other groups of plants. Another external element of the Pol III promoter, the GC-dinucleotide (Douet and Tourmente, 2007) is duplicated in several Solanum species and is located, respectively, both in the typical position-12 and, additionally, in position-14. Similar to Solanum, duplication of this presumptive external promoter element was also found in the Quercus species (Tynkevich and Volkov, 2019). The third conservative promoter element, cytosine, in position-1 (Douet and Tourmente, 2007; Simon et al., 2018), has been replaced by thymine in more than half of the Solanum species. In addition, we found that the dinucleotide GA in position-3 in the IGS is highly conserved, indicating its possible involvement in transcription initiation. At the beginning of the IGS in Solanum, like in other genera, the oligo-T motif TTTTT was found, which probably represents a transcription termination site (Hemleben and Werts, 1988; Simon et al., 2018; de Souza et al., 2020). The most variable central region of the IGS can be subdivided into (i) AT-rich and (ii) subrepeated regions. Previously, AT-rich regions were found in the IGS of Fabaceae (Hemleben and Werts, 1988) and Poaceae (Röser et al., 2001). AT-rich regions demonstrate a similarity to amplification-promoting sequences (Borisjuk et al., 2000), which may be involved in amplification of 5S rDNA repeats. Also, regions composed of subrepeats were described for the IGS of several plant taxa, e.g., Rosaceae (Tynkevich and Volkov, 2014) and Poaceae (Ishchenko et al., 2018, 2021). Previously, we have demonstrated that structural rearrangements of the variable central region of the IGS in Solanum species of sect. Petota as well as in distantly related S. melongena and S. betaceum are preferentially associated with four classes (A–D) of short direct subrepeats: the IGS evolved mainly by duplications of some sequence motifs, resulting in formation of several variants of subrepeats, which were independently amplified in different sections of the genus after radiation from a common ancestor (Volkov et al., 2001; Davidjuk et al., 2010, 2013).

Intragenomic Heterogeneity and Molecular Evolution of 5S rDNA

It is widely believed that 5S rDNA repeats present in the same genome (at least in diploid species) should be nearly identical because of concerted evolution (Coen et al., 1982; Tynkevich and Volkov, 2014; Barman et al., 2016). In our study, we performed a detailed analysis of 5S rDNA intragenomic sequence diversity and found several ribotypes in all the species studied. Comparative analysis of all available sequences showed that the IGS sequence similarity in Solanum species ranges from 51.4 to 100%. The highest levels were found in S. wrightii and four representatives of the tomato group, namely lycopersicum var. cerasiforme-2, S. galapagense, and S. huaylasense; each of which had only one ribotype detected. The high intragenomic homogeneity of IGS (over 95%) is also characteristic of other representatives of the tomato group with the exception of S. lycopersicum-2 and S. corneliomuelleri. The relatively low IGS similarity in S. corneliomuelleri (89.5%) is due to the presence of 10-bp deletion in one ribotype, while no further indels are found in any of the other members of the tomato group. Hence, deletions are very rare during the evolution of IGS in the tomato group, which is in obvious contrast to other Solanum groups, especially to closely related tuber-bearing species of Petota. Our calculations indicated that the lowest level of intragenomic IGS sequence similarity is demonstrated by S. melongena-1 and -2 (56 and 51.4%), S. sogarandinum (54.9%), S. wendlandii (58.4%), and S. lasiophyllum (59.4%). In S. melongena-1 and -2, it is due to simultaneous existence of short and long (containing extra-long duplication, see Figure 2 and Supplementary Material) repeats in the genome, while in S. melongena-3, which possesses only long repeats, the similarity amounts to 95.2–99.4%. Similarly, in S. lasiophyllum, three adjacent deletions (65 bp in total) present in one of six ribotypes is the main reason for the low level of intragenomic similarity. In contrast, two mechanisms contribute to the low similarity of IGS in S. sogarandinum: (i) a long deletion in one ribotype and (ii) multiple base substitutions in another. In the second case, 33 of 210 bp in the same ribotype was changed compared to the consensus sequence. Notably, these mutations are present in the 3′ IGS region, which likely contains external promoter elements, suggesting putative pseudogenization of the ribotype. Putative 5S rDNA-related pseudogenes have already been described for members of Solanum and other genera of Solanaceae (Volkov et al., 2001, 2017). In S. wendlandii, similar to S. sogarandinum, two mechanisms, an insertion and a large number of base substitutions, cause increased heterogeneity of IGS sequences (Figure 3). Accordingly, we excluded long (more than 5 bp) deletions and multiple base substitutions from our calculations and found that in this case the minimum level of intragenomic similarity of the IGS in Solanum species is around 85–90%. In general, our results indicate that there are two mechanisms, long indels and multiple base substitutions, that significantly affect the heterogeneity of the IGS in Solanum species. Multiple base substitutions are rare events: out of about 900 analyzed sequences, only five ribotypes bearing multiple base substitutions were identified in four plant accessions (S. kurtzianum, S. pinnatisectum-2, S. sogarandinum-1, and S. vernei-3), while long indels are much more common. The question, “what can be the source of the IGS intragenomic polymorphism?” arises. There are at least two possible options: (i) new variants emerge in the genome itself by accumulation of mutations and (ii) new variants appear in the genome as a result of introgression of genetic material due to interspecific hybridization. It is well known that in sect. Petota, especially in the S. brevicaule complex, interspecific hybridization is widespread at both the diploid and polyploid levels (Hawkes, 1990; Spooner et al., 2014). Among the 125 examined accessions representing sect. Petota, two or more structural variants of the IGS were found in 25 cases, and interspecific hybridization seems to be a plausible explanation for this polymorphism, especially when structurally different IGS variants (e.g., A and D) occur in the same genome. However, further research is needed to confirm this option. Our data suggest that long indels and multiple base substitutions appeared repeatedly during the molecular evolution of IGS in the Solanum genus. However, it seems that they were mostly not conserved and eliminated. Accordingly, the length of IGS and contents of GC pairs did not change significantly during the course of speciation (see above). The likely reason for this negative selection could be the association between indels/multiple base substitutions and pseudogenization of 5S rDNA. Accordingly, it looks that the main road of the IGS molecular evolution seems to be step-wise accumulation of single base substitution or short indels.

Intraspecific 5S rDNA Heterogeneity

It could be anticipated that different accessions of same species possess identical/similar sets of ribotypes. To check this assumption, we examined two to three accessions for 25 diploid species (Tables 1, 2 and Supplementary Material), and in most cases, only one IGS variant was actually found. However, in seven species (S. ambosinum, S. commersonii, S. microdontum, S. okadae, S. phureja, S. raphanifolium, and S. stenotomum; Figure 8 and Supplementary Material), one or two structural variants were detected in different accessions, indicating presumptive interspecific hybridization. Among the species studied, we examined the highest number of accessions in S. bukasovii, which was considered one of the ancestors of cultivated potato (Ugent, 1970; Hawkes, 1990; Hosaka, 1995; Spooner et al., 2005; Hardigan et al., 2015). This close relationship was also confirmed in our previous study by analyzing the 5′-external transcribed spacer (ETS) region of nuclear 35S rDNA (Volkov et al., 2003). According to the comparison of whole plastid genomes, the species belongs to clade 4 North (Huang et al., 2019). Respectively, based on its taxonomic position, it might be expected that S. bukasovii should possess the IGS variant D. However, SV-A3 was previously found in the buk1 accession (Volkov et al., 2001), which indicates incongruence among different phylogenetic markers. To further clarify the issue, we analyzed ten accessions of S. bukasovii in this study and found an extreme variability of the 5S IGS set. Two accessions (buk1 and buk2) contain only SV-A3 in the genome, while six others (buk3, buk4, buk5, buk7, bukm2, and bukm3) possess different D variants, SV-D1,-D4,-D7,-D8,-D9, and-D10 (see Table 4). Two remaining accessions (buk6 and bukm1) have both SV-A3 and –D4 or-D10. The D variants detected in accessions of S. bukasovii belong to different clusters in the median-joining network (Figure 8) and are identical or very similar to the D variants of several species (e.g., S. achacachense, S. ambosinum, S. canasense, S. marinasense, S. phureja, S. stenotomum, etc.) belonging to clade 4 North + cultivated (Huang et al., 2019). Similarly, the SV-A3 found in S. bukasovii is identical/similar to that of S. abancayense, S. multiinterruptum, S. raphanifolium, and S. chaucha. This unusual diversity of IGS might indicate a complex hybridogenic origin: i.e., some of the examined accessions could represent natural hybrids between S. bukosovii and different Petota species. The genetic heterogeneity of S. bukosovii accessions and the putative hybrid origin of some of them by crossing with S. sparsipilum or S. raphanifolium have recently been demonstrated using plastid and mitochondrial markers (Achakkagari et al., 2020, 2021; Bozan, 2021).

Origin of Polyploid Species

There are several polyploids among the studied species. Two of them, S. chaucha and S. juzepczukii, are triploids. It was originally postulated that S. chaucha arose as a result of hybridization between diploid S. phureja and tetraploid S. tuberosum ssp. andigena (Bukasov, 1939), but later, S. chaucha was recognized as the autotriploid of S. phureja (Bukasov, 1978). In contrast, Hawkes (1962, 1990) suggested that S. chaucha originated from hybridization between S. tuberosum ssp. andigena and diploid species S. stenotomum. Our analysis indicates an allotriploid origin of S. chaucha, as three structural variants of IGS, SV-A3,-D8, and-D10, are found in its genome (Table 4 and Figure 8). The molecular data confirm the close relationship among S. chaucha, S. phureja (accession phu2 but not phu1), S. tuberosum ssp. andigena, and a diploid species, S. stenotomum subsp. goniocalyx; all of which have SV-D8. However, the origin of S. chaucha appeared to be more complicated, because other structural variants of the IGS had to be inherited from species belonging to clusters A3 and D10. It is widely believed that the triploid species S. juzepczukii originated from a natural cross between a cultivated diploid, S. stenotomum, and the wild tetraploid species S. acaule (maternal form), and pentaploid species S. curtilobum arose from a combination of non-reduced gamete of S. jusepczukii (3×, maternal form) with a reduced (2×) gamete of tetraploid S. tuberosum ssp. andigena (Bukasov, 1939; Hawkes, 1962, 1990; Schmiediche et al., 1980). The application of nuclear molecular markers confirmed this scenario (Rodríguez et al., 2010), but an alternative origin of S. curtilobum by hybridization of triploid species of the Andigenum group (non-reduced gamete, maternal form) and S. acaule was later proposed (Gavrilenko et al., 2013; Spooner et al., 2014). We found that the above-mentioned species contain the following IGS variants: S. acaule: SV-D10, S. jusepczukii: SV-A2 and-D2; S. tuberosum ssp. andigena: SV-D6,-D8, and-D9; S. curtilobum: SV-A2,-D1,-D2,-D9, and-D10. Three sets of IGS variants were identified for three accessions of S. stenotomum (stn1: SV-D1,-D9; stn2: SV-D4; gon: SV-D8). Remarkably, five IGS structural variants have been identified for pentaploid S. curtilobum that demonstrate the complex hybrid nature of this species. The analysis of the results showed that several common IGS structural variants are present in genomes of the examined species, which are correspondingly co-localized in the median-joining network (see Figure 8). However, no expected additivity of IGS structural variants from the presumptive parents in the derived alloploid progeny was found. It looks probable that the origin of S. jusepczukii and S. curtilobum may involve more parental diploids and requires further clarification using more plant accessions and additional molecular markers. Two structural variants of IGS, SV-A2, and -D1, are present in the genome of S. medians, indicating its origin from a cross between potato species included in clusters A2 and D1. For S. medians, diploid and triploid populations were reported (Hijmans et al., 2007). Unfortunately, we could not find in SRA information about the ploidy level of the plant accession analyzed here. According to a GISH analysis of meiotic preparations, S. stoloniferum appears to be an allotetrapolyploid species with a genomic constitution of AABB. It has been suggested that the species originated from S. verrucosum as the A genome donor and another North or Central American diploid species (e.g., S. cardiophyllum, S. ehrenbergii, or S. jamesii) as the B genome donor (Hijmans et al., 2007). In that case, regarding our data (Table 4 and Figure 8), it is expected that S. stoloniferum inherited SV-A1, -A2, or -B from S. cardiophyllum, S. ehrenbergii, or S. jamesii, respectively, as well as SV-D4 from S. verrucosum. However, S. stoloniferum possesses only D-variants of IGS, namely, SV-D1 and SV-D3, that could be inherited from S. ambosinum and from one of the species that belong to cluster D3. Previously, we have discussed the presumptive origin of a tetraploid, S. acaule, and S. tuberosum as well as hexaploids S. demissum and S. iopetalum (Volkov et al., 2001). The new data confirm the close relationship between S. demissum and S. acaule (cluster D10) and more distant position of S. iopetalum (cluster D6), and provide new information on putative diploid ancestors of the polyploid species. Our novel data also show that the IGS variants of two accessions of S. tuberosum ssp. andigena belong to different clusters (tbrA1: clusters D4, D8, and D9, and tbrA2: clusters D3 and D10), indicating that these accessions have an independent hybrid origin from different parental diploids.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

RV, YT, and IP conceived and designed the study. YT, AS, and LK performed the experiments and collected the material. YT, RV, AS, IP, and VH analyzed the data. RV and VH wrote the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  62 in total

1.  Phylogenetic relationships among the polyploid and diploid Aegilops species inferred from the nuclear 5S rDNA sequences (Poaceae: Triticeae).

Authors:  B R Baum; T Edwards; M Mamuti; D A Johnson
Journal:  Genome       Date:  2012-02-16       Impact factor: 2.166

2.  Tobacco ribosomal DNA spacer element stimulates amplification and expression of heterologous genes.

Authors:  N Borisjuk; L Borisjuk; S Komarnytsky; S Timeva; V Hemleben; Y Gleba; I Raskin
Journal:  Nat Biotechnol       Date:  2000-12       Impact factor: 54.908

3.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

4.  IGS sequences in Cestrum present AT- and GC-rich conserved domains, with strong regulatory potential for 5S rDNA.

Authors:  Thaíssa Boldieri de Souza; Marcos Letaif Gaeta; Cesar Martins; André Luís Laforga Vanzela
Journal:  Mol Biol Rep       Date:  2019-09-30       Impact factor: 2.316

5.  Interpopulation hybridization generates meiotically stable rDNA epigenetic variants in allotetraploid Tragopogon mirus.

Authors:  Roman Matyášek; Eva Dobešová; Dalibor Húska; Ivana Ježková; Pamela S Soltis; Douglas E Soltis; Aleš Kovařík
Journal:  Plant J       Date:  2016-02       Impact factor: 6.417

6.  RNAstructure: software for RNA secondary structure prediction and analysis.

Authors:  Jessica S Reuter; David H Mathews
Journal:  BMC Bioinformatics       Date:  2010-03-15       Impact factor: 3.169

7.  Molecular evolution of rDNA external transcribed spacer and phylogeny of sect. Petota (genus Solanum).

Authors:  Roman A Volkov; Nataliya Y Komarova; Irina I Panchuk; Vera Hemleben
Journal:  Mol Phylogenet Evol       Date:  2003-11       Impact factor: 4.286

8.  A comparative genetic linkage map of eggplant (Solanum melongena) and its implications for genome evolution in the solanaceae.

Authors:  Sami Doganlar; Anne Frary; Marie-Christine Daunay; Richard N Lester; Steven D Tanksley
Journal:  Genetics       Date:  2002-08       Impact factor: 4.562

9.  Solanaceae--a model for linking genomics with biodiversity.

Authors:  Sandra Knapp; Lynn Bohs; Michael Nee; David M Spooner
Journal:  Comp Funct Genomics       Date:  2004

10.  Genetic and epigenetic variation in 5S ribosomal RNA genes reveals genome dynamics in Arabidopsis thaliana.

Authors:  Lauriane Simon; Fernando A Rabanal; Tristan Dubos; Cecilia Oliver; Damien Lauber; Axel Poulet; Alexander Vogt; Ariane Mandlbauer; Samuel Le Goff; Andreas Sommer; Hervé Duborjal; Christophe Tatout; Aline V Probst
Journal:  Nucleic Acids Res       Date:  2018-04-06       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.