Literature DB >> 30807571

Mobile genomic element diversity in world collection of safflower (Carthamus tinctorius L.) panel using iPBS-retrotransposon markers.

Fawad Ali1,2, Abdurrahim Yılmaz1, Muhammad Azhar Nadeem1, Ephrem Habyarimana3, Ilhan Subaşı4, Muhammad Amjad Nawaz5, Hassan Javed Chaudhary2, Muhammad Qasim Shahid6, Sezai Ercişli7, Muhammad Abu Bakar Zia8, Gyuhwa Chung9, Faheem Shehzad Baloch1.   

Abstract

Safflower (Carthamus tinctorius L.) is a multipurpose crop of dry land yielding very high quality of edible oil. Present study was aimed to investigate the genetic diversity and population structure of 131 safflower accessions originating from 28 different countries using 13 iPBS-retrotransposon markers. A total of 295 iPBS bands were observed among which 275 (93.22%) were found polymorphic. Mean Polymorphism information content (0.48) and diversity parameters including mean effective number of alleles (1.33), mean Shannon's information index (0.33), overall gene diversity (0.19), Fstatistic (0.21), and inbreeding coefficient (1.00) reflected the presence of sufficient amount of genetic diversity in the studied plant materials. Analysis of molecular variance (AMOVA) showed that more than 40% of genetic variation was derived from populations. Model-based structure, principal coordinate analysis (PCoA) and unweighted pair-group method with arithmetic means (UPGMA) algorithms clustered the 131 safflower accessions into four main populations A, B, C, D and an unclassified population, with no meaningful geographical origin. Most diverse accessions originated from Asian countries including Afghanistan, Pakistan, China, Turkey, and India. Four accessions, Turkey3, Afghanistan4, Afghanistan2, and Pakistan24 were found most genetically distant and might be recommended as a candidate parents for breeding purposes. The findings of this study are most probably supported by the seven similarity centers hypothesis of safflower. This is a first study to explore the genetic diversity and population structure in safflower accessions using the iPBS-retrotransposon markers. The information provided in this work will therefore be helpful for scientists interested in safflower breeding.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30807571      PMCID: PMC6391045          DOI: 10.1371/journal.pone.0211985

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


1. Introduction

Safflower (Carthamus tinctorius L.) belongs to the Compositae family, it is self-pollinated and has a haploid genome size of about 1.4 GB and 2n = 24 chromosomes [1]. This crop is cultivated over wide geographical zones throughout the world for several uses including production of dyes, extraction of edible oil, and various medicinal utilizations [2]. Safflower has been in use since ancient times and the archeological remains of Carthamus spp. were found at sites in Syria about 7500 BC ago [3]. From these sites, cultivation of safflower spread to other regions like, Egypt, the Aegean, and southeastern Europe. Safflower accessions specific to some regions show similarity on the basis of their morphological traits and such a region can be considered as a safflower similarity center. However, there is still a debate on the actual number of similarity centers in the world as ascertained by molecular markers [4]. Knowles [5] proposed seven similarity centers (1: Far East, 2: India-Pakistan, 3: Middle East, 4: Egypt, 5: Sudan, 6: Ethiopia, and 7: Europe) for safflower while, Ashri [6] identified ten similarity centers (1: Near East, 2: Iran/Afghanistan, 3: Turkey, 4: Egypt, 5: Ethiopia, 6: Sudan, 7: Far East, 8: India/Pakistan, 9: Europe, and 10: Kenya). Similarly, Chapman et al. [4] proposed five safflower similarity centers for safflower (1: Near East, 2: Iran & Afghanistan, Turkey, 3: Egypt, Ethiopia, (Sudan), 4: Far East, India/Pakistan, (Sudan), 5: Europe). It is estimated that safflower is cultivated in nearly 20 countries with a total cultivated area of 1,140,002 hectares and the production of 948,516 tons [7]. Safflower major producer countries include, Russian Federation (286,351 tons), Kazakhstan (167,243 tons), Mexico (121,767 tons), USA (99,830 tons), Turkey (58,000 tons), and India (53,000 tons) which account for about 71% of the total world production [7]. In spite of containing good amount of polyunsaturated fatty acids and being resistant to dry conditions, still safflower did not gain the status of major oilseed crop. The primary factors which prevented its cultivation on large scale are low seed yield, low oil content, biotic stresses susceptibility, and spininess [8]. Therefore, the enhanced acceptability and utilization of safflower as an oilseed crop will require genetic improvement for the traits of interest. To this end, genetic diversity can be an effective approach by providing a good source of variations upon which breeding programs can build [9]. However, it is unfortunate that current safflower germplasm and breeding lines displayed low levels of genetic diversity, and were therefore of reduced usefulness in breeding programs. An extensive genetic and phenotypic diversity characterization among global safflower germplasm can help broaden the genetic base and diversity in the safflower crop, and identify elite accessions [10-11]. Safflower genetic diversity was investigated using different molecular markers; Random Amplified Polymorphic DNA (RAPD), Inter Simple Sequence Repeat (ISSR), Amplified Fragment Length Polymorphism (AFLP), Simple Sequence Repeats (SSRs), and Single Nucleotide Polymorphism (SNPs) [4-11-12-13-14-15-16-17-18-19-20], but so far, iPBS-retrotransposon markers have not been used to investigate the genetic diversity in safflower. Retrotransposons are known as an important component of the plant genome in terms of structural evolution and have great potential of changing its position and copy number across plant genome [21]. Retrotransposons are genetic elements ranging from 50 to 90% in various plant genomes depending upon the plant species [22]. Long terminal repeat (LTR) and non- long terminal repeat (non-LTR) are the two classes of retrotransposons, and plant genome reveal higher proportions of LTR retrotransposons as compare to non-LTR [23]. Limitations in the retrotransposon marker systems resulted in the development of a new marker system named Inter-primer binding site (iPBS) retrotransposons having universal applicability [23-24]. iPBS is a PCR-based, universal marker system and depends upon the presence of tRNA as a reverse transcriptase primer binding site [25]. Minimum cost and high efficiency of iPBS-retrotransposons make them good marker system [23]. Various crops like, pea, chickpea, Lens, Turkish okra, Tobacco, and common bean have been studied efficiently using iPBS-retrotransposon markers system [26-27-28-29-30]. Several crop species have been improved utilizing molecular markers in various crop breeding programs [31]. However, for safflower, its genetics and genomics were less studied, which can explain the lack of reliable marker systems for use in the process of developing superior safflower cultivars [32-33]. This study was conducted to evaluate the genetic diversity and population structure of safflower accessions using iPBS-retrotransposons as a start for further scientific investigations and practical breeding use cases.

2. Materials and Methods

2.1. Plant materials and DNA isolation

Experimental materials comprising 131 safflower accessions collected from 28 different countries were evaluated in this study. Among these accessions, 94, 17, and 20 originated from the United States Department of Agriculture (USDA), Plant Genetic Resources Institute Pakistan, and from the Turkish Central Research Institute for Field Crops (Table 1). A total of 94 accessions from USDA and 17 from Pakistan used in this study were landraces. The 20 Turkish accessions were single plant selection among international germplasm from USDA and are candidate cultivars. Seeds of each accession were sown at the research and experimental area of Bolu Abant Izzet Baysal University. Fresh, young healthy leaves were harvested at proper time for the isolation of DNA, brought to laboratory and frozen at -80°C for later use. DNA extraction was performed using the bulk leaves of each accession, and followed CTAB protocol [34] with slight modifications [35]. DNA concentration of each accession was measured using agarose gel (0.8%) and was also confirmed with the help of NanoDrop (DeNovix DS-11 FX, USA). Final DNA concentration for the 131 accession samples to be used in polymerase chain reactions (PCR) was adjusted to 5 ng/μL; the samples were stored at -25 oC till the start of PCR amplifications.
Table 1

Passport data of 131 world safflower panel.

Accession NumberGenotype NameAccession NoDonor OrganizationLocationProvince/DistrictCountry OriginPlant IDContinent
G1Isreal-130548USDA--IsrealP1-198990Asia
G2Romania-130549USDA--RomaniaP1-209287Europe
G3Morocco-130552USDA--MoroccoP1-239042Africa
G4Egypt-130563USDA--EgyptP1-250082Africa
G5Pakistan-130564USDA--PakistanP1-250194Asia
G6Pakistan-230565USDA--PakistanP1-250201Asia
G7Pakistan-330567USDA--PakistanP1-250345Asia
G8Pakistan-430568USDA--PakistanP1-250346Asia
G9Pakistan-530569USDA--PakistanP1-250351Asia
G10Pakistan-630570USDA--PakistanP1-250353Asia
G11Pakistan-730573USDA--PakistanP1-250481Asia
G12Egypt-230574USDA--EgyptP1-250528Africa
G13Egypt-330577USDA--EgyptP1-250532Africa
G14Egypt-430578USDA--EgyptP1-250540Africa
G15India-130579USDA--IndiaP1-250601Asia
G16Egypt-430580USDA--EgyptP1-250605Africa
G17Egypt-630581USDA--EgyptP1-250608Africa
G18Iran-130588USDA--IranP1-250720Asia
G19Jordan-130589USDA--JordanP1-251284Asia
G20Jordan-230590USDA--JordanP1-251285Asia
G21Isreal-230594USDA--IsrealP1-253386Asia
G22Spain-130595USDA--SpainP1-253388Europe
G23Spain-230596USDA--SpainP1-253391Europe
G24Spain-330597USDA--SpainP1-253394Europe
G25Spain-430598USDA--SpainP1-253395Europe
G26Portugal-130604USDA--PortugalP1-253553Europe
G27Portugal-230605USDA--PortugalP1-253556Europe
G28Morocco-230606USDA--MoroccoP1-253560Africa
G29Portugal-330608USDA--PortugalP1-253564Europe
G30Portugal-430610USDA--PortugalP1-253569Europe
G31Portugal-530611USDA--PortugalP1-253571Europe
G32Iraq-130612USDA--IraqP1-253761Asia
G33Iraq-230613USDA--IraqP1-253762Asia
G34Afghanistan-130614USDA--AfghanistanP1-253764Asia
G35Isreal-33015USDA--IsrealP1-253892Asia
G36Syria-130616USDA--SyriaP1-253898Asia
G37Syria-230617USDA--SyriaP1-253900Asia
G38Portugal-630620USDA--PortugalP1-258412Europe
G39Uzbekistan-130623USDA--UzbekistanP1-262435Asia
G40China-130624USDA--ChinaP1-262452Asia
G41China-230625USDA--ChinaP1-262453Asia
G42Iran-230631USDA--IranP1-304444Asia
G43Iran-330633USDA--IranP1-304448Asia
G44Turkey-130646USDA--TurkeyP1-304498Asia
G45Turkey-230648USDA--TurkeyP1-304502Asia
G46Turkey-330650USDA--TurkeyP1-304504Asia
G47Turkey-430651USDA--TurkeyP1-304505Asia
G48Afghanistan-230653USDA--AfghanistanP1-304592Asia
G49India-230662USDA--IndiaP1-305195Asia
G50Russia-130663USDA--RussiaP1-305535Asia
G51India-330673USDA--IndiaP1-306926Asia
G52India-430674USDA--IndiaP1-306941Asia
G53India-530677USDA--IndiaP1-306976Asia
G54Kazakhstan-130681USDA--KazakhstanP1-314650Asia
G55Turkey-530688USDA--TurkeyP1-340086Asia
G56Argentina-130695USDA--ArgentinaP1-367833America
G57Uzbekistan-230696USDA--UzbekistanP1-369846Asia
G58Uzbekistan-330697USDA--UzbekistanP1-369853Asia
G59Syria-330700USDA--SyriaP1-386174Asia
G60Thailand-130701USDA--ThailandP1-387821Asia
G61Iran-430713USDA--IranP1-405958Asia
G62Iran-530718USDA--IranP1-405967Asia
G63Bangladesh-131509USDA--BangladeshPI-401472Asia
G64Bangladesh-231510USDA--BangladeshPI-401478Asia
G65Bangladesh-331511USDA--BangladeshPI-401480Asia
G66India-633538USDA--IndiaPI 199878Asia
G67Afghanistan-333541USDA--AfghanistanPI 220647Asia
G68Australia-133542USDA--AustraliaPI 235660Oceania
G69Turkey-633543USDA--TurkeyPI 237538Asia
G70Pakistan-833547USDA--PakistanPI 250474Asia
G71Pakistan-933548USDA--PakistanPI 250478Asia
G72Iran-633556USDA--IranPI 250840Asia
G73Jordan-333559USDA--JordanPI 251265Asia
G74Jordan-433560USDA--JordanPI 251267Asia
G75Jordan-533561USDA--JordanPI 251268Asia
G76Israel-433564USDA--IsraelPI 251290Asia
G77Turkey-733565USDA--TurkeyPI 251978Asia
G78Turkey-833567USDA--TurkeyPI 251984Asia
G79Austria-133568USDA--AustriaPI 253519Europe
G80Hungary-133575USDA--HungaryPI 288983Europe
G81Libya-133608USDA--LibyaPI 393499Africa
G82Bangladesh-433609USDA--BangladeshPI 401470Asia
G83Iran-733621USDA--IranPI 406010Asia
G84Turkey-933627USDA--TurkeyPI 406701Asia
G85Turkey-1033628USDA--TurkeyPI 406702Asia
G86Pakistan-1033635USDA--PakistanPI 426521Asia
G87China-333638USDA--ChinaPI 543979Asia
G88China-433639USDA--ChinaPI 543982Asia
G89China-533642USDA--ChinaPI 544001Asia
G90China-633651USDA--ChinaPI 568809Asia
G91China-733661USDA--ChinaPI 568874Asia
G92France-133662USDA--FrancePI 576985Europe
G93Austria-233670USDA--AustriaBVAL-901352Europe
G94Pakistan-11CheckPGRI-Pakistan--PakistanThori-78Asia
G95Pakistan-1216266PGRI-PakistanJacobabadSindhPakistan-Asia
G96Pakistan-1316267PGRI-PakistanShikarpurSindhPakistan-Asia
G97Pakistan-1416268PGRI-PakistanShikarpurSindhPakistan-Asia
G98Pakistan-1516269PGRI-PakistanLarkanaSindhPakistan-Asia
G99Pakistan-1616270PGRI-PakistanLarkanaSindhPakistan-Asia
G100Pakistan-1716355PGRI-PakistanDaduSindhPakistan-Asia
G101Pakistan-1816356PGRI-PakistanDaduSindhPakistan-Asia
G102Pakistan-1916357PGRI-PakistanKarachiSindhPakistan-Asia
G103Pakistan-2016358PGRI-PakistanKarachiSindhPakistan-Asia
G104Pakistan-2116359PGRI-PakistanGilgitGBPakistan-Asia
G105Pakistan-2219233PGRI-PakistanGilgitGBPakistan-Asia
G106Pakistan-2320920PGRI-PakistanIslamabadFederal AreasPakistan-Asia
G107Pakistan-2421933PGRI-PakistanKarachiSindhPakistan-Asia
G108Pakistan-2524779PGRI-PakistanQuettaBalochistanPakistan-Asia
G109Pakistan-2627549PGRI-PakistanHyderabadSindhPakistan-Asia
G110Pakistan-2730698PGRI-PakistanHyderabadShindhPakistan-Asia
G111Pakistan-2835803PGRI-PakistanGakoochGilgit/BalistanPakistan-Asia
G112Afganistan-47-TCRIFC-Turkey--Afganistan-Asia
G113Afganistan-59-TCRIFC-Turkey--Afganistan-Asia
G114China-827-TCRIFC-Turkey--China-Asia
G115China-929-TCRIFC-Turkey--China-Asia
G116Turkey-1136-TCRIFC-Turkey-TarmeTurkey-Asia
G117Turkey-1237-TCRIFC-Turkey-TarmeTurkey-Asia
G118Turkey-1357-TCRIFC-Turkey-ElbistanTurkey-Asia
G119Turkey-1458-TCRIFC-Turkey-ElbistanTurkey-Asia
G120Canada-174-TCRIFC-Turkey- Canada-America
G121Canada-275-TCRIFC-Turkey- Canada-America
G122USA-180-TCRIFC-Turkey-MontanaUSA-America
G123Iran-8116-TCRIFC-Turkey- Iran-Asia
G124USA-2130-TCRIFC-Turkey- USA-America
G125USA-3132-TCRIFC-Turkey- USA-America
G126Turkey-15134-TCRIFC-Turkey-TarmeTurkey-Asia
G127USA-4149-TCRIFC-Turkey-İdohaUSA-America
G128Iran-9152-TCRIFC-Turkey- Iran-Asia
G129USA-5153-TCRIFC-Turkey-İdohaUSA-America
G130Iran-10177-TCRIFC-Turkey- Iran-Asia
G131Turkey-16277-TCRIFC-Turkey-TarmeTurkey-Asia

USDA: United States Department of Agriculture; PGRI: Plant Genetic Resources Institute; CRIFC: Central Research Institute for Field Crop;—Not known.

USDA: United States Department of Agriculture; PGRI: Plant Genetic Resources Institute; CRIFC: Central Research Institute for Field Crop;—Not known.

2.2. iPBS-retrotransposon PCR amplifications

Seventy iPBS-retrotransposon primers were initially screened using eight randomly selected accessions of safflower for PCR amplifications [25]. Out of the 70 iPBS-retrotransposon primers, 13 were found polymorphic and selected for PCR amplification, and produced strong bands (Table 2). A total reaction volume of 20 μL for PCR amplifications were comprised of 3 ng/ul template DNA, 2 μL dNTPs (Thermo Scientific), 0.2 μL U Taq DNA polymerase (Thermo Scientific), 3.2 μL primer, 2 μL 1x PCR buffer (Thermo Scientific), 2 μL MgCl2 and 7.6 μL distilled water. Reactions were performed in the sequence of denaturation at 95 oC for 3 min, subsequently followed by 30 denaturation cycles at 95 oC for 15 sec, annealing temperature 50–65 oC for one minute depending upon the primer, and a final extension for five minute at 72 oC [25]. The amplified fragments were electrophoresed on agarose gel 1.2% (w/v) using 0.5x TBE buffer at a constant voltage of 120 V for 230 minute. Staining of the gel was performed with ethidium bromide and visualized using UV Imager Gel Doc XR+ system (Bio-Rad, USA) light and photographed. A 100 bp+ DNA ladder was used as molecular weight marker.
Table 2

List of 13 iPBS-retrotransposon primers with their sequence and annealing temperature used to determine genetic diversity among 131 safflower accessions.

Primer nameSequenceAnnealing temperature (oC)
iPBS2252TCATGGCTCATGATACCA52
iPBS2376TAGATGGCACCA52
iPBS2377ACGAAGGGACCA53
iPBS2391ATCTGTCAGCCA52
iPBS2398GAACCCTTGCCGATACCA51
iPBS2228CATTGGCTCTTGATACCA53
iPBS2374CCCAGCAAACCA53
iPBS2399AAACTGGCAACGGCGCCA52
iPBS2401AGTTAAGCTTTGATACCA53
iPBS2239ACCTAGGCTCGGATGCCA52
iPBS2375TCGCATCAACCA52
iPBS2383GCATGGCCTCCA53
iPBS2392TAGATGGTGCCA52

2.3. Data analysis

Strong, clear, and unambiguous bands were selected for scoring. iPBS-retrotransposon markers are dominantly inherited markers and were therefore scored using the binary system: 0 or 1, respectively, for the absence and presence of specific bands with respect to 100 bp+ DNA ladder (Fig 1). For individual iPBS-retrotransposon markers, PopGene ver. 1.32 [36] was used to estimate various important genetic diversity parameters including effective alleles number (Ne), Shannon's Information Index (I), and gene diversity (He) (Table 3). Polymorphism information content (PIC) was computed for each iPBS-retrotransposon marker following Baloch et al. [28] criteria. At the safflower samples level, the diversity metrics evaluated included the overall gene diversity (Ht), inbreeding coefficient (Fis) and the pair-wise FST (measure of genetic structure), all of which were determined using hierfstat R package [37] following the algorithms of Goudet et al. [38] and Yang, [39]. R statistical software was used to compute pairwise genetic distance (GDj) as measured by Jaccard’s coefficient [40]. The population structure was assessed using the Bayesian clustering model-based STRUCTURE software, unweighted pair group method with arithmetic mean (UPGMA), and Principle coordinate analysis (PCoA). The most suitable number of clusters (K subpopulations) was determined following the protocol of Evanno et al. [41] using STRUCTURE software. A total of ten independent runs were set for each K value, and for each run, the initial burn-in period was set to 500 with 500,000 MCMC (Markov chain Monte Carlo) iterations with no prior information on the origin of individuals. We plotted the clusters number (K) against logarithm probability relative to standard deviation (ΔK). Final assignment of individual accessions was based on the magnitude of the membership coefficient being greater than or equal to 50% as suggested by Habyarimana, [42] and Nadeem et al. [9]. R statistical software was used to analysis of molecular variance (AMOVA) for considering two main population strata: the model based structure and the country of origin of the accessions.
Fig 1

A representative gel imaging picture revealing genetic diversity among 131 safflower accessions using 13 iPBS-retrotransposon markers.

Table 3

List of various diversity parameters computed to evaluate genetic diversity among 131 safflower accessions using 13 iPBS- retrotransposon primers.

PrimersTotal BandsPolymorphic BandsPolymorphism (%)PICNeIHeHt
iPBS22522015750.4321.23990.26660.16090.16092
iPBS2376322990.60.5311.44610.41710.27460.26511
iPBS2377362980.60.7811.49350.45780.30110.28914
iPBS2391108800.6631.46720.41430.27450.27452
iPBS2398222090.90.3161.30230.29010.18350.18353
iPBS2228161487.50.3231.18130.19990.120.12005
iPBS2374272696.30.3741.29040.3130.19390.19393
iPBS2399282692.90.2711.22930.22480.140.12747
iPBS2401221986.40.2311.15780.19980.11170.07055
iPBS2239282692.90.6231.3240.33530.20840.18431
iPBS2375222090.90.5871.44510.40550.26550.25677
iPBS2383151173.30.4881.26030.27870.16930.12818
iPBS2392171482.40.5821.50530.43720.29090.27281
Mean22.6919.7786.10.4771.3340.32610.20730.19441
Total295275      

PIC: Polymorphism information content, Ne: effective alleles number, I: Shannon's Information Index, He: gene diversity, Ht: overall gene diversity

PIC: Polymorphism information content, Ne: effective alleles number, I: Shannon's Information Index, He: gene diversity, Ht: overall gene diversity

3. Results

3.1.iPBS-retrotransposon marker analysis and genetic diversity

Thirteen most polymorphic iPBS-retrotransposon primers produced a total of 295 clear and strong scorable bands with an average of 22.69 bands per primer across 131 safflower accessions. Out of the 295 scorable bands, 275 (93.22%) were polymorphic with an average of 19.77 bands per primer (Table 3). The highest (36) and lowest (10) number of scorable bands were observed for primers iPBS2377 and iPBS2391, respectively. The primers iPBS2376 and iPBS2377 revealed highest number of polymorphic bands (29) each and exhibited highest information content (PIC), while primer iPBS2391 revealed least number of polymorphic bands (8) and was least informative. The PIC value ranged from 0.23 (iPBS2401 primer) to 0.78 (iPBS2377 primer) with a mean of 0.48. Highest (1.51) and lowest (1.16) number of effective alleles were observed for primers iPBS2392 and iPBS2401, respectively with an average of 1.33 effective numbers of alleles. Similarly, maximum (0.46) and minimum (0.20) Shannon's information index was reported for primers iPBS2377 and iPBS2401 and iPBS2228 respectively, having an average value of 0.33. Highest (0.30) level of gene diversity was recorded for primer iPBS2377 while, lowest (0.11) level of gene diversity was observed for primer iPBS2401 with an average of 0.21. At the safflower accession samples level, the overall gene diversity (Ht), Fstatistic (Fst) and inbreeding coefficient (Fis) were 0.19, 0.21, and 1, respectively. The mean genetic diversity indices; observed number of alleles (1.86), effective number of alleles (1.34), Nei's gene diversity (0.21), Shannon's information index (0.33), and overall gene diversity (0.20) across four populations and one unclassified population were also determined (Table 4). Population A revealed observed number of alleles (1.68), effective number of alleles (1.28), Nei's gene diversity (0.17), Shannon's information index (0.28), and overall gene diversity (0.15). Population B revealed observed number of alleles (1.70), effective number of alleles (1.33), Nei's gene diversity (0.20), Shannon's information index (0.31), and overall gene diversity (0.19). Population C revealed observed number of alleles (1.63), effective number of alleles (1.26), Nei's gene diversity (0.16), Shannon's information index (0.26), and overall gene diversity (0.15). Population D revealed observed number of alleles (1.65), effective number of alleles (1.29), Nei's gene diversity (0.18), Shannon's information index (0.28), and overall gene diversity (0.17). Unclassified population revealed observed number of alleles (1.51), effective number of alleles (1.22), Nei's gene diversity (0.14), Shannon's information index (0.22), and overall gene diversity (0.13).
Table 4

Various diversity parameters computed to evaluate genetic diversity among 131 safflower across populations using 13 iPBS-retrotransposon primers.

PopulationsNaNeHIHtMean Jaccard Genetic distance (GD)GD Range
Population A1.68141.28310.17480.27540.14980.2220.05–0.339
Population B1.69831.32550.19920.30960.19440.2420.057–0.33
Population C1.63051.25720.16160.25530.14590.2380.126–0.357
Population D1.65421.29310.18160.28400.16850.3090.148–0.455
UP1.50851.21500.13730.21760.13110.2770.134–0.372
Overall1.86441.33990.21060.33120.19710.2880.05–0.507

Na: observed number of alleles, Ne: effective alleles number, I: Shannon's Information Index, h: gene diversity, Ht: overall gene diversity, UP: unclassified population

Na: observed number of alleles, Ne: effective alleles number, I: Shannon's Information Index, h: gene diversity, Ht: overall gene diversity, UP: unclassified population To clearly understand the broader picture of genetic diversity, pairwise genetic distance among 131 safflower accessions was measured with the Jaccard coefficient. The mean Jaccard genetic distance across the evaluated accessions was 0.288. The highest genetic distance (0.51) was observed between Turkey3 and Afghanistan4 accessions. Similarly, lowest genetic distance (0.05) was present between Afghanistan4 and Afghanistan5 accessions. Genetic distance was also calculated across the populations and mean genetic distance for population A (0.22), population B (0.24), population C (0.24), population D (0.31), and unclassified population (0.28). Analysis of molecular variance (AMOVA) was carried out considering two main population strata: the model based structure and the country of origin of the accessions (Table 5). AMOVA revealed that the country of origin was not significant, while the model statistically significant effects on the molecular genotypic variability resulted from model-based structure (P = 0.005), country within model-based populations (P = 0.02), and model-based populations within country (P = 0.047). Variations between countries were not significant (P = 0.07), whereas variations within countries (P = 0.037) and between populations (P = 0.046) were significant (Table 6). The variations within and between populations explained 43 and 5 percent, respectively, of the genetic structure (Table 7). The country within population and the population within country explained 35 and 52 percent of the observed structure.
Table 5

Analysis of molecular variance (AMOVA) revealing genetic diversity in; (a) country within STRUCTURE populations, (b) populations within country.

A
SourceDfSSMSF.ModelR2Pr(>F)
Country279417348.781.47890.223640.152
country: group2614531558.892.36980.345090.02*
Residuals7718160235.840.43126
Total130421081   
B
SourceDfSSMSF.ModelR2Pr(>F)
Structure42177544.352.30810.051710.005 **
group: country4921771444.31.88390.517030.047 *
Residuals7718160235.840.43126
Total13042108  1 

“**” significance at the 0.1% nominal level and

“*” significance at the 1% nominal level; Country:group = country within STRUCTURE populations; Group:country = populations within country

Table 6

Analysis of molecular variance (AMOVA) revealing genetic diversity within the studied 131 safflower accessions.

TestObsStd.ObsAlterPvalue
Variations within samples235.839-1.9713Less0.037
Variations between samples92.26061.69048greater0.07
Variations between group-2.31092.06289greater0.046
Table 7

Analysis of molecular variance (AMOVA) revealing intra-genetic diversity within different Structure populations.

SourceDfSSMSF.ModelR2Pr(>F)
Populations4278.6369.6588.39810.210490.001 ***
Within populations1261045.118.2950.78951
Total1301323.74  1 

“***” corresponds to significance at the 0.05% nominal level

“**” significance at the 0.1% nominal level and “*” significance at the 1% nominal level; Country:group = country within STRUCTURE populations; Group:country = populations within country “***” corresponds to significance at the 0.05% nominal level In accordance with the observed most suitable goodness of fit (K = 4), the Bayesian clustering model implemented in STRUCTURE software divided the evaluated safflower accessions into four main populations; 31 accessions (23.66% of the total accessions) in the population A (black), 22 accessions (16.79% of the total accessions) in the population B (red), 33 accessions (25.19% of the total accessions) in the population C (blue), 27 accessions (20.61% of the total accessions) in the population D (pink) (Fig 2). Eighteen accessions (on the right-most end of the structure graph) did not reach the membership threshold (50%) and were named unclassified population. The UPGMA based clustering divided 131 safflower accessions into four main clusters corresponding to the four populations (populations A, B, C, D) identified using the model-based structure. The unclassified accessions were dispersed throughout the four populations, particularly in population D where 9 (50%) of the unclassified accessions clustered. With the UPGMA algorithm, two (Jordan4, Jordan5) and five (Israel2, Egypt2, Egypt3, Spain2, Spain4) population B accessions clustered with population D and population C, respectively. Similarly, relative to model-based clustering algorithm, UPGMA discrepantly clustered the accession Iran8 (population C) in population A (Fig 3). PCoA divide all accessions into four populations; A, B, C, and D similar to structure based clustering, with the unclassified accessions being dispersed particularly throughout populations B, C, and D (Fig 4).
Fig 2

Structure-based clustering among 131 safflower accessions using 13 iPBS-retrotransposon markers.

Fig 3

UPGMA based clustering among 131 safflower accessions using 13 iPBS-retrotransposon markers.

Fig 4

Principal coordinate analysis (PCoA) among 131 safflower accessions using 13 iPBS-retrotransposon markers.

4. Discussion

4.1. iPBS-retrotransposons in assessing genetic diversity of safflower panel

To the best of our knowledge, the present investigation represents the first attempt to elucidate the genetic diversity and population structure of safflower accessions at DNA level using iPBS-retrotransposons. It was observed that retrotransposons are abundant and widely distributed throughout plant genome [43] and huge amount of error-prone retroviral replications lead to the accumulation of these genetic variations [44-45]. iPBS based markers have been used greatly for fingerprinting and genetic diversity investigation in plants [26-35-46-47]. A total of 13 polymorphic iPBS- retrotransposon markers were used in this study to carry out genetic diversity in a panel of 131 safflower accessions from 28 different countries, and 295 clear and strong bands were recorded. The average number of bands per primer was 22.69 while, 275 (93.22%) out of 295 bands were polymorphic. Mean polymorphism found in this study was higher than that of Yang et al. [20], as they reported 82.7% polymorphism using ISSRs in 48 safflower accessions. Furthermore, Sehgal et al. [19] obtained even lower polymorphism levels of 57.6, 68.0, and 71.2% using RAPD, SSR and AFLP markers, respectively. Polymorphism is one of the key requirements to determine good quality genetic markers; therefore, iPBS markers satisfy this requirement in safflower. Polymorphism information content (PIC) is a widely used metric of the usefulness of molecular markers [48]. The PIC was found higher (0.48) in this work than in the findings by Ambreen et al. [12], Ambreen et al. [13], Barati and Arzani [49], Derakhshan et al. [50], Hamdan et al. [33], and Lee et al. [17], all of whom used SSR markers to evaluate the genetic diversity in safflower. In their works, Houmanat et al. [51] found lower PIC value of 0.23 relative to this study, using ISSRs markers in safflower. These results clearly suggest that more diverse iPBS-retrotransposon markers loci can be identified and effectively used as a tool for assessing genetic diversity and other investigations relying on genetic variants. Maximum number of effective alleles is desirable because it represent the presence of higher level of genetic variations. Number of effective alleles (1.16 to 1.51) found in this work was in the similar range (1.29 to 1.72) to that of Panahi and Neghab [52] using ISSR markers to assess the genetic diversity in Iranian safflower germplasm. Similarly, Sung et al. [53] obtained lower range of effective number of alleles (1.02 to 1.09) than us using RAPD markers. Possible reason behind the presence of higher number of effective alleles in this study might be the differences of experimental materials used during evaluation and also the different molecular marker system. Shannon's information index usually distinguishes the level of available genetic diversity in a population, combining abundance and evenness. Kumar et al. [54] reported lower range of Shannon's information index (0.24 to 0.44) than this study using AFLP markers, highlighting the safflower accessions evaluated in this work were more diverse with genetic variants being more evenly distributed throughout the population. This was confirmed also by the level of gene diversity which was found higher than that of Ambreen et al. [12] and Pearl and Burke [18]. To know the genetic diversity more clearly, diversity metrics like; overall gene diversity (0.24), Fst (0.21), and Fis (1) were also computed. The Fst (a measure of genetic differentiation) obtained in this work was comparable with the findings of Ambreen et al. [12] as they obtained Fst in the range 0.08 to 0.29. On the other hand, Mokhtari et al. [55] obtained mean Fis value of 0.01 which is lower than that (1.00) presented in this work. Safflower is a self-pollinated crop, higher Fis values are therefore expected. In this study, the estimated Fst value (0.21) was higher than the variation explained by the genetic population as evaluated by the analysis of molecular variance (AMOVA). The difference of magnitude between the two metrics was expected as Fst accounted only for genetic populations as a source of variation, while AMOVA accounted for genetic populations and the provenance of the accessions. To understand the variations level more clearly, various diversity indices were calculated at the population’s level and population B was found superior by representing higher values for these diversity indices. On the other hand, unclassified population reflected lesser level of diversity by accounting lower values for these diversity indices. The evaluation of pairwise genetic distance showed a mean of 0.288, with the highest genetic distance between accessions Turkey3 and Afghanistan4, followed by Afghanistan2 and Pakistan24 with respective distance values of 0.51 and 0.49. Greater similarity was found between Afghanistan4 and Afghanistan5 accessions showing least genetic distance of 0.05. One understandable reason behind the presence of maximum genetic similarity might be due to their origin from the common parents. To explore the genetic diversity more clearly, genetic distances were also calculated at the population level and mean maximum genetic distance was reflected by the population D and minimum was resulted by population A. Within populations, Turkey16 and China9 reflected maximum genetic distance and minimum was present between Afghanistan4 and Afghanistan5 accessions belonging to population A. Within population B, maximum genetic distance was observed between accessions Iraq1 and Jordan4, while minimum genetic distance was shown between accessions Jordan4 and Jordan5. Argentina1 and Iran8 were the two most distinct accessions reflecting maximum genetic distance in the population C and Australia1 and Turkey6 were found two most genetically similar accession of population C representing minimum genetic distance. Within population D, Turkey3 and Iran9 were most diverse accessions and Kazakhstan1 and Pakistan14 were two genetically distinct accessions belonging to unclassified population. Germplasm containing desirable plant traits can be usefully integrated in breeding programs to develop superior cultivars [24], particularly through controlled hybridizations involving genetically distant parental lines. The above four most diverse accessions identified in this work can be recommended as a candidate parents for future safflower breeding programs. The analysis of molecular variance (AMOVA) was used to determine the pattern of the partition of the total gene diversity among and within populations, and the countries of origin [56]. AMOVA showed that most of genetic structure was explained by variations from individuals within populations, the genetic populations within countries and the countries within genetic populations. These findings are in agreement with Wodajo et al. [57], as they reported more within-population (98.9%) importance on genetic structure than among populations (1.1%) using ISSR markers to evaluate 70 safflower accessions from Ethiopia. The discrepancy in terms of the magnitude of variance components explained by the differing sources of variation included in the AMOVA model. The authors included in their model only the population as a source of variation, while in this work two sources of variation were considered including the population and the country of origin. The model-based structure application proved more robust and informative in previous investigations [58-59]. Structure was therefore used in this work as a benchmark for clustering algorithms. Using structure, the 131 safflower accessions were partitioned into four main populations (A, B, C, and D), and 18 individuals with poor membership coefficients across clusters were considered unclassified population (Fig 2). A total of 31, 22, 33, 27, and 18 safflower accessions were found in populations A, B, C, D, and unclassified population, respectively. Population A comprised of 31 safflower accessions from Turkey, USA, Canada, Iran, Afghanistan, China, and Pakistan. This population represents the accessions from the Asian and North American regions. Population B consisted of 22 safflower accessions from different countries including Syria, Israel, Jordan, Afghanistan, Portugal, Spain, Morocco, Iraq, and Egypt. Population B contained the accessions from the Mediterranean countries and all clustering of these accessions together represents their genetic similarity. The 33 safflower accessions found in population C were collected from Pakistan, Bangladesh, Australia, Afghanistan, Turkey, Iran, Libya, Uzbekistan, Thailand, India, China, Syria, and Argentina. Population C comprised of accessions from the Asian and Mediterranean countries and clustering of accessions from both regions proposed the distribution of safflower from Mediterranean region to Asia through Turkey. Population D comprised of 27 safflower accessions from Afghanistan, Turkey, Israel, Egypt, China, Morocco, Romania, Pakistan, India, Iran, and Russia. The unclassified population composed of 18 safflower accessions from Pakistan, Spain, Egypt, France, India, Austria, China, Kazakhstan, Uzbekistan, Hungry, Jordan, Portugal, and Israel. Clustering of accessions from Mediterranean countries confirmed this region as center of origin for safflower especially Syria [3] and from this region, it is distributed to other parts of the world. Turkey, represents a great level of biodiversity, differentiation center among the continents, and played a vital role to connect the continents with each other [24]. On continents basis, population A clustered a total of 7 and 24 accessions belonging to American and Asian continents respectively. In population B, 3, 11 and 8 accessions originated from Africa, Asia and Europe, respectively. Population C comprised accessions from America (2), Asia (29), Europe (1), and Oceania (1). In population D most of the accessions originated from Asia (23), while a few accessions came from Africa (3) and Europe (1). The unclassified population contained genotypes mostly from Asia (11), while the other few came from Africa (2) and Europe (5) accessions also made divergence from above four populations by making their separate group. Clearly, the clustering based on molecular markers did not discriminate the origins of the safflower accessions evaluated in this work, which was also confirmed by the AMOVA inferences. Accessions from different countries clustered together, implying that kinship was more determinant for the population structure than the geographical provenance. In addition to sharing common parentage, similarities of accessions in same group during clustering might also be due to convergent evolution and selection [60]. It can therefore be inferred that populations from different geographical regions shared a great proportion of genetic diversity. The design of the experiment in this work cannot provide explanation of the observed predominance of Asian safflower accessions. However, the above countries of origin are part of the seven "centers of similarity" (the Far East, India-Pakistan, the Middle East, Egypt, Sudan, Ethiopia and Europe) as recognized by Knowles [5]. Safflower accessions from Afghanistan, Pakistan, Turkey, India, and particularly from China were found more diverse as they were present in all populations. The higher diversity observed in the Asian safflower accessions is a strong evidence of their wider adaptability, which is supported by the findings of Yang et al. [20] and Zhang [61]. In 1969, Knowles recognized the existence of seven safflower similarity centers across the world. Overall, the centers of similarity were represented by several accessions in this study. However, the molecular marker data used in this study did not provide much support to the above Knowles’s hypothesis on the similarity centers. Indeed accessions belonging to different similarity centers were clustered together. This lack of importance of similarity centers in defining molecular-based populations was reported in scientific literature [62]. In population A, the safflower accessions locally collected from Pakistan were mostly (12 accessions) part of the India-Pakistan similarity center. Also, six accessions from Turkey, two from Afghanistan, and two from Iran were present in this population and can be assigned to the Middle East similarity center. Population B comprised of safflower accessions from Syria (2), Israel (2), Jordan (4), Afghanistan (1), and Iraq (2) belonging to the Middle East similarity center. Similarly, population B contains safflower accessions from Spain (3), Portugal (5), and Morocco (1) which are part of the Europe similarity center. Population C exhibited safflower accessions from Afghanistan (1), Turkey (6), Iran (5), and Syria (1) revealing the Middle East similarity center. Also, population C contains accessions from Pakistan (3), Bangladesh (4), and India (2) showing the India-Pakistan similarity center. Population D revealed the India-Pakistan similarity center by containing accessions from India (3) and Pakistan (9). Population D also exhibits the Middle East similarity center because it contains accessions from Afghanistan (1), Turkey (4), Israel (1), and Iran (3). The unclassified population revealed the presence of Europe similarity center as it contains one accession from each country; Spain, France, Austria, Hungry, and Portugal. In the same way, India-Pakistan similarity center was also available in the unclassified population due to the presence of safflower accessions from Pakistan (4) and India (1). There is a still need for more research in order to shed more light on the safflower similarity centers at molecular level by collecting and evaluating accessions from all known similarity centers. The investigation of genetic relationships between the 131 accessions using UPGMA clustering algorithm resulted in a clustering pattern comparable with the model-based algorithm with a few exceptions as two and five population B accessions clustered with population D and population C, respectively, and UPGMA discrepantly clustered the accession Iran8 (population C) in population A (Fig 3). Since these accessions displayed mostly full membership coefficients in model-based Structure, the discrepancy observed in UPGMA clustering approaches can be explained by its reduced resolution power relative to the model-based Structure [58-59]. Principal coordinate analysis (PCoA) greatly supported the structure based clustering of 131 safflower accessions using 13 iPBS-retrotransposon primers (Fig 4). The four populations were clearly distinguishable, and the unclassified population was disseminated throughout the other populations, particularly throughout populations B, C, and D. These light discrepancies between PCoA and model-based structure can derive from differing clustering resolution, with model-based structure exhibiting more resolution. Indeed, 40% of the variation in the overall genetic structure was not accounted for by the first two PCoA dimensions presented in this work. The above-mentioned misclassifications of accessions in the principal coordinate space can be explained by the existence of genomic admixture. PCoA analysis revealed the same pattern of distribution of similarity centers as identified by structure based analysis. Population A, B, and D exhibited the Middle East similarity centers as they contain safflower accessions from Turkey, Afghanistan, Iran, Syria, Israel, Jordan, and Iraq. Population C comprised of India-Pakistan similarity center by containing safflower accession from India, Pakistan, and Bangladesh. Europe similarity center is present in population B and in the unclassified population of PCoA based analysis. It suggests more research work regarding the confirmation of safflower similarity centers at molecular level. Overall, iPBS-retrotransposons revealed a good spectrum of genome diversity in safflower and the explored genetic diversity can be used in future safflower breeding programs. As iPBS-retrotransposon marker system demonstrated competitive results in this work and in previous investigations, it is warranted to focus further attention on collecting and evaluating safflower germplasm at molecular level using iPBS-retrotransposons as an important tool for enhancing productivity. To contribute to the yet unending discussion on the safflower similarity centers, a robust sampling techniques including random sampling without replacement can be implemented on the accessions in major world safflower seed repositories; the sampled materials can be evaluated using clustering algorithms such as those implemented in this work.

5. Conclusion

A good level of genetic diversity was identified among 131 safflower accessions. The importance of genetic populations on the genetic structure was significant, but its magnitude was lesser than the importance the variations of individuals within genetic populations. The provenance of the samples showed no effects on the genetic structures in the 131 accessions. Our results most probably obey the seven similarity centers hypothesis of safflower but still there is need to conduct further research works to confirm these similarity centers at the molecular level. Generally, safflower accessions from Asian countries like Afghanistan, Pakistan, China, Turkey, and India were found diverse. Specifically, among 131 safflower germplasm, accessions Turkey3, Afghanistan4, Afghanistan2, and Pakistan24 were found most diverse at molecular level and might be recommended as a candidate parents for future safflower breeding programs.
  26 in total

1.  iPBS: a universal method for DNA fingerprinting and retrotransposon isolation.

Authors:  Ruslan Kalendar; Kristiina Antonius; Petr Smýkal; Alan H Schulman
Journal:  Theor Appl Genet       Date:  2010-07-10       Impact factor: 5.699

2.  Optimizing parental selection for genetic linkage maps.

Authors:  J A Anderson; G A Churchill; J E Autrique; S D Tanksley; M E Sorrells
Journal:  Genome       Date:  1993-02       Impact factor: 2.166

3.  Nested retrotransposons in the intergenic regions of the maize genome.

Authors:  P SanMiguel; A Tikhonov; Y K Jin; N Motchoulskaia; D Zakharov; A Melake-Berhan; P S Springer; K J Edwards; M Lee; Z Avramova; J L Bennetzen
Journal:  Science       Date:  1996-11-01       Impact factor: 47.728

4.  Testing differentiation in diploid populations.

Authors:  J Goudet; M Raymond; T de Meeüs; F Rousset
Journal:  Genetics       Date:  1996-12       Impact factor: 4.562

Review 5.  Eukaryotic transposable elements and genome evolution.

Authors:  D J Finnegan
Journal:  Trends Genet       Date:  1989-04       Impact factor: 11.639

6.  ESTIMATING HIERARCHICAL F-STATISTICS.

Authors:  Rong-Cai Yang
Journal:  Evolution       Date:  1998-08       Impact factor: 3.694

7.  Analysis of population genetic structure with RAPD markers.

Authors:  M Lynch; B G Milligan
Journal:  Mol Ecol       Date:  1994-04       Impact factor: 6.185

8.  Genetic characterization of Iranian safflower (Carthamus tinctorius) using inter simple sequence repeats (ISSR) markers.

Authors:  Bahman Panahi; Mahmoud Ghorbanzadeh Neghab
Journal:  Physiol Mol Biol Plants       Date:  2013-04

9.  Genetic structure, linkage disequilibrium and signature of selection in Sorghum: lessons from physically anchored DArT markers.

Authors:  Sophie Bouchet; David Pot; Monique Deu; Jean-François Rami; Claire Billot; Xavier Perrier; Ronan Rivallan; Laëtitia Gardes; Ling Xia; Peter Wenzl; Andrzej Kilian; Jean-Christophe Glaszmann
Journal:  PLoS One       Date:  2012-03-13       Impact factor: 3.240

10.  Association Mapping for Important Agronomic Traits in Safflower (Carthamus tinctorius L.) Core Collection Using Microsatellite Markers.

Authors:  Heena Ambreen; Shivendra Kumar; Amar Kumar; Manu Agarwal; Arun Jagannath; Shailendra Goel
Journal:  Front Plant Sci       Date:  2018-03-29       Impact factor: 5.753

View more
  3 in total

1.  Development of molecular markers based on LTR retrotransposon in the Cleistogenes songorica genome.

Authors:  Tiantian Ma; Xingyi Wei; Yufei Zhang; Jie Li; Fan Wu; Qi Yan; Zhuanzhuan Yan; Zhengshe Zhang; Gisele Kanzana; Yufeng Zhao; Yingbo Yang; Jiyu Zhang
Journal:  J Appl Genet       Date:  2021-09-23       Impact factor: 3.240

2.  Genome-wide association study identifies various loci underlying agronomic and morphological traits in diversified potato panel.

Authors:  Muhammad Abu Bakar Zia; Ufuk Demirel; Muhammad Azhar Nadeem; Mehmet Emin Çaliskan
Journal:  Physiol Mol Biol Plants       Date:  2020-03-27

3.  Genetic relationships and diversity analysis in Turkish laurel (Laurus nobilis L.) germplasm using ISSR and SCoT markers.

Authors:  Abdurrahim Yilmaz; Vahdettin Ciftci
Journal:  Mol Biol Rep       Date:  2021-06-20       Impact factor: 2.316

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.