Literature DB >> 25541702

Genetic diversity and population structure in a legacy collection of spring barley landraces adapted to a wide range of climates.

Raj K Pasam1, Rajiv Sharma1, Alexander Walther2, Hakan Özkan3, Andreas Graner1, Benjamin Kilian1.   

Abstract

Global environmental change and increasing human population emphasize the urgent need for higher yielding and better adapted crop plants. One strategy to achieve this aim is to exploit the wealth of so called landraces of crop species, representing diverse traditional domesticated populations of locally adapted genotypes. In this study, we investigated a comprehensive set of 1485 spring barley landraces (Lrc1485) adapted to a wide range of climates, which were selected from one of the largest genebanks worldwide. The landraces originated from 5° to 62.5° N and 16° to 71° E. The whole collection was genotyped using 42 SSR markers to assess the genetic diversity and population structure. With an average allelic richness of 5.74 and 372 alleles, Lrc1485 harbours considerably more genetic diversity than the most polymorphic current GWAS panel for barley. Ten major clusters defined most of the population structure based on geographical origin, row type of the ear and caryopsis type - and were assigned to specific climate zones. The legacy core reference set Lrc648 established in this study will provide a long-lasting resource and a very valuable tool for the scientific community. Lrc648 is best suited for multi-environmental field testing to identify candidate genes underlying quantitative traits but also for allele mining approaches.

Entities:  

Mesh:

Year:  2014        PMID: 25541702      PMCID: PMC4277474          DOI: 10.1371/journal.pone.0116164

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Cultivated barley, Hordeum vulgare L. ssp. vulgare, the domesticated form of wild barley H. vulgare L. ssp. spontaneum (C. Koch) Thell., is one of the oldest cereal crops [1]. Barley withstands hot and dry climates, marginal soils, to some extent salinity and a broad range of soil pH conditions [2], [3]. Today, barley is grown from 70° N in Norway to 46° S in Chile. The morphological, physiological and functional variation in barley reflects the underlying genetic diversity, which eases the environmental adaptation of this species [4], [5]. In barley, as in most crops, genetic bottlenecks occurred during domestication and crop improvement. For most loci current elite varieties harbour less genetic diversity than their wild relatives or early domesticates [6]–[9]. Landraces are traditional domesticated populations of locally adapted genotypes maintained by local farmers over generations. Early barley cultivars were direct selections among landraces or descended from genetic recombination of different landraces. Since then, new barley varieties are mainly developed through the reshuffling of alleles resulting in a more or less constant repertoire of alleles within the elite gene pool. Overall, the genetic basis in present elite barley breeding materials is rather limited. Since the beginning of the 20th-century landraces were largely replaced by modern cultivars [10], [11], which are higher yielding under optimal conditions but which can completely fail under harsh environments [12]. Today, most barley landraces have disappeared from practical farming. Many of them are still maintained in ex situ repositories. Globally, landraces represent the largest part of barley germplasm conserved in genebanks (44%, 128.870 accessions) [13], [14]. Examples demonstrating the utilization of landrace genetic diversity include the introgression of i) plant height dwarfing alleles (Rht1 and Rht2) derived from the Japanese wheat landrace “Shiro Daruma” [15], ii) several disease and insect resistance genes in wheat [16], [17], iii) submergence tolerance (Sub1) in rice [18], iv) broad spectrum powdery mildew resistance allele mlo11 (from an Ethiopian barley landrace) [19], the rym4 virus resistance [20] or v) the boron-toxicity tolerance in barley, which was obtained from the Algerian landrace “Sahara” [21]. Genome-wide association analysis (GWAS), a population based method to identify marker-trait associations based on linkage disequilibrium (LD) is being recently used extensively in crop plants [22]. Genetic diversity, population size and stratification, extent of genome-wide LD, allele frequency distribution, marker type and coverage as well as other parameters determine the accuracy, resolution and power of GWAS [22]–[24]. As a consequence of genetic bottlenecks during domestication and crop improvement, allele frequency changes resulted in different levels of LD and genetic diversity. Thus, the extent of LD increases from wild barley to landrace and to elite cultivars, whereas the reverse trend was observed for genetic diversity [25]. The trade-off of higher LD in current cultivars is lower resolution in GWAS [26]. To fine-map QTL, the varying extent of LD observed in different genepools of barley (e.g. wild, landraces, and cultivars) could be exploited and provides a great opportunity for high-resolution association mapping [27]. Several studies were performed in barley to investigate genetic diversity in different germplasm collections using molecular markers, and few of these collections (panels) were established for GWAS. However, most studies were based on either cultivar collections [28]–[30], or mixtures of cultivars and landraces [31]–[33] or landrace panels from distinct geographical regions only [34]–[37]. Very few collections of barley landraces collected from a wider geographic range were established but none of them was designed for higher resolution GWAS [11], [38]–[39].Here we describe the establishment of a comprehensive and diverse collection of spring barley landraces adapted to a wide range of climates (Landrace Collection 1485, “Lrc1485”). We studied genetic diversity and population structure using microsatellite (Simple Sequence Repeat, SSR) markers, which provided the basis for targeted research activities. The legacy core reference set (CRS) Lrc648 established here is available for the scientific community to integrate data and to improve our elite barley varieties under changing environmental conditions.

Material and Methods

Plant material

In total, 1491 spring barley landrace accessions adapted to diverse climate conditions were originally considered for this study. This representative collection was carefully selected among 22,093 Hordeum accessions available in 2008 at the “Federal ex-situ Genebank for Agricultural and Horticultural Crop Plants” maintained at the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) in Gatersleben, Germany. The landraces originated from 41 countries in Europe, West and Central Asia, North and East Africa and covering 5.63°–62.47° N and 16.62°–71.5° E. The selection was based on the following criteria: i) Mansfeld's taxonomical classification system considering growth habit, row type of the ear, kernel coverage, spike density, and seed colour [40], ii) the collection site had to be well documented (S1 Table), and iii) passport data (Characterization and Evaluation data since 1946) - obtained from IPK's Genebank Information System, GBIS, http://gbis.ipk-gatersleben.de). Barley landraces collected during targeted expeditions before 1992 were considered whenever possible to work with trustable materials. From Syria and Jordan only a few landraces (six accessions) were included - as barley landraces from this region were comprehensively investigated by [34], [41]. Apart from other considerations, the proportion of landraces selected from each country should represent the overall composition of IPK's spring barley landrace collection containing 6,800 accessions meaning the number of accessions per country differed (Fig. 1, Table 1, S1 Table).
Figure 1

Geographic information system (GIS)-based topographic maps indicate the natural distribution of 1485 barley landrace accessions.

(a) Structure (K = 4); (b) Structure (K = 10) - inferred clusters. Every individual accession is represented by a coloured circle indicating the membership to a cluster. Admixed accessions are indicated by black stars. See Fig. 2, Fig. 3, S2 Fig., S3 Fig., S4 Fig., Table 1, Table 3, Table 4, S3 Table and S1 Table for more information.

Table 1

Distribution of landraces considered in this study according to their countries of origin, caryopsis type and row type.

HulledNaked
Country of originNo. of accessionsTwo- rowedSix- rowedTwo- rowedSix- rowed
Afghanistan107987110
Albania2251025
Algeria52300
Armenia44000
Austria56362000
Azerbaijan10100
Bulgaria1501401
Croatia32100
Czech177172
Denmark20002
Egypt51400
Ethiopia299544695104
Finland33000
France92601
Georgia80473300
Germany37 (1)2673 (1)1
Greece70 (1)1950 (1)01
Hungary32001
Iran84443190
Iraq37 (1)14 (1)2210
Italy4292805
Kazakhstan10010
Latvia11000
Libya1231310703
Lithuania11000
Macedonia10100
Morocco5014801
Netherlands11000
Norway11000
Poland58408100
Romania105311
Russia23 (1)5312 (1)3
Slovakia149 (2)146 (2)300
Spain3403400
Sweden32100
Switzerland108101
Syria65100
Tunisia41300
Turkey99485010
Ukraine62301
Yugoslavia* 94500
Total Considered1491 (6) 1485570 (3) 567635 (1) 634143 (2) 141143 (0) 143

*Former Yugoslavia (incl. Serbia, Kosovo, Bosnia and Herzegovina). Numbers in brackets indicate the number of accessions excluded from analyses. Thus in total 1485 accessions were considered for analyses.

Geographic information system (GIS)-based topographic maps indicate the natural distribution of 1485 barley landrace accessions.

(a) Structure (K = 4); (b) Structure (K = 10) - inferred clusters. Every individual accession is represented by a coloured circle indicating the membership to a cluster. Admixed accessions are indicated by black stars. See Fig. 2, Fig. 3, S2 Fig., S3 Fig., S4 Fig., Table 1, Table 3, Table 4, S3 Table and S1 Table for more information.
Table 3

Assignment of 1485 landraces into Structure inferred groups.

AccessionsHulledNaked
GroupassignedTwo-rowedSix-rowedTwo-rowedSix-rowed
a)G1314108 (34.30)151 (48.00)43 (13.60)12 (3.80)
G227744 (15.80)36 (12.90)89 (32.10)108 (38.90)
G343913 (2.90)420 (95.60)1 (0.20)5 (1.10)
G4323316 (97.80)3 (0.00)3 (0.00)1 (0.00)
Admixed13284 (63.60)24 (18.10)7 (5.30)17 (12.80)
Total1485565 (0.380)634 (0.426)143 (0.096)143 (0.096)
b)G11940 (0.00)0 (0.00)88 (45.30)106 (54.6)
G2661 (15.00)64 (96.90)0 (0.00)1 (1.50)
G32268 (3.50)218 (96.40)0 (0.00)0 (0.00)
G45633 (58.90)23 (41.00)0 (0.00)0 (0.00)
G58343 (51.80)40 (48.10)0 (0.00)0 (0.00)
G6801 (1.20)70 (87.50)0 (0.00)9 (11.20)
G79791 (93.80)6 (6.10)0 (0.00)0 (0.00)
G8295288 (97.60)3 (1.00)4 (1.30)0 (0.00)
G9540 (0.00)0 (0.00)42 (77.70)12 (22.20)
G101385 (3.60)132 (95.60)0 (0.00)1 (0.72)
Admixed19695 (48.40)78 (39.70)9 (4.50)14 (0.71)
Total1485565 (38.00)634 (42.60)143 (9.60)143 (9.60)

(a) groups at K = 4 and (b) at K = 10. Assigned - accessions with membership coefficient above the threshold of 60%. Admixed - accessions with less than 60% membership coefficient to a particular group. Numbers in brackets represent the proportion of accessions assigned to either group in percentage (%).

Table 4

Diversity and summary statistics for Structure inferred groups a) K = 4; and b) K = 10.

GroupSample sizeMajor allele frequencyAllele No.Mean allele No.AvailabilityGene diversityHetero-zygosityAllelic richnessGroup specific rare alleles
a) G13140.5553 296 7.04760.98570.57500.00976.674345
G22770.7506 206 4.90480.98620.33010.01236.507257
G34390.5744 296 7.04760.97940.54390.01477.284877
G43230.70292305.47620.98820.40840.00826.277057
b) G11940.79121744.14290.98720.28050.01213.24257
G2660.62511894.50000.98380.48160.01644.4395
G32260.6460 252 6.00000.98020.46740.01494.917 78
G4560.77591403.33330.98680.30060.00523.3235
G5830.81501463.47620.9845 0.2589 0.01223.3095
G6800.63152044.85710.98360.48100.00974.6846
G7970.61272135.07140.9880 0.4929 0.01144.7749
G82950.71262185.19050.98840.39470.00843.94670
G9540.7218 129 3.07140.98680.36060.00803.0658
G101380.66492205.23810.97810.44280.01144.66348

*alleles <1% frequency in the whole collection are considered rare alleles.

*Former Yugoslavia (incl. Serbia, Kosovo, Bosnia and Herzegovina). Numbers in brackets indicate the number of accessions excluded from analyses. Thus in total 1485 accessions were considered for analyses. The IPK genebank practices the splitting of original landrace accessions and maintains them as morphologically distinct accessions to counteract the possible loss of rare alleles in the original population of genotypes [42], [43]. For this reason, a current landrace accession might correspond to a single representative genotype of the original landrace population collected. Whenever possible, latitude and longitude coordinates were inferred using the original collection site descriptions. A broader source location (nearby city or province or country capital) was considered to infer the geographic coordinates, when the exact collection location was not documented. Searches were performed using Google maps (http://maps.google.com/maps) and global Gazetteer version 2.2 (http://www.fallingrain.com/world/index.html).

Genotyping using SSR markers

Four seeds per landrace accession were randomly selected and sown under greenhouse conditions at IPK in 2008. Leaf material from one representative plant per accession was harvested three weeks after sowing. DNA was isolated from freeze-dried leaves using a BioRobot 9600 Work Station and the MagAttract 96 DNA plant core kit (Qiagen, Germany). Forty-five fluorescence-labelled SSR markers were selected based on their mapping position in the barley genome, covering all seven chromosomes [44], [45] (Table 2, S1 Fig.). Primers were labelled with HEX, FAM and TAMRA dyes allowing multiplexing of primers pairs into 15 multiplexes (M1 to M15) with three primer pairs per amplification. PCR reactions were performed following the protocols described by [46]. Amplification products were separated on a MegaBACE 1000 capillary sequencer (Amersham Biosciences). Fragment sizes were recorded and analyzed using MegaBACE fragment profiler software version 1.2 and inspected manually. Allele sizes and peak intensities were recorded. Low intensity bands were assigned missing values. Two markers that were monomorphic (GBM1043 & GBM1036) and one marker that amplified multiple fragments (GBM1326) were excluded from further analysis (Table 2, S1 Fig.). Six accessions were excluded from diversity and population differentiation analysis due to pure DNA quality leaving 1485 accessions for analysis (S1 Table).
Table 2

Diversity statistics for 1485 landrace accessions.

MarkerMplexDyeSSRLinkage groupPositionFragment rangeAllele numberMissing (%)Hetero-zygosityPICMAFAllelic richnessGene diversity
motif
GBM1363M1HEX(AGG)75H120.68110–12031.54880.00680.37760.52042.19500.5014
GBM1404M1FAMTATG6H129.76220–29081.81820.00620.04990.97442.74700.0506
GBM1461M1TAMRA(CA)6n1H135.94190–240208.08080.00220.81080.264812.43000.8307
GBM1033M2FAM(AT)97H67.13270–29081.95290.00000.58320.48496.45200.6432
GBM1110M2HEX(AAG)63H60.27210–245101.01010.03940.50510.63605.75200.5460
GBM1326M2TAMRA(CTT)87H31.24Excluded
GBM1013M3FAM(CTG)91H67.5160–17550.87540.00750.35420.71373.73200.4231
GBM1015M3HEXACAT4H115.94190–275221.21210.02040.82150.336115.18200.8350
GBM1176M3TAMRAAT5H18.59280–29574.04040.00000.64300.35415.59400.7001
GBM1031M4HEXAG3H50.26280–29560.60610.00140.60900.43835.09200.6697
GBM1043M4FAMAAC3H90.39Excluded
GBM1212M4TAMRA(AGG)56H55.1100–11150.60610.00140.48570.51323.36200.5740
GBM1003M5TAMRACTT4H79.53185–220111.41410.00750.54500.56866.11600.5975
GBM1035M5HEXCT2H29.46270–28551.14480.00200.70750.31884.84200.7515
GBM1064M5FAMAGGG5H157.6280–30081.07740.00000.40780.71024.45700.4515
GBM1020M6TAMRAAC4H64.05240–25042.08750.00070.38110.52202.40500.5033
GBM1036M6HEXCT2H156.39Excluded
GBM1334M6FAM(GGC)81H70.69120–14052.35690.00140.31620.75172.88900.3799
GBM1029M7TAMRAAG1H60.42220–23050.20200.00070.37680.61443.02700.4825
GBM1047M7HEXAGC2H129.66205–22260.13470.00270.57920.41104.37100.6494
GBM1413M7FAM(TCATA)63H49.69150–17560.20200.03030.57380.46744.57400.6423
GBM1021M8FAMAC6H40.17250–280151.07740.01290.73130.30677.99300.7677
GBM1060M8HEXGGT7H8.78200–22061.01010.00070.43430.68244.55400.4832
GBM1075M8TAMRAGT6H50.08290–30550.80810.00070.48770.63304.12200.5384
GBM1007M9FAMAC1H26.45180–230220.94280.02920.69320.465712.83600.7212
GBM1256M9TAMRA(GA)86H75.4340–36091.61620.00340.83960.21288.63000.8562
GBM1483M9HEX(GCG)75H80.64150–18061.61620.01500.39010.62533.61100.4856
GBM1221M10HEX(AC)104H14.65105–135120.67340.00270.73040.36018.95700.7628
GBM1405M10TAMRA(CGCA)53H86.33270–29040.53870.00470.68530.35424.00000.7335
GBM1516M10FAMCT7H81.2190–115100.53870.04190.65990.39827.74300.7074
GBM1063M11FAM(ACAT)76H63.49195–22070.20200.03370.68640.36405.65500.7326
GBM1280M11HEXCTT3H3.83270–29560.20200.00540.59700.43254.11500.6639
GBM1323M11TAMRA(GCC)84H28.96110–13570.47140.00200.51590.56224.69900.5823
GBM1002M12TAMRACCT1H101.5250–355120.47140.00410.33550.77274.45500.3723
GBM1464M12FAMAT7H53.53130–220150.67340.01020.72540.30456.77800.7641
GBM1501M12HEX(TAGA)64H0250–290120.60610.02640.48300.63864.95600.5329
GBM1026M13FAMAC5H53.08210–22052.22220.00140.39460.66874.02600.4694
GBM1419M13TAMRACTCAT7H95.7590–13081.68350.01640.50250.62345.42500.5509
GBM1459M13HEX(AC)72H64.35150–175102.28960.04890.66700.45427.90900.7067
GBM1018M14FAM(CCG)64H132.69250–28571.81820.00340.39470.70233.83600.4505
GBM1061M14HEX(GGT)61H130.75320–350105.99330.05080.63180.38655.80000.6886
GBM1208M14TAMRA(AG)62H102.85130–160102.22220.00340.53730.50486.47800.6089
GBM1008M15HEX(AAC)106H95.37150–180101.68350.03970.64680.38847.92900.6967
GBM1054M15FAMCCG5H132.16255–27591.88550.00340.58660.50005.00100.6448
GBM1218M15HEXGA2H72.45130–150112.55890.00690.54980.57296.40300.5989
Mean8.85711.52800.01190.54840.51225.74120.6036

Marker name, multiplex (Mplex) number, SSR motif, chromosomal linkage group, genetic position in cM, range of fragment sizes, number of alleles per locus, percentage of missing data per marker, heterozygosity, polymorphic information content (PIC), major allele frequency (MAF), allelic richness and gene diversity values across 45 SSR loci.

Marker name, multiplex (Mplex) number, SSR motif, chromosomal linkage group, genetic position in cM, range of fragment sizes, number of alleles per locus, percentage of missing data per marker, heterozygosity, polymorphic information content (PIC), major allele frequency (MAF), allelic richness and gene diversity values across 45 SSR loci.

Inferring the population structure

The population structure of the 1485 landraces (Lrc1485) considered was inferred using Structure 2.2 [47], [48] based on 42 SSR markers. This approach uses a Bayesian clustering analysis to assign individuals to clusters (K) without prior knowledge of their population affinities. Structure simulations were performed with the number of presumed clusters from K = 1 to K = 20 and five runs per K value. For each run, the initial burn-in period was set to 50,000 followed by 100,000 Markov Chain Monte Carlo (MCMC) iterations. The most probable number of clusters was determined by plotting the estimated likelihood values [LnP(D)] as a function of K [48]. Furthermore, delta (K) values were also calculated as proposed by [49]. A cut-off limit of 60% membership coefficient (Q-matrix) was considered to assign the individuals to a particular group as suggested by [50]. Accessions that did not meet this criterion were considered as admixed. Principal Component Analyses (PCA) were performed using Past 2.12 [51]. Neighbor-Joining tree and Neighbor-Net planar graph based on Hamming distances (uncorrected p-distance) between 1485 landraces were constructed using SplitsTree 4.13.1 [52].

Genetic diversity and population differentiation

The number and frequency of alleles, gene diversity and heterozygosity (He), were determined for all loci across the total population using Powermarker 3.25 [53]. Polymorphism Information Content (PIC) values were determined according to [54]. Genetic variation within and among populations was estimated by Analysis of Molecular Variance (AMOVA) using Arlequin 3.1 [55]. AMOVA was conducted between morphological, geographical and Structure inferred groups. Genetic differentiation among groups was calculated based on unbiased Fst estimators [56]. Pair wise population comparisons using Fixation statistics (Fst) were determined among all groups as well as allelic richness, gene diversity (GD) and the number of alleles for each group were calculated using Fstat 2.9.3 [57]. Allelic richness values for subpopulations were calculated based on rarefaction to account for varying sample size [58]. In all analyses, statistical significance was determined by performing 1000 permutations.

Spatial distribution of groups in relation to geography and climate

Geographic ground distances in kilometres between accessions were calculated based on latitude and longitude coordinates. The genetic diversity index [59] was calculated, and the genetic distance matrix was calculated using the shared allele distance approach of [60] based on allele frequencies at 42 loci. Based on this, Mantel tests were conducted between genetic distance and all other distance matrices. Mantel correlograms were generated analogous to the autocorrelation function [61] - and allowed to assess the overall relationship between matrices and to determine the significance level of correlation for each distance class. Mantel correlograms were constructed using Passage 2.1 [62]. The spatial distribution of accessions, categorized into population-based groups, was visualized in relation to climate zones based on the Köppen-Geiger classification [63]. These climate zones are obtained by classifying the mean climate conditions on land areas around the globe using climate variables such as annual and seasonal mean temperatures and precipitation. The five main zones range from tropical climate through arid and temperate climates to polar climates. Subgroups can be determined by, for example, hot summers, dry and cold winters, year-round precipitation or monsoonal conditions.

Establishing a Core Reference Set (CRS) for high resolution LD-mapping

To build a legacy CRS [64] comprising c. 600–700 genotypes, we first genetically purified the entire collection using two rounds of single seed descent (SSD) [65] under greenhouse conditions in 2008–2009 (started with the same plants from which we isolated DNA, see above). Afterwards, the materials were multiplied and survey-phenotyped under field conditions at IPK using local standard agricultural practices in two subsequent years: i) 2010 (single row per landrace genotype, isolated by one row of a spring wheat genotype, 8 plants per row, 20 cm distance between the plants), and ii) in 2011 (micro plots, 1.2×1 m; all available seeds from the 2010 harvest were sown and equally distributed over six rows). Plants were harvested and threshed manually to avoid mixtures of seeds. In total, 1014 genotypes produced sufficient seeds after two rounds SSD and the two subsequent multiplication cycles (S1 Table). Subsequently, the M strategy as implemented in Mstrat [66] was used to establish the legacy CRS. All data sets available were considered for this: i) geographical origin; ii) 42 SSR markers, and iii) morphological traits as quantitative parameters, which were obtained during seed multiplication in 2011, such as row type, caryopsis type, heading date, plant height and spike length (S1 Table). The best CRS that maximizes the number of observed alleles was then established based on five replications using Mstrat. Diversity scores calculated based on allelic richness [67] were compared and validated with a random sampling approach as well as by comparing phenotypic diversity.

Results

We considered a collection of 1485 landraces originating from 41 countries (Fig. 1). This collection comprised of 708 two-rowed (47.7%) and 777 six-rowed (52.3%) barley genotypes. In total, 284 (19.1%) naked genotypes were considered (Table 1).

High levels of polymorphism and many rare alleles in the collection

Data quality was high, and the level of missing information across SSR markers was very low (1.528%). Based on the 42 SSR markers considered, 372 alleles were obtained with fragment sizes ranging from 90 to 360 bp (Table 2, S2 Table). The number of alleles per locus ranged from 3 (GBM1363) to 22 (GBM1007, GBM1015) with an average of 8.86 alleles per locus. Major allele frequencies per locus ranged between 0.213 (GBM1256) and 0.974 (GBM1404) (mean value of 0.512) and PIC values ranged from 0.050 (GBM1404) to 0.839 (GBM1256) (average of 0.548). The majority of markers (78%) showed PIC values between 0.4 and 0.8. The average allelic richness was 5.7412, ranging from 2.1950 (GBM1363) to 15.1820 (GBM1015) (Table 2). An average gene diversity (GD) value of 0.6036 was obtained, indicating a high level of genetic variation among the accessions. Heterozygosity levels (He) were very low ranging 0–0.05, with an average value of 0.0119 per locus (Table 2). A total of 157 rare alleles (allelic frequency <1% in the total collection) were identified amounting to 42% of the total number of alleles discovered (Table 2, S2 Table).

Population structure within the panel

Structure runs were performed for K = 1 to K = 20 based on the distribution of 372 alleles at 42 SSR loci among 1485 accessions. LnP(D) values increased slowly starting from K = 10, thus probably representing the most appropriate number of major clusters in this collection. However, the maximum delta (K) value was reached at K = 4 (Fig. 2).
Figure 2

Structure analysis for 1485 barley landraces.

a) mean Log probability values (LnP(D)) plotted as function of K (number of clusters); b) Delta K vs. K plotted as proposed by [44]. The graph indicates the maximum change at K = 4.

Structure analysis for 1485 barley landraces.

a) mean Log probability values (LnP(D)) plotted as function of K (number of clusters); b) Delta K vs. K plotted as proposed by [44]. The graph indicates the maximum change at K = 4. The primary division at K = 2 was observed mainly between Ethiopian (Group 1) and non-Ethiopian landraces (Group 2) (S1 Table). Here 92.59% of the landraces were assigned to either of two groups (G) considering the 60% membership coefficient (Q-matrix). The least number of admixtures (110 accessions; 7.4%) was observed at this K value. At K = 4, 91.1% of the landraces were assigned to one of the following groups – G1 (314): two-rowed and six-rowed barleys mainly from Southwest Asia; G2 (277): two-rowed and six-rowed landraces mostly from Ethiopia; G3 (439): mainly six-rowed hulled barleys from a wide geographical range including northern Africa; and G4 (323): mainly two-rowed hulled forms from Europe and Southwest Asia. The population structure inferred using PCA provided congruent results (Fig. 1, Fig. 3, S2 Fig., S3 Fig., Table 3, Table 4, S1 Table).
Figure 3

Population structure of 1485 landraces using 42 SSR inferred by Structure.

a) for K = 4; and b) for K = 10. Genotypes were ordered according to their membership coefficient (Q) values to one group.

Population structure of 1485 landraces using 42 SSR inferred by Structure.

a) for K = 4; and b) for K = 10. Genotypes were ordered according to their membership coefficient (Q) values to one group. (a) groups at K = 4 and (b) at K = 10. Assigned - accessions with membership coefficient above the threshold of 60%. Admixed - accessions with less than 60% membership coefficient to a particular group. Numbers in brackets represent the proportion of accessions assigned to either group in percentage (%). *alleles <1% frequency in the whole collection are considered rare alleles. Known key determinants of the population structure for barley like i) row type of the ear, ii) kernel coverage, and iii) geographical origin divided the collection at K = 10 (Fig. 1, Fig. 3, S2 Fig., Table 3, Table 4, S1 Table). The clusters were also relatively well associated to distinct climate zones (S4 Fig.). G1 (194) consisted of naked barleys mostly from Ethiopia. G2 comprised of 66, mostly six-rowed landraces from a wide geographical range were hot desert climate is prevailing. G3 was the second largest group (226; 15.2%) and harboured the highest average number of alleles (252) and 78 group specific rare alleles. Allelic richness based on rarefaction provided the highest value for this group (Table 4b). Mainly six-rowed hulled barleys from the Mediterranean were included here: Libya (61), Morocco (48), Spain (30), Italy (21), Greece (28) and Turkey (10). All 56 landraces assigned to G4 originated from or close to Georgia. Interestingly, both two-rowed and six-rowed hulled types were present in this group. All but one of the 83 accessions assigned to G5 were hulled barleys from Ethiopia but with different row types: including all assigned deficiens (9); intermedium (4); and labile (7) genotypes. Gene diversity was lowest for G5 (0.258) (Table 4). Altogether, 80 landraces clustered in G6, comprising mainly hulled barley from Afghanistan (61) and Iran (7). Among nine naked barleys considered here, five were collected in Afghanistan. Ninety-seven, mainly two-rowed landraces from Minor Asia and the eastern part of the Fertile Crescent were assigned to G7 including lines from Turkey (29), Iraq (11), Iran (32), Afghanistan (5) and Georgia (7). Highest gene diversity value was found for this group (0.492) (Table 4). G8, the largest group at K = 10, consisted of 295 accessions of mainly two-rowed hulled types. Among them, 276 accessions originated from Europe. G9 was the smallest group harbouring 54 naked landraces, mainly of non-Ethiopian origin (52), and only 129 alleles (Table 4). Finally, G10 consisted of 138 landraces, majorly six-rowed hulled barleys from a broad geographical range north of the Mediterranean Sea (Fig. 1, S4 Fig.). At K = 10, principal components (PC) 1 and 2 explained 9.90% and 8.58% of the variation, respectively, and separated most Structure inferred groups (S2 Fig.). Relationships among accessions within each Structure inferred group are shown in S5 Fig. Variation explained by PC1 and PC2 together, per group, ranged from 11% to 51% (G1, 23.95%; G2, 39.67%; G3, 11.17%; G4, 25.32%; G5, 21.15%; G6, 18.036%; G7, 26.04%; G8, 19.99%; G9, 51.05%; G10, 16.84%) indicating especially within G1, G2, G4, G5; G7 and G9 further substructure based on e.g. row type or geographical origin. Pairwise Fst values among Structure inferred groups at K = 10 ranged from 0.196 (between G4 - from Georgia and G7) to 0.556 (between G1 – naked Ethiopian, and G4). Similarly pairwise Fst comparisons for inferred groups at K = 4 ranged from 0.187 to 0.388 (S3 Table). Highly significant levels of genetic differentiation between and within populations were found using AMOVA. The percentage of genetic variation among populations ranged from 6.38% (all two-rowed vs. all six-rowed Ethiopian) to 37.6% (Structure inferred groups() and the percentage within the populations ranged from 62.40% (Structure inferred groups() to 93.62% (all two-rowed vs. all six-rowed Ethiopian). The values for the comparison of all Ethiopian naked vs. all non-Ethiopian naked barleys are interesting and possibly indicating two evolutionary lineages (S4 Table).

Association between eco-geographical factors and genetic diversity

Significant relationships were found between genetic distances of accessions and eco-geographical parameters at the site of origin (S5 Table). Most significantly correlated with genetic distances was geographical distance, followed by latitudinal distance (S5 Table). The Mantel test between the genetic distance matrix (shared allele distance matrix) and the geographical distance matrix revealed a significant Mantel correlation of 0.357 (P <0.0001). The correlation between genetic distance and longitudinal distance (r = 0.305, P <0.0001) was high. The correlation between genetic distance and latitudinal differences (r = 0.193, P <0.0001) was 2-fold lower than the correlation between genetic and geographic distances. In the Mantel correlogram for genetic distance vs. geographic distance, the matrix was subdivided into 20 discrete distance classes. Within the distance class of 0-300 km between the collection sites of accessions, the correlation was highest (r = 0.523). The r - values (Mantel correlation) and their significances declined with increasing distances. Spatial Mantel correlograms provided similar trends (S6 Fig., S5 Table). Structure inferred clusters at K = 10 were assigned to distinct Köppen climates, although value should not be given to the few suspect lines (see below) (Fig. 1, S3 Fig., S4 Fig.). S6 Table shows the major Köppen climates for each of the Structure inferred group. Climate zones range from winter dry tropical climates (Aw) to fully humid boreal climates (Dfb). Most samples are located in warm temperate climates, either summer dry (Csa) or fully humid (Cfb). In more detail one can observe, that all accessions from G1 and G5 are located in winter dry climates in Ethiopia; G4 lines were only sampled from Georgia, where fully humid boreal climates (Dfb) prevail. G2 is mainly located in hot and dry desert climates (BWh) and G6 (six-rowed) prevails in wintercold steppe climates (BSk). G7 occurs along a narrow latitudinal range (30–43° N) in Minor Asia and the Fertile Crescent in warm temperate summer dry climates (Csa). G8 (two-rowed) and G10 (six-rowed) were sampled further north, majorly between 40–50°N (Cfb, temperate fully humid) and 36–47°N (Csa, temperate summer dry), respectively.

Establishment of a core reference set

To determine the theoretical optimum size of a core group representing most of the genetic diversity within the whole Lrc1485 collection, different sizes of core groups from n = 1 to n = 1485, with a step size increase of 50 accessions were computed using Mstrat and plotted against their diversity score calculated based on allelic richness. The M strategy performed better in efficiently capturing maximum allelic diversity than the random sampling approach (S7 Fig., Table 5). The theoretical optimum size of the core group was determined as in the range between c. 600 to 750 samples. More specifically, 745 individuals must be selected to harbour the maximum allelic diversity (S7 Fig.). Subsequently, based on the 1014 genotypes, which were available after two rounds of SSD and multiplication, the core reference set Lrc648 was established using Mstrat consisting of 308 two-rowed (242 hulled and 66 naked types) and 340 six-rowed (285 hulled and 55 naked types) genotypes (S8 Fig., S1 Table, S7 Table). We validated the superiority of the M strategy by comparing phenotypic diversity between the core set Lrc648 and a random set of landraces comprising the same number of accessions (Lrc648r) (S8 Table).
Table 5

Comparison of diversity statistics for different panels.

Genobar collectionLrc1485Lrc648Lrc648r Lrc648 2-rowedLrc648 6-rowed
Number of accessions2241485648648304344
Number of SSR markers424242424242
Average allele number5.36 8.95 8.387.547.527.00
Gene diversity0.570.600.600.590.610.58
Polymorphism information content0.520.550.550.540.560.52
Total number of alleles225 372 352317316294
Major allele frequency0.570.510.510.510.510.53
Number of unique alleles53a 159a 141a 135 a 11297
Number of rare alleles (freq <0.01)31 157 1381059095

Genobar, Lrc1485, Lrc648, Lrc648r (648 randomly selected lines of Lrc1485 as additional validation), Lrc648 two-rowed sub-panel, and Lrc648 six-rowed sub-panel. atwo way comparisons: Genobar vs Lrc1485, Genobar vs Lrc648, Genobar vs Lrc648 (304 two-rowed accessions) and Genobar vs Lrc648 (344 six-rowed accessions) and Genobar vs Lrc648r.

Genobar, Lrc1485, Lrc648, Lrc648r (648 randomly selected lines of Lrc1485 as additional validation), Lrc648 two-rowed sub-panel, and Lrc648 six-rowed sub-panel. atwo way comparisons: Genobar vs Lrc1485, Genobar vs Lrc648, Genobar vs Lrc648 (304 two-rowed accessions) and Genobar vs Lrc648 (344 six-rowed accessions) and Genobar vs Lrc648r.

Discussion

The demand for higher yielding and better-adapted crop varieties has raised the need to exploit the large ex situ genebank collections [68]. So far, in case of barley, only very few collections of landraces were investigated, mostly sampled from particular geographical regions [34], [35], [69], [70]. The primary aim of this study was to establish and to characterize a diverse legacy collection of barley landraces adapted to a wide range of climates.

Lessons learned from genebank materials

Most probably due to careful selection and accurate description of the material (conservation management of IPK genebank certified according to - DIN EN ISO 9001:2008; line splitting practice for heterogeneous accessions), very few geographical outliers were observed. Another reason for the good fit of accessions into geographical clusters could be that 1035 (69.7%) accessions were sampled during IPK collection missions (or from collections which are hosted at IPK e.g. AUTMAYR-22-32, HIND-35/36, IRNFAOKU-52-54, S1 Table) and then maintained at IPK – thus significantly reducing confusions arising from seed exchange with other genebanks. For most accessions considered here, the collection site information was available (which is usually rarely the case for landraces from other genebanks), thus significantly improving spatial-evolutionary studies, too. Few accessions appeared to be suspect and do not seem to represent their original collection site (e.g. at K = 10 for G1: the seven lines sampled in Europe and Morocco). These accessions were probably not collected originally in these areas. We assume they were sampled elsewhere and then grown ex situ in these countries, or they may have been incorrectly recorded or mixed up during seed exchange and propagation. As recently shown by [50] consequent elimination of any doubtful line identified would provide the best resolution.

Higher values of genetic diversity within the Lrc1485 and the legacy collection Lrc648 compared to the world-wide Genobar GWAS panel

In the collection of 1485 barley landraces, 372 alleles were detected using 42 SSR markers, with an observed average allele number (AN) of 8.95 alleles per locus, which is larger than in most other studies, including [71] using 45 SSRs in 223 cultivars of worldwide origin but also including some wild barleys (AN = 7.7), [69] using 39 SSRs for Eritrean landraces (AN = 7.6), [35] with 44 SSRs for Himalayan landraces (AN = 5.54) or [72] with 12 SSRs for Sardinian landrace populations (AN = 5.6). On the other hand, [73] reported AN = 16.7 for a worldwide collection of 953 barley accessions including wild barleys based on 48 genomic SSRs – which are much more polymorphic than the cDNA derived SSRs in our study. Certainly, the number of alleles per locus depends on the genotypes considered, the loci investigated and the marker type. A total of 157 rare alleles were detected in the whole collection. Rare alleles were mostly detected at SSR loci, which displayed a high number of polymorphic alleles. The presence of 42% rare alleles highlights the potential of this collection for subsequent allele mining studies but could potentially limit GWAS. To evaluate the potential of Lrc1485 for GWAS in more detail, genetic diversity values were directly compared to the potentially most diverse GWAS panel for barley comprising of 224 spring barley cultivars and landraces of worldwide origin (“Genobar” panel) [32], [46], [74], [75] - using the same set of 42 SSR markers (Table 5). Overall, the Lrc1485, the CRS Lrc648 but also the subpanels of Lrc648 based on the row type of the ear - are all more diverse than the Genobar panel as indicated by e.g. the total number of alleles, number of unique alleles or average gene diversity. Unique alleles observed in the Genobar panel came from genotypes collected from East Asia as well as from the Americas. Such regions were not considered in Lrc1485 (Table 5). LD estimates between SSR markers were computed (S9 Fig.). All pair-wise comparisons showed very low LD (r2 <30), which is not surprising due to the relatively low marker coverage (S1 Fig.) and the large population size. Inferring genome wide LD dynamics in this population from few SSR markers would be a vague interpretation. Therefore, high-density marker coverage across the genome is required (a least a few hundred of mapped and equally spaced SNPs) to estimate more precisely the LD extent and the pattern at the population level and across the genome.

Ten major clusters define most of the population structure within the spring Lrc1485 panel

We assessed population structure by different approaches. Based on Bayesian clustering and PCA analyses we considered K = 10 as the most appropriate number of major clusters in the Lrc1485 collection. Although the maximum delta (K) value was reached at K = 4, clusters at K = 10 were better defined based on geographical origin, row type of the ear and caryopsis type – and also associated to distinct Köppen climate zones. Clusters G1, G3, G4, G5 G8 and G10 were distinct and well supported. An intrinsic genetic substructure was visible for groups, which included accessions from a larger geographical range (G2, G6, G7, G9). PCA provided mostly congruent results for these clusters. However, PCA does not classify accessions into discrete clusters in all cases, especially not when admixed accessions and accessions of various geographical origins with a constant gene flow are included [76]. Neighbor-Joining and Neighbor-Net analysis supported these findings. All clusters obtained by Structure analysis were distinguishable, although a high amount of reticulation was evident, owing to the fact that common alleles per locus were shared among geographic regions (S10 Fig., S11 Fig.). To explore the genetic diversity and relationships among and within Structure inferred groups, various diversity statistics were assessed (Table 4). Gene diversity (GD) over 42 loci was highest for G7, G6, G2 and G3, which is comparable with other local collections [69], [77]. Highest values for allelic richness and group specific rare alleles were found in G3, which might be due to i) materials sampled from a wide range of eco-geographical conditions around the Mediterranean Basin (local climates and niches at similar latitudes) (Fig. 1, S4 Fig.); or ii) gene flow among barley genepools [31]. As expected for barley, the row type of the ear was an important determinant of the population structure [28], [78] and six groups comprised majorly either two-rowed (G7, G8) or six-rowed (G2, G3, G6, G10) types, respectively (Table 3). Karyopsis type defined G1 and G9 as they included only naked types. Adaptation to local climates is defined e.g. through G4 (from Georgia) and G5 (from Ethiopia) containing both row types. However, within groups G1, G4, G5 and G9 subclusters can largely be explained by the row type of the ear.

Eco-geographical factors and spatial genetics

Detailed knowledge of environmental parameters at a certain collection site can help determining the role of relevant climatic factors influencing the genetic differentiation and adaptation of genotypes to their environments. Furthermore, this knowledge might help selecting most suitable parental lines for population development and breeding programs. The distribution of accessions was found rather along the latitudes, meaning a wider W-E than N-S window. Just Ethiopia is the southern-most sampling area, i.e. stretching the latitudinal direction the most. In general, climatologically, climate zones are determined by i) incoming solar radiation (more at equator, less at poles); ii) b) altitude (higher elevation will yield similar climate zones as usually on higher latitudes); and iii) proximity to the sea yielding maritime vs. continental climates. Thus the general pattern of climate zones (as the word “zonal” means in this context) is band along the latitudes. The distribution pattern of Structure inferred groups probably indicate a preferred distribution path rather zonal (W-E) than meridional (N-S), i.e. higher correlation with longitude and indeed this was observed for Lrc1485 groups. As domesticated barley spread meridionally from the Fertile Crescent to north-western Europe, the crop encountered considerable ecological and environmental change. Natural mutation, selection and enrichment of favorable alleles at key loci such as Ppd-H1 [79], HvCEN [80] or vernalization-related genes VRN-H1 [81], [82], VRN-H2 [83] and VRN-H3 (also known as HvFT1, [84]) contributed to successful environmental adaptation and range extension in barley. However, based on the IPK genebank information system, we selected only spring barleys, which flowered at IPK without the need for any cold period to promote flowering. In Central Europe (such as at IPK), spring types are sown between early March and end of April depending on the weather conditions every year [68]. Thus, vernalization-or frost tolerance-related loci should be less relevant for the adaptation of spring barleys to their environments. However, as shown by [85], the Genobar spring barley collection, harbored 8 haplotypes at VRN-H3. Genotyping the same set of 224 accessions at VRN-H2 suggested the presence of recessive alleles at VRN-H2 (Kilian et al. unpublished), observed as deletions of a cluster of up to three ZCCT family genes, which contribute to the spring growth habit [86]. Thus, ideally, genotypes should be characterized at molecular and phenotypic levels, before assigning them to a specific growth habit (i.e. winter, facultative, spring). However, in the genebank context this has not been achieved for most of the collections yet. Therefore, we expect some facultative types within Lrc1485. As already shown partly for the Genobar collection, which harbors six haplotypes at PpdH1 [85] and six out of seven haplotypes detected in the domesticated genepool at HvCEN studied by [80], we also expect a remarkable number of haplotypes at key genes responsible for environmental adaptation within Lrc648. Thus, we suggest using the legacy CRS Lrc648 or their two subpanels in particular for allele mining and gene discovery studies. Although altitude must be considered as an important factor influencing the genetic diversity distribution [87], this factor was not considered in our studies because precise geographic coordinates were not available for all landraces. The initial Köppen analysis presented here does not give a very clear picture due to outliers and thus a rather high within-group-variability (S6 Table). However for some groups a particular climate applies. Regarding climate change one could investigate Köppen maps based on future climate projections and check where climate zones suitable for barley cultivation will be located in the future. Modelling crop performance under changing climates will help guiding the breeding programs to the expected future needs [88]–[90].

New insights into barley evolution: two examples

With our genetic analysis in 1485 barley landraces, new insights into barley domestication history can be obtained. In total, 299 accessions from Ethiopia were genotyped (Table 1), thus providing probably the largest SSR data set generated for Ethiopian landraces so far (S1 Table, S2 Table). Overall, Ethiopian barleys were apparently found to be distinct from all other groups (S3 Table, S4 Table), which is in line with previous studies [87], [91]–[93]. Different evolutionary forces (environment) and domestication histories (e.g. agricultural practices, cultural preferences of human tribes) in the Ethiopian highlands compared to the Fertile Crescent might be reasons for this distinctness [5], [94]. At K = 10, 277 Ethiopian landraces were assigned to either of two groups comprising naked (G1, 194 lines) and hulled types (G5, 83 lines). Both groups were further sub-structured according to their row type of the ear (S5 Fig.). Although morphologically diverse, Ethiopian hulled barleys harboured a relatively low level of nucleotide diversity as indicated by the lowest gene diversity value, the second lowest level for allelic richness and only five group specific rare alleles. These results provide further evidence that Ethiopian barleys went through a major genetic bottleneck followed by adaptation to climatic (e.g. rainfall patterns, altitude) and edaphic conditions in the Abyssinian highlands (Table 4, S6 Table) [87], [95]–[98]. It is interesting that the early flowering haplotype IV at HvCEN (which derived from major haplotype II) predominated in genotypes assigned to G5 (89%) [80]; S1 Table). Furthermore, based on our preliminary data set at PpdH1 (Sharma et al. unpublished), most lines from G5 carry insensitive alleles, thus in combination with haplotype IV at HvCEN, providing favorable alleles for the two growing seasons of barley in Ethiopia - Meher and Belg [87]. Interestingly, Ethiopian naked barleys (G1) harbored 57 group specific alleles (second largest number found) and higher value for allelic richness compared to G9. Pairwise comparison of Fst values between Structure inferred groups showed that G1 is most closely related to G5 (0.35) supporting a common origin of Ethiopian barleys. However, G1 and G9 (the two naked groups) are genetically less related (0.49), while G9 is closer connected to G7 (Fst 0.29). Thus, our data suggest at least two evolutionary lineages of naked barleys, both of which probably originated in the eastern Fertile Crescent from a monophyletic natural mutation (17 kb deletion harboring an ethylene response factor (ERF) family transcription factor gene) at the nud locus on chromosome 7H [99]–[101]. However, resequencing larger germplasm sets at the nud locus are required to test this hypothesis and to shed more light on the origin of naked barley. Interestingly, outside Ethiopia, two-rowed naked landraces were mainly sampled from Iraq and Iran, while six-rowed naked types were collected further east (Afghanistan) [102], [103]. Interestingly, the second highest variation explained among groups by AMOVA was found between all Ethiopian naked vs. all non-Ethiopian naked barleys (32.02%) (and thus supporting two evolutionary lineages for naked barley), while all Ethiopian hulled vs. all non-Ethiopian hulled barleys explained only 18.50% (S4 Table). AMOVA of all hulled vs. all naked types explained 16.17% of variation, which is nearly two-fold higher compared to the variation between all two-rowed vs. all six-rowed types (8.87%). Also AMOVA of all hulled Ethiopian vs. all naked Ethiopian types (26.69) compared to all two-rowed Ethiopian vs. all six-rowed Ethiopian landraces (6.38) suggested that hulled and naked genepools are genetically more distant than the two-rowed and six-rowed clusters. Hulled and naked types probably evolved largely independent under cultivation and were domesticated for different end-use qualities [35].

Conclusions

The Lrc1485 harbors great genetic diversity. However, the collection size of 1485 genotypes is not manageable in most phenotypic studies, and smaller panels are needed. The legacy CRS Lrc648 established here is best suited for multi-environmental field testing under various climates. However, to work with even more manageable sets, we suggest dividing the Lrc648 into two-rowed and six-rowed subpanels, depending on the trait of interest (Table 5). Increased marker coverage [80] and precise phenotypic data for Lrc648 will help to identify candidate genes also for agronomic and adaptation-related traits using GWAS [27]. Re-sequencing candidate genes [104], [105] or genomic regions underlying quantitative traits using next generation sequencing (NGS) approaches [106]–[109] can be applied for Lrc648. Seeds of the Lrc648 can be requested in small quantities from IPK. Seed delivery just awaits the Standard Material Transfer Agreement (SMTA) procedure. Distribution of 45 SSR markers used across the seven linkage groups of barley. (TIF) Click here for additional data file. Scatter plot of 1485 barley landraces based on Principal Component Analysis calculated from 42 SSR data. a) for K = 4, b) for K = 10. Colours correspond to the different Structure inferred groups. (TIF) Click here for additional data file. Geographical distribution of 1485 landraces over Köppen climate zones according to S  = 4. Each group (G1-G4) and admixed types were separately plotted. (a) G1; (b) G2; (c) G3; (d) G4; (e) admixed types. Climate abbreviations are explained in S6 Table. (TIF) Click here for additional data file. Geographical distribution of 1485 landraces over various Köppen climate zones according to S  = 10. Each group (G1-G10) and admixed types were separately plotted. (a) G1; (b) G2; (c) G3; (d) G4; (e) G5; (f) G6; (g) G7; (h) G8; (i) G9; (j) G10; (k) admixed types. Abbreviations are explained in S6 Table. (TIF) Click here for additional data file. Individual PCA's for each S  = 10. Each plot represents a single group: (a) G1, (b) G2, (c) G3, (d) G4, (e) G5, (f) G6, (g) G7, (h) G8, (i) G9, (j) G10. Blue circles indicate two-rowed and red circles six-rowed barleys. (TIF) Click here for additional data file. Correlograms showing spatial genetic autocorrelation patterns among: (a) genetic distance and geographical distance; (b) genetic distance and longitude; (c) genetic distance and latitude; (d) genetic distance and annual mean temperature; (e) genetic distance and mean diurnal range; (f) genetic distance and mean temperature of warmest quarter; (g) genetic distance and annual precipitation. The x-axis represents distinct classes and the y-axis represents the mantel r values. (TIF) Click here for additional data file. Comparing the sampling efficiency based on M. Average diversity score calculated based on allelic richness was plotted against the sample size. Red circles indicate scores of the core collection using the M strategy and blue circles indicate scores of randomly selected accessions. (TIF) Click here for additional data file. Scatter plot of L Landraces selected for Lrc648 are indicated in red colour. (TIF) Click here for additional data file. Linkage Disequilibrium (LD) display of 1485 barley landraces using 42 SSR markers. LD was calculated in TASSEL 2.1(www.maizegenetics.net/tassel) using 1000 permutations. Markers are arranged according to the genetic positions on barley genome (see S1 Fig.). (TIF) Click here for additional data file. Evolutionary relationships of 1485 barley landraces I. The evolutionary history was inferred by a) a Neighbor-Joining tree, and b) a Neighbor-Joining strict consensus tree computed in SplitsTree software. (TIF) Click here for additional data file. Evolutionary relationships of 1485 barley landraces II. The Neighbor-Net planar graph of uncorrected p-distances visualizes the high amount of reticulation in the collection (Taxa = 1485; Chars = 372; Fit = 90,615; Splits = 4604). (TIF) Click here for additional data file. The L a) Details of accession names, their taxonomical designations, row type and kernel coverage, collection sites, collection missions and all other information available from IPK genebank documentation System (GEBIS); and b) Structure assignments to groups K = 2 to K = 20, phenotypic data for Lrc648 and haplotype information for HvCEN obtained from [74], if available are given. Accessions selected for the CRS Lrc648 are indicated. Six accessions were excluded from further analysis due to large extent of missing values. NS – not selected for core group of 648 genotypes; *lost during single seed descent and multiplication. (XLSX) Click here for additional data file. Final SSR data set for the whole collection of 1491 barley landraces investigated. Fragment sizes and 0/1 matrix are provided. (XLSX) Click here for additional data file. Pairwise comparison of F for K = 4 and b) for K = 10. Significance of P-values computed after 1000 permutations are represented above diagonal and the Fst values are presented below. (DOCX) Click here for additional data file. Analysis of Molecular Variance (AMOVA). Summary of partitioning of genetic variation among and within different groups of Lrc1485. (DOCX) Click here for additional data file. Mantel correlogram tables. a) Mantel correlogram tables between: genetic distance and geographic distance; b) genetic distance and longitude difference matrix; c) genetic distance and latitude difference matrix; d) genetic distance and annual mean temperature; e) genetic distance and mean diurnal range; f) genetic distance and temperature of warmest quarter; and g) genetic distance and annual precipitation. 1different distance classes are shown; 2lower and upper boundary values for each class; 3number of pairs for which the correlation was calculated within each distance class; 4the mantle correlation for each class; 5the significance of mantel correlation for each class. (DOCX) Click here for additional data file. Overview of Köppen climates prevailing in S Variety - number of different climates within the group; *the few geographical outliers were not removed. (XLSX) (XLSX) Click here for additional data file. Comparison of diversity statistics for different sample sizes of core groups generated from 1485 accessions as well as L N - number of accessions; AN - average allele number; GD - gene diversity; PIC - polymorphism information content; MAF - average major allele frequency. (DOCX) Click here for additional data file. Comparison of phenotypic diversity between the core set L Min. - minimum; Max. – maximum; SD. - standard deviation of the measured traits heading date (Hd) (in days to flowering), spike length (Sl) (in cm) and plant height (Ht) (cm). (DOCX) Click here for additional data file.
  69 in total

1.  Patterns of polymorphism detected in the chloroplast and nuclear genomes of barley landraces sampled from Syria and Jordan.

Authors:  J R Russell; A Booth; J D Fuller; M Baum; S Ceccarelli; S Grando; W Powell
Journal:  Theor Appl Genet       Date:  2003-07-03       Impact factor: 5.699

2.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

3.  The Horn of Africa as a centre of barley diversification and a potential domestication site.

Authors:  Jihad Orabi; Gunter Backes; Asmelash Wolday; Amor Yahyaoui; Ahmed Jahoor
Journal:  Theor Appl Genet       Date:  2007-02-06       Impact factor: 5.699

Review 4.  Seed banks and molecular maps: unlocking genetic potential from the wild.

Authors:  S D Tanksley; S R McCouch
Journal:  Science       Date:  1997-08-22       Impact factor: 47.728

5.  Analysis of >1000 single nucleotide polymorphisms in geographically matched samples of landrace and wild barley indicates secondary contact and chromosome-level differences in diversity around domestication genes.

Authors:  Joanne Russell; Ian K Dawson; Andrew J Flavell; Brian Steffenson; Eva Weltzien; Allan Booth; Salvatore Ceccarelli; Stefania Grando; Robbie Waugh
Journal:  New Phytol       Date:  2011-03-28       Impact factor: 10.151

6.  A physical, genetic and functional sequence assembly of the barley genome.

Authors:  Klaus F X Mayer; Robbie Waugh; John W S Brown; Alan Schulman; Peter Langridge; Matthias Platzer; Geoffrey B Fincher; Gary J Muehlbauer; Kazuhiro Sato; Timothy J Close; Roger P Wise; Nils Stein
Journal:  Nature       Date:  2012-10-17       Impact factor: 49.962

7.  A barley cultivation-associated polymorphism conveys resistance to powdery mildew.

Authors:  Pietro Piffanelli; Luke Ramsay; Robbie Waugh; Abdellah Benabdelmouna; Angélique D'Hont; Karin Hollricher; Jørgen Helms Jørgensen; Paul Schulze-Lefert; Ralph Panstruga
Journal:  Nature       Date:  2004-08-19       Impact factor: 49.962

Review 8.  Construction of a genetic linkage map in man using restriction fragment length polymorphisms.

Authors:  D Botstein; R L White; M Skolnick; R W Davis
Journal:  Am J Hum Genet       Date:  1980-05       Impact factor: 11.025

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Evolutionary history of wild barley (Hordeum vulgare subsp. spontaneum) analyzed using multilocus sequence data and paleodistribution modeling.

Authors:  Sabine S Jakob; Dennis Rödder; Jan O Engler; Salar Shaaf; Hakan Ozkan; Frank R Blattner; Benjamin Kilian
Journal:  Genome Biol Evol       Date:  2014-03       Impact factor: 3.416

View more
  12 in total

1.  Exome sequencing of geographically diverse barley landraces and wild relatives gives insights into environmental adaptation.

Authors:  Joanne Russell; Martin Mascher; Ian K Dawson; Stylianos Kyriakidis; Cristiane Calixto; Fabian Freund; Micha Bayer; Iain Milne; Tony Marshall-Griffiths; Shane Heinen; Anna Hofstad; Rajiv Sharma; Axel Himmelbach; Manuela Knauft; Maarten van Zonneveld; John W S Brown; Karl Schmid; Benjamin Kilian; Gary J Muehlbauer; Nils Stein; Robbie Waugh
Journal:  Nat Genet       Date:  2016-07-18       Impact factor: 38.330

2.  Detection and validation of genomic regions associated with resistance to rust diseases in a worldwide hexaploid wheat landrace collection using BayesR and mixed linear model approaches.

Authors:  Raj K Pasam; Urmil Bansal; Hans D Daetwyler; Kerrie L Forrest; Debbie Wong; Joanna Petkowski; Nicholas Willey; Mandeep Randhawa; Mumta Chhetri; Hanif Miah; Josquin Tibbits; Harbans Bariana; Matthew J Hayden
Journal:  Theor Appl Genet       Date:  2017-03-02       Impact factor: 5.699

3.  Genome-Wide Association Study of Seed Dormancy and the Genomic Consequences of Improvement Footprints in Rice (Oryza sativa L.).

Authors:  Qing Lu; Xiaojun Niu; Mengchen Zhang; Caihong Wang; Qun Xu; Yue Feng; Yaolong Yang; Shan Wang; Xiaoping Yuan; Hanyong Yu; Yiping Wang; Xiaoping Chen; Xuanqiang Liang; Xinghua Wei
Journal:  Front Plant Sci       Date:  2018-01-05       Impact factor: 5.753

Review 4.  Assessing and Exploiting Functional Diversity in Germplasm Pools to Enhance Abiotic Stress Adaptation and Yield in Cereals and Food Legumes.

Authors:  Sangam L Dwivedi; Armin Scheben; David Edwards; Charles Spillane; Rodomiro Ortiz
Journal:  Front Plant Sci       Date:  2017-08-29       Impact factor: 5.753

5.  Deciphering Genomic Regions for High Grain Iron and Zinc Content Using Association Mapping in Pearl Millet.

Authors:  N Anuradha; C Tara Satyavathi; C Bharadwaj; T Nepolean; S Mukesh Sankar; Sumer P Singh; Mahesh C Meena; Tripti Singhal; Rakesh K Srivastava
Journal:  Front Plant Sci       Date:  2017-05-01       Impact factor: 5.753

6.  Geographical and environmental determinants of the genetic structure of wild barley in southeastern Anatolia.

Authors:  Reza Pournosrat; Selma Kaya; Salar Shaaf; Benjamin Kilian; Hakan Ozkan
Journal:  PLoS One       Date:  2018-02-08       Impact factor: 3.240

7.  Landraces of snake melon, an ancient Middle Eastern crop, reveal extensive morphological and DNA diversity for potential genetic improvement.

Authors:  Samer Omari; Yuri Kamenir; Jennifer I C Benichou; Sarah Pariente; Hanan Sela; Rafael Perl-Treves
Journal:  BMC Genet       Date:  2018-05-23       Impact factor: 2.797

8.  A tiered approach to genome-wide association analysis for the adherence of hulls to the caryopsis of barley seeds reveals footprints of selection.

Authors:  Celestine Wabila; Kerstin Neumann; Benjamin Kilian; Volodymyr Radchuk; Andreas Graner
Journal:  BMC Plant Biol       Date:  2019-03-06       Impact factor: 4.215

9.  The influence of breeding history, origin and growth type on population structure of barley as revealed by SSR markers.

Authors:  Seyyed Abolghasem Mohammadi; Nayyer Abdollahi Sisi; Behzad Sadeghzadeh
Journal:  Sci Rep       Date:  2020-11-05       Impact factor: 4.379

Review 10.  Importance of Landraces in Cereal Breeding for Stress Tolerance.

Authors:  Daniela Marone; Maria A Russo; Antonia Mores; Donatella B M Ficco; Giovanni Laidò; Anna M Mastrangelo; Grazia M Borrelli
Journal:  Plants (Basel)       Date:  2021-06-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.