| Literature DB >> 20502861 |
Marília Lobo Burle1, Jaime Roberto Fonseca, James A Kami, Paul Gepts.
Abstract
Brazil is the largest producer and consumer of common bean (Phaseolus vulgaris L.), which is the most important source of human dietary protein in that country. This study assessed the genetic diversity and the structure of a sample of 279 geo-referenced common bean landraces from Brazil, using molecular markers. Sixty-seven microsatellite markers spread over the 11 linkage groups of the common bean genome, as well as Phaseolin, PvTFL1y, APA and four SCAR markers were used. As expected, the sample showed lower genetic diversity compared to the diversity in the primary center of diversification. Andean and Mesoamerican gene pools were both present but the latter gene pool was four times more frequent than the former. The two gene pools could be clearly distinguished; limited admixture was observed between these groups. The Mesoamerican group consisted of two sub-populations, with a high level of admixture between them leading to a large proportion of stabilized hybrids not observed in the centers of domestication. Thus, Brazil can be considered a secondary center of diversification of common bean. A high degree of genome-wide multilocus associations even among unlinked loci was observed, confirming the high level of structure in the sample and suggesting that association mapping should be conducted in separate Andean and Mesoamerican Brazilian samples.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20502861 PMCID: PMC2940433 DOI: 10.1007/s00122-010-1350-5
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Fig. 1Collecting sites of common bean landraces in Brazil
Genetic and mapping information for microsatellites and other markers used in this study
| Marker | Linkage group | Relative position within linkage group (cM) | Referencea | Type | SSR sequence | No. of alleles | Gene diversity |
| PIC |
|---|---|---|---|---|---|---|---|---|---|
| BMd45 | 1 | 0 | 1 | Gene-based | (AG)5 | 5 | 0.43 | 0 | 0.34 |
| PVag003 | 1 | 146 | 4 | Gene-based | (AG)8 | 3 | 0.54 | 0 | 0.43 |
| BMd10 | 1 | 152 | 1 | Gene-based | (GA)8 | 5 | 0.55 | 0.012 | 0.46 |
| PVBR139 | 1 | 174 | 3 | Genomic | (AG)7 | 2 | 0.21 | 0.004 | 0.18 |
| BM157 | 1 | NA | 2 | Genomic | (GA)16 | 5 | 0.35 | 0 | 0.33 |
| BM146 | 1 | NA | 2 | Genomic | (CTGTTG)4(CTG)4(TTG)3(CTG)3(CTG)4 | 4 | 0.17 | 0 | 0.15 |
| PvTFL1y | 1 | NA | 5 | Not applicable | 2 | 0.23 | 0 | 0.21 | |
| BM156 | 2 | 0 | 2 | Genomic | (CT)32 | 8 | 0.43 | 0.004 | 0.41 |
| PVBR243 | 2 | 14 | 3 | Genomic | (CT)20 | 4 | 0.42 | 0 | 0.37 |
| GATS91 | 2 | 21 | 2 | Genomic | (GA)17 | 16 | 0.85 | 0.004 | 0.83 |
| PVgccacc001 | 2 | 37 | 4 | Gene-based | (GCCACC)5 | 3 | 0.64 | 0 | 0.56 |
| BM143 | 2 | 53 | 2 | Genomic | (GA)35 | 14 | 0.81 | 0.019 | 0.79 |
| BM139 | 2 | 101 | 2 | Genomic | (CT)25 | 8 | 0.38 | 0 | 0.36 |
| PVBR106 | 2 | NA | 3 | Genomic | (CTT)2(CT)5 | 5 | 0.35 | 0 | 0.3 |
| PVBR125 | 2 | NA | 3 | Genomic | (GA)6A(GA)3 | 4 | 0.55 | 0.007 | 0.49 |
| SW13 | 2 | NA | 6 | Not applicable | 2 | 0.45 | 0 | 0.35 | |
| AG1 | 3 | 0 | 2 | Genomic | (GA)8GGTA(GA)5GGGGACG(AG)4 | 3 | 0.09 | 0.004 | 0.09 |
| BMd1 | 3 | 8 | 1 | Gene-based | (AT)9 | 6 | 0.62 | 0 | 0.56 |
| PVat008 | 3 | 8 | 4 | Gene-based | (AT)9 | 12 | 0.71 | 0 | 0.67 |
| BM159 | 3 | 41 | 2 | Genomic | (CT)9(CA)8 | 5 | 0.42 | 0 | 0.38 |
| BMd36 | 3 | 72 | 1 | Genomic | (TA)8 | 7 | 0.74 | 0.011 | 0.7 |
| BM172 | 3 | 96 | 2 | Genomic | (GA)23 | 8 | 0.37 | 0 | 0.35 |
| PVBR131 | 3 | NA | 3 | Genomic | (TGA)5 | 3 | 0.47 | 0.004 | 0.39 |
| PVBR87 | 3 | NA | 3 | Genomic | (GA)16 | 11 | 0.76 | 0.004 | 0.73 |
| PVctt001 | 4 | 0 | 4 | Gene-based | (CTT)3(T)3(CTT)6 | 6 | 0.7 | 0.011 | 0.64 |
| PVat001 | 4 | 23 | 4 | Gene-based | (TA)22 | 37 | 0.96 | 0.008 | 0.96 |
| PVat003 | 4 | 49 | 4 | Gene-based | (AT)6 | 4 | 0.43 | 0 | 0.34 |
| BMd26 | 4 | 82 | 1 | Gene-based | (GAT)6 | 6 | 0.38 | 0.022 | 0.32 |
| PVatgc002 | 4 | 133 | 4 | Gene-based | (ATGC)4 | 5 | 0.58 | 0 | 0.54 |
| PVatct001 | 4 | NA | 4 | Gene-based | (ATCT)3 | 1 | 0 | 0 | 0 |
| PVBR112 | 4 | NA | 3 | Genomic | (TC)13 | 4 | 0.4 | 0 | 0.36 |
| PVBR182 | 4 | NA | 3 | Genomic | (AG)12 | 6 | 0.44 | 0.007 | 0.38 |
| SB12 | 4 | NA | 7 | Not applicable | 2 | 0.47 | 0 | 0.36 | |
| SF10 | 4 | NA | 8 | Not applicable | 2 | 0.4 | 0 | 0.32 | |
| APA | 4 | NA | 9 | Not applicable | 2 | 0.31 | 0 | 0.26 | |
| BMd53 | 5 | 0 | 1 | Gene-based | (GTA)5 | 5 | 0.36 | 0.004 | 0.31 |
| BM175 | 5 | 42 | 2 | Genomic | (AT)5(GA)19 | 8 | 0.41 | 0.004 | 0.38 |
| BMd20 | 5 | 72 | 1 | Gene-based | (TA)5 | 4 | 0.51 | 0.018 | 0.46 |
| BMd28a | 5 | 94 | 1 | Gene-based | (GT)4 | 2 | 0.15 | 0 | 0.14 |
| BMd28b | NA | 4 | 0.53 | 0.009 | 0.43 | ||||
| BMd12 | 6 | 0 | 1 | Genomic | (AGC)7 | 6 | 0.16 | 0.046 | 0.16 |
| PVBR163 | 6 | 16 | 3 | Genomic | (AG)16 | 15 | 0.84 | 0.009 | 0.82 |
| PVBR198 | 6 | 29 | 3 | Genomic | (AG)10 | 4 | 0.5 | 0.008 | 0.47 |
| ROC11 | 6 | NA | 10 | Not applicable | 2 | 0.36 | 0 | 0.3 | |
| BM160 | 7 | 0 | 2 | Genomic | (GA)15(GAA)5 | 7 | 0.4 | 0 | 0.37 |
| BM201 | 7 | 76 | 2 | Genomic | (GA)15 | 6 | 0.75 | 0.018 | 0.71 |
| BM210 | 7 | 80 | 2 | Genomic | (CT)15 | 13 | 0.75 | 0.012 | 0.71 |
| BMd40 | 7 | 90 | 1 | Genomic | (AT)6 | 7 | 0.6 | 0.004 | 0.55 |
| BM209 | 7 | 157 | 2 | Genomic | (TA)4(TG)16 | 5 | 0.6 | 0 | 0.52 |
| PVBR35 | 7 | NA | 3 | Genomic | (TC)10 | 8 | 0.54 | 0.018 | 0.51 |
| PVatcc001 | 7 | NA | 4 | Gene-based | (ATCC)3(AG)2(TAC)3 | 5 | 0.34 | 0.004 | 0.3 |
| PVatcc002 | 7 | NA | 4 | Gene-based | (ATCC)3(AG)2(TAC)3T(CTA)3 | 4 | 0.13 | 0.004 | 0.13 |
| PVatcc003 | 7 | NA | 4 | Gene-based | (ATCC)3 | 2 | 0.01 | 0 | 0.01 |
| Phaseolin | 7 | NA | 11 | Not applicable | 2 | 0.33 | 0 | 0.28 | |
| BMd25 | 8 | 0 | 1 | Gene-based | (GAT)6 | 2 | 0.26 | 0.004 | 0.22 |
| BM189 | 8 | 44 | 2 | Genomic | (CT)13 | 11 | 0.81 | 0.019 | 0.78 |
| BM151 | 8 | 84 | 2 | Genomic | (CT)14 | 6 | 0.67 | 0.015 | 0.62 |
| PVBR173 | 8 | NA | 3 | Genomic | (GA)21 | 1 | 0 | 0 | 0 |
| PVBR83 | 8 | NA | 3 | Genomic | (GA)9 | 5 | 0.47 | 0 | 0.37 |
| BM188 | 9 | 0 | 2 | Genomic | (CA)18(TA)7 | 2 | 0.4 | 0 | 0.32 |
| BM141 | 9 | 66 | 2 | Genomic | (GA)29 | 13 | 0.68 | 0.011 | 0.64 |
| BM114 | 9 | 103 | 2 | Genomic | (TA)8(GT)10 | 16 | 0.8 | 0.008 | 0.77 |
| PVat007 | 9 | NA | 4 | Gene-based | (AT)12 | 14 | 0.87 | 0.02 | 0.85 |
| PVBR60 | 9 | NA | 3 | Genomic | (CT)9 | 10 | 0.64 | 0 | 0.59 |
| GATS11B | 10 | 0 | 2 | Genomic | (CT)8 | 6 | 0.46 | 0.007 | 0.42 |
| BMd42 | 10 | 30 | 1 | Genomic | (AT)5 | 7 | 0.67 | 0.014 | 0.62 |
| PVBR181 | 10 | NA | 3 | Genomic | (AG)11 | 5 | 0.4 | 0.004 | 0.36 |
| PVBR185 | 10 | NA | 3 | Genomic | (TC)11 | 5 | 0.57 | 0 | 0.53 |
| BM212 | 10 | NA | 2 | Genomic | (CA)13 | 3 | 0.32 | 0 | 0.28 |
| BMd22 | 11 | 0 | 1 | Gene-based | (TC)6 | 5 | 0.55 | 0.008 | 0.46 |
| BMd33 | 11 | 7 | 1 | Genomic | (ATT)9 | 3 | 0.34 | 0 | 0.28 |
| BMd41 | 11 | 11 | 1 | Genomic | (ATT)9 | 6 | 0.37 | 0.008 | 0.34 |
| BM205b | 11 | 53 | 2 | Genomic | (GT)11 | 5 | 0.43 | 0.011 | 0.39 |
| PVag001 | 11 | 108 | 4 | Gene-based | (GA)11 | 4 | 0.33 | 0 | 0.28 |
| PVBR215 | 11 | NA | 3 | Genomic | (CT)6 | 2 | 0.28 | 0 | 0.24 |
| Mean | 6 | 0.46 | 0.005 | 0.42 | |||||
PIC polymorphism information content, NA not available
a1, Blair et al. (2003); 2, Gaitán-Solís et al. (2002); 3, Grisi et al. (2007); 4 Yu et al. (2000); 5, Kwak et al. (2009); 6, Melotto et al. (1996); 7, Mendez de Vigo et al. (2002); 8, Corrêa et al. (2000); 9, Kami et al. (2006); 10, Johnson et al. (1997); 11, Kami et al. (1995)
Fig. 2Structure bar plot of membership coefficients for all the accessions of common bean in the study sample sorted in the same order and classified according to successive selected preset K values ranging from 2 to 10. For K = 2 and K = 5, the groups are identified. G2 and G4: groups 2 and 4, respectively. Hybrid represents a large group of accessions resulting from hybridization mostly among Mesoamerican groups
Estimate of the number hybrid accessions between the Andean and Mesoamerican gene pools based on phaseolin and Structure analyses
| Status | Andean | Mesoamerican |
|---|---|---|
| Phaseolin analysisa | ||
| Non-hybrid | 53 | 216 |
| Hybrid | 6 | 6 |
|
| ||
| Non-hybrid | 58 | 221 |
| Hybrid | 1 | 1 |
aBased on the tandem repeat test of Kami et al. (1995)
bBased on a posterior membership coefficient threshold of 0.80 determined by the Structure 2.1 software (Pritchard et al. 2000)
Mean likelihoods of models and their standard deviations, Delta K for simulations for different K values, mean similarity coefficients and their standard deviations for different Structure simulations within each preset K
|
| Ln | Delta | Similarity coefficientb | ||
|---|---|---|---|---|---|
| Mean | Standard deviation | Mean | Standard deviation | ||
| 2 | −25559.83c | 62.91 | 208.16c | 1.00 | 0.00030 |
| 3 | −23300.03 | 23.07 | 89.66 | 0.99 | 0.00098 |
| 4 | −23108.59 | 1515.67 | 0.86 | 0.62 | 0.26332 |
| 5 | −21603.28 | 283.127 | 2.96 | 0.70 | 0.23525 |
| 6 | −20944.23 | 133.70 | 1.86 | 0.60 | 0.26913 |
| 7 | −20534.81 | 299.00 | 0.666 | 0.51 | 0.18273 |
| 8 | −20323.73 | 400.98 | 0.56 | 0.4 | 0.17342 |
| 9 | −19908.24 | 407.93 | 0.06 | 0.36 | 0.17416 |
| 10 | −19520.35 | 207.81 | NA | 0.43 | 0.16021 |
a20 iterations for each K level
bSimilarity coefficients between different runs (20) within each preset K, according to Nordborg et al. (2005)
cA different set of Structure simulations was run for the estimation of Delta K for K = 2; these simulations used the same parameters (admixture model, correlated allele frequencies, running length of 5,000 burn-in and 50,000 MCMC repetitions)
Fig. 3Neighbor-joining tree reconstructed for the Brazilian germplasm collection of landraces based on the C.S. Chord distances and all 73 polymorphic molecular markers. Branches are colored according to the Structure simulation for K = 3. 271: hybrid accession identified at K = 2 (posterior probability < 0.80); gray branches hybrid accessions identified at K = 3 (see Fig. 2 and text)
Counts for comparisons between pair of loci according to significance (P < 0.01) for genome-wide linkage disequilibrium test in the whole study sample and in Mesoamerican and Andean groups separately
| Whole study sample | Mesoamerican group | Andean group | ||||
|---|---|---|---|---|---|---|
| Counts | % | Counts | % | Counts | % | |
| In LD ( | 1,337 | 80 | 360 | 23 | 96 | 8 |
| No. of comparisons evaluated | 1,676 | 1,568 | 1,166 | |||
Fig. 4Magnitude of multilocus associations as measured by r 2 (a) and D′ (b) among Brazilian landraces. The abscissa represents the genetic distance expressed in cM for locus pairs within linkage groups (LG). Along the same axis, “Across LG” represents the r 2 and D′ values for locus pairs distributed among linkage groups