| Literature DB >> 33630231 |
John Ocampo1,2, Tatiana Ovalle1, Ricardo Labarta1, Dung Phuong Le1, Stefan de Haan1, Nguyen Anh Vu3, Le Quy Kha4, Luis A Becerra Lopez-Lavalle5.
Abstract
KEY MESSAGE: A molecular analysis using informative SNP markers in 1570 clones of cassava from Vietnam reveals varietal composition from farmers' field and genebank collections Cassava is the most important smallholder cash crops in Southeast Asia and is especially used in industrial products. Yet, systematic genetic studies on molecular markers from Vietnamese germplasm have not been considered for breeding and conservation programs. We conducted a molecular analysis of 1570 clones of cassava germplasm from farms across six agro-ecological zones using informative SNP markers. We unraveled the genetic diversity and population structure and provided insights into the value of breeding and conservation programs. Duplicated genotypes comprised 98% of the total sample of the Central Highlands region. Ninety-six SNPs were amplified Central Highlands and South East provinces had the highest allelic richness, covering up to 83% of alleles. The average observed heterozygosity (Ho = 0.43) was slightly higher than expected (He = 0.40) across SNP markers, suggesting an excess of heterozygotes plants. Diversity indexes indicated that cassava populations from North West and Eastern Vietnam are genetically diverse (mean He = 0.40). Genetic parentage tests identified 85 unique genetic groups within the varieties KM94, KM419, BRA1305, KM101, KM140, PER262, KM60, KM57 and two unidentified varieties, which accounted for 82% of the frequency distribution. KM94 is the most dominant variety in Vietnamese farms surveyed (38%), reflecting its superior quality and productivity. Discriminant analysis of principal components (DAPC) revealed four main subgroups, which were partially corroborated by neighbor joining (NJ) analyses. After removing duplicates, 31 unique genotypes were distributed across five of the agro-ecological zones. These were well distributed in the subgroups revealed via DAPC and NJ analyses. The genetic groups identified herein could be used to select unique accessions that should ideally conform with ex situ germplasm collections and identify areas where on-farm conservation programs should be targeted. Newly identified genotypes may also contribute as genetic breeding resources that could be used to adapt cassava to future changes and farmers' needs.Entities:
Keywords: Cassava varieties; Genetic variability; Germplasm; SNP markers
Mesh:
Year: 2021 PMID: 33630231 PMCID: PMC9162981 DOI: 10.1007/s11103-021-01124-0
Source DB: PubMed Journal: Plant Mol Biol ISSN: 0167-4412 Impact factor: 4.335
Fig. 1Dot map distribution of cassava plantations across different provinces and agro-ecological zones in Vietnam
Fig. 2Pedigree of cassava varieties released in Thailand and Vietnam
Fig. 3Numbers of varietal names recorded across 1570 genotypes collected in a survey of farms
Fig. 4Frequency of heterozygosity observed across 96 SNPs markers selected (a) and from a set of 85 different cassava genetic groups identified across 1570 clones’ evaluated (b)
Genetic diversity of cassava in six agro-ecological zones, as estimated by the single nucleotide polymorphism (SNP) analysis
| Agoecologycal zone | Genotypes (n) | Farmer variety names (n) | % Duplicate genotypes | No. Heterozygous indivudials | |||
|---|---|---|---|---|---|---|---|
| North West | 209 | 16 | 12.0 | 89 | 0.43 | 0.40 | 0.31 |
| North East | 256 | 18 | 16.0 | 112 | 0.44 | 0.40 | 0.31 |
| North Central Coast | 218 | 31 | 13.0 | 100 | 0.46 | 0.36 | 0.29 |
| Central Highlands | 419 | 32 | 24.5 | 180 | 0.43 | 0.36 | 0.29 |
| South Central Coast | 190 | 29 | 12.0 | 83 | 0.44 | 0.30 | 0.27 |
| South East | 278 | 35 | 17.0 | 121 | 0.44 | 0.38 | 0.30 |
| Whole sample | 1570 | 97 | 98.0 | 685 | 0.43 | 0.40 | 0.30 |
Fig. 5Spatial distribution of cassava plantations modeled based on allele richness; red areas indicate high concentrations of alleles with single nucleotide polymorphisms (SNPs; 63.73% to 82.94%)
Fig. 6Spatial frequency distribution of the ten main cassava varieties identified in farmer’s fields across 92 villages in mainland Vietnam
Varieties most predominant in Vietnam and their Institutional or Genebank names in Asia and Colombia (CIAT)
| Variety | Frequency (%) | Names given by farmers | No. samples | Status | Year of registration or release | Pedigree | Relationship first degree | Traits of interest |
|---|---|---|---|---|---|---|---|---|
| KM94 | 37.7 | 40 | 592 | Improved | 1995 | Rayong 1 × Rayong 90 | G44, Unique sample (2738) | High yield and starch content, highly branched |
| KM419 | 9.8 | 16 | 154 | Improved | 2016 | BKA900 x KM318 | IND59, Dòng 1 | High yield |
| BRA1305 | 7.0 | 8 | 111 | Landrace | Unknown | Unknown | Unique sample (346) | Suitable for fresh consumption |
| KM101 | 6.3 | 16 | 99 | Improved | 2017 | Rayong 1 × CMR 23-149-128 | Uniques samples (1864,397) | High yield and starch content |
| KM140 | 5.8 | 18 | 90 | Improved | 2007—2009 | KM146 × KM98-1 | Undetermined | High yield, low cyanogenic content, suitable for fresh consumption, best harvesting time |
| G44 | 4.0 | 11 | 64 | Unknown | Unknown | Unknown | KM94 | High yield and starch content |
| PER262 | 3.4 | 10 | 54 | Landrace | Unknown | Unknown | KM57, Unique sample (473) | Suitable for fresh consumption |
| G7 | 3.2 | 6 | 51 | Unknown | Unknown | Unknown | BRA1305, PER308 | Suitable for fresh consumption |
| KM60 | 2.5 | 11 | 40 | Improved | 1995 | Rayong 60 × SM937-26 | Unique sample (2636), G49, COL1684 | High yield, good root shape, early harvest, yellow flesh |
| KM57 | 2.3 | 9 | 37 | Landrace | Unknown | Unknown | PER262 | Suitable for fresh consumption |
Fig. 7Scatterplots of discriminant analysis of principal components (DAPC) of a set of 85 varieties representing maximum variability
Genetic distances for all samples (n = 1570) by agro-ecological zone following Nei (1973)
| Agro-ecological zone | North West | North East | North Central Coast | Central Highlands | South Central Coast |
|---|---|---|---|---|---|
| North East | 0.0050 | ||||
| North Central Coast | 0.0242 | 0.0258 | |||
| Central Highlands | 0.0200 | 0.0224 | 0.0086 | ||
| South Central Coast | 0.0455 | 0.0445 | 0.0163 | 0.0152 | |
| South East | 0.0277 | 0.0287 | 0.0269 | 0.0218 | 0.0375 |
Fig. 8Cluster analysis of a portfolio of 85 varieties constructed with the neighbor joining (NJ) method using shared alleles to define genetic distances; differing internal colors represent DAPC groups
Set of 31 unique genotypes of cassava identified according to single nucleotide variants
Different colors are indicating four subgroups identified by DAPC and NJ analyses