| Literature DB >> 27076998 |
Nidia Gutiérrez-López1, Isidro Ovando-Medina1, Miguel Salvador-Figueroa1, Francisco Molina-Freaner2, Carlos H Avendaño-Arrazate3, Alfredo Vázquez-Ovando1.
Abstract
Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (F ST = 0) and SAMOVA (F ST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding.Entities:
Keywords: Chloroplast DNA; Haplotype; Indels; Nucleotide diversity; trnH-psbA
Year: 2016 PMID: 27076998 PMCID: PMC4830229 DOI: 10.7717/peerj.1855
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Geographic populations* and genetic classification of the analyzed Theobroma cacao trees.
For populations 1–5 (from farms in Soconusco, Mexico) genetic clustering was based on membership to the Criollo group (%) described by Vázquez-Ovando et al. (2014) using SSR markers. For the reference trees** (populations 6–9), the genetic grouping was suggested by Avendaño-Arrazate et al. (2010) and the database accessions (ICGD; Turnbull & Hadley, 2016).
| Pop* | Coordinates latitude (N)/longitude (W) | Criollo ( | Non-Criollo ( | Admixtures ( |
|---|---|---|---|---|
| 1 | 14°59′28″N, | TASG12 (93%) | HUJF01 (9%) | TASG16 (86%) |
| 92°26′44″W (Huehuetán) | TASG18 (95%) | HUJF03 (2%) | ||
| 14°52′55″N, | ||||
| 92°21′42″W (Tapachula) | ||||
| 2 | 14°56′41″N, | TCHR04 (98%) | CAAM12 (1%) | CAAM04 (53%) |
| 92°09′59″W (Tuxtla Chico) | ||||
| 14°59′53″N, | ||||
| 92°10′44″W (Cacahotán) | ||||
| 3 | 14°47′31″N, | FHSA06 (1%) | FHSA02 (36%) | |
| 92°11′11″W (Frontera Hidalgo) | SUED02 (2%) | |||
| 14°38′27″N, | SUED03 (1%) | |||
| 92°13′47″W (Suchiate) | SUED06 (1%) | |||
| 4 | 14°48′56″N, | MAMG12 (98%) | MAMG03 (2%) | MAMG10 (24%) |
| 92°29′06″W (Mazatán) | MAMG04 (1%) | |||
| MAMG07 (1%) | ||||
| MAMG08 (9%) | ||||
| 5 | 15°28′07″N, | MAJH02 (96%) | MAJH03 (63%) | |
| 92°48′42″W (Mapastepec) | VCHL01 (97%) | |||
| 15°10′31″N, | VCHL02 (96%) | |||
| 92°38′06″W (Villa Comaltitlán) | VCHL03 (97%) | |||
| 15°11′17″N, | VCHL04 (97%) | |||
| 92°36′55″W (Villa Comaltitlán) | VCLB02 (97%) | |||
| VCLB03 (98%) | ||||
| VCLB04 (98%) | ||||
| 6** | 20°32′29.25″N, | Yaxcabá | ||
| 88°50′35.82″W (Yucatán) | Xocen | |||
| 7** | 16°06′42.92″N, | Lacandón 06 | ||
| 90°56′31.28″W (Selva Lacandona) | Lacandón 28 | |||
| SL01 | ||||
| 8** | INIFAP (Several) | Lagarto | CATONGO | RIM 24 |
| Carmelo | EET 399 | |||
| 9** | Accessions (ICGD) | CRIOLLO 22 | SCA 6 (MIA 29885) | ICS 1 (TARS 16656) |
| AMELONADO (TARS 16542) | ICS 6 (TARS 16658) | |||
| MATINA 1/6 | ICS 39 (TARS 16664) |
Notes.
International Cocoa Germplasm Database
Tropical Agriculture Research Station
Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias
Nucleotide polymorphic sites and cpDNA haplotypes in cacao populations based on variation in the intergenic trnH-psbA spacer region.
| Haplotype | Polymorphic site | Populations (see | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 22 | 134 | 206 | 309 | 310 | 487 | Pop1 | Pop2 | Pop3 | Pop4 | Pop5 | Pop6 | Pop7 | Pop8 | Pop9 | |
| H1 | – | T | – | A | A | A | 3 | 2 | 3 | 2 | 5 | 1 | 3 | ||
| H2 | C | T | – | A | A | A | 1 | 1 | 2 | 1 | |||||
| H3 | C | T | A | – | – | – | 1 | 1 | |||||||
| H4 | – | A | – | A | A | A | 1 | ||||||||
| H5 | – | T | – | A | A | – | 1 | ||||||||
| H6 | C | T | – | A | – | A | 1 | ||||||||
| H7 | – | T | – | A | – | – | 1 | 1 | |||||||
| H8 | – | T | A | – | – | A | 1 | 1 | |||||||
| H9 | – | T | A | – | – | – | 1 | 1 | 1 | ||||||
| H10 | – | A | – | A | A | – | 1 | ||||||||
| H11 | – | T | – | A | – | A | 1 | 1 | |||||||
| H12 | – | T | A | A | – | – | 6 | ||||||||
Figure 1Location of indels (blue arrows) and the transversion (red arrow) in a sequenced fragment of the chloroplast DNA trnH-psbA intergenic spacer from Theobroma cacao trees.
See Table 1 for sample details.
Genetic diversity in cacaos from Soconusco (Chiapas, Mexico) grouped by the geographic approach (Pop) and genetic origin approach.
| Pop | Locality | ||||||
|---|---|---|---|---|---|---|---|
| 1 | Huehuetán, Tapachula | 5 | 2 | 1 | 3 | 0.70 ± 0.21 | 0.0019 ± 0.0017 |
| 2 | Cacahoatán, Tuxtla Chico | 3 | 2 | 1 | 2 | 0.67 ± 0.31 | 0.0026 ± 0.0026 |
| 3 | Frontera Hidalgo, Suchiate | 5 | 5 | 0 | 3 | 0.70 ± 0.21 | 0.0042 ± 0.0032 |
| 4 | Mazatán | 6 | 5 | 3 | 5 | 0.93 ± 0.12 | 0.0048 ± 0.0035 |
| 5 | Mapastepec, Villa Comaltitlán | 9 | 5 | 0 | 4 | 0.69 ± 0.14 | 0.0039 ± 0.0027 |
| 6 | Yucatán | 2 | 1 | 1 | 2 | 1.00 ± 0.50 | 0.0019 ± 0.0027 |
| 7 | Selva Lacandona | 3 | 0 | 0 | 1 | 0 | 0 |
| Total | 33 | – | 6 | – | |||
| Mean ± sd | 0.66 ± 0.08 | 0.0032 ± 0.0021 | |||||
| Genetic origin approach | |||||||
| “Criollo” | 12 | 6 | 1 | 4 | 0.64 ± 0.13 | 0.0025 ± 0.0019 | |
| “Non-Criollo” | 11 | 5 | 1 | 5 | 0.62 ± 0.16 | 0.0030 ± 0.0021 | |
| “Admixtures” | 5 | 5 | 1 | 5 | 1.00 ± 0.12 | 0.0060 ± 0.0041 | |
| Criollo-reference | 8 | 4 | 1 | 5 | 0.79 ± 0.15 | 0.0033 ± 0.0025 | |
| Forastero-reference | 5 | 3 | 0 | 3 | 0.80 ± 0.16 | 0.0031 ± 0.0025 | |
| Trinitario-reference | 4 | 4 | 0 | 2 | 0.50 ± 0.27 | 0.0038 ± 0.0032 | |
| Total | 45 | – | 4 | – | |||
| Mean ± sd | 0.80 ± 0.05 | 0.0038 ± 0.0024 | |||||
Notes.
Samples sizes
Number of segregating
Singletons
Number of haplotypes
Haplotype diversity
Nucleotide diversity
standard deviation
Including sequences GenBank (Criollo-reference n = 1, Forastero-reference n = 3, Trinitario-reference n = 3).
Classification based on membership (>90%) to Criollo type, see Table 1 (Vázquez-Ovando et al., 2014).
Figure 2Median joining network for chloroplast DNA trnH-psbA intergenic spacer haplotypes of Theobroma cacao trees from Soconusco, Mexico, and the outgroup haplotype (Theobroma bicolor).
The map indicates the geographic distribution of the haplotypes. The colored portions represent the proportions of the same haplotype occurring in each sampling locality. Trees employed as the references (Pop 6 and Pop 7) are shown outside the map. The population code and details are shown in Table 1.
Figure 3Median joining network for the chloroplast DNA trnH-psbA intergenic spacer haplotypes of Theobroma cacao trees cultivated in Soconusco, Mexico, and the reference accessions.
The circle sizes are proportional to the haplotype frequencies, and the color represents the proportions of the same haplotype occurring in each genetic group. For genetic group details, see Table 1.
Spatial analysis of molecular variance (K = 2) for cacao populations and the statistical analysis of molecular variance fixation indices corresponding to the groups.
| Source of variation | df | SS | VC | Variation (%) | Fixation indices | |
|---|---|---|---|---|---|---|
| Among groups | 1 | 1.61 | 0.1282 | 13.98 | 0.7341 | |
| Among populations within groups | 5 | 2.51 | −0.0879 | −9.59 | 0.0068 | |
| Within populations | 26 | 22.80 | 0.8765 | 95.61 | 0.1496 | |
| Total | 32 | 26.91 | 0.9168 |
Notes.
degrees of freedom
Sum of squares
Variance components