Literature DB >> 30697115

Assessing hidden parentage and genetic integrity of the "United Fruit Clones" of cacao (Theobroma cacao) from Costa Rica using SNP markers.

Allan Mata-Quirós1, Adriana Arciniegas-Leal1, Wilbert Phillips-Mora1, Lyndel W Meinhardt2, Lambert Motilal3, Sue Mischke2, Dapeng Zhang2.   

Abstract

The international cacao collection in CATIE, Costa Rica contains nearly 1200 accessions of cacao, mainly from the center of genetic diversity of this species. Among these accessions, the United Fruit clones (UF clones) were developed by the United Fruit Company in Costa Rica, and they represent one of the earliest groups of improved cacao germplasm in the world. Some of these UF clones have been used as key progenitors for breeding resistance/tolerance to Frosty Pod and Black Pod diseases in the Americas. Accurate information on the identity and background of these clones is important for their effective use in breeding. Using Single Nucleotide Polymorphism (SNP) markers, we genotyped 273 cacao germplasm accessions including 44 UF clones and 229 reference accessions. We verified the true-to-type identity of UF clones in the CATIE cacao collection and analyzed their population memberships using maximum-likelihood-based approaches. Three duplicate groups, representing approximately 30% of the UF clones, were identified. Both distance- and model-based clustering methods showed that the UF clones were mainly composed of Trinitario, ancient Nacional and hybrids between ancient Nacional and Amelonado. This result filled the information gap about the UF clones thus will improve their utilization for cacao breeding.

Entities:  

Keywords:  Amelonado; CATIE; Central America; Nacional; genebank; genetic diversity; tropical tree

Year:  2018        PMID: 30697115      PMCID: PMC6345235          DOI: 10.1270/jsbbs.18057

Source DB:  PubMed          Journal:  Breed Sci        ISSN: 1344-7610            Impact factor:   2.086


Introduction

Cacao (Theobroma cacao L.), the source of cocoa powder and cocoa butter used for chocolate, is a tropical forest species native to South America that is cultivated extensively in tropical regions. The cacao collection at CATIE (Centro Agronómico Tropical de Investigación y Enseñanza) is one of two international cacao germplasm collections in the world; the other is the “International Cocoa Genebank, Trinidad”, which is curated by the University of West Indies, St. Augustine, Trinidad and Tobago (CacaoNet 2012). The CATIE collection was initiated in 1944 to promote the exchange of germplasm of tropical crops. At the time this study was initiated, this collection maintained 1200 clones or accessions from Central America, Mexico, South America, the Caribbean, Asia, and Africa. Some of these accessions, such as the ancient Criollo, Amelonado and accessions from Brazil, are unique in this collection in terms of international distribution (Phillips-Mora ). The collection has also been an important source for resistance to Frosty Pod and Phytophthora Pod diseases. In 1978, the collection was catalogued by the International Board for Plant Genetic Resources (IBPGR, now Bioversity International) as one of the two “International Cacao Collections”. Since 2004, it has been under the auspices of the Food and Agriculture Organization and covered by an international treaty for the protection of plant genetic resources (CacaoNet 2012, Phillips-Mora et al. 2006). Although the CATIE collection has been characterized using microsatellite DNA markers, comprehensive assessment of genetic integrity and genetic diversity of different germplasm groups remains to be accomplished (Zhang ). Some information gaps need to be filled in order to improve the accuracy and efficiency of conservation and utilization of this international collection. Among the CATIE cacao germplasm holdings, there are several groups of improved breeding lines which were selected from earlier breeding activities in Costa Rica and other countries in Central America. One of them is the “United Fruit Clones” (UF clones), which were developed by the United Fruit Company in Costa Rica (Engels 1981, Johnson ). The United Fruit Company was formed in 1899 when the Boston Fruit Company merged with the Tropical Trading and Transport Company, then with several other companies that produced, imported and marketed bananas from the Caribbean islands, Central America and Colombia. The principal founder was Minor C. Keith (the banana king of Costa Rica), who developed banana plantations in Costa Rica beginning in the 1870s. The company started cocoa production in the early 20th century, hoping to replace financial losses resulting from problems in the banana industry of this country (Johnson , Keithan 1940). In 1907, cacao seedlings derived from pods introduced from Trinidad and Tobago were planted on the Caribbean coasts of Costa Rica and Panama. The pods from Trinidad were described as a Forastero Amelonado type having much larger pods and beans than the Amelonado cacao grown in Limon at that time. In 1936, after years of random hybridization and selection for pod size, bean size and yield, and after surveying thousands of trees in Limon, Costa Rica, advanced breeding lines were selected. The 12 best trees were clonally propagated and designated with UF (for United Fruit) numbers (Table 1). In mid-1940s, the United Fruit Company expanded the planting of cacao (and oil palm) as replacement crops on the west coast of Costa Rica, because of the widespread incidence of Panama disease on banana. Extensive evaluation was carried out in at the Quepos Los Rios farm, using planting materials introduced from Limon and other undocumented sources. An additional group of UF clones were selected after multi-location trials involving Almirante, Limon and Quepos. Today, a total of 44 UF clones are maintained at CATIE, Costa Rica in the “International Cocoa Collection” (IC3). Some of these clones, such as UF-12, UF-273 and UF-712 have been used as key progenitors for breeding new cacao varieties having resistance/ tolerance to Frosty Pod disease. This disease, caused by Moniliophthora roreri, occurs in most major cacao producing areas in the Western Hemisphere, but will potentially spread to the major cocoa producing regions in Asia and Africa (Evans 2007, Phillips-Mora and Wilkinson 2007).
Table 1

Summary list of 273 analyzed cacao accessions including UF clones, Trinitario clones and the ten reference germplasm groups

Population/groupOriginSample sizeProvider
United Fruit (UF)Costa Rica44CATIE, Costa Rica
NacionalEcuador20INIAP, Ecuador
CriolloPuerto Rico, Honduras, Nicaragua20SPCL, TARS, USDA
AmelonadoPuerto Rico, Honduras, Nicaragua20SPCL, TARS, USDA
LCT EEN (Curaray)Ecuador20INIAP, Ecuador
IMC (Iquitos)Peru20ICG, T, Trinidad
NanayPeru20ICG, T, Trinidad
ParinariPeru25ICG, T, Trinidad
Scavina/Ucayali (Contamana)Peru20ICG, T, Trinidad and ICT, Peru
PurusBrazil25SPCL, TARS, USDA
French GuianaFrench Guiana25CIRAD, France
TrinitarioTrinidad14ICG, T, Trinidad
So far, the utilization of the UF clones as progenitors outside the Americas (e.g. Southeast Asia and West Africa) has been minimal. The lack of information on genetic background of the UF clones is one of the reasons for their limited utilization in breeding. The UF clones have been characterized morphologically and genotyped using SSR markers and a field guide has been developed based on these works (Johnson ), which significantly improved our understanding about genetic diversity in this germplasm group. However, because at that time the diversity analysis was not conducted in the context of the entire cacao primary gene pool, the genetic background of the UF clones, in terms of their ancestry and relationships with other known germplasm groups, was not analyzed. It is believed that the UF clones were selected from MATINA (an Amelonado cacao variety in Costa Rica) material and Nacional type germplasm from Ecuador (Bartley 2005), but detailed information has not been available. The main objective of the present study was to verify the genetic identity and analyze the ancestry and genetic background of these breeding lines. The resultant information will be highly useful for improving the accuracy and efficiency of cacao genebank management in CATIE and will facilitate the further use of this germplasm in breeding new varieties with enhanced resistance to diseases, productivity and quality attributes.

Materials and Methods

Plant materials

The 44 UF Clones were sampled from the international cacao germplasm collection at CATIE, located in Turrialba in the province of Cartago, Costa Rica (Table 1). These trees were morphologically characterized in the 1970s (Engels 1981) based on a comprehensive list of morphological descriptors. Two examples of UF clones, showing different pod shapes were presented in Fig. 1. In addition to the 44 UF clones, 229 international clones were included in this experiment as references. These 229 reference clones represent 10 known germplasm groups of cultivated cacao, as classified by Motamayor . Genetic identities of these clones were determined through an international initiative of DNA fingerprinting of cacao (Zhang , 2009b) and various studies on cacao germplasm management (Cosme , Ji , Motilal ). The majority of the reference clones were maintained in the two international gene banks in Trinidad and Costa Rica (Motilal , Zhang , 2009b). These reference trees were sampled from the original collections maintained at Marper Farm and San Juan Estate in Trinidad, and Cabiria Farm at CATIE in Costa Rica. The rest of the reference clones were from the cacao samples collected by the USDA ARS Sustainable Perennial Crops laboratory from various national collections, including the Agricultural Research Institute (INIAP) of Ecuador and the Tropical Crop Institute (ICT) of Peru and Comissão Executiva do Plano da Lavoura Cacaueira and (CEPLAC) of Brazil. The summary list of these ten groups and UF clones were listed in Table 1.
Fig. 1

Matured pods (fruits) of two UF clones, showing the pod shape of Nacional hybrid (UF-613) and Trinitario (UF-677).

Two healthy young leaves were collected from each tree, and the samples were dried in silica gel and sent to the USDA Beltsville agricultural Research Center, Maryland, USA for genotyping. DNA was extracted from dried leaf samples with the DNeasy® Plant Mini Kit (Qiagen, Inc., Valencia, CA), which is based on the use of silica as an affinity matrix. The remainder of the extraction method followed manufacturer’s suggestions. DNA was eluted from the silica column with two washes of 50 μL Buffer AE, which were pooled, resulting in 100 μL DNA solution. Using a NanoDrop spectrophotometer (Thermo Scientific, Wilmington, DE), DNA concentration was determined by absorbance at 260 nm. DNA purity was estimated by the 260/280 ratio and the 260/230 ratio.

SNP markers and genotyping

Forty-eight SNP markers were selected from 1560 putative candidate SNPs based on cDNA sequences from a wide range of cacao organs (Allegre , Argout , Boccara, personal communication). The selection of SNPs was based on the level of polymorphism and their distribution across the ten chromosomes in cacao. The chosen 48 markers were used to design and manufacture primers for a SNPtypeTM genotyping panel by the Assay Design Group at Fluidigm Corporation (San Francisco, CA). The full list of the 48 SNPs and their flanking sequences are presented in Table 2. Genotyping was performed on the high-throughput Fluidigm EP1TM system, using the Fluidigm SNPtype Genotyping Reagent Kit according to the manufacturer’s instructions, and nanofluidic 48.48 Dynamic ArrayTM IFCs (Integrated Fluidic Circuit; Fluidigm Corp.). These chips automatically assemble PCR reactions, enabling simultaneous testing of up to 48 samples with 48 SNP markers. Fluorescent intensity was measured with the EP1TM reader and plotted in two axes. Genotypic calls were made using the Fluidigm SNP Genotyping Analysis program.
Table 2

Forty-eight SNPs and their flanking sequences used for genotyping cacao germplasm accessions in CATIE, Costa Rica

SNP codeChromosome numberSNPs and Flanking sequences
TcSNP259TTTATGTTACTAGTTTTTGTTGGGTGGTTGTGTTTTGTTTGCATTATTTTGTGGAC[C/G]GGACCTTAGTATACTGCTTGTTTGCTGTTTTTTGTAGTAGTGGGTTGTTAGT
TcSNP324TCACTTTTGAATATAGAAGATGGATGATATTGTGCAAAAATAATACCATAGT[A/T]CTAAGAAAATGCACATTTTTGTAAATAGGAAGAATGCCATGGTAATGTTGTGTTGA
TcSNP1398AAATAATACTAGTAACATTAGACCGATATTTATGTAGGGAGAAAGACATYTTGA[T/G]TTTGGCTTGGTGGTATGTTGATTTTTGTTGATACCAAAAAAGACCAGATG
TcSNP14410ACCGACACAGGATACCGTGAATGGGGAACTATGAAGGCAGCTTGCTATGTGTGGGA[A/C]ATTTGCACTTTGGGTCCAATAAGTTTAAATTACCCGACAGCTTATA
TcSNP1505ATACCCTGACGACTACTGCTGACTTGGGAACACCCACCAGCGTTTTGCCGGCAACA[T/G]GGTGACTGGTGATTTTCTAATTTTTATAGAACTTTGAAGCGTTTAAAATA
TcSNP1518TGAAGCTGGAGGGATTGTGGAGAGTGTTGGTGAGGGTGTGACTGACAACCAGGTGA[T/C]CATGTCCCATTTACTGGAGAATGCAAGGAGTGGCCACTGCTTGAGAAGAA
TcSNP1939GCATAGAAATTACGACAGCCCAATTTCTTGTTAAAAGTTCTGTTTAAT[A/C]GTTTCATCTGAAAATGTGATCAAGCTTTTGGCAGCCGCCGCAGCAA
TcSNP2269AAGCCCAAGGCCCAAGAGCTACGCGACGAAGGCAGTGAACCAGGCGCTGAGGGGGG[C/G]CACCGGCAGACTAAAGAAATGATGCTGGTGAACAAGAAAACCGAGGGGCCGG
TcSNP23010CAGCAGAGGCCGAAGACAAAGAAGGGGAGAGGATGTGACACCCAAGGTTTTT[A/G]GAAGATACAACGGAACACAAAAAGCTTTTACGCAACATAATTTGCT
TcSNP2429AGCAATCCGCCAAACAAGCCAAAAAACCCGTCTGTGCCTT[T/C]CTGTTGCCGAGAGAAACATTTGGGTCTTGCCAAT
TcSNP3096ATACGAGTAAGAAAAAAATTTACATTTTAAGTACAAGAAATTCCAAGATGCC[T/C]CTCACAAACAATATAGTAAAGCACACCGTCGTACCACCAGTGTGTT
TcSNP3724GGAAATTGAAGATTGTGAAGGAGAACTGATTAGACCTGGAATGATTTAAA[A/T]CTTGATTAAGAGGGGTGGCAATGGCAGGTTTTTGAAGACTGCTGCCTATGGGCATTTT
TcSNP4292TGGATGGATTATAAAGGTTGAGATGAATGACGCCGGGGAGTTGAAGAACTTGATGGGC[A/G]GAAGAATACCAAGTTGTGAGGAAGAAGATAAAGCACTGACCAGATGCCCTGGCC
TcSNP4697AGCCATGGCCGAGAAAAAGCCAGCCCGGTTAACCCGATTCGACCCGACTCAG[A/G]GAGACGTTACCGCGGCGTGAAAGCGCCCATGGGGCCGTTACGCGGCGGAGACGG
TcSNP5291ATAGGCCTTGAATGACTACTGACCTACAAGCAACAAAAGGCT[A/C]AGAAAAATAGGCATGTATGGCTTGCACTTGGGTTGCAGTTGCAAGCCGGCAGTA
TcSNP5341TGTAAAGAGTGCCATTGTAAGTGGATGCAGCGCCTTTTAAGCAATGGTACCTAGCA[T/C]TATGGTGGACATTGGGAGGAAGAAGAAAACAGCAGCTAAGAAGGAAGCTACTGAGGAA
TcSNP56010GAGGCAGAGAAATACAAGGCCGAGGATGAGGAGCACAAGAAGAAGGCGGAGGCCAAGAAT[T/G]CTTTGGAGAACTACGCCTATAACATGAGGAACACTGAAGGATGAGAAGATTGGCAA
TcSNP5775CAATTGCAAACACGCCATGATGAATGATGATGATAATGATGAACAGAAAACAAGAT[C/G]AACGGCCACAAATTTAACACAAATTACAAGGGAAAAAAAGGGGAGAAAATARAAAACA
TcSNP5911GGGCGGAGAAGACTGGCTGCTATTAAGGTTGAAACAGAGGGGCTTATGCGGA[A/C]GGAGAAGGCGAAAGATAGCCAGGCCATGCCGGAAAGGGCAAAATGAGGAAGC
TcSNP6196GAACAAAAATTGTAAATTAATTATGCATGGAGTAATGACCCCACAGCTTTGCAACACC[T/C]CAAAAATGGTGGTTGCCTTTGTACATAATGATTGGGATGAAATTTGTTTGTA
TcSNP6455TGAAGGCACTGGAGCCCTTTGTGAATTGGCTGGAGGAAGCAGAAGAAGAGGAATAAGTGC[A/G]ATAACAATAATATTATGAACTTTGAAAAGAAAAGGGGTGAACCCTGTACTTGTCTT
TcSNP72310AAATTGGAACCCCACCAAACCATGCATGGCCATGGATACTAGCAATGATGCTGCTT[T/G]GTTACCTTGCAGGCTGCCTTGGCTGCTTAGTGTTTGACAATGCTAAAAAC
TcSNP7506ACCTAGCCTTAAGCTGCAAGCCGCCTTGACAAAACAAACCTCTG[T/C]GTTGGTTGACACAGCATGAGCAGCTGGCTGTGGCAGCCCAACAGAGACCCAACCCA
TcSNP8362AAGGGTAACCATGTGGAAAGCTAAGTGCCATCACCATGTACATGGGAC[T/C]CCAACAACCTACATATGAGCCCATACTAAGAGTATGACCATGGAGCCCTG
TcSNP8523TGTCTTTACAACCATTGACTGATGGGTGTGAGTAAGACCAAGTGCA[C/G]TACTGTGGGTATAAGGCATTGCTTGGGCCTTTGGAGGCATGATTTGCTTGTA
TcSNP8724AGTGATGTTGCACAGGACAAAGCTAACCTTGTTGCCAGAACAAGAGGG[C/G]AGACCAAAGAACAAGGTGGTAGTCCCCAGTTACAAGCTTGTACCTTGC
TcSNP8783CCGAGGACGGCCGACACGGGCTGACGCCCGAGTACTAGTGGAACCC[C/G]ACCTGGATGACGGGAATGTAGGCCGGAGGTAAAGCGTACAGGAGGATTTGTTGA
TcSNP8864GCGAGTGCCGAGTAGGCAGGAGGAAAGTAGGGAGAAATAAGCGGGCGGTTTGAG[T/C]ACTAGAGACTAGGGGAGCTTTACTAGTATTATGTTTGACTTATGAGGTAAAG
TcSNP8912CAGAAAGCACACTTTGCACAAGGTTACACAGTATAAGAAGGGTAAGGATAGTTTGGCTGC[T/C]CAAGGGAAACGCCGTTATGAGCAAACAAAGGTTATGGTGGGCAGACCAAACCAGTG
TcSNP91710TATGGAAACTGGGTAAAGGCAAGAATGGGCGGCAATCTGGTGGACAAGCCACTTAC[T/C]TGGGCAACCCTGTCATGGGGTACCCCCTAATTGGAGAATTGCCA
TcSNP9293TGAAATGGAAATGAAAATACAAAAAACCAAGATGTAATAGCAAGTGATGCTTTT[C/G]TGCTGATTATAACAGAAGTTGTGTTAATTTTACACTTAGTTGATGTGGTTTTAAAGGG
TcSNP9534GGGTAAATGGCACTTGAGGTTGCCAGAGATAAAGCTATATGCCAAGCCAA[A/T]TTTGGGCATCAGGGAAACACCAAAAACAAGTTTTTATGGCAAAATG
TcSNP9946AACAAACCGAAATATATTAGGGAAACTTTGCATTTGCAACCCCTATTTGACTTGAT[T/C]TGCAGATGCTTTTGAACAACTTGAATAGAAYATGGGAGGAGGGTAG
TcSNP9985AACAAGAACAAGGGAATGGGGACAATATGACCACTAGACATGATGGCTTTCC[A/G]TAGTAGTGAGAAAGGAATAACATAGAATAAGCTGACAGATGCACTGCAACCT
TcSNP10385GGACACCTAGACTAGGAGCAACCTGCTTTTGACAAGAAGCAGTTTGTAACCT[A/G]TGAAGAGGTACAAAGAACTTGACACCCAAATTAGAGCCAGAGAAGCAAGAGTTGTA
TcSNP10602CAGTTTGATTTTGAGATTGAAGCTAACCTTGCACTGAGTACTTGGCACTTGGTAAC[T/C]GTTGGAAGAATATGTAGCCTGAATAAGACGTGCCATGAACATATATGGATATTAGG
TcSNP10623ATGCAAGAGGGAACTGGAGAGTAGTATGCTAAGCCGCATGAAATGCTGCAAG[A/G]GACCTTGACCCTTGATGAGAAGAACCCACGGGATTTGAGGGTGAGGCTTT
TcSNP10751TTATGGCAGCATGCACTTATAATTTATGATTGCAACCCAACTGATACATAAATG[A/T]GTAGTAGGCCTGTATAGATGAATTACGAAACAAAGCATGAGAGTGCATGT
TcSNP11446GGTGGACTTTGTGGAGGAGATTTGCAGTATTATATGGATGAGAATTATTGGG[T/C]TGTTTTGCCCAAACTGGAGAGGTTGTTAGTGAAAGTTACGTAACAAGCAAACAGGC
TcSNP11652GAAATTGCTTAAACATATTTGCATAGCATTATAGATATTTAAAACCATGATGGAGG[T/C]TGCCAAGGTTTCTAGCTTACTTGATGGCCTTAGCCTTGGCCCCCCA
TcSNP12539CACTTGCCACAAGTCACTAAAGCATTGAAACCAGCAAGAAGTGATTTA[T/G]ATTACCAGCACTTAAAACTTTAAAGGATAGGTGAGTAAAGAAATGAGGCGCT
TcSNP12707ATATTTGAGTTGTTTGATGTTTACTACAAGACCTGCCACTTCCGCAGCTT[T/C]GTACCAAGAGCCCTYCAATTTACCAATTATGTTGATTAGGGATGGCTTTCAGAT
TcSNP13501CAAAATTTTTTAACTATATGATGGACCATATAGCCTAAATAAATATAGCAAAAATG[A/C]ATAACAACAAATTATATGGCTGGCTTGCAAAAAGACTATAAGGGCTTGTGGCTAGT
TcSNP14149TGTGACTTACGGTTACCCCAAAAGAGCGTGAGGGAGTTGATTTACAAAAGAGGTTA[T/C]GGGAAGTTGAACAAGCAGCGTGTTGCTTTGACTGACAATGAAAATTGAGCAGGCTG
TcSNP14429AAAAGTGATGGAGGAAAGGGAAAGAGATGGGTGGGAGTAGAGATGGCCTTTGGTGTTC[T/C]TATGGATTTACTACAAAAGCTTTGGTGTGTGATGGAATTTACAGCTACTGTTAT
TcSNP14581TGGATGAGAGCTAAAAGAATTAGAAGGAAAAGCAGATGGCGA[C/G]CAGAGTTTGCAAAGAGAAGCCGACCTCTTACCTTGATTGAGCTG
TcSNP14846AAAACAGAAACGGGGTTGACTTAGCCGCAGCTGTGTAAACACAC[A/G]AGGGGACAGATGGCTGACTGAGAAAGACTGGACCGACGTTGAGTTTAGGG
TcSNP15208ACTGCCTAAATATATATGATGAAGAAAAAGCTTTGGAGAAAACAAGGAAGGTTGAC[T/C]GAGAAGATTGCAGCTGAACTGCTATTGACGATGTTTGCAGCCGAACTGAA

Data analysis

Key descriptive statistics for measuring informativeness of the SNP markers were calculated, including observed heterozygosity, expected heterozygosity, and probability of identity (Evett and Weir 1998, Waits ). The program GenAlEx 6.2 (Peakall and Smouse 2012) was used for computation. For clone or duplicate identification, pairwise multilocus matching was applied among individual varieties and the reference clones, using the same program. Statistical rigor was assessed for match declaration using the probability of identity (PID) that two individuals may share the same multilocus genotype by chance (Waits ). In computing PID, it was assumed that all individual genotypes were siblings (PID-sib), which was defined as the probability that two sibling individuals drawn at random from a population have the same multilocus genotype (Evett and Weir 1998, Waits ). The overall PID-sib is the upper limit of the possible ranges of PID in a population, thus providing the most conservative number of loci required to resolve all individuals, including relatives (Waits ). This can be computed using the following equation: where p is the frequency of the ith allele. The computation was carried out using the program GenAlEx 6.2 (Peakall and Smouse 2012). Accessions with different names that were fully matched at the genotyped SNP loci were declared duplicates or synonymous accessions. After examining duplicates in the analyzed samples, assignment test was applied to infer population membership and admixed ancestry (hybrids or ancestral forms) of the UF clones, using a model-based clustering method implemented in the software program STRUCTURE (Pritchard ). The UF clones were analyzed, together with samples in the 10 reference groups that potentially had made ancestral contribution to these farmer selections. The number of clusters (K-value) was set to10, assuming that each of the 10 populations may have contributed to the UF clones. Ten independent runs were assessed for K = 10. The run with the highest Ln Pr (X|K) value of the 10 runs was chosen and presented in a bar plot. Q-value was used to present the ancestral contribution (membership) from each germplasm group. Accessions possessing ≥25% membership (Q-value) in a given cluster were considered as receiving a significant ancestry contribution from that cluster (genetic group). Accessions possessing ≥75% membership were considered to be a member of that cluster. Accessions possessing >25% but <75% membership were considered as hybrids of two (or more) clusters. After assignment test, multivariate analysis was used to provide a complementary assessment of the relationship among the UF clones and their relationships with reference clones from international genebanks. In this analysis, we included only ancestry populations that are relevant to the origin of the UF clones, based on the result of assignment test. Pair-wise genetic distance was computed for every pair of accessions, using the genetic distance procedure in GenAlEx 6.2 (Peakall and Smouse 2006). The same program was then used to perform Principal Coordinates Analysis (PCoA), based on the pairwise distance matrix. Both distance and covariance were standardized.

Results

Of the 48 SNP markers, 44 were successful in genotyping across all 313 samples. The remaining four SNPs (Tc1038, Tc1144, Tc 1165 and Tc226) had a low success rate (< 90%) thus were removed from the data set. A total of 44 polymorphic SNPs were retained for further analysis. Based on the 44 SNP markers, the expected heterozygosity in the UF clones was 0.741, whereas the observed heterozygosity was 0.694. Inbreeding coefficient was negligible in the UF clones. This result revealed that UF clones are mainly composed by hybrids involving different germplasm groups. Multilocus matching, based on 44 SNP markers, revealed three synonymous groups, involving 16 clones (Table 3). SNP profiles of the repeated genotyping on DNA samples that had been independently extracted from the same accessions showed that genotyping results were highly consistent. The probability that two UF clones will have the same genotype at the 44 SNP loci is approximately 1 in 1,000,000 for the tested UF clones, as computed by the multilocus matching procedure implemented in GenAlEx 6.5. In total, the duplicated accessions accounted for approximately 30% of the UF clones maintained in this collection. The clones in the three synonymous groups were excluded in subsequent analyses of genetic diversity and population structure.
Table 3

Identified synonymous groups (duplicates) in the UF clones maintained in CATIE, Costa Rica based on SNP markers (showing truncated profiles of 48 SNPs)

Synonymous groupSample nameTcSNP 25TcSNP 32TcSNP 139TcSNP 144TcSNP 150TcSNP 151TcSNP 193TcSNP 226TcSNP 230TcSNP 242TcSNP 309TcSNP 372
1UF 10AG GA TG TA CG TC TA CC GA GC TC TA T
1UF 601G GA TG TA CG TC TA CC GA GC TC TA T
1UF 168G GA TG TA CG TC TA CC GA GC TC TA T
1UF 601G GA TG TA CG TC TA CC GA GC TC TA T
1UF 650G GA TG TA CG TC TA CC GA GC TC TA T
1UF 654G GA TG TA CG TC TA CC GA GC TC TA T
1UF 667G GA TG TA CG TC TA CC GA GC TC TA T
1UF 668G GA TG TA CG TC TA CC GA GC TC TA T
1UF676G GA TG TA CG TC TA CC GA GC TC TA T
1UF 677G GA TG TA CG TC TA CC GA GC TC TA T

2UF 29C GA TG TC CG TC TA CC CA AC CC TA T
2UF 242C GA TG TC CG TC TA CC CA AC CC TA T
2UF 705C GA TG TC CG TC TA CC CA AC CC TA T

3UF 716C GA TG GC CG TC TA CC CA GC CT TA T
3UF 717C GA TG GC CG TC TA CC CA GC CT TA T
Model-based assignment test showed that out of the 44 UF clones, twelve were identified as classical Trinitario (Table 4). These clones (UF 613, 650, 652, 654, 666, 667, 668, 672, 676, 677, 678 and 679) are descendants of Trinidad and Tobago germplasm and were selected in 1936 by the United Fruit Company in Limon, Costa Rica, after years of random hybridization and selection. At that time the germplasm of Upper Amazon Forastero had not been collected and used in cacao breeding. Therefore Criollo and Amelonado were the only available parental lines. The result is comparable with the early selections (e.g. the Imperial College Selections) in Trinidad and Tobago, which are mainly hybrids between Criollo and Amelonado.
Table 4

Assigned population membership representing inferred pedigree of 44 United Fruit clones from the international cacao genebank, CATIE, Costa Rica

Assigned membership (Q-value)

AccessionInferred pedigreeNacionalAmelonadoCriolloParinariGuianaIMCLCT EENNanayPurusScavina
UF 4Nacional × Amelonado0.4700.5100.0000.0000.0000.0100.0100.0100.0000.000
UF 10Trinitario0.0000.5100.4700.0000.0000.0000.0000.0000.0100.010
UF 11Trinitario0.0000.5200.4600.0100.0000.0100.0000.0000.0000.000
UF 12Trinitario0.0000.5200.4600.0000.0000.0000.0100.0100.0100.000
UF 168Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 221Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 601Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 650Trinitario0.0000.5500.4300.0000.0000.0000.0000.0000.0000.000
UF 654Trinitario0.0000.5100.4700.0000.0000.0000.0000.0000.0000.000
UF 667Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 668Trinitario0.0000.5100.4800.0000.0000.0000.0000.0000.0000.000
UF 676Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 677Trinitario0.0000.5200.4600.0000.0000.0000.0000.0000.0000.000
UF 242Nacional × Amelonado0.4700.5000.0000.0100.0000.0000.0000.0100.0000.000
UF 705Nacional × Amelonado0.4800.5000.0000.0000.0000.0000.0000.0100.0000.000
UF 716Nacional × Amelonado0.4100.5500.0100.0100.0000.0100.0100.0000.0000.010
UF 717Nacional × Amelonado0.4100.5500.0100.0100.0000.0100.0100.0000.0000.010
UF 20Nacional0.9900.0100.0000.0000.0000.0000.0000.0000.0000.000
UF 29Nacional × Amelonado0.4500.5100.0000.0100.0100.0100.0000.0100.0000.010
UF 36Nacional × Amelonado0.5100.4400.0100.0000.0000.0100.0000.0000.0100.010
UF 38Amelonado × Scavina0.0200.3900.0300.0100.0000.0000.0100.0000.0000.540
UF 93Trinitario0.0000.6000.3800.0100.0000.0000.0000.0000.0000.000
UF 122Amelonado0.0000.8300.1500.0000.0000.0000.0100.0000.0000.000
UF 210Nacional × Amelonado0.4400.5300.0000.0000.0000.0100.0000.0000.0000.000
UF 273 T1Nacional × Amelonado0.6300.3200.0000.0200.0100.0100.0000.0100.0100.000
UF 273 T2Nacional × Amelonado0.6300.2500.0000.0900.0100.0100.0000.0100.0100.000
UF 296Nacional × Amelonado0.3700.5900.0000.0100.0100.0100.0000.0100.0000.010
UF 602Amelonado0.0000.9500.0100.0000.0100.0000.0100.0100.0000.000
UF 613Nacional × Amelonado0.3500.5700.0400.0100.0100.0100.0000.0000.0000.010
UF 666Trinitario0.0000.5100.4700.0000.0000.0000.0000.0000.0000.000
UF 672Trinitario0.0000.6200.3700.0000.0000.0000.0000.0000.0000.000
UF 700Trinitario0.0000.6900.2800.0000.0000.0100.0000.0000.0000.010
UF 701Trinitario0.0000.7700.2100.0000.0000.0000.0100.0000.0000.000
UF 703Nacional × Amelonado0.4400.5400.0000.0000.0000.0100.0000.0100.0000.000
UF 704Nacional × Amelonado0.4800.4900.0000.0000.0000.0000.0100.0100.0100.010
UF 706Amelonado0.0000.7900.1300.0200.0200.0100.0100.0000.0000.010
UF 707Trinitario0.0000.7300.2200.0000.0000.0000.0300.0000.0100.000
UF 708Trinitario0.0000.6500.3300.0000.0000.0000.0000.0000.0100.000
UF 709Trinitario0.0000.7300.2100.0000.0100.0000.0100.0100.0200.000
UF 711Nacional × Amelonado0.7300.2000.0000.0100.0100.0100.0100.0100.0200.010
UF 712Nacional0.9900.0000.0000.0000.0000.0000.0000.0000.0000.000
UF 713Nacional × Amelonado0.4400.5100.0000.0100.0100.0100.0000.0100.0100.000
UF 714Nacional × Trinitario0.2500.4500.2000.0200.0300.0100.0000.0000.0100.020
UF 715Nacional × Trinitario0.1600.5200.1700.0000.0000.0100.0100.0000.0300.090
In addition to the Trinitario type accessions, two Nacional type clones, UF-20 and UF-712, were revealed in the present study (Table 4). Both clones have a Q-value above 95%, thus are assigned to the Nacional group, showing their full Nacional membership. These clones reflected the introduction of native cacao germplasm from Ecuador after the UF Company’s purchase of large cacao plantation in Tenguel, Ecuador. After the catastrophic disease attack in the 1910s–1920s, cacao production was replaced by banana in the coast of Ecuador and some of the cacao germplasm was transferred to Costa Rica. The last group of UF clones was all Nacional hybrids, with Nacional membership ranging from 0.25 to 0.77 (Table 4). This group of UF clones probably represented selections in the breeding program of United Fruit Company during a later stage (1944–1950) when the indigenous germplasm from Ecuador was hybridized with Amelonado cacao in Costa Rica. Result of the distance-based Principal Coordinates Analysis (Fig. 2) fully supported the Bayesian clustering outcome. The plane of the first three main PCO axes accounted for 25.3%, 12.9% and 9.0% of total variation, respectively. The relevant reference clones were clustered in six groups, which matched well with their known classification in cacao germplasm groups. The three types of UF clones, including classical Trinitario, ancient Nacional and Nacional hybrids were clearly separated in the PCoA plot (Fig. 2).
Fig. 2

PCoA plot of UF cacao clones and relevant references from other cacao germplasm collections. The plane of the first three main PCO axes accounted for 47.2% of total variation. First axis 25.3% of total information, the second 12.9% and the third 9.0%.

Discussion

The International Cacao Collection at CATIE (Centro Agronómico Tropical de Investigación y Enseñanza) in Costa Rica and ICGT in Trinidad and Tobago are two universal collections covering all of the known genetic groups (CacaoNet 2012). As with most other cacao germplasm collections, information gaps on passport data remain to be filled. Some primary and secondary contributors of germplasm were unable to guarantee the authenticity of the material supplied. This is considered a common cause of the introduction of mislabeled accessions into cacao collections (Motilal , Wadsworth and Harwood 2000). Significant efforts have been made to solve the problem of mislabeling in some international cacao collections (Motilal , Zhang ); however, the problem in most of the various national collections has not been fully resolved. In the past few years, microsatellite markers have been widely used in cacao genotyping and individual identification, enabling systematic assessment of genetic identity in national and international cacao genebanks (Motilal , 2010, Zhang ). Reference SSR profiles of cacao clones have been deposited in the International Cacao Germplasm Database at the University of Reading, UK (http://www.icgd.rdg.ac.uk/index.php). However, comparison of genotyping results from different laboratories has not been straightforward. The effectiveness of clone identification via SSR fingerprints depends on the number of loci used for genotyping, as well as the rate of genotyping error. For example, it may require multiple repeated genotyping runs to reach the “consensus genotype”. Moreover, data generated from different genotyping platforms can be difficult to compare with each other because the same allele may be binned differently, leading to false conclusions. The present study demonstrated that SNP-based multilocus fingerprints significantly improved the efficiency of genotype identification. The “UF Clones” in Costa Rica was one of the earliest groups of improved cacao germplasm now maintained in the international germplasm collection in CATIE, Costa Rica. Although morphological characterization has been done on these clones, accurate identification of an individual clone had not been achieved. Our result showed that synonymously mislabeled clones can be accurately identified through the comparison of a small set of SNP markers. Moreover, since SNP genotyping can be done in high-throughput fashion and executed by a centralized service provider, as shown by the recent example in West Africa (Padi ), verification of large numbers of trees can be achieved rapidly with reasonable cost. In addition to accurate genotype identification, this small set of SNP markers allowed us to clarify the genetic background of the UF clones, by comparing them with the ten known germplasm groups. Through multivariate clustering analysis and assignment test, we delineated their origin and genetic background. Both distance- and model-based clustering methods showed that the UF clones were composed of Trinitario and Nacional background. This genetic background is highly similar to the Refractario cacao maintained in the international genebank in Trinidad (Zhang ). Refractario cacao was collected from the coast of Ecuador by J.F. Pound after the catastrophic disease infection in the 1920s. However the proven resistance to Frosty Pod in some of the UF clones, such as UF-712 and UF-273, make them highly valuable for cacao breeders. The present study used a germplasm panel of 229 accessions to represent the 10 known germplasm groups. The classification system of 10 germplasm clusters was reported by Motamayor based on 90 SSR markers. In the present study, we were able to differentiate the 10 germplasm clusters based on the 44 SNP markers. This reference germplasm panel allowed us to assess ancestry admixture or infer parentage for the UF clones. In addition to the UF clones, the CATIE cacao collection contains several other groups of improved cacao germplasm including ARF (Área de Recursos Fitogenéticos), CC (Cacao Center Selections) and PMCT (Programa de Mejoramiento de Cultivos Tropicales). For these breeding lines, the reference germplasm panel of 10 clusters is being used to verify population membership and/or recorded pedigrees, which are essential for maintaining correct passport information records and to facilitate a better use of these breeding lines for cacao genetic improvement. Frosty Pod caused by Moniliophthora roreri, is a major concern due to its devastating effects on yields and limited available control measures (Evans 2007, Phillips-Mora and Wilkinson 2007). The breeding program in CATIE, Costa Rica started breeding for improved resistance to FPR in early 1990s. The UF clones have been used as the main source for FPR resistance (Phillips-Mora ). Moniliophthora roreri was confined to northwestern South America until the 1950s; now it is found in 11 countries in tropical America. M. roreri, is a very aggressive pathogen that has the capacity to survive under extreme environmental conditions; has a rapid dispersal mechanism and the propensity for human dispersion, and is capable of infecting most commercial cacao genotypes, all of which makes Frosty Pod disease a substantial threat to the worldwide cultivation of cacao (Bailey ). Preventive breeding in uninfected cacao-producing regions has been proposed but this practice has not been implemented except by Brazil (Gutierrez , Phillips-Mora ). So far, in Southeast Asia and West Africa where 85% of the world’s cocoa production are based, sources of resistance to FPR are still needed to be incorporated in the breeding program or in seed gardens. Lack of information on the genetic background of these breeding lines may partially explain breeders’ reluctance to use the UF germplasm. The present study fills the information gap regarding their origin and genetic background, and the FPR-resistant UF clones, with a Nacional background that differs from the currently used progenitors in Asia and Africa, should be incorporated in their breeding programs and seed gardens.
  8 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Estimating the probability of identity among genotypes in natural populations: cautions and guidelines.

Authors:  L P Waits; G Luikart; P Taberlet
Journal:  Mol Ecol       Date:  2001-01       Impact factor: 6.185

3.  Cacao diseases-the trilogy revisited.

Authors:  Harry C Evans
Journal:  Phytopathology       Date:  2007-12       Impact factor: 4.025

4.  Frosty pod of cacao: a disease with a limited geographic range but unlimited potential for damage.

Authors:  W Phillips-Mora; M J Wilkinson
Journal:  Phytopathology       Date:  2007-12       Impact factor: 4.025

5.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

6.  Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

Authors:  Mathilde Allegre; Xavier Argout; Michel Boccara; Olivier Fouet; Yolande Roguet; Aurélie Bérard; Jean Marc Thévenin; Aurélie Chauveau; Ronan Rivallan; Didier Clement; Brigitte Courtois; Karina Gramacho; Anne Boland-Augé; Mathias Tahi; Pathmanathan Umaharan; Dominique Brunel; Claire Lanaud
Journal:  DNA Res       Date:  2011-12-30       Impact factor: 4.458

7.  Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions.

Authors:  Xavier Argout; Olivier Fouet; Patrick Wincker; Karina Gramacho; Thierry Legavre; Xavier Sabau; Ange Marie Risterucci; Corinne Da Silva; Julio Cascardo; Mathilde Allegre; David Kuhn; Joseph Verica; Brigitte Courtois; Gaston Loor; Regis Babin; Olivier Sounigo; Michel Ducamp; Mark J Guiltinan; Manuel Ruiz; Laurence Alemanno; Regina Machado; Wilberth Phillips; Ray Schnell; Martin Gilmour; Eric Rosenquist; David Butler; Siela Maximova; Claire Lanaud
Journal:  BMC Genomics       Date:  2008-10-30       Impact factor: 3.969

8.  Geographic and genetic population differentiation of the Amazonian chocolate tree (Theobroma cacao L).

Authors:  Juan C Motamayor; Philippe Lachenaud; Jay Wallace da Silva E Mota; Rey Loor; David N Kuhn; J Steven Brown; Raymond J Schnell
Journal:  PLoS One       Date:  2008-10-01       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.