Literature DB >> 22605966

Elaeis oleifera genomic-SSR markers: exploitation in oil palm germplasm diversity and cross-amplification in arecaceae.

Noorhariza Mohd Zaki1, Rajinder Singh1, Rozana Rosli1, Ismanizan Ismail2.   

Abstract

Species-specific simple sequence repeat (SSR) markers are favored for genetic studies and marker-assisted selection (MAS) breeding for oil palm genetic improvement. This report characterizes 20 SSR markers from an Elaeis oleifera genomic library (gSSR). Characterization of the repeat type in 2000 sequences revealed a high percentage of di-nucleotides (63.6%), followed by tri-nucleotides (24.2%). Primer pairs were successfully designed for 394 of the E. oleifera gSSRs. Subsequent analysis showed the ability of the 20 selected E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The average Polymorphism Information Content (PIC) value for the SSRs was 0.402, with the tri-repeats showing the highest average PIC (0.626). Low values of observed heterozygosity (H(o)) (0.164) and highly positive fixation indices (F(is)) in the E. oleifera germplasm collection, compared to the E. guineensis, indicated an excess of homozygosity in E. oleifera. The transferability of the markers to closely related palms, Elaeis guineensis, Cocos nucifera and ornamental palms is also reported. Sequencing the amplicons of three selected E. oleifera gSSRs across both species and palm taxa revealed variations in the repeat-units. The study showed the potential of E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The markers are also a valuable genetic resource for studying E. oleifera and other genus in the Arecaceae family.

Entities:  

Keywords:  Elaeis oleifera; genomic library; simple sequence repeat (SSR); transferability

Mesh:

Substances:

Year:  2012        PMID: 22605966      PMCID: PMC3344202          DOI: 10.3390/ijms13044069

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   6.208


1. Introduction

Elaeis oleifera is a species in the oil palm genus along with the commercial Elaeis guineensis and occurs naturally in South-Central America, from Honduras to Colombia and in the Amazon region [1]. This American species is seen as a promising genetic resource for oil palm improvement and is currently used in oil palm hybrid (E. guineensis × E. oleifera) breeding programs. It has attracted the attention of breeders by reason of several interesting agronomic traits: low height increment, resistance to Fusarium wilt and lethal yellowing [2], which can have important economic implications if introgressed into E. guineensis. Beside the agronomic traits, the oil from E. oleifera is highly unsaturated (i.e., high iodine value, or IV) with high linoleic and oleic acids, low palmitic acid and high carotene [3]. A genomic in situ hybridization technique (GISH) using specific DNA probes to distinguish oleifera and guineensis chromosomes has been developed to assist hybrid backcross breeding programs [4]. In plant genetics and breeding studies, DNA-based assays, and especially molecular markers, are known to be efficient tools for genetic diversity assessment, molecular ecology studies, gene mapping as well as marker-assisted selection (MAS) [5]. Among all the available molecular markers, simple sequence repeats (SSR) are still among the most favored, due to their many desirable attributes, which include hypervariability, wide genomic distribution, co-dominant inheritance, a multi-allelic nature and chromosome specific location. In addition, they are easily assayed using PCR [6]. Currently, SSRs also appear to be the most promising molecular marker systems for understanding oil palm population genetic structure [7]. Furthermore, SSR markers which are highly transferable across taxa are advantageous as they save time and cost in developing SSR markers for members of taxa that have not been extensively studied. These SSR markers are also useful tools for comparative genetic studies within the genus. In oil palm, E. guineensis-based SSR markers have been used to construct genetic maps [8,9], and are also actively used to characterize germplasm collections [1]. The Malaysian Palm Oil Board (MPOB) has an extensive collection of germplasm from both species of oil palm. E. guineensis from Africa and E. oleifera maintained as ex-situ collections in Kluang, Johor, Malaysia. Assessing the performance and genetic diversity of the wild material is important for understanding the genetic structure of natural oil palm populations. Furthermore, the information is important for oil palm breeding programs, and also for continued ex-situ conservation of the germplasm in Malaysia. Currently, only the E. guineensis germplasm is well characterized, using various types of molecular markers, such as isozymes [10], restriction fragment length polymorphisms (RFLPs) [11], amplified fragment length polymorphism (AFLP) [12], random amplified polymorphic DNA (RAPD) [13] and SSRs [7,14]. However, the work on E. oleifera has been limited, only involving RAPD [15] and SSR markers developed from E. guineensis [14,16]. Nevertheless, the increasing number of sequence collections available for E. oleifera has made it possible to develop SSR markers from E. oleifera and utilize them to understand the genetics of the species. Thus, the objectives of this study were to (a) develop and characterize E. oleifera genomic SSR markers from a collection of E. oleifera genomic sequences; (b) evaluate the efficiency of these markers in assessing the genetic diversity in the MPOB E. oleifera germplasm collection; and (c) determine the transferability of E. oleifera SSR markers among selected palm genera and taxa.

2. Results and Discussion

2.1. Characterization of E. oleifera Genomic SSRs

The GeneThresher™ library is a comprehensive collection of gene sequences of oil palm obtained from sequencing of the hypomethylated region of the oil palm genome using methylation filtration technology [17]. As such, the sequences are likely to be located within or close to the genic regions in oil palm. The clear advantage is that the SSR locus may point to a gene of interest and show high levels of polymorphism associated with being genomic-based SSR markers. Of the 2000 E. oleifera GeneThresher™ derived sequences used in this study, 1861 non-redundant sequences (1735 singleton and 126 consensus) were successfully assembled with CAP3 sequence assembly software [18]. A total of 603 SSRs were identified in 472 genomic sequences, suggesting the E. oleifera genomic library is a valuable resource for this genetic marker type. One hundred and four (22%) of the genomic sequences contained more than one SSR. Mononucleotides were the most abundant repeat type (437 = 72.4%), and showed a strong bias to the A/T repeat-motifs (97.9%) over the C/G repeat motif (Table 1). Feng et al. [5] reported that mononucleotides were generally not very informative and thus were not considered for analysis in this study. With the omission of mononucleotides, the most prevalent repeats were di-nucleotides (63.3%), followed by tri-nucleotides (24.2%), tetra-nucleotides (6%), penta-nucleotides (4.8%), hexa-nucleotides (0.6%) and 1.2% of the hepta-nucleotides. Among the di-nucleotide repeats, the AG/CT (46.7%) and AT/AT motifs (43.8%) were by far the most common, while AC/GT was present in low abundance (9.5%). The abundance of the AG/CT motif has consistently been reported in EST sequences from E. guineensis [7,19,14], peach [20], coffee [21] and rubber [5]. AG/CT SSR may have a higher probability of being linked to important traits [22], based on report by Morgante et al. [23], highlighting the frequent occurrence of this di-repeat motif in the 5′ flanking regions of genes in plants. The two most common tri-nucleotide motifs were AAG/CTT (50%) and AAT/ATT (25%) followed by AGG/CCT (17.5%), AAC/GTT (5%) and ACC/GGT (2.5%). The abundance of the AAG/CTT tri-repeat motif in E. oleifera is similar to that reported for E. guineensis EST-SRR [7,14,19], Arabidopsis thaliana [24], soybean [25], barley [26] as well as coffee [27]. The most abundant tetra- and penta-nucleotide repeat-motifs were AAAT/ATTT and AAAAG/CTTTT at a frequency of 40% and 50%, respectively.
Table 1

Frequency and distribution of SSRs in 2000 Elaeis oleifera genomic sequences.

SSR MotifNumber of Repeat UnitsTotal

56789101112131415>15
Mononucleotide
A/T-----101786037292895428
C/G-----23-1--39
Di-nucleotide
AC/GT--42121----10
AG/CT--787391162549
AT/AT--6556622-21246
Tri-nucleotide
AAC/GTT-2----------2
AAG/CTT943-1111---20
AAT/ATT1321-111---10
ACC/GGT-1----------1
AGG/CCT52----------7
Tetra-nucleotide
AAAC/GTTT---1--------1
AAAG/CTTT1----------1
AAAT/ATTT13---------4
AATT/AATT-1---------1
ACAT/ATGT---1-1------2
AGCT/ATCG1-----------1
Penta-nucleotide
AAAAG/CTTTT31----------4
AAAAT/ATTTT3-----------3
AGGGG/CCCCT1-----------1
Hexa-nucleotide
AGAGGG/CCCTCT1-----------1
Hepta-nukleotide
AAACCCT/ATTTGGG-----------22
N (Mono-)-----103816038292899437
NN (Di-)--1715131115436417105
NNN (Tri-)15125111122---40
NNNN (Tetra-)34-2-1------10
NNNNN (Penta-)71----------8
NNNNNN (Heksa-)1-----------1
NNNNNNN(Hepta-)-----------22
Total603

2.2. Primers Designed for E. oleifera gSSR

With exclusion of the 437 mononucleotide repeats, attempts were made to design primer pairs for the 166 identified SSRs. Primer pairs were successfully designed for 144 SSRs (86.7%), of which 63.9% were di-repeats, 24.3% tri-repeats, 4.9% tetra and penta-repeats each, 0.7% hexa-repeats and 1.4% hepta-repeats. The failure to design primers for the remaining sequences (13.3%) was probably due to short (or absence of) flanking regions, or that the sequences submitted did not correspond to the minimum criteria required by the primer design software [7]. Nevertheless, the success rate is high compared to previous work on genomic SSRs of wheat [28] and Sorghum [29], where the success rates were only 51% to 66%. Subsequently, 20 of the 144 E. oleifera-based gSSR primer pairs (Table 2), representing a variety of motifs (di- to penta-repeats) were randomly selected to analyze samples from the oil palm germplasm collection.
Table 2

Information on the E. oleifera gSSR markers used for germplasm analysis and cross- transferability evaluation.

Primer IDPrimer Sequence (5′-3′) (F: Forward; R: Reverse)SSR MotifTa (°C)Amplicon (bp)Accession No. (ProbeDB)Allele No.PIC
Di-nucleotide
sMo00018F: TTAAATGAGAGAGAGACGAGGACR: TGGAGCCATGAGAAAGAGTA(CT)1454246Pr00994796360.555
sMo00020F: CCTTTCTCTCCCTCTCCTTTTGR: CCTCCCTCCCTCTCACCATA(AG)1558190Pr009947964120.824
sMo00024F: TCACCAAAGCAGAAGAAACAR: GGTGTTGATAATTGCCTGAA(AT)2854223Pr010315683--
sMo00027F: TTACAGTTGAGGCAGTATGTCAATR: CTGTATGTCAAACCTTCTGCAC(TC)1450209Pr00994796560.574
sMo00055F: GGCATTTCAGATAACGACAAAR: GCACCCAAGTCTCTCTACCTC(GA)1154202Pr01031568450.243
sMo00108F: AGCTTCAATTCATACGCAACR: TGTTATATGTGACTACCAGAGCA(AT)1953170Pr01031568510
Mean6.00.549
Tri-nucleotide
sMo00127F: GTGGTTTGGGAGAAAGAGTGTR: TGCGGTGGATTAGCATTATT(GAA)1256205Pr010315686--
sMo00128F: TAGCTCCAACAGCTTGCCTTATR: GGTCCCGTCCTATGATTTATTCT(AAT)1256192Pr00994796660.654
sMo00129F: TTAGTATTGGGTGTGCATAAGTGGR: GCTTCCAGCTCCTCTTTCTACC(TTC)1356229Pr00994796780.786
sMo00130F: TAAGCAAAAGATCAGGGCACTCR: GGCTGGTGAAAATAGGTTTACAAAG(AAG)1156192Pr009947968130.801
sMo00132F: ATAGCCAGAGGGCAAAACTGTR: GCAACACACGGACTCAAAACTA(TTA)1356161Pr00994796940.264
Mean7.80.626
Tetra-nucleotide
sMo00134F: TCCCAATAGTCGTTACAAACCAGR: GATTAGCAAAAGGGCAAAAAGG(ATTA)656252Pr00994797020.338
sMo00137F: AGGAAGGAGAAGGAGATGAACAGR: CTTTGGATTTGAGCAGAGGAAG(AAAT)654151Pr01031568730.141
Mean2.50.240
Penta-nucleotide
sMo00138F: AGGGTTGTCGCTCCAATTTATR: GGCATCTTTTTGACCTGTAGAAG(TTTTC)656190Pr00994797150.498
sMo00140F: TTAGATCATTTCCCTTGCTTCGR: CGCTGGTCCTGATAACACATT(AAAAT)556216Pr01031568810
sMo00141F: ACTTGACATACAGGTTCCACTGAR: CCTGCTACCTCCTAATTCTATCAAA(TTCTT)556174Pr01031702920.218
sMo00147F: TACCCAATCCCACCGAGTTAR: CGTCTCCACTGAACCACAAAA(AAAAG)554240Pr01031703030.225
Mean2.750.314
Compound
sMo00152F: GGAACAGAGGACAAGAAAGAAAR: TGTATCAAGCCTCAAGTATCTGG(AC)6(AG)1156255Pr00994797230.209
sMo00154F: CAAAAGGGTTGTTTGTATACGTGR: TGCATGAATATCCTCTCAAAGTTAC(TG)7cgcgcgtgtgcgcgtg(TA)854161Pr01031703180.349
sMo00161F: ACTGTTTCGTCAAGCATTTGR: ATCAAGAGAAGGTCGTGTCAG(TG)8(AG)854163Pr01031703210
Mean4.00.279

2.3. Germplasm Characterization: Allelic Polymorphism and Genetic Variation in E. oleifera and E. guineensis

To ascertain the attributes of the E. oleifera-based gSSR markers in characterizing E. oleifera germplasm, 20 primer pairs (markers) were tested on a panel of 119 E. oleifera palms from the germplasm collection. Ten E. guineensis from the Nigerian collection and another 10 from the MPOB advanced breeding material population (Deli dura) were included for comparison. This allowed the study to also determine the ability of the E. oleifera-derived SSR markers to reveal the genetic diversity in the Deli dura material which had undergone several cycles of self-pollination, and also the wild Nigerian materials. This provenance is reported to be the center of diversity for E. guineensis [11]. Eighteen of the 20 primers successfully produced amplicons (Table 2), and 15 of the 18 primers (83.3%) reveal polymorphisms in at least one of the collections analyzed. The remaining three, sMo00108, sMo00140 and sMo00161 were monomorphic in all the samples tested. The high level of detected polymorphism (83.3%) shows the ability of E. oleifera gSSR markers to amplify the target sequences and detect polymorphism in both Elaeis palms. The E. oleifera gSSRs detected 89 alleles, ranging from 1 to 13 across the Elaeis samples. Of them (alleles), 48.3% and 31.5% of alleles were specific to E. oleifera and E. guineensis, respectively, and 20.2% common in both species. Within the repeats, tri-nucleotides detected more alleles (mean = 7.8 alleles) than the other repeats. It would appear that the tri-nucleotide genomic SSRs show higher average PIC values than di-nucleotide repeats. This is most likely a reflection of the specific region of the genome targeted by the methylation filtration technique. Botstein et al. [30] defined any locus (marker) with PIC > 0.5 as highly polymorphic. All the loci derived from the di- and tri-repeat gSSRs met this criterion, except sMo00055 and sMo00132. This shows that both the repeat types are generally informative in the samples analyzed. However, the mean PIC (0.402) from this study was slightly lower than that previously reported for E. guineensis-derived EST-SSRs (7, mean = 0.53; and 14, mean = 0.65) which were used mainly to analyze E. guineensis germplasm. Interestingly, the ability of E. oleifera-derived gSSRs to reveal allelic polymorphism and genetic diversity in the Elaeis genus was more efficient than by other tested marker systems. For instance, E. oleifera gSSRs generated more alleles (A) in both Elaeis species (means = 2.27–2.66; Table 3), compared to RFLP [11] and isozyme [10,31], which generated A < 2.0. Furthermore, the efficiency of the oleifera genomic SSRs in revealing heterozygosity was distinctively higher (mean H = 0.273) than in previous studies on E. guineensis using isozymes (H = 0.184, 10), RFLP (H = 0.135, 15; He = 0.199, 11) and AFLP (He = 0.117, 15).
Table 3

Summary of observed allele numbers (A), percentage polymorphic loci (P), observed and expected heterozygosity (Ho and He) and (F) for 14 loci across six oil palm populations.

CountryNAoP (%)Ho(SD)He(SD)Fis
E. oleifera
Colombia292.5650.00.200 (0.263)0.275 (0.325)0.273 *
Costa Rica343.0055.60.160 (0.223)0.253 (0.316)0.368 *
Panama342.8955.60.193 (0.264)0.310 (0.325)0.377 *
Honduras222.1750.00.102 (0.197)0.210 (0.260)0.514 *
Mean2.6652.80.164 (0.238)0.262 (0.307)0.383
E. guineensis
Deli dura102.0744.40.118 (0.171)0.260 (0.282)0.546 *
Nigeria102.4750.00.321 (0.362)0.329 (0.305)0.024
Mean2.2747.20.220 (0.267)0.295 (0.294)0.285

P = Percentage of polymorphic loci (0.95 criterion); F = Inbreeding coefficient (Wright’s 1965: 1- [Ho/He]).

significant deviation from HWE at P <0.01.

By focusing only on analysis carried out on E. oleifera collections, H revealed by E. oleifera SSR markers (0.262) was slightly lower than those generated by E. guineensis EST-SSR markers (14; H = 0.286). As such, E. oleifera gSSR markers are also additional promising tools for characterizing E. oleifera collections, although the A and H values are lower than those generated by E. guineensis gSSRs (16; A = 0.535 and H = 0.69). The differences revealed by both the genomic SSR markers were possibly due to the number of samples analyzed and populations evaluated. Billotte et al. [16] analyzed 21 E. oleifera samples (1–2 samples per country), whereas 119 E. oleifera samples (22–34 samples per country) were analyzed in this study. Furthermore, the genomic library utilized in this study was constructed from hypo-methylated regions, the chances of the employed E. oleifera gSSR markers being closely located within the conserved coding regions are higher than in the sequences obtained from a conventional genomic library. This could also explain the lower diversity observed in this study. Regarding the genetic variation between the two Elaeis species, the heterozygosity in the E. oleifera germplasm varied from 0.102 to 0.200 (mean H = 0.164), while E. guineensis generated H from 0.118 (Deli dura) to 0.321 (Nigeria) (mean H = 0.220). E. oleifera generally had lower diversity, compared to E. guineensis, with higher H revealed by the Colombian and Panama palms (H = 0.200; H = 0.193 respectively). The higher H in both collections could be due to human assisted movement of palm samples that was probably accelerated during and subsequent to construction of the Panama Canal in 1914. Some palms could have been brought in from other South American countries, widening the genetic base of E. oleifera in these countries. Furthermore, the H value obtained for E. oleifera (mean = 0.262) was comparable to those obtained by RFLP (H = 0.225) and AFLP (H = 0.298) analyses in screening 241 E. oleifera accessions [15]. Among E. guineensis, H for the Nigerian samples (0.329) was lower than that obtained with E. guineensis EST-SSR markers [7,14], where the reported H values were 0.442 and 0.534 respectively. The F values were positive at all loci in all tested collections with mean F ranging from 0.024 (Nigeria) to 0.546 (Deli dura) (Table 3). This reflects the differences in the prospecting areas for the germplasm. The E. guineensis germplasm was collected over widespread areas in Africa, resulting in more heterogeneous collections compared to E. oleifera, which were mostly from scattered isolated populations across four South-Central American countries [32]. This may have encouraged inbreeding, resulting in a relatively homozygous genome for the E. oleifera collections. Furthermore, the extremely high F in Deli dura populations compared to the Nigerian and other E. oleifera germplasm supported the low level of genetic diversity of this advanced breeding population which had undergone several cycles of selfing. This also explains the low genetic diversity of Deli dura population (mean H = 0.260) generated by E. oleifera gSSR markers in this study, which was even lower than that revealed by E. guineensis EST-SSR markers (14; mean H = 0.340).

2.4. Genetic Relationship of the Genus Elaeis

The 18 informative E. oleifera genomic SSRs described in this study successfully grouped the six collections of oil palm into two distinct clusters: E. oleifera and E. guineensis (Figure 1). In general, the clusters supported the origins and geographical distributions of the palms, E. oleifera from Latin America and E. guineensis from Africa. Within E. oleifera, the collections from Costa Rica and Panama showed a very close relationship. This is not surprising as Costa Rica and Panama are neighboring countries. The collection from Honduras also fell into the same cluster as Costa Rica and Panama, again probably due to the close proximity of Honduras to Costa Rica and Panama. The collections from Colombia were clearly separate from the other three collections from Central America.
Figure 1

Unweighted pair group with arithmetic mean (UPGMA) dendrogram reflecting genetic relationship among the Elaeis genus revealed by 18 Elaeis oleifera gSSR markers.

2.5. Cross-Transferability of E. oleifera gSSR Markers

Eleven of the E. oleifera gSSR markers produced clear and prominent banding profiles in both E. guineensis and E. oleifera. These markers were further used to evaluate cross species/genera transferability in the Arecaceae taxa (Table 4). Successful amplification (transferability) of either similar or varying sized fragments was obtained with all the primers in the coconut palms and in at least one of the tested ornamental palms species. As such, the E. oleifera gSSR markers showed 100% transferability to E. guineensis and, more importantly, also perfect (100%) transferability in the tested Cocos nucifera samples. With the ornamental palms, the frequencies of transferability were Euterpe (72.7%) > Oenocarpus (63.6%) > Jessinia (54.5%) > Ptychosperma (54.5%) > Dictyosperma (45.5%) > Cyrtostachys (45.5%). Two markers: sMo00055 (di-repeats) and sMo00137 (penta-repeats), generated clear banding profiles of various sizes in all the samples analyzed, including the ornamental palms.
Table 4

Cross amplification of E. oleifera gSSR primers in various palm (Arecaceae) species.

GenusElaeisCocosOenocarpusEuterpeJesseniaPtychospermaCyrtostachysDictyosperma

SpeciesoleiferaguineensisNuciferamulticaulis-SpruceoleraceabatauaMacarthuriirenda Blumealbum

SSR locusColombiaCosta RicaNigeriaDeli duraCocos nucifera (Yellow)Cocos nucifera (Red)Cocos nucifera (Green)Oenocarpus multicaulis-SpruceEuterpe oleraceaJessenia batauaPtychosperma MacarthuriiCyrtostachys renda BlumeDictyosperma album
sMo00020191200184190210218219-NANA188NANA
sMo00027212210200200300300300226226224NA260200
sMo00055200200188188195195195190190190190190190
sMo00129222230204204178178178180NANANANANA
sMo00130192192176184176176176188188188-208184
sMo00134252252–260252252252–260252–260252–260-252–260-252–260--
sMo00137151151154–162162140138140150140151142146142
sMo00138190–206200184–198198208218208--NA186NANA
sMo00140214214204204184184184204204---204
sMo00141176176250–260250–260176176176NA260260NANANA
sMo00154160160238232160160160160160160160160NA

Figures given are base pair size of fragments: Missing sample; NA, not amplifiable/banding pattern not clear.

SSR primers developed for one species are known to often detect homologous sites in related species. The ability of sMo00055 and sMo00137 markers to amplify fragments with similar sizes indicates their efficiency in revealing sequence conservation among the species in the Arecaceae family. In general, cross species transferability differs highly among taxa, especially in flowering plants [33]. The transferability across related species and genus facilitates comparative genetic studies [34]. The successful rate of transfer for SSR has been reported to average 76.4% at the genus level and 35.2% at the family level [35]. The success rate for E. oleifera SSRs averaged 75% at the genus level, comparable to Phyllostachys Pubescens (75.3%), but lower than rice (90%) [36]. Furthermore, all the tested Elaeis-derived gSSR markers were able to amplify PCR products in Cocos, reflecting their capability in characterizing the three different Cocos samples tested. This also suggests the relatively close proximity of E. oleifera to coconut. The high cross-transferability of E. oleifera gSSR markers to Elaeis species and related genera suggests the potential application of these markers in comparative studies across members of the Arecaceae family. High cross-species conservation of SSR loci within genus has also been reported for Olea [37], Picea [38] and Pinus [39].

2.6. Sequence Variability and Molecular Basis of E. oleifera gSSR Markers Fragment Length Polymorphism

Three markers comprising various repeat types (sMo00055/di-repeat, sMo00137/tetra-repeat and sMo00138/penta-repeat) were used to determine the sequence variability in some of the species in Arecaceae family (E. oleifera, E. guinensis, Cocos nucifera, Jessinia bataua and Oenocarpus multicaulis). Amplified PCR fragments of these markers in selected individuals were cloned and sequenced. The amplicons of the three SSR markers were successfully cloned and sequenced. The sequences were aligned with the original sequence from which the primers were designed (Figure 2). In general, sMo00137 gave the highest sequence similarity among the samples analyzed, followed by sMo00138. sMo00055 showed the lowest similarity with highest number of bases interrupted in the flanking region.
Figure 2

ClustalW alignments of sequences obtained from PCR bands amplified by the E. oleifera gSSR markers (a) sMo00055; (b) sMo00137; and (c) sMo00138 in E. oleifera, E. guineensis, C. nucifera, Jessinia bataua and Oenocarpus multicaulis. Missing data refers to amplicons that were not successfully cloned.

Generally, the sequence data generated by the three selected E. oleifera gSSR loci (sMo00055, sMo00137 and sMo00138) revealed variable numbers of repeat motifs in the SSR regions within the tested samples. This further explains the primary basis of the observed fragment length polymorphism in the Arecaceae family screened in this study. The variations were mainly due to changes in the number of repeat motifs in the SSR region, combined with indels and base substitutions. Similar results were reported by Billotte et al. [16] and Ting et al. [14] who employed E. guineensis SSR and EST-SSR markers, respectively. However, looking specifically at sMo000555, the repeat motif observed in E. oleifera was missing in E. guineensis, and the repeat number was very low in coconut and one of the ornamental palms. Although this locus was successfully amplified in all samples, the lack of repeat conservation in some samples suggests that the amplified fragments may not represent functional SSRs in those species. Nevertheless, the ability of E. oleifera genomic SSRs (sMo00137 and sMo00138) to reveal high inter-species and inter-genera transferability (>90%) supports the close phylogenetic relationship between the species and genera.

3. Experimental Section

3.1. Plant Materials and gSSR Source

The oil palm germplasm collections used in this study are maintained at the MPOB Research Station, Kluang, Johor. A total 149 spear leaves (one per palm) were harvested from the palms in Table 5. Cross-transferability of the E. oleifera gSSR was tested on three coconut (Cocos nucifera) samples and six ornamental palms (Euterpe oleracea, Jessinia bataua, Oenocarpus multicaulis, Ptychosperma macarthurii, Cyrtostachys renda and Dictyosperma album). Genomic DNA was extracted and purified from each spear leaf using the modified CTAB method described by Doyle and Doyle [40]. The E. oleifera genomic library was constructed using GeneThresher™ Technology (17) and the genomic clones and sequences stored at MPOB’s Biological Resource Centre (MBRC).
Table 5

Palms (family Arecaceae) analyzed using Elaeis oleifera gSSR markers.

GenusSpeciesFull NameOriginNo. of Palms
ElaeisOleiferaElaeis oleiferaColombia29
Costa Rica34
Panama34
Honduras22
Sub-total119
ElaeisguineensisElaeis guineensisNigeria10
Deli dura10
Sub-total20
CocosNuciferaCocos nuciferaSolomon Islands3
EuterpeOleraceaEuterpe oleraceaSouth America2
JesseniaBatauaJessenia batauaMart. South America1
Oenocarpusmulticaulis-spruceOenocarpus multicaulis SpruceNorth-western South America,1
PtychospermamacarthuriiPtychosperma macarthuriiNortheastern Australia1
Cyrtostachysrenda BlumeCyrtostachys renda BlumeMalaysia, Indonesia1
DictyospermaAlbumDictyosperma albumMauritius1
Sub-total10
Total149

3.2. SSR Identification and Primer Design

A total of 2000 E. oleifera genomic sequences were assembled using the CAP3 assembly program [18] with default parameters. The file containing the sequences was submitted in a FASTA formatted text file. Identification and localization of the SSR markers were performed using MISA software as described by Thiel et al. [26]. The search criteria were: mononucleotides ≥10 repeat units, di-nucleotides ≥7 repeat units and tri-, tetra-, penta- and hexa-nucleotides ≥5 repeat units respectively. Interrupted compound SSRs were also selected where the interval bases interrupting two SSRs were ≤10 repeat units. The relative frequency and distribution of the repeat types in the genomic sequences were estimated. Primer pairs were designed flanking the identified SSRs using PRIMER 3 [41]; all the primers were synthesized by Invitrogen ™ USA.

3.3. SSR Analysis

The forward primer was 5′ end-labeled in 1 μL reaction containing 4.5 μM forward primer, 0.1 μL γ-33p dATP (GE Healthcare Biosciences, UK, 3000Ci/mmol) and 1U T4 polynucleotide kinase (Invitrogen™ USA) for 1 hour at 37 °C. The PCR reaction was subsequently carried out in 10 μL of 1 μL 10X PCR buffer (buffer composition-MgCl2), 15 mM MgCl2, 1 mM dNTPs, 5 μM unlabeled reverse primer, 1 μL labeled forward primer, 0.5 U Taq DNA polymerase and 50 ng template DNA. PCR was performed in a Perkin Elmer 9600 thermocycler as follows: denaturation at 95 °C for 3 min, 35 cycles at 95 °C for 30 s, 52–56 °C for 30 s (depending on the primers requirement), 72 °C for 30 s and a final extension at 72 °C for 5 min. The PCR reaction was stopped by addition of 10 μL formamide dye (0.3% bromophenol blue, 0.3% xylene cyanol, 10 mM EDTA pH 8.0, 97.5% deionized formamide). A total 5 μL of the mixture was denatured at 90 °C for 3 min, chilled on ice, and separated in a 6.0% polyacrylamide gel containing 7 M urea in 0.5 X TBE buffer at constant power of 1600V for 3 hours. The gel was then dried and exposed to X-ray film (Kodak) for 3–4 days at −80 °C. The size of each allele was determined using the 100–330 bp AFLP DNA ladder (Invitrogen™ USA).

3.4. Data Analysis

Only fragments that could be clearly scored were used in the data analysis. The genotyped data were analyzed using POPGENE version 1.32 [42]. The genetic diversity parameters analyzed for included: percentage of polymorphic loci (0.95 criterion) (P), expected and observed heterozygosity (H and H) in the collections used and fixation indices (F). Chi squared tests were performed for each locus for deviation of the genotypes from the Hardy-Weinberg equilibrium (HWE). The allelic polymorphism information content (PIC) for each gSSR marker and distance matrix [43] between the populations were calculated using the PowerMarker V3.25 software [44]. The unweighted pair-group method with arithmetic averaging (UPGMA) [45] dendrogram was constructed from the distance matrix [43] imported from PowerMarker V3.25 using MEGA4 [46].

3.5. Cross-Transferability Amplification

Eleven E. oleifera gSSR markers that produced clear banding profiles in both E. guineensis and E. oleifera samples were further used to study cross species and genus amplification within Arecaceae family. The markers were tested against three Cocos nucifera varieties and six ornamental palms (Table 5). The SSR analysis, as described above, was carried out at least twice to confirm the transferability of the primers.

3.6. Sequencing of Cloned SSR-PCR Products for Alignment and Phenetic Analysis

The amplicons generated by three selected E. oleifera gSSR markers (sMo00055, sMo00137 and sMo00138) in the E. oleifera, E. guineensis, Cocos nucifera, Jessinia bataua and Oenocarpus multicaulis samples were excised from the agarose gel and purified. The purified fragments were cloned into pCR2.1-TOPO vector (TOPO TA cloning kit, Invitrogen™ USA) and sequenced using the ABI PRISM 377 automated DNA sequencer. The sequences were aligned and compared using CLUSTALW multiple sequence alignment tool employing BIOEDIT sequence alignment editor version 7.0.0 [47]. The sequences were also compared to the original genomic sequence containing the SSR.

4. Conclusions

A set of E. oleifera gSSR markers developed were found to be valuable genetic resources for understanding the genetic diversity of E. oleifera and E. guineensis. The study indicates that E. oleifera-derived SSR markers were more efficient in revealing the genetic diversity of E. oleifera than E. guineensis EST-SSR markers. The sequence data showed their ability to amplify DNA, not only in the two oil palm species, but also in coconut and other selected ornamental palms, thus verifying the ability of SSRs to amplify across species and genera in the Arecaceae family. Furthermore, the variability in allele sizes and sequences among the species reflected the mutational processes that had taken place at both the repeat and flanking regions. An expanded study using all the available SSR markers on a larger set of samples (from both species of oil palm) would provide a clearer picture on the genetic diversity of the germplasm available at MPOB
  25 in total

1.  Primer3 on the WWW for general users and for biologist programmers.

Authors:  S Rozen; H Skaletsky
Journal:  Methods Mol Biol       Date:  2000

2.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes.

Authors:  Michele Morgante; Michael Hanafey; Wayne Powell
Journal:  Nat Genet       Date:  2002-01-22       Impact factor: 38.330

3.  PowerMarker: an integrated analysis environment for genetic marker analysis.

Authors:  Kejun Liu; Spencer V Muse
Journal:  Bioinformatics       Date:  2005-02-10       Impact factor: 6.937

4.  SSR mining in oil palm EST database: application in oil palm germplasm diversity studies.

Authors:  Ngoot-Chin Ting; Noorhariza Mohd Zaki; Rozana Rosli; Eng-Ti Leslie Low; Maizura Ithnin; Suan-Choo Cheah; Soon-Guan Tan; Rajinder Singh
Journal:  J Genet       Date:  2010-08       Impact factor: 1.166

5.  An integrated SSR and RFLP linkage map of Sorghum bicolor (L.) Moench.

Authors:  D Bhattramakki; J Dong; A K Chhabra; G E Hart
Journal:  Genome       Date:  2000-12       Impact factor: 2.166

6.  QTL detection by multi-parent linkage mapping in oil palm (Elaeis guineensis Jacq.).

Authors:  N Billotte; M F Jourjon; N Marseillac; A Berger; A Flori; H Asmady; B Adon; R Singh; B Nouy; F Potier; S C Cheah; W Rohde; E Ritter; B Courtois; A Charrier; B Mangin
Journal:  Theor Appl Genet       Date:  2010-02-25       Impact factor: 5.699

7.  Isolation and mapping of microsatellite markers specific for the D genome of bread wheat.

Authors:  E Pestsova; M W Ganal; M S Röder
Journal:  Genome       Date:  2000-08       Impact factor: 2.166

Review 8.  Construction of a genetic linkage map in man using restriction fragment length polymorphisms.

Authors:  D Botstein; R L White; M Skolnick; R W Davis
Journal:  Am J Hum Genet       Date:  1980-05       Impact factor: 11.025

9.  Single-copy, species-transferable microsatellite markers developed from loblolly pine ESTs.

Authors:  Cherdsak Liewlaksaneeyanawin; Carol E Ritland; Yousry A El-Kassaby; Kermit Ritland
Journal:  Theor Appl Genet       Date:  2004-03-26       Impact factor: 5.699

10.  Oil palm (Elaeis guineensis Jacq.) tissue culture ESTs: identifying genes associated with callogenesis and embryogenesis.

Authors:  Eng-Ti L Low; Halimah Alias; Soo-Heong Boon; Elyana M Shariff; Chi-Yee A Tan; Leslie Cl Ooi; Suan-Choo Cheah; Abdul-Rahim Raha; Kiew-Lian Wan; Rajinder Singh
Journal:  BMC Plant Biol       Date:  2008-05-29       Impact factor: 4.215

View more
  10 in total

1.  Morphological and molecular characterization of an Elaeis oleifera (H.B.K) Cortes germplasm collection located in Ucayali, Peru.

Authors:  Alina Camacho-Villalobos; Fernando Serna; Jhofre Flores; Hector Flores; Paulo Manrique; Jorge Bendezu
Journal:  PLoS One       Date:  2021-05-06       Impact factor: 3.240

2.  Chromosome identification in oil palm (Elaeis guineensis) using in situ hybridization with massive pools of single copy oligonucleotides and transferability across Arecaceae species.

Authors:  Noorhariza Mohd Zaki; Trude Schwarzacher; Rajinder Singh; Maria Madon; Corey Wischmeyer; Nordiana Hanim Mohd Nor; Muhammad Azwan Zulkifli; J S Pat Heslop-Harrison
Journal:  Chromosome Res       Date:  2021-10-16       Impact factor: 5.239

3.  Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability.

Authors:  Jing Xiao; Jin Zhao; Mengjun Liu; Ping Liu; Li Dai; Zhihui Zhao
Journal:  PLoS One       Date:  2015-05-22       Impact factor: 3.240

4.  Mixed model approach for IBD-based QTL mapping in a complex oil palm pedigree.

Authors:  Sébastien Tisné; Marie Denis; David Cros; Virginie Pomiès; Virginie Riou; Indra Syahputra; Alphonse Omoré; Tristan Durand-Gasselin; Jean-Marc Bouvet; Benoît Cochard
Journal:  BMC Genomics       Date:  2015-10-15       Impact factor: 3.969

5.  In Silico Approach for Characterization and Comparison of Repeats in the Genomes of Oil and Date Palms.

Authors:  Jaire Alves Ferreira Filho; Lucas Soares de Brito; André Pereira Leão; Alexandre Alonso Alves; Eduardo Fernandes Formighieri; Manoel Teixeira Souza Júnior
Journal:  Bioinform Biol Insights       Date:  2017-04-18

6.  Genetic diversity of Elaeis oleifera (HBK) Cortes populations using cross species SSRs: implication's for germplasm utilization and conservation.

Authors:  Maizura Ithnin; Chee-Keng Teh; Wickneswari Ratnam
Journal:  BMC Genet       Date:  2017-04-19       Impact factor: 2.797

7.  Development of novel EST-SSR markers in the macaúba palm (Acrocomia aculeata) using transcriptome sequencing and cross-species transferability in Arecaceae species.

Authors:  Bárbara Regina Bazzo; Lucas Miguel de Carvalho; Marcelo Falsarella Carazzolle; Gonçalo Amarante Guimarães Pereira; Carlos Augusto Colombo
Journal:  BMC Plant Biol       Date:  2018-11-12       Impact factor: 4.215

8.  Development and validation of whole genome-wide and genic microsatellite markers in oil palm (Elaeis guineensis Jacq.): First microsatellite database (OpSatdb).

Authors:  Kalyana Babu B; Mary Rani K L; Sarika Sahu; R K Mathur; Naveen Kumar P; Ravichandran G; Anitha P; Bhagya H P
Journal:  Sci Rep       Date:  2019-02-13       Impact factor: 4.379

9.  Transferability, development of simple sequence repeat (SSR) markers and application to the analysis of genetic diversity and population structure of the African fan palm (Borassus aethiopum Mart.) in Benin.

Authors:  Mariano Joly Kpatènon; Kolawolé Valère Salako; Sylvain Santoni; Leila Zekraoui; Muriel Latreille; Christine Tollon-Cordet; Cédric Mariac; Estelle Jaligot; Thierry Beulé; Kifouli Adéoti
Journal:  BMC Genet       Date:  2020-12-03       Impact factor: 2.797

10.  Analyses of hypomethylated oil palm gene space.

Authors:  Eng-Ti L Low; Rozana Rosli; Nagappan Jayanthi; Ab Halim Mohd-Amin; Norazah Azizi; Kuang-Lim Chan; Nauman J Maqbool; Paul Maclean; Rudi Brauning; Alan McCulloch; Roger Moraga; Meilina Ong-Abdullah; Rajinder Singh
Journal:  PLoS One       Date:  2014-01-30       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.