Literature DB >> 24718264

Whole mitochondrial and plastid genome SNP analysis of nine date palm cultivars reveals plastid heteroplasmy and close phylogenetic relationships among cultivars.

Jamal S M Sabir1, Dhivya Arasappan2, Ahmed Bahieldin3, Salah Abo-Aba4, Sameera Bafeel1, Talal A Zari1, Sherif Edris3, Ahmed M Shokry5, Nour O Gadalla6, Ahmed M Ramadan5, Ahmed Atef1, Magdy A Al-Kordy6, Fotoh M El-Domyati3, Robert K Jansen7.   

Abstract

Date palm is a very important crop in western Asia and northern Africa, and it is the oldest domesticated fruit tree with archaeological records dating back 5000 years. The huge economic value of this crop has generated considerable interest in breeding programs to enhance production of dates. One of the major limitations of these efforts is the uncertainty regarding the number of date palm cultivars, which are currently based on fruit shape, size, color, and taste. Whole mitochondrial and plastid genome sequences were utilized to examine single nucleotide polymorphisms (SNPs) of date palms to evaluate the efficacy of this approach for molecular characterization of cultivars. Mitochondrial and plastid genomes of nine Saudi Arabian cultivars were sequenced. For each species about 60 million 100 bp paired-end reads were generated from total genomic DNA using the Illumina HiSeq 2000 platform. For each cultivar, sequences were aligned separately to the published date palm plastid and mitochondrial reference genomes, and SNPs were identified. The results identified cultivar-specific SNPs for eight of the nine cultivars. Two previous SNP analyses of mitochondrial and plastid genomes identified substantial intra-cultivar ( = intra-varietal) polymorphisms in organellar genomes but these studies did not properly take into account the fact that nearly half of the plastid genome has been integrated into the mitochondrial genome. Filtering all sequencing reads that mapped to both organellar genomes nearly eliminated mitochondrial heteroplasmy but all plastid SNPs remained heteroplasmic. This investigation provides valuable insights into how to deal with interorganellar DNA transfer in performing SNP analyses from total genomic DNA. The results confirm recent suggestions that plastid heteroplasmy is much more common than previously thought. Finally, low levels of sequence variation in plastid and mitochondrial genomes argue for using nuclear SNPs for molecular characterization of date palm cultivars.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24718264      PMCID: PMC3981771          DOI: 10.1371/journal.pone.0094158

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Date palm (Phoenix dactylifera L., Arecaceae) is the primary crop in many countries in western Asia and northern Africa [1]. This species is the oldest domesticated fruit-bearing tree with archaeological records dating back to 4000–5000 years ago in southern Iraq [2]–[3]. The cultivation of date palm enabled the development of the oasis system that allowed human expansion into the deserts of Asia and northern Africa [1]. The economic importance of date palm is due largely to its nutritionally valuable fruit, which contains 44–80% carbohydrates, 0.2–0.5% fat, 2.3–5.6% protein and 6–12% dietary fiber [4]–[5]. Numerous medicinal uses have also been attributed to date palms, including treatment for intestinal ailments, colds, sore throat, toothaches, fever, gonorrhea, and cough [6]–[8]. In view of the huge economic value of date palm, it is no surprise that there has been intense interest in breeding programs to enhance fruit production. However, there are several impediments to using traditional breeding practices for genetic improvement of new cultivars. Date palms are propagated either from seed or vegetative offshoots. For both approaches, extremely slow growth of seedlings and offshoots does not allow the use of classical breeding techniques; it takes 8–10 years before plants produce fruit. Propagation with seeds is unsuitable for commercial production because half of the progeny are males and there is currently no way to sex date palm plants at an early stage of development. The exact number of named date palm cultivars is uncertain but estimates as low as 340 and as high as 5000 have been reported [5], [9]. In the past, female cultivars have been identified by morphology of the fruits, including size, color, shape, and taste. Many of the named cultivars have local names that are based on geographic location resulting in names that may not be genetically distinct. During the past decade, there have been numerous attempts to use molecular markers to characterize date palm biodiversity but most of these studies have relied on fragment data, such as RAPD, ISSR, SSR, and AFLP approaches, e.g., [10]–[18]. Although these methods have some merit, they are not as reliable in characterizing genetic diversity and identifying cultivars as more recent genomic approaches [19]. The advent of next generation sequencing has generated a surge of interest in using genomic approaches to characterize date palm cultivars. The publication of the mitochondrial [20], nuclear [21]–[22], and plastid [23]–[24] genome sequences of date palm provides reference genomes to examine SNPs for identifying cultivar diversity and genetic relationships among cultivars. Three recent studies utilized a whole genome approach to detect SNPs in the mitochondrial and plastid genomes of one or three common date palm cultivars [20], [23]–[24]. All of these studies were limited by the number of cultivars examined (3 or less) and issues concerning how to deal with the high percentage of the plastid genome that is also present as insertions in the mitochondrial genome (46.5%). All three studies concluded that there was considerable number of polymorphic sites in both the mitochondrial and plastid genes among and within the three cultivars examined. Two of these three studies [23]–[24] indicated that single plants were used for DNA isolation. In these cases, if intra-cultivar ( = intra-varietal) polymorphisms were present this would be unusual for plastid genomes because heteroplasmy has been considered to be rare [25], although more recent studies have suggested that it may be more common [26]–[27]. In this study, we sequenced mitochondrial and plastid genomes for nine additional date palm cultivars from Saudi Arabia. The four questions of our investigation are: (1) Is there heteroplasmy in the mitochondrial and plastid genomes?; (2) What is the effect of plastid DNA transfer to the mitochondrion on organellar SNP analyses? (3) Are organellar SNPs useful for identification of date palm cultivars?; and (4) What are the phylogenetic relationships among cultivars?

Materials and Methods

Sampling and DNA Isolation

Approximately 500 mg of field-collected leaf tissue from a single plant of each of the nine cultivars (Table 1) of Phoenix dactylifera was collected from Hada El-Sham Station, King Abdulaziz University, Saudi Arabia and frozen in liquid nitrogen. Isolation of total genomic DNA was performed using the modified procedure of Gawel and Jarret [28]. RNA contaminants were removed by adding 10 mg/ml of RNase A (Sigma, USA) to the DNA samples followed by incubation at 37°C for 30 min. Estimation of the DNA concentration was performed by measuring optical density at 260 nm according to the equation: DNA concentration (ug/ml) = OD260×50× dilution factor. Purified DNA samples were sent to Beijing Genomics Institute (BGI), Shenzhen, China for sequencing.
Table 1

Date palm cultivars examined.

CultivarGeographic locationAbbreviationSexFruit shape1 Fruit color1 Accession number
Sukkariat Al-MadinahAl-MadinahSUK-AFemaleOvalBrownSRR974792
Dekhaini Al-RiyadhAl-RiyadhDEKFemaleCylindricalYellowSRR974793
Ajwa Al-MadinahAl-MadinahAJWFemaleOvalRedSRR974754
Perny Al-RiyadhAl-RiyadhPERFemaleOvalBrownSRR974758
Sukkariat QassimQassimSUK-QmaleOvalBrownSRR974794
Rabia Al-MadinahAl-MadinahRABmaleOvalBrownSRR974795
Shalaby Al-MadinahAl-MadinahSHAmaleCylindricalYellowSRR974796
Moshwaq Al-RiyadhAl-RiyadhMOS-AmaleCylindricalYellowSRR974797
Moshwaq Hada Al-ShamHada Al-ShamMOS-HmaleCylindricalYellowSRR974798
KhalasReference genome from GenBankKHA-P; KHA-MNINININC_013991.2 – plastid; NC_016740.1 - mito

NI  =  Not included for reference genome; 1Fruit shape and color for male plants is based on these features from female plants of the same cultivar.

NI  =  Not included for reference genome; 1Fruit shape and color for male plants is based on these features from female plants of the same cultivar.

Genome sequencing, mapping of reads to reference, and SNP analysis

Total genomic DNA was sequenced using the Illumina HiSeq 2000 platform at BGI. For each species, about 60 million 100 bp paired-end reads were generated from a sequencing library with 500 bp inserts. The raw data was processed in two steps: adapter sequences in reads were trimmed and then reads that contained more than 50% low quality bases (quality value ≤ 5) were removed. The remaining sequencing reads from the nine samples were aligned separately to the date palm plastid (NC_013991) and mitochondrial (NC_016740.1) reference genomes using BWA (http://bio-bwa.sourceforge.net/). Reads were then run through samtools version mpileup (http://samtools.sourceforge.net/) and bcftools pipelines to identify SNPs that are unique to the mitochondria or plastid genomes. Only SNPs with a read depth of ≥ 10, mapping quality≥20, and SNP quality≥15 were retained. Initially, all reads were included in the mapping but a separate mapping was performed after filtering out of all reads that aligned to both plastid and mitochondrial genomes; only filtered reads were used in all subsequent SNP comparisons.

Alignment and phylogenetic analyses

Ten mitochondrial and plastid genomes (nine from this paper and one from GenBank, Table 1) were aligned with MAFFT [29]. These alignments were used to generate Maximum Likelihood trees using the PhyML plugin in Geneious 6.0.5 (Biomatters Ltd.). Congruence between trees generated from mitochondrial and plastid SNPs was examined using the incongruence length difference test (ILD) implemented in PAUP*4.0b10 [30].

Results

Mapping of reads to reference genomes

The number of reads generated for each sample ranged from 66.72 to 77.87 million. Mapping of the reads to the mitochondrial genome (NC_016740.1) covered 100% of the genome for all nine cultivars. The number of reads mapped to the mitochondrial genome varied from 854,270–1,495,892 depending on the cultivar, which included 1.41–2.07% of the total reads (Table 2). Mapping of the reads to the reference plastid genome (NC_013991.2) resulted in 99.61–100% coverage of the genome. The number of reads mapped to the plastid genome for the nine cultivars ranged from 751,281–1,153,632, which represents 0.96–1.52% of the total reads (Table 3).
Table 2

Summary of alignment results to the mitochondrial reference genome (NC_016740.1).

CultivarTotal reads (million)Number of reads mappedNumber of reads mapped after filteringCoverageFiltered coverage% reads mapped% reads mapped after filtering
SUK-A72.231,495,892985,5214182762.07%1.36%
DEK77.871,096,970712,3833071991.41%0.91%
AJW75.381,484,793972,1984152721.97%1.29%
PER74.241,112,452683,6813111911.50%0.92%
SUK-Q66.72854,270454,8972391271.28%0.68%
RAB75.81,167,048627,4193261761.54%0.83%
SHA72.351,049,708645,1572941801.45%0.89%
MOS-A70.31,016,158637,7602841781.45%0.91%
MOS-H74.951,127,157692,7003151941.50%0.92%
Table 3

Summary of alignment results to the plastid reference genome (NC_013991.2).

Cultivar NameTotal reads (million)Number of reads mappedNumber of reads mapped after filteringCoverageFiltered coverage% reads mapped% reads mapped after filtering
SUK-A72.231,014,305503,9341,2806361.40%0.70%
DEK77.87751,281366,6949484630.96%0.47%
AJW75.381,040,624528,0291,3136661.38%0.70%
PER74.24853,777425,0061,0785361.15%0.57%
SUK-Q66.72847,495448,1221,0705661.27%0.67%
RAB75.81,153,632614,0031,4567751.52%0.81%
SHA72.35841,371436,8201,0625511.16%0.60%
MOS-A70.3792,442414,0441,0005231.13%0.59%
MOS-H74.95892,144457,6871,1265781.19%0.61%
Since 10.3% of the 715,001 bp mitochondrial genome represents plastid insertions [20], the reads that mapped to both genomes were removed and the remaining reads were mapped to the reference plastid and mitochondrial genomes to avoid generating false SNPs that represent DNA sequences that were transferred from plastid genome to the mitochondrial genome. For the mitochondrial genome, this reduced the number of mapped reads to 627,419–985,521 for the nine cultivars, which represented 53–66% of the reads mapped before filtering (Table 2). In the case of the plastid genome, the number of reads mapped was reduced to 366,694–614,003 or 49–53% of the reads mapped before filtering (Table 3). Filtering out reads that mapped to both genomes reduced the number of SNPs detected in both the mitochondrial and plastid genomes and it also reduced the read depth coverage for each SNP.

Mitochondrial SNPs

The number of mitochondrial SNPs detected for each of the nine Saudi Arabian date palm cultivars relative to the reference genome ranged from 18–25 for a total of 188 SNPs (Figure 1A). For the most part, mitochondrial SNPs were homogeneous since all reads at each SNP position had either the reference or the alternate nucleotide (Table 4). There were 15 SNPs that showed polymorphisms but in these cases the majority of the reads matched either the reference or the alternate nucleotide (Table 4). The 188 SNPs were located at 37 different sites in the mitochondrial genome. Most SNPs were shared among cultivars with only 14 unique to a single cultivar (Tables 4). The number of shared SNPs was 16 for all nine cultivars, two for eight cultivars, one for five cultivars, one for three cultivars, and three for two cultivars. Only five of the nine cultivars had unique mitochondrial SNPs that could be used as a marker for their identification (Figure 1A). All but one of the SNPs was located in intergenic spacer regions (Figure 2). The one exception was a nonsynonymous substitution in the matR gene at coordinate 559,552 in the cultivar Moshwaq Al-Riyadh (MOS-A).
Figure 1

Number of total and unique SNPs detected for each of the nine Saudi Arabian date palm cultivars.

(A) mitochondrial, (B) plastid.

Table 4

Mitochondrial SNPs sorted by position in the genome.

CultivarPositionReferenceAlternateQualityRead depthDepth referenceDepth alternateLocation
SUK-A117,620GA7821020IGS
DEK117,620GA6113011IGS
AJW117,620GA6914013IGS
PER117,620GA7214013IGS
SUK-Q117,620GA6510010IGS
RAB117,620GA8116014IGS
SHA117,620GA8121021IGS
MOS-A117,620GA7312010IGS
MOS-H117,620GA6213011IGS
SHA* 130,703 C T 34 20 9 7 IGS
SHA* 130,707 C T 36 24 14 7 IGS
SHA* 130,715 G A 95 32 22 8 IGS
SUK-A157,036CT12357054IGS
DEK157,036CT12345042IGS
AJW157,036CT10653052IGS
PER157,036CT, G10037035IGS
SUK-Q157,036CT761009IGS
RAB157,036CT9121020IGS
SHA157,036CT10030028IGS
MOS-A157,036CT8628024IGS
MOS-H157,036CT10132028IGS
SUK-A215,792AC2222650250IGS
DEK215,792AC2221960181IGS
AJW215,792AC2221780165IGS
PER*215,792AC2221601153IGS
SUK-Q215,792AC20684075IGS
RAB215,792AC2181230113IGS
SHA215,792AC2221180109IGS
MOS-A215,792AC2221320127IGS
MOS-H215,792AC2221420131IGS
SUK-Q* 260,494 A T 18 115 97 15 IGS
MOS-H* 329,782 G A 60 21 16 5 IGS
SUK-A349,157AT7516016IGS
DEK349,157AT7015015IGS
PER349,157AT7315014IGS
MOS-A 350,750 C A 222 125 0 121 IGS
MOS-H* 452,157 C G 38 159 129 22 IGS
SUK-A457,989CA6612012IGS
AJW457,989CA6423023IGS
PER457,989CA7918016IGS
SUK-Q457,989CA6610010IGS
RAB457,989CA5211011IGS
SHA457,989CA6319019IGS
MOS-A457,989CA6817017IGS
MOS-H457,989CA7517017IGS
SUK-A457,994AT4612012IGS
DEK457,994AT4210010IGS
AJW457,994AT5229029IGS
PER457,994AT6320019IGS
SUK-Q457,994AT5511011IGS
RAB457,994AT4814014IGS
SHA457,994AT6622021IGS
MOS-A457,994AT4918018IGS
MOS-H457,994AT6222021IGS
SUK-A458,029AC6215013IGS
DEK458,029AC6411010IGS
AJW458,029AC7130030IGS
PER458,029AC6320020IGS
SUK-Q458,029AC6017017IGS
RAB458,029AC4516016IGS
SHA458,029AC6320020IGS
MOS-A458,029AC6719018IGS
MOS-H458,029AC5421021IGS
SUK-A458,036CA6913013IGS
AJW458,036CA7230029IGS
PER458,036CA7920018IGS
SUK-Q458,036CA7017016IGS
RAB458,036CA6116016IGS
SHA458,036CA6020020IGS
MOS-A458,036CA7919019IGS
MOS-H458,036CA7821021IGS
SUK-A464,552CG22262057IGS
DEK464,552CG22252052IGS
AJW464,552CG22258053IGS
PER464,552CG22257051IGS
SUK-Q464,552CG18731031IGS
RAB464,552CG22237034IGS
SHA464,552CG22233032IGS
MOS-A464,552CG22238035IGS
MOS-H464,552CG19346041IGS
SUK-A475,318AT1731100105IGS
DEK475,318AT1701090101IGS
AJW475,318AT1891070101IGS
PER475,318AT16393088IGS
SUK-Q475,318AT12928026IGS
RAB475,318AT13957055IGS
SHA475,318AT15262058IGS
MOS-A475,318AT15256054IGS
MOS-H475,318AT11959056IGS
SUK-A475,346GT2191390135IGS
DEK475,346GT2201040102IGS
AJW475,346GT2221470145IGS
PER475,346GT20798097IGS
SUK-Q475,346GT18447047IGS
RAB475,346GT19667066IGS
SHA475,346GT19787087IGS
MOS-A475,346GT20675073IGS
MOS-H475,346GT21271069IGS
SHA 482,322 A C 93 11 0 11 IGS
SUK-A503,021AC14917012IGS
DEK503,021AC12211011IGS
MOS-A* 559,552 C T 26 169 138 26 NS-matR
SUK-Q* 571,857 G A 37 168 137 24 IGS
SHA* 572,726 G A 22 194 152 31 IGS
SUK-A587,016GA14014010IGS
PER587,016GA1021006IGS
SUK-Q* 590,324 C T 16 131 108 18 IGS
SUK-Q* 590,624 T G 17 104 87 16 IGS
PER* 629,408 G T 46 106 29 17 IGS
SUK-A632,571AC88105096IGS
DEK632,571AC9191075IGS
AJW632,571AC7689079IGS
PER632,571AC7865060IGS
SUK-Q632,571AC6845039IGS
RAB632,571AC8564056IGS
SHA632,571AC6751048IGS
MOS-A632,571AC6247039IGS
MOS-H632,571AC7570064IGS
SUK-A642,650GT9950045IGS
DEK642,650GT14947036IGS
AJW642,650GT14643030IGS
PER642,650GT14146036IGS
SUK-Q642,650GT10420014IGS
RAB642,650GT13624021IGS
SHA642,650GT11939031IGS
MOS-A642,650GT12841034IGS
MOS-H642,650GT12440027IGS
SUK-A642,669TG21051050IGS
DEK642,669TG22249048IGS
AJW642,669TG22243042IGS
PER642,669TG22246045IGS
SUK-Q642,669TG22220020IGS
RAB642,669TG22226024IGS
SHA642,669TG22241037IGS
MOS-A642,669TG22242041IGS
MOS-H642,669TG22241040IGS
SUK-A642,689AT10750049IGS
DEK642,689AT13746045IGS
AJW642,689AT13042041IGS
PER642,689AT13446046IGS
SUK-Q642,689AT14119019IGS
RAB642,689AT13728026IGS
SHA642,689AT14143040IGS
MOS-A642,689AT15743042IGS
MOS-H642,689AT15341041IGS
DEK642,706GT753909IGS
SHA642,706GT633407IGS
SUK-A642,707AC273905IGS
DEK642,707AC583908IGS
PER642,707AC24.34005IGS
SHA642,707AC663407IGS
MOS-A642,707AC513706IGS
SUK-A*658,617TG2222941263IGS
DEK658,617TG2182060188IGS
AJW658,617TG2223260290IGS
PER658,617TG2192090184IGS
SUK-Q658,617TG1951310120IGS
RAB658,617TG2201870168IGS
SHA658,617TG2222060185IGS
MOS-A658,617TG2161800165IGS
MOS-H658,617TG2222020187IGS
SUK-A711,571TG105108098IGS
DEK711,571TG9881072IGS
AJW711,571TG10394086IGS
PER*711,571TG8772163IGS
SUK-Q711,571TG7738031IGS
RAB711,571TG9145043IGS
SHA711,571TG, A7952045IGS
MOS-A711,571TG11058053IGS
MOS-H711,571TG11070063IGS
SUK-A711,576AC1201160104IGS
DEK711,576AC11585080IGS
AJW711,576AC11197090IGS
PER711,576AC10474066IGS
SUK-Q711,576AC12541035IGS
RAB711,576AC10147046IGS
SHA711,576AC10263054IGS
MOS-A711,576AC12961057IGS
MOS-H711,576AC12173066IGS
SUK-A711,612TG6962061IGS
DEK711,612TG7549046IGS
AJW711,612TG7759057IGS
PER711,612TG7246044IGS
SUK-Q711,612TG7620020IGS
RAB711,612TG6925025IGS
SHA711,612TG6538038IGS
MOS-A711,612TG8733032IGS
MOS-H711,612TG7642042IGS

SNPs that are unique to an individual cultivar are in bold. * indicate polymorphic SNPs; IGS  =  intergenic spacer; NS  =  nonsynonymous.

Figure 2

Number of mitochondrial and plastid SNPs in intergenic spacers, introns and protein coding genes.

For those SNPs in coding regions the number that results in synonymous versus non-synonymous substitutions is indicated.

Number of total and unique SNPs detected for each of the nine Saudi Arabian date palm cultivars.

(A) mitochondrial, (B) plastid.

Number of mitochondrial and plastid SNPs in intergenic spacers, introns and protein coding genes.

For those SNPs in coding regions the number that results in synonymous versus non-synonymous substitutions is indicated. SNPs that are unique to an individual cultivar are in bold. * indicate polymorphic SNPs; IGS  =  intergenic spacer; NS  =  nonsynonymous.

Plastid SNPs

The number of plastid SNPs ranged from one to eight per cultivar with a total of 30 among the nine date palm cultivars and all but two cultivars (DEK and SHA) had at least one unique SNP (Table 5; Figure 1B). One half of the SNPs (15) were present in a single cultivar with two shared by four cultivars, two by two cultivars, and one by three cultivars. All plastid SNPs were heterogeneous as evidenced by the fact that both the reference and alternate nucleotides were present in some of the reads (Table 5). For most SNPs the number of reads for each nucleotide were very similar indicating that date palm plastid genomes are heteroplasmic, especially since single plants were sampled for each cultivar. The 30 plastid SNPs were located in 20 different positions in the genome with 13 in genes, two in introns, and five in intergenic spacers (Table 5, Figure 2). For the genic SNPs six resulted in nonsynonymous changes and seven were synonymous substitutions (Figure 2).
Table 5

Plastid SNPs sorted by position in the genome.

CultivarPositionReferenceAlternateQualityRead DepthDepth ReferenceDepth AlternateLocation
SUK-A12,167AC219697S-atpA
DEK12,191AC58134916S-atpA
MOS-A12,191AC22140138S-atpA
PER12,191AC441721616S-atpA
SUK-A12,191AC431991720S-atpA
MOS-A 38,157 T G 52 11 4 5 S-psaB
MOS-A 38,160 C T 36 11 5 5 S-psaB
MOS-A 38,181 A C 56 13 6 7 S-psaB
PER 38,233 A G 21 17 11 6 NS-psaB
PER 38,608 G T 19.1 16 10 6 NS-psaB
AJW38,634GA361147S-psaB
MOS-A38,634GA461156S-psaB
PER38,634GA70211110S-psaB
SUK-A38,634GA551477S-psaB
PER 38,692 T G 49 11 4 5 NS-psaB
DEK40,739GA33271210NS-psaA
SHA40,739GA341557NS-psaA
AJW 40,783 C T 20 22 7 5 S-psaA
AJW40,785CT332295NS-psaA
DEK40,785CT2723116NS-psaA
RAB40,785CT19.120116NS-psaA
MOS-H 40,812 C T 16.1 15 4 5 NS-psaA
AJW 48,066 A C 23 14 7 6 I-trnL-UAA
MOS-A65,042AG491147IGS-petA:psbJ
SUK-A65,042AG15.11245IGS-petA:psbJ
MOS-A 65,045 A T 22 11 4 7 IGS-petA:psbJ
MOS-A 65,409 C G 19.1 38 30 7 IGS-petA:psbJ
SUK-Q 65,427 C T 30 33 25 7 IGS-petA:psbJ
PER 65,453 G A 58 42 26 13 IGS-petA:psbJ
RAB 79,175 G T 18.1 22 14 7 I-petD

SNPs that are unique to an individual cultivar are in bold. IGS  =  intergenic spacer; I  =  intron; NS  =  nonsynonymous; S  =  synonymous.

SNPs that are unique to an individual cultivar are in bold. IGS  =  intergenic spacer; I  =  intron; NS  =  nonsynonymous; S  =  synonymous.

Relationships among cultivars

Unrooted maximum likelihood (ML) trees were generated independently for mitochondrial and plastid SNPs to estimate relationships among the 10 cultivars of date palm (Figure 3). The mitochondrial tree (Figure 3A) was not well resolved and bootstrap support for resolved nodes was low. This is likely due to the fact that 30 of the 37 SNP positions were either unique to a single cultivar or shared by all nine cultivars relative to the reference (Table 4). Three features (sex, fruit shape, and fruit color) were plotted on the mitochondrial tree to determine if any of these characters corresponded to the relationships among cultivars (Figure 3A). The only feature that showed some correspondence with the tree topology was sex, with three of the four female plants examined grouping together.
Figure 3

Maximum likelihood trees of mitochondrial SNPs for 10 date palm cultivars.

(A), plastid (B), and combined (C). Numbers below each node represent bootstrap values for 1000 replicates. Cultivar abbreviations are provided in Table 1. Cultivar acronyms in red and black are female and male plants, respectively. Fruit shape is indicated and acronym names are color coded by fruit color (yellow, red, and brown).

Maximum likelihood trees of mitochondrial SNPs for 10 date palm cultivars.

(A), plastid (B), and combined (C). Numbers below each node represent bootstrap values for 1000 replicates. Cultivar abbreviations are provided in Table 1. Cultivar acronyms in red and black are female and male plants, respectively. Fruit shape is indicated and acronym names are color coded by fruit color (yellow, red, and brown). The ML tree for the plastid SNPs was also not well-resolved or supported, however, bootstrap values for two nodes were slightly higher than those in the mitochondrial tree (Figure 3B). The low resolution and support was due to the small number of SNPs that are shared among a subset of the cultivars (Table 5). The topology of the plastid tree is largely incongruent with the mitochondrial tree and there is no correspondence between the tree topology and sex, fruit shape, or fruit color. However, ILD test for incongruence resulted in a p value = 0.63 indicating that the trees are not significantly incongruent. Therefore a combined analysis of mitochondrial and plastid SNPs was performed. The resulting ML tree topology was more resolved and better supported than either of the individual trees, however, there was still no correspondence between the ML tree topology and any of the three key features (Figure 3C).

Discussion

Heteroplasmy of SNPs

Three previous studies within and among one or three date palm cultivars reported intra- and inter-cultivar organellar SNPs [20], [23]–[24]. The detection of intra-cultivar SNPs suggests heteroplasmy in both mitochondrial and plastid genomes. Comparison of three date palm cultivars for mitochondrial SNPs using a combined Solid/454 sequencing approach from total genomic DNA revealed 347–378 intra-cultivar and 56–97 inter-cultivar SNPs [20]. However, the mitochondrial comparison did not account for the fact that 10.3% (73,691 bp or 46.5% of the plastid genome) of this genome represents DNA transferred from the plastid. Thus, it is likely that the high levels of intra-cultivar mitochondrial SNPs reported by Fang et al. [20] are due to the fact that sequences from the plastid genome mapped to the mitochondrial genome. In our SNP analysis of nine Saudi cultivars, filtering out all reads that mapped to both the mitochondrial and plastid genomes eliminated this artifact as evidenced by the fact that only 15 of the 188 mitochondrial SNPs remained polymorphic within individual cultivars (versus all of them before filtering), and in most of these cases the majority of the mapped reads matched either the reference or the alternate nucleotide (Table 4). Thus, intra-cultivar heteroplasmy in the mitochondrial genome of date palms is much less extensive than previously reported. We are not suggesting that intra-cultivar heteroplasmy does not exist, especially since it is recognized that heteroplasmy in plant mitochondria is common [31]–[32]. Straub et al. [33] raised concerns about reports of heteroplasmy in plastid genomes when performing next generation sequencing of total genomic DNA due to the transfer of plastid sequences to the nucleus and mitochondrion. Two previous studies examined SNPs in plastid genomes of date palms, one focused only on inter-cultivar variation [24] and the other on intra-cultivar variation [23]. Yang et al. [23] reported that all 78 SNPs are intra-cultivar, 16 in intergenic spacers and 62 in 23 different genes; in protein-coding genes 29 were synonymous substitutions and 31 were nonsynonymous. Yang et al. [23] utilized four adjustments in an attempt to eliminate false plastid SNPs caused by contamination of nuclear and mitochondrial sequences: (1) only count SNPs where the number of aligned reads is>50; (2) only count SNPs where the percentage of the reads with the minor variant is>10%; (3) exclude SNPs in regions where there are gaps in the alignment; and (4) eliminate SNPs in regions of overlapping homopolymer runs. The first adjustment will not take care of the problem of plastid DNA that has been transferred to the mitochondria because it is well known that read depth for plastid sequences is much higher due to the higher copy number of plastids [33]. So, one would predict that most SNPs in the mitochondrion that are in regions with inserted plastid DNA will have a much higher read depth because many plastid sequences would assemble to these mitochondrial regions. The fourth modification will only correct for errors associated with the well-characterized issue of homopolymer runs using the 454 sequencing platform. We took a much more conservative approach to testing for plastid heteroplasmy by eliminating all reads that mapped to both the plastid and mitochondrial genomes. Our setting of read depth for each SNP at≥10 would greatly reduce the chances of detecting mitochondrial sequences that have been transferred to the nucleus in the mitochondrial SNPs because the read depth of nuclear sequences is so much lower than either mitochondrial or plastid sequences [33]. Although this stringent constraint greatly reduced the number of SNPs detected and their read depth, all remaining plastid SNPs show heteroplasmy (Table 5). Furthermore, similar read depths for the reference and variant plastid SNPs (Table 5) support their location in the plastid genome as opposed to the nucleus, providing further support for occurrence of plastid heteroplasmy. Thus, it is clear that date palm plastid genomes are heteroplasmic, however, caution is recommended for SNP analyses using next generation sequencing of total genomic DNA. The traditional view has been that heteroplasmy in plastids is uncommon [25] but several examples of this phenomenon have been detected across flowering plants, including in Actinidia [26], Coreopsis [34], Cynomorium [35], Epilobium [36], Medicago [37], [38], Gossypium [39], Oenothera [40] Oryza [41], Passiflora [42], Pelargonium [43], and Senecio [27]. Thus, heteroplasmy is more common than previously thought and it likely went undetected because of the paucity of molecular studies that examined intra-individual variation. Two different mechanisms have been suggested for the development of heteroplasmy in plastids. The more commonly suggested explanation is biparental inheritance in which each parent transmits organelles to the zygote, an inheritance mode that occurs in approximately one fifth of angiosperms [44]–[47]. The other mechanism occurs in plants with uniparental plastid inheritance in which plastid sorting in the parent is incomplete resulting in heteroplasmic gametes. In the case of date palm, incomplete sorting is the likely mechanism for heteroplasmy since plastid genomes are considered to have maternal inheritance [44]. We expect that many more cases of plastid heteroplasmy will be revealed as more genomic investigations of single plants are performed.

Challenges of organellar SNP analysis caused by DNA transfers

Integration of plastid DNA into the mitochondrial genome can cause difficulties in utilizing organellar genomes for SNP analyses. In the previous studies of date palm organellar SNPs heteroplasmy was greatly overestimated because 10.3% of the mitochondrial genome represents plastid DNA transfers. The transfer of plastid DNA to the mitochondrion is a common phenomenon with 1–12% of published angiosperm mitochondrial genomes representing plastid DNA [48]. There are several approaches to dealing with this issue. Isolation of purified plastid or mitochondrial DNA would avoid this problem but it is often not possible to obtain sufficient plant material and/or isolate organellar DNA from many species. The most common approach for genomic SNP analyses is to sequence total genomic DNA and align these reads to a reference genome. Although it is well known that the depth of coverage for plastid reads is much higher than mitochondrial or nuclear reads, it is not likely that read depth could resolve this problem. Yang et al. [23] attempted this approach in the date palm investigation but they still overestimated the levels of intra-cultivar heterogeneity. We took a more stringent approach by removing all reads that mapped to both the mitochondrial and plastid genomes to attain a more realistic estimate of organellar SNPs among date palm cultivars. Although we are confident that we did not overestimate the number of intra-individual SNPs, the number of SNPs detected in both organellar genomes was greatly reduced due to the elimination of a large number of reads. In the case of date palm, nearly one half of the plastid genome (73,691 bp) has been transferred to the mitochondrial genome so the SNP analysis only sampled 53.5% of the plastid genome. Filtering reads that map to both organellar genomes is preferable to reporting erroneous SNPs caused by transfer of plastid DNA to the mitochondrion. Another issue with using total genomic DNA for SNP analyses from genome sequence data is the prevalence of both plastid and mitochondrial DNA in the nucleus, which is commonly referred to as NUMTS (nuclear mtDNA) or NUPTs (nuclear ptDNA). In flowering plants, it is well known that large fragments of DNA from both of these genomes are transferred to the nucleus [49], and the proportion varies considerably among different species [50]. However, since the depth of reads for nuclear sequences is so much lower than for mitochondrial or plastid reads, read depth can be used to eliminate overestimation of the number of organellar SNPs.

Cultivar identification and phylogenetic relationships

Date palm cultivar identification is complicated by the fact that there are so many named cultivars, and most of these are characterized by fruit size, color, shape, and taste. This has resulted in different cultivar names for the same morphological type in different countries. Also, reliance on characters that are only present on female plants has caused considerable confusion since it takes 8–10 years before plants flower. Thus, there has been an increasing effort to utilize molecular markers to define cultivars, and most of these studies have used fragment data from RAPD, ISSR, and AFLP comparisons. These approaches are problematic in terms of producing a well-characterized molecular signature for each cultivar, largely because of their limited repeatability. Even though the SNP comparison of the mitochondrial and plastid genomes was limited by cross compartment DNA transfer, our results were successful in detecting unique SNPs for eight of the nine cultivars examined (Figure 1). The main limitations of the organellar approach are the high levels of sequence conservation in these genomes and the need to eliminate regions of transferred plastid sequences to avoid erroneous SNP identification, which reduces the amount of sequence data available for cultivar identification. Two recent comparison of date palm SNPs in the nuclear genome provided much more data. Comparison of four cultivars by Al-Dous et al. [21] revealed over 3.5 million SNPs in 381 Mb and Al-Mssallem et al. [9] identified 3.85 to 6.63 SNPs per kb among 11 cultivars. Although transfer of mitochondrial and plastid DNA to the nucleus may complicate this approach, the huge number of SNPs in the nuclear genome makes this genome much more attractive for future characterization of date palm cultivars. Phylogenetic analyses of mitochondrial and plastid SNPs generated incongruent tree topologies that provided only limited resolution among cultivars with low support values (Figure 3). This result is not surprising in view of the fact that a considerable portion of the data was filtered out of the analysis due to the transfer of 46.5% of the plastid genome to the mitochondrion. Expanded cultivar sampling is not likely to improve the situation. Only a few previous studies have utilized organellar genome sequences for SNP analyses within species [51]–[55], and in all cases the low level of variation detected limited the utility of this approach for population studies. In view of the much higher number of nuclear SNPs in date palms [9], [21], future phylogenetic analyses among cultivars should utilize this genome.
  31 in total

1.  The mechanism of the mixed inheritance of chloroplast genes in Pelargonium : Evidence from gene frequency distributions among the progeny of crosses.

Authors:  R A Tilney-Bassett; C W Birky
Journal:  Theor Appl Genet       Date:  1981-01       Impact factor: 5.699

2.  A case of chloroplast heteroplasmy in kiwifruit (Actinidia deliciosa) that is not transmitted during sexual reproduction.

Authors:  J Chat; S Decroocq; V Decroocq; R J Petit
Journal:  J Hered       Date:  2002 Jul-Aug       Impact factor: 2.645

3.  Rice chloroplast DNA molecules are heterogeneous as revealed by DNA sequences of a cluster of genes.

Authors:  E Moon; T H Kao; R Wu
Journal:  Nucleic Acids Res       Date:  1987-01-26       Impact factor: 16.971

4.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

5.  Examination of the cytoplasmic DNA in male reproductive cells to determine the potential for cytoplasmic inheritance in 295 angiosperm species.

Authors:  Quan Zhang; Yang Liu
Journal:  Plant Cell Physiol       Date:  2003-09       Impact factor: 4.927

6.  Beginnings of fruit growing in the old world.

Authors:  D Zohary; P Spiegel-Roy
Journal:  Science       Date:  1975-01-31       Impact factor: 47.728

7.  Extensive intraindividual variation in plastid rDNA sequences from the holoparasite Cynomorium coccineum (Cynomoriaceae).

Authors:  Miguel A García; Erica H Nicholson; Daniel L Nickrent
Journal:  J Mol Evol       Date:  2004-03       Impact factor: 2.395

8.  The complete chloroplast genome of 17 individuals of pest species Jacobaea vulgaris: SNPs, microsatellites and barcoding markers for population and phylogenetic studies.

Authors:  Leonie Doorduin; Barbara Gravendeel; Youri Lammers; Yavuz Ariyurek; Thomas Chin-A-Woeng; Klaas Vrieling
Journal:  DNA Res       Date:  2011-03-28       Impact factor: 4.458

9.  Complete Arabis alpina chloroplast genome sequence and insight into its polymorphism.

Authors:  Christelle Melodelima; Stéphane Lobréaux
Journal:  Meta Gene       Date:  2013-11-15

10.  Genome sequence of the date palm Phoenix dactylifera L.

Authors:  Ibrahim S Al-Mssallem; Songnian Hu; Xiaowei Zhang; Qiang Lin; Wanfei Liu; Jun Tan; Xiaoguang Yu; Jiucheng Liu; Linlin Pan; Tongwu Zhang; Yuxin Yin; Chengqi Xin; Hao Wu; Guangyu Zhang; Mohammed M Ba Abdullah; Dawei Huang; Yongjun Fang; Yasser O Alnakhli; Shangang Jia; An Yin; Eman M Alhuzimi; Burair A Alsaihati; Saad A Al-Owayyed; Duojun Zhao; Sun Zhang; Noha A Al-Otaibi; Gaoyuan Sun; Majed A Majrashi; Fusen Li; Jixiang Wang; Quanzheng Yun; Nafla A Alnassar; Lei Wang; Meng Yang; Rasha F Al-Jelaify; Kan Liu; Shenghan Gao; Kaifu Chen; Samiyah R Alkhaldi; Guiming Liu; Meng Zhang; Haiyan Guo; Jun Yu
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

View more
  14 in total

Review 1.  Strengthening desert plant biotechnology research in the United Arab Emirates: a viewpoint.

Authors:  Sanjay Gairola; Khawla I Al Shaer; Eman K Al Harthi; Kareem A Mosa
Journal:  Physiol Mol Biol Plants       Date:  2018-05-30

2.  Comparative analysis of single nucleotide polymorphisms in the nuclear, chloroplast, and mitochondrial genomes in identification of phylogenetic association among seven melon (Cucumis melo L.) cultivars.

Authors:  Qianglong Zhu; Peng Gao; Shi Liu; Sikandar Amanullah; Feishi Luan
Journal:  Breed Sci       Date:  2016-10-18       Impact factor: 2.086

Review 3.  CRISPR/Cas9: A Practical Approach in Date Palm Genome Editing.

Authors:  Muhammad N Sattar; Zafar Iqbal; Muhammad N Tahir; Muhammad S Shahid; Muhammad Khurshid; Abdullatif A Al-Khateeb; Suliman A Al-Khateeb
Journal:  Front Plant Sci       Date:  2017-08-23       Impact factor: 5.753

4.  The complete chloroplast genome of Colobanthus apetalus (Labill.) Druce: genome organization and comparison with related species.

Authors:  Piotr Androsiuk; Jan Paweł Jastrzębski; Łukasz Paukszto; Adam Okorski; Agnieszka Pszczółkowska; Katarzyna Joanna Chwedorzewska; Justyna Koc; Ryszard Górecki; Irena Giełwanowska
Journal:  PeerJ       Date:  2018-05-23       Impact factor: 2.984

5.  Sequencing of organellar genomes of Gymnomitrion concinnatum (Jungermanniales) revealed the first exception in the structure and gene order of evolutionary stable liverworts mitogenomes.

Authors:  Kamil Myszczyński; Piotr Górski; Monika Ślipiko; Jakub Sawicki
Journal:  BMC Plant Biol       Date:  2018-12-03       Impact factor: 4.215

Review 6.  The Promise of Molecular and Genomic Techniques for Biodiversity Research and DNA Barcoding of the Arabian Peninsula Flora.

Authors:  Kareem A Mosa; Sanjay Gairola; Rahul Jamdade; Ali El-Keblawy; Khawla Ibrahim Al Shaer; Eman Khalid Al Harthi; Hatem A Shabana; Tamer Mahmoud
Journal:  Front Plant Sci       Date:  2019-01-21       Impact factor: 5.753

7.  Complete Plastid Genome of the Recent Holoparasite Lathraea squamaria Reveals Earliest Stages of Plastome Reduction in Orobanchaceae.

Authors:  Tahir H Samigullin; Maria D Logacheva; Aleksey A Penin; Carmen M Vallejo-Roman
Journal:  PLoS One       Date:  2016-03-02       Impact factor: 3.240

8.  Genome Sequences of Populus tremula Chloroplast and Mitochondrion: Implications for Holistic Poplar Breeding.

Authors:  Birgit Kersten; Patricia Faivre Rampant; Malte Mader; Marie-Christine Le Paslier; Rémi Bounon; Aurélie Berard; Cristina Vettori; Hilke Schroeder; Jean-Charles Leplé; Matthias Fladung
Journal:  PLoS One       Date:  2016-01-22       Impact factor: 3.240

9.  The complete chloroplast genome of Primulina and two novel strategies for development of high polymorphic loci for population genetic and phylogenetic studies.

Authors:  Chao Feng; Meizhen Xu; Chen Feng; Eric J B von Wettberg; Ming Kang
Journal:  BMC Evol Biol       Date:  2017-11-07       Impact factor: 3.260

Review 10.  Genomic Insights into Date Palm Origins.

Authors:  Muriel Gros-Balthazard; Khaled Michel Hazzouri; Jonathan Mark Flowers
Journal:  Genes (Basel)       Date:  2018-10-17       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.