Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH-trnK, trnL-trnF, 16S and trnS-psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR-RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified.
Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH-trnK, trnL-trnF, 16S and trnS-psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR-RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified.
The wild relatives of crop species often carry beneficial alleles that are effective against various biotic and abiotic stresses. They hold the key to successful crop improvement programmes through introgression of desired genes from wild to cultivated crop species (Dida and Devos 2006). In this context, the assessment of phylogenetic relationships at the inter-specific level and the identification of gene pools are considered important. Based on chloroplast deoxyribonucleic acid (cpDNA) diversity, we obtained information on the molecular phylogeny of finger millet (Eleusine coracana)
vis-à-vis its wild relatives.The genus Eleusine is a member of the tribe Eragrosteae, subfamily Chloridoideae and the family Poaceae. It is a small genus of nine species, which includes six diploid (2n = 2x = 16, 18, 20) and three polyploid (2n = 4x = 36, 38) species (Bisht and Mukai 2002; Liu ). These species are widely distributed in the tropical and subtropical regions of Africa, Asia and South America (Phillips 1972). East Africa is considered to be the centre of diversity for the genus (Bisht and Mukai 2002).Eleusine coracana (2n = 4x = 36), commonly known as finger millet or ragi, is the only economically important species of the genus. After sorghum and pearl millet, finger millet ranks third in cereal production in the semi-arid regions of the world (Bisht and Mukai 2002). It is widely cultivated in the arid and semi-arid regions of East Africa, India, Nepal and many other Asian countries for its grain and fodder value (Verma 2009). The grain is widely used for preparing bread, cakes, soup, puddings, porridge and fermented beverages (Hilu and deWet 1976; Chandrashekar 2010; Neves 2011). Finger millet is a rich source of essential amino acids and polyphenols, and it has comparatively higher levels of calcium and iron than other known cereals (Barbeau and Hilu 1993; Chandrashekar 2010). It has a number of medicinal properties as well, particularly in controlling blood sugar levels in diabeticpatients (Duke and Wain 1981; Chandrashekar 2010; Pradhan ).The assessment of genomic relationships between E. coracana and its allied species has been a subject of comprehensive investigations at the morphological, cytogenetic, biochemical and DNA level. Chromosome research has demonstrated the significant role of polyploidy and aneuploidy in the evolution of the genus (Chennaveeraiah and Hiremath 1974; Hiremath and Chennaveeraiah 1982; Hiremath and Salimath 1991; Bisht and Mukai 2000, 2001, , 2002). Biochemical, nuclear and cpDNA markers have provided valuable insight into relationships, and on the origin of the crop species (Hilu ; Hilu 1988, 1995; Hilu and Johnson 1992, 1997; Werth , 1994; Salimath ; Neves ; Dida , 2008; Liu ). The 2n number and the genomic formula proposed on the basis of earlier studies are given in Table 1.
Table 1.
Plant materials used in the present study. aDr. Mathews M. Dida, Kenya; USDA, United States Department of Agriculture, USA; ILRI, International Livestock Research Institute, Ethopia; NBPGR, National Bureau of Plant Genetic Resources, India. bBisht and Mukai (2002).
Species
Accession number
Sourcea
2n
Genomeb formula
Growth habit
E. coracana
PI 482594PI 462778PI 462779
USDAUSDAUSDA
36
AABB
Annual
E. africana
EC 541535EC 541536PI 226270PI 315700
Dida, KenyaDida, KenyaUSDAUSDA
36
AABB
Annual
E. tristachya
PI 477078PI 331791
USDAUSDA
18
AA
Annual
E. indica
PI 442480
USDA
18
AA
Annual
E. floccifolia
PI 196853
USDA
18
BB
Perennial
E. multiflora
PI 226067
USDA
16
CC
Annual
E. jaegeri
PI 273888
USDA
20
DD
Perennial
E. kigeziensis
11121079
ILRIILRI
38
AADD
Perennial
E. intermedia
S. No 116
ILRI
18
AB
Perennial
D. aegyptium
IC-285214
NBPGR
Plant materials used in the present study. aDr. Mathews M. Dida, Kenya; USDA, United States Department of Agriculture, USA; ILRI, International Livestock Research Institute, Ethopia; NBPGR, National Bureau of Plant Genetic Resources, India. bBisht and Mukai (2002).In spite of the enormous amount of information, there still exists considerable disagreement over the identification of the diploid ancestors of the three polyploid species and the level of speciation and evolutionary relationships among the nine species of Eleusine (Phillips 1972, 1995; Hilu and Johnson 1997; Lye 1999; Bisht and Mukai 2000, 2001, 2002; Devarumath ; Neves ; Liu ). Eleusine coracana (2n = 4x = 36) (AABB) is considered to be an allotetraploid and has been domesticated from its wild progenitor, Eleusine africana (2n = 4x = 36) (AABB) (Chennaveeraiah and Hiremath 1974; Hilu and deWet 1976; Hilu ; Hilu 1988, 1995; Hiremath and Salimath 1992; Werth ; Hilu and Johnson 1997; Bisht and Mukai 2000, 2001, ; Devarumath ; Neves ; Dida , 2008). The genetic maps generated by Dida showed that E. coracana and E. africana are allotetraploids. There is also strong evidence to suggest that Eleusine indica (2n = 2x = 18) (AA) is the maternal genome donor of E. coracana and E. africana (Hilu 1988; Hiremath and Salimath 1992; Hilu and Johnson 1997; Bisht and Mukai 2000, 2001; Neves ). Bisht and Mukai (2000, 2001, 2002) indicated that Eleusine floccifolia (2n = 2x = 18) could be the BB donor species of E. coracana. This has, however, been refuted by others (Hiremath and Salimath 1992; Neves ; Devarumath ; Liu ). According to these authors, the BB genome donor species remains unidentified and may possibly be extinct. Eleusine kigeziensis (2n = 4x = 36 or 38) (AADD) is the third tetraploid species of the genus. Eleusine indica (2n = 2x = 18) (AA) and E. jaegeri (2n = 2x = 20) (DD) are proposed to be the wild progenitors of E. kigeziensis (Bisht and Mukai 2002; Devarumath ). On the contrary, Neves proposed E. kigeziensis to be autotetraploid, with E. indica being closely related to E. kigeziensis but not the direct genome donor to E. kigeziensis. Liu have concluded that all three tetraploids (E. coracana, E. africana and E. kigeziensis) are of allotetraploid origin. They suggested independent origins of E. kigeziensis and E. africana—E. coracana. They are of the view that both events may have involved the diploids E. indica and E. tristachya as maternal parents, but the paternal parents remain unidentified. Eleusine indica and E. tristachya are considered to be very similar and the degree of relationship between the two remains unresolved (Hiremath and Chennaveeraiah 1982; Hiremath and Salimath 1991; Hilu and Johnson 1992; Werth ; Bisht and Mukai 2001, ; Neves ; Liu ).In the present study, the cpDNA restriction site pattern variation of seven amplified chloroplast genes/intergenic spacers, the chloroplast trnK gene sequence and cp microsatellite polymorphism were investigated for the first time in all nine Eleusine species, with the objective of constructing the phylogeny of the genus Eleusine and identifying the maternal genome donors of the polyploid species including E. coracana. Both direct sequencing of the trnK gene and polymerase chain reaction–restriction fragment length polymorphism (PCR–RFLP) of cpDNA regions and chloroplast simple sequence repeats (cpSSRs) are considered to be very good markers for detecting cpDNA variation. The chloroplast trnK gene (which also contains the matK gene within the trnK gene intron) sequence has been effectively used for the construction of grass phylogenies (Hilu and Alice 1999; Hilu ). The matK gene sequence is one of the seven loci widely utilized for the DNA barcoding of plants (Babbar ). Specific chloroplast genes and/or intergenic spacers can be amplified (Taberlet ; Demesure ; Tsumura , 1996; Dhingra and Folta 2005; Heinze 2005). The amplicons can be directly sequenced or restriction endonuclease digested (PCR–RFLP or cleaved amplified polymorphic sequence), and the occurrence of microsatellites (cpSSRs) in the chloroplast genome has been widely utilized for species identification, reconstruction of phylogenetic relationships, taxonomic studies and the identification of maternal parents in polyploids (Tsumura , 1996; Weising and Gardner 1999; Ishii and McCouch 2000; Lakshmi ; Parani , 2001; Komatsu ; Provan ; Kishimoto ; Nwakanma ; Zhu ; Asadi Abkenar , 2008; Van Droogenbroeck ; Ibrahim ; Angioi ; Sehgal ; Jena ; Liu ; Poczai ).
Methods
Plant materials
Seeds of the Eleusine species and one outgroup species (Dactyloctenium aegyptium) utilized in the present study were obtained from the United States Department of Agriculture (USDA) (Beltsville, MD, USA), the International Livestock Research Institute (ILRI) (Addis Ababa, Ethiopia), Dr Mathews M. Dida (Maseno University, Maseno, Kenya) and the National Bureau of Plant Genetic Resources (NBPGR) (New Delhi, India). The accession numbers and source of the seed material are given in Table 1. The seeds were grown under controlled conditions in the experimental field of the Department of Botany, University of Delhi.
DNA extraction
The total genomic DNA of the Eleusine species and D. aegyptium was isolated from fresh young leaves using the cetyl trimethyl ammonium bromide method as described by Murray and Thompson (1980) with some modifications. Instead of a CsCl–ethidium bromide ultracentrifugation step, the DNA was purified by phenol–chloroform extraction.
PCR amplification of chloroplast genes/intergenic spacers, their digestion and data analysis
Seven chloroplast genes and intergenic spacers (trnS–psbC, psaA, 16S, trnK, psbD, trnL–trnF and trnH–trnK) were amplified using previously published universal forward and reverse primers (Table 2). Amplification was carried out in 100 μL reaction mixtures containing 80 ng of template DNA, 0.1 mM dNTPs (Amersham Biosciences, UK), 2 mM MgCl2 (Bangalore Genei, India), 1.3 μM each forward and reverse primer (Bangalore Genei), 2.5 U of Taq DNA polymerase (Bangalore Genei) and 10 μL of 10× assay buffer [100 mM Tris pH 9.0, 500 mM KCl, 0.1 % gelatin (Bangalore Genei)]. DNA amplification was performed in a My cycler (Bio-Rad, USA) programmed to 36 cycles each of 1 min (5 min for the first cycle) at 94 °C for template DNA denaturation, 1 min at the annealing temperature (63 °C for psaA, 55 °C for trnS–psbC, trnK and psbD, 60 °C for 16S, and 50 °C for trnL–trnF and trnH–trnK), and 2 min at 72 °C for primer extension, followed by a final extension cycle of 15 min at 72 °C.
Table 2.
List of chloroplast genes/intergenic spacers amplified in the present study. aSizes in source.
List of chloroplast genes/intergenic spacers amplified in the present study. aSizes in source.Seven genes and intergenic spacers amplified from the chloroplast genome were separately restricted with 31 four-, five- and six-base cutter restriction endonucleases (AluI, AvaI, AccI, AfaI, BamHI, BalI, BglI, BglII, ClaI, DraI, EcoRV, EcoRI, HaeIII, HinfI, HindIII, HincII, MspI, KpnI, MboI, MluI, PstI, PvuII, SacI, SalI, SpeI, SphI, SmaI, SspI, TaqI, XhoI and XbaI). The 12.20 μL reaction mix contained 10 μL of the amplified gene product, 1.2 μL of enzyme buffer and 10 U of the restriction enzyme. After gentle mixing the mixture was incubated overnight at 37 °C (except for TaqI, which was incubated at 60 °C). The digestion was terminated by adding 1.5 μL of 10× loading buffer. The digested products were fractionated on 1.5 % agarose gels containing ethidium bromide (0.05 μg mL−1) in 0.5× TBE buffer. A DNA ladder mix was loaded alongside the digested products to serve as size markers. After agarose gel electrophoresis, the gel was photographed in ultraviolet light. Reproducibility of the patterns was tested by repeating all the reactions at least twice.For PCR–RFLP analysis, the presence (1) or absence (0) of a restriction fragment was recorded. Total and mean character differences between pairs of species were calculated using PAUP* 4.0 (Swofford 2002). Nei and Li's coefficient of genetic distance (Nei and Li 1979) was calculated between each pair of species after the optimality criterion was set to DISTANCE. Cluster analysis was carried out using the unweighted pair-group method using arithmetic averages (UPGMA) (Sneath and Sokal 1973) and neighbour-joining (NJ) (Saitou and Nei 1987) methods. Bootstrap values were calculated from 100 replicates using the BOOTSTRAP command in PAUP.
PCR amplification, cloning and sequencing of the trnK gene, and data analysis
The trnK gene was amplified from the nine Eleusine species and one outgroup as described above. The amplification products were separated on a 1 % agarose gel, excised from the gel and purified using a QIA quick gel extraction kit (Qiagen, Germany). The purified amplification products were cloned and sequenced with an ABI PRISM 377 automated DNA sequencer (Applied Biosystems, Foster City, CA, USA). BLAST similarity searches were performed using the National Centre for Biotechnology Information (NCBI) BLASTN algorithm to confirm the identity of the trnK sequences. The nucleotide sequences have been submitted to GenBank with accession numbers KF357736–KF357745. The nucleotide sequence of the partial trnK gene from Acrachne racemosa available in GenBank (accession number JN681616.1) was also utilized for phylogenetic analysis. The consensus sequence for the trnK gene from the nine Eleusine taxa and one outgroup (D. aegyptium) was generated using the software CLC Sequence viewer 6.1.The sequences were aligned using Clustal-X, version 1.8 (Thompson ). Phylogenetic analyses were carried out with the software MEGA 5 (Tamura ). Pairwise sequence divergence rates between species were calculated using the maximum composite likelihood method. Phylogeny construction was carried out using NJ, minimum evolution (ME), maximum likelihood (ML) and maximum parsimony (MP) methods. Neighbour-joining and ME trees were obtained using the maximum composite likelihood criterion while ML and MP trees were constructed using the nearest-neighbour-interchange (NNI) and tree-bisection-reconnection algorithms, respectively. In all the analyses, all positions containing gaps and missing data were eliminated from the dataset (complete deletion option). Support values of the internal branches of NJ, ME, ML and MP trees were evaluated by bootstrap (500 replicates).
PCR amplification of chloroplast microsatellites, polyacrylamide gel electrophoresis and sequencing
A total of eight primer pairs (Table 3) were used for amplification of the chloroplast microsatellites. Amplification was carried out in 25 mL reaction mixtures containing 2.5 mL of 10× reaction buffer, 1.5 mM MgCl2, 200 mM dNTPs, 200 nM each primer, 0.5 U of Taq DNA polymerase (Bangalore Genei) and 25 ng of template DNA. DNA amplification was performed in a MyCycler™ (Bio-Rad) programmed to initial denaturation at 94 °C for 5 min followed by 35 cycles each of 1 min at 94 °C, 1 min at 55 °C (65 °C for ccmp2 and 53 °C for ccmp9) and 1 min at 72 °C, followed by a final extension cycle of 5 min at 72 °C.
Table 3.
List of chloroplast microsatellites amplified in the present study.
Oryza sativa cv Nipponbare (Ishii and McCouch 2000)
List of chloroplast microsatellites amplified in the present study.An equal volume (10 mL) of formamide dye (98 % formamide, 10 mM EDTA, 0.026 g of bromophenol blue, 0.026 g of xylene cyanol) was added to each amplified product. The samples were heated for 5 min at 94 °C and immediately placed on ice. A total of 2.5 mL of each sample was loaded on a 6 % polyacrylamide gel (19 : 1 acrylamide : bisacrylamide, 7.5 M urea and 1× TBE buffer), and electrophoresis was conducted at 55 W and 55 °C for ∼2 h.For silver staining, the gel was fixed in 10 % (v:v) acetic acid for 30 min. It was subsequently rinsed three times in de-ionizedwater (2 min per rinse). The gel was then kept for staining for 30 min in a 2 L solution containing 2 g of silver nitrate and 3 mL of 37 % formaldehyde (Promega, USA). The stained plate was rinsed with de-ionizedwater for 20 s and developed in a prechilled (10 °C) developer (2 L) solution containing 60 g of sodium carbonate, 3 mL of 37 % formaldehyde and 400 µL of sodium thiosulfate (10 mg mL−1). When bands became visible, the gel was immediately transferred to 10 % acetic acid solution to stop further reaction. The gel was finally rinsed with distilled water and air dried.The PCR products of eight primer–template combinations were characterized by direct sequencing. A PCR was performed in 100 µL volumes as described above. An aliquot of the PCR product was checked by agarose gel electrophoresis, and the remainder was purified through a QIA quick PCR clean up kit (Qiagen). Nucleotide sequencing was performed using an ABI PRISM 377 automated DNA sequencer (Applied Biosystems). The nucleotide sequences have been submitted to GenBank with accession numbers KF357730–KF357735.
Results
Chloroplast PCR–RFLP
Robust amplification products were obtained for all seven genes/intergenic spacers (trnS–psbC, psaA, 16S, trnK, psbD, trnL–trnF and trnH–trnK) from the nine Eleusine species and an outgroup species, D. aegyptium. For each gene/intergenic spacer, the amplified products appeared to be monomorphic on a 1.5 % agarose gel across the species. The approximate size of the PCR product for trnS–psbC, psaA, 16S, trnK, psbD, trnL–trnF and trnH–trnK was 1620, 2380, 1480, 2510, 1090, 1090 and 2270 bp, respectively. Aliquots of PCR products were digested separately with 31 four-, five- and six-cutter restriction endonucleases. Of the 217 amplification product–enzyme combinations, 137 did not show internal restriction sites in any of the 10 species investigated. In the remaining 80 amplification product–enzyme combinations, 1–5 restriction sites within the amplification product were obtained. Of these, 57 combinations revealed no polymorphism among the 10 species. The remaining 23 (trnK/AfaI, trnK/MspI, trnK/DraI, trnK/SpeI, trnK/SspI, trnK/KpnI, trnK/TaqI, trnK/BamHI, trnL–trnF/AluI, trnL–trnF/BglII, trnL–trnF/DraI, trnL–trnF/MboI, psaA/AluI, psaA/TaqI, psaA/MspI, psbD/HaeIII, psbD/MboI, psbD/TaqI, trnS–psbC/AfaI, trnS–psbC/AvaI, trnS–psbC/HaeIII, 16S/MboI, trnH–trnK/AluI) combinations were phylogenetically very informative (Table 4; Fig. 1). A total of 282 bands were scored for cluster analyses. No variation was observed across different accessions of the same taxon. The polymorphism in all the profiles was the result of site mutations.
Table 4.
Grouping of Eleusine and outgroup species based on restriction fragment patterns of seven cpDNA amplified genes/intergenic spacers.
Species
trnS–psbC
16S
psbD
psaA
trnK
trnH–trnK
trnL–trnF
AfaI
AvaI
HaeIII
MboI
HaeIII
MboI
TaqI
AluI
MspI
TaqI
AfaI
DraI
MspI
SpeI
SspI
KpnI
TaqI
BamHI
AluI
AluI
BglII
DraI
MboI
E. coracana
A1
B1
C1
D1
E1
F1
G1
H1
I1
J1
K1
L1
M1
N1
O1
P1
Q1
R1
S1
T1
U1
V1
X1
E. africana
A1
B1
C1
D1
E1
F1
G1
H1
I1
J1
K1
L1
M1
N1
O1
P1
Q1
R1
S1
T1
U1
V1
X1
E. tristachya
A1
B1
C1
D1
E1
F1
G1
H1
I1
J1
K1
L1
M1
N1
O1
P1
Q1
R1
S1
T1
U1
V1
X1
E. indica
A1
B1
C1
D1
E1
F1
G1
H1
I1
J1
K1
L1
M1
N1
O1
P1
Q1
R1
S1
T1
U1
V1
X1
E. floccifolia
A2
B1
C1
D2
E1
F2
G1
H2
I1
J1
K2
L1
M1
N1
O2
P1
Q1
R1
S1
T1
U1
V1
X1
E. intermedia
A2
B1
C1
D2
E1
F2
G1
H2
I1
J1
K2
L1
M1
N1
O2
P1
Q1
R1
S1
T1
U1
V1
X1
E. kigeziensis
A1
B1
C1
D1
E1
F1
G1
H1
I1
J1
K1
L1
M1
N1
O1
P1
Q1
R1
S1
T1
U1
V1
X1
E. jaegeri
A2
B1
C1
D2
E1
F2
G1
H2
I2
J1
K2
L1
M1
N1
O2
P1
Q1
R1
S1
T1
U1
V1
X1
E. multiflora
A2
B1
C1
D1
E2
F2
G1
H1
I1
J2
K1
L2
M2
N2
O2
P1
Q1
R1
S1
T1
U1
V1
X1
D. aegyptium
A2
B2
C2
D1
E3
F2
G2
H1
I1
J1
K2
L1
M3
N1
O1
P2
Q2
R2
S2
T2
U2
V2
X2
Combinations
I
VIII
VIII
III
IV
I
VIII
III
V
VI
VII
VI
IV
VI
II
VIII
VIII
VIII
VIII
VIII
VIII
VIII
VIII
Figure 1.
Restriction fragment size patterns of the amplified trnK gene with AfaI (A), the amplified trnS–psbC intergenic spacer with AfaI (B), the amplified trnL–trnF gene with MboI (C) and the amplified 16S gene with MboI (D) in E. coracana (lane 1), E. africana (2), E. tristachya (3), E. indica (4), E. floccifolia (5), E. intermedia (6), E. kigeziensis (7), E. multiflora (8), E. jaegeri (9) and D. aegyptium (10). Marker DNA (M). The size of the fragments in base pairs is indicated on the left.
Grouping of Eleusine and outgroup species based on restriction fragment patterns of seven cpDNA amplified genes/intergenic spacers.Restriction fragment size patterns of the amplified trnK gene with AfaI (A), the amplified trnS–psbC intergenic spacer with AfaI (B), the amplified trnL–trnF gene with MboI (C) and the amplified 16S gene with MboI (D) in E. coracana (lane 1), E. africana (2), E. tristachya (3), E. indica (4), E. floccifolia (5), E. intermedia (6), E. kigeziensis (7), E. multiflora (8), E. jaegeri (9) and D. aegyptium (10). Marker DNA (M). The size of the fragments in base pairs is indicated on the left.Genetic distance (Nei and Li's) among the nine Eleusine species and one outgroup ranged from 0.0000 to 0.01857 (Table 5). Based on the restriction site data of the amplified gene/intergenic products, UPGMA and NJ dendrograms were generated. Both showed similar topologies. They resolved into two major clades. Clade I, supported by 100 % bootstrap support, contained E. kigeziensis, E. indica, E. tristachya, E. coracana and E. africana. Clade II was comprised of E. jaegeri, E. floccifolia, E. intermedia and E. multiflora. Eleusine floccifolia and E. intermedia were more closely related to each other (supported by a bootstrap value of 100 %) than either of the species was to E. jaegeri. Dactyloctenium aegyptium was the most diverged species among the analysed species (Fig. 2).
Table 5.
Genetic distance (Nei and Li's coefficient) matrix generated from cpDNA PCR–RFLP data of nine Eleusine species and an outgroup, D. aegyptium.
1
2
3
4
5
6
7
8
9
10
1
E. coracana
2
E. africana
0.00000
3
E. tristachya
0.00000
0.00000
4
E. indica
0.00000
0.00000
0.00000
5
E. floccifolia
0.00479
0.00479
0.00479
0.00479
6
E. intermedia
0.00479
0.00479
0.00479
0.00479
0.00000
7
E. kigeziensis
0.00000
0.00000
0.00000
0.00000
0.00479
0.00479
8
E. multiflora
0.00650
0.00650
0.00650
0.00650
0.00790
0.00790
0.00650
9
E. jaegeri
0.00655
0.00655
0.00655
0.00655
0.00170
0.00170
0.00655
0.00827
10
D. aegyptium
0.01443
0.01443
0.01443
0.01443
0.01815
0.01815
0.01443
0.01692
0.01857
Figure 2.
Unweighted pair-group method using arithmetic averages dendrogram based on the restriction fragment data of amplified chloroplast gene/intergenic spacers. Numbers at the nodes represent bootstrap probability values out of 100 replicates.
Genetic distance (Nei and Li's coefficient) matrix generated from cpDNA PCR–RFLP data of nine Eleusine species and an outgroup, D. aegyptium.Unweighted pair-group method using arithmetic averages dendrogram based on the restriction fragment data of amplified chloroplast gene/intergenic spacers. Numbers at the nodes represent bootstrap probability values out of 100 replicates.
trnK sequence data
The trnK gene codes for tRNA-LysUUU. In rice, the chloroplast trnK gene is 2576 bp long, of which the exon is only 72 bp long. The exon is divided into two parts by a long intron of ∼2500 bp. The 5′ exon consists of 37 bp and the 3′ exon consists of 35 bp. The matK gene, which codes for maturaseK, is 1536 bp in length and is located within the chloroplast trnK intron. In the present study, the trnK gene was sequenced from the nine Eleusine species and D. aegyptium. The aligned trnK gene sequence of Eleusine corresponded to position 45 at the 5′ end to position 2531 at the 3′ end of rice. The length of the trnK gene in the nine Eleusine species ranged between 2463 (E. jaegeri) and 2467 bp (E. tristachya and E. floccifolia). The length of trnK in A. racemosa and D. aegyptium was 2466 and 2472 bp, respectively [see]. The GC content of the trnK gene sequence varied from 32.3 to 33 % with an average of 32.6 %. The nucleotide frequencies were 0.162 for G, 0.164 for C, 0.324 for A and 0.350 for T. The dataset including alignment gaps and missing data comprised 2486 nucleotide positions, out of which 2254 were conserved, 214 were variable and 57 were parsimony informative sites.The trnK sequence data were correlated with the polymorphic PCR–RFLP profiles of the trnK gene. Eight amplicon–enzyme combinations for the trnK gene showed polymorphic profiles which revealed that polymorphism in all the profiles was the result of site mutations. The entire trnK gene sequence from the taxa analysed showed 214 single-nucleotide polymorphisms (SNPs). A total of 10 SNPs in the restriction sites were responsible for the eight polymorphic profiles. The polymorphic profiles produced were due to gain or loss of the restriction sites caused by SNPs. The consensus trnK gene sequence was generated from the taxa analysed and the locations of the polymorphic sites were marked on it (Fig. 3).
Figure 3.
Consensus sequence of the trnK gene for nine Eleusine species and one outgroup (D. aegyptium) showing major restriction sites.
Consensus sequence of the trnK gene for nine Eleusine species and one outgroup (D. aegyptium) showing major restriction sites.Pairwise sequence divergence ranged from 0.003 to 0.058 (Table 6). Dendrograms were generated using four different methods (NJ, ML, ME and MP). The phylogeny reconstruction through the NJ method resulted in an optimal tree with a sum of branch length of 0.095. In the NJ bootstrap consensus tree, all the Eleusine species were grouped into two distinct clades with 100 % bootstrap support. Clade I consisted of E. coracana, E. africana, E. indica, E. kigeziensis and E. tristachya. Eleusine coracana and E. africana grouped together in one subclade with a bootstrap support of 95 %, and E. indica and E. kigeziensis grouped together in another subclade with a bootstrap support of 54 % within Clade I. Eleusine tristachya did not group within any of the two subclades and thus was the most diverged among the five species of Clade I with a bootstrap support of 98 %. Clade II consisted of E. intermedia, E. floccifolia, E. jaegeri and E. multiflora. Eleusine intermedia and E. floccifolia were more closely related to each other than to E. jaegeri with a bootstrap support of 54 %. Eleusine multiflora represented the most diverged species within Clade II. Acrachne racemosa and D. aegyptium were the most diverged species among all the species (Fig. 4A).
Table 6.
Genetic divergence (maximum composite likelihood method) from trnK sequence data for nine Eleusine species and two outgroups, A. racemosa and D. aegyptium.
1
2
3
4
5
6
7
8
9
10
11
E. floccifolia
E. intermedia
0.008
E. jaegeri
0.026
0.025
E. multiflora
0.010
0.009
0.027
E. coracana
0.012
0.012
0.026
0.012
E. africana
0.013
0.013
0.028
0.013
0.004
E. kigeziensis
0.014
0.014
0.030
0.014
0.007
0.008
E. indica
0.011
0.010
0.026
0.010
0.003
0.004
0.005
E. tristachya
0.011
0.011
0.026
0.011
0.005
0.006
0.007
0.003
A. racemosa
0.029
0.028
0.046
0.029
0.030
0.030
0.030
0.027
0.027
D. aegyptium
0.039
0.038
0.058
0.041
0.040
0.041
0.043
0.039
0.040
0.030
Figure 4.
Dendrograms based on the sequence data of the trnK gene of Eleusine and outgroup taxa. (A) Neighbour-joining bootstrap consensus tree, (B) ML bootstrap consensus tree and (C) MP consensus tree. Numbers at the nodes represent bootstrap probability values out of 500 replicates.
Genetic divergence (maximum composite likelihood method) from trnK sequence data for nine Eleusine species and two outgroups, A. racemosa and D. aegyptium.Dendrograms based on the sequence data of the trnK gene of Eleusine and outgroup taxa. (A) Neighbour-joining bootstrap consensus tree, (B) ML bootstrap consensus tree and (C) MP consensus tree. Numbers at the nodes represent bootstrap probability values out of 500 replicates.The robustness of the present data lies in the fact that the dendrograms based on ME, ML and MP analysis showed almost the same topology as that of the NJ tree (Fig. 4). Maximum parsimony analysis resulted in most parsimonious trees (Fig. 4C) with a tree length of 232. The bootstrap consensus tree had a consistency index of 0.943, a retention index of 0.859 and a rescaled consistency index of 0.811.
Chloroplast microsatellite polymorphism and sequence data
Eight consensus primer pairs were used to amplify cpDNA microsatellites from the nine species (Table 3). The primer pairs were able to amplify a product in all 10 species (Fig. 5). A major band was produced for each primer–template combination. The typical stuttering phenomenon which usually occurs upon amplification of mononucleotide- and dinucleotide-type microsatellites was observed. Size variation for the amplified products was only observed for two cpSSR loci. Two size variants were detected for ccmp2 and ccmp5. For ccmp2, all but one (E. tristachya) species showed a 197-bp allele. In E. tristachya, it was 200 bp. In the case of ccmp5, E. jaegeri showed a band of 146 bp, while each of the remaining nine species had a 145-bp allele (Fig. 5).
Figure 5.
Silver-stained polyacrylamide gel electrophoresis patterns generated by ccmp5 (A) and ccmp6 (B) in E. coracana (1), E. africana (2), E. tristachya (3), E. indica (4), E. floccifolia (5), E. intermedia (6), E. kigeziensis (7), E. multiflora (8), E. jaegeri (9) and D. aegyptium (10). The size of the fragments in base pairs is indicated on the left.
Silver-stained polyacrylamide gel electrophoresis patterns generated by ccmp5 (A) and ccmp6 (B) in E. coracana (1), E. africana (2), E. tristachya (3), E. indica (4), E. floccifolia (5), E. intermedia (6), E. kigeziensis (7), E. multiflora (8), E. jaegeri (9) and D. aegyptium (10). The size of the fragments in base pairs is indicated on the left.To test the presence of a repeat motif in Eleusine, nucleotide sequencing was carried out for amplified products from E. coracana with all eight cpSSR primer sets. RCt5 and RCt8 could not be successfully sequenced. ccmp2, ccmp5, ccmp6, RCt3, RCt4 and RCt7 showed sequence sizes of 197, 145, 96, 113, 131 and 126 bp, respectively (Tables 3 and 7). Repeat motifs were identified in all six amplification products at the expected positions but the degree of conservation was variable (Table 7). Among the repeat motifs, RCt4 and RCt7 were highly conserved, ccmp5 and ccmp6 were moderately conserved, and ccmp2 and RCt3 were least conserved. In the case of ccmp2 and RCt3, the number of mononucleotide repeats was reduced.
Table 7.
Nucleotide sequences of cpDNA of E. coracana amplified using six cpSSR primer pairs. The location of repeats is underlined.
Nucleotide sequences of cpDNA of E. coracana amplified using six cpSSR primer pairs. The location of repeats is underlined.
Discussion
The restriction site variation identified in the amplified chloroplast gene/intergenic spacers has provided new insights into the origin and evolution of the three polyploid species, and the genetic relationships between the cultivated and wild Eleusine species. Species-specific markers were also identified. None of the amplicon–enzyme combinations could discriminate between E. coracana (AABB, 2n = 4x = 36), E. africana (AABB, 2n = 4x = 36), E. tristachya (AA, 2n = 2x = 18), E. indica (AA, 2n = 2x = 18) and E. kigeziensis (AADD, 2n = 4x = 38) on the one hand, and between E. floccifolia (BB, 2n = 2x = 18) and E. intermedia (AB, 2n = 2x = 18) on the other hand. Although the maximum number of polymorphic and phylogenetically informative markers was obtained in eight trnK amplicon–enzyme combinations, the amount of variation that can be tapped with the PCR–RFLP method was lower than the variation tapped with sequencing of the trnK gene. A total of 214 SNPs were found in the trnK gene sequence, of which only 10 were causative to 8 polymorphic profiles. This clearly demonstrates that sequencing of the trnK gene is more informative than the PCR–RFLP method. Moreover, the nucleotide sequence of the chloroplast trnK gene fine-tuned the species relationships. The nine species of Eleusine were grouped into two clades, in both PCR–RFLP and trnK gene sequence data analyses. The trnK sequence data clearly showed that the three tetraploids were closer to E. indica than to E. tristachya, as concluded from PCR–RFLP data.The present observations based on restriction site variation and the trnK gene sequence of cpDNA are congruent with the conclusions reached by various authors based on the multiple marker nuclear DNA assay, chloroplast markers and chromosome research. The strong affinities between E. coracana, E. africana and E. indica have been highlighted in earlier studies (Hilu and Johnson 1992, 1997; Hilu 1995; Bisht and Mukai 2000, 2001, ; Neves ; Dida , 2008; Liu ) as well. A number of studies based on chromosome research (Hiremath and Chennaveeraiah 1982; Hiremath and Salimath 1991), 2C DNA content (Hiremath and Salimath 1991), crossability data (Salimath ) and ribosomal DNA polymorphism (Hilu and Johnson 1992; Werth ; Bisht and Mukai 2000), ITS sequence data (Neves ) and cpDNA (Hilu and Johnson 1997; Neves ; Liu ) also support the close affinity between E. indica and E. tristachya. The allotetraploid E. kigeziensis exhibits the same maternal lineage as that of the two other allotetraploid species, E. coracana and E. africana. The five species share some common morphological features, such as 1–3 nerved lower glumes with a winged keel, 3–7 nerved upper glumes with a more or less winged keel, the presence of 1–3 subsidiary nerves adjacent to the central nerve of the lemma and the generally winged keel of the palea (Phillips 1972).The present data unambiguously support the view of Chennaveeraiah and Hiremath (1974), Hilu (1988), Hiremath and Salimath (1992), Hilu and Johnson (1997), Bisht and Mukai (2000, 2001, ) and Neves that E. africana is the wild progenitor of the cultivated species, E. coracana, and that E. indica is one of the diploid progenitors of the two polyploid species (Liu ). The present results further indicate that E. indica with the AA genome might be the maternal parent for all three tetraploid species, viz. E. coracana, E. africana and E. kigeziensis. Morphologically, E. kigeziensis resembles E. indica. It is considered to be a hybrid between E. indica and one of the perennial species (Phillips 1972). Salimath (1990) and Bisht and Mukai (2002) have also proposed E. indica as one of the genome donors of E. kigeziensis.The close affinity in the cpDNA PCR–RFLP profiles and trnK sequences between E. floccifolia and E. intermedia is also supported by the occurrence of the same chromosome number (2n = 18), 2C DNA values (Hiremath and Chennaveeraiah 1982; Hiremath and Salimath 1991) and ITS sequence data (Neves ). Close association between E. jaegeri, E. floccifolia and E. intermedia suggested in earlier studies (Hiremath and Salimath 1991; Hilu and Johnson 1992; Liu ) is further supported by our data. Eleusine jaegeri in all combinations grouped together with E. floccifolia and E. intermedia except in the psaA–MspI combination, where it shows a unique profile vis-à-vis the remaining species. In the case of trnK sequence analysis also these three species are found to be closely associated. Eleusine multiflora also behaves in the same fashion. The species exhibits unique profiles vis-à-vis the remaining species in trnK–DraI, trnK–MspI, trnK–SpeI, psaA–TaqI and psbD–HaeIII combinations, with the result that it represents the most diverged taxon among all the Eleusine species. These amplicon–enzyme combinations can therefore be used as species-specific markers for E. jaegeri and E. multiflora. The position of E. multiflora within the genus Eleusine is questionable mainly on the basis of its unusual inflorescence morphology [considered to be intermediate between Eleusine and Acrachne (Phillips 1972; Clayton and Renvoize 1986)], distinct chemical composition (Hilu ), molecular data (Hilu and Johnson 1992; Hilu 1995), chromosome number (2n = 16), 2C DNA value and other cytogenetic features (Mysore and Baird 1997; Bisht and Mukai 2000) including the present results. Dactyloctenium aegyptium in restriction site analysis, and A. racemosa and D. aegyptium in trnK sequence analysis indicate that they are the most diverged species vis-à-vis
Eleusine species analysed and thus represented as outgroups.Chloroplast microsatellites are known to have potential for phylogenetic studies (Powell , , 1996; Provan , 2001; Ishii and McCouch 2000; Angioi ). The present cpSSR data on Eleusine species, however, were not helpful as very little polymorphism was obtained for various amplified microsatellites. Only two cpSSR loci (ccmp2 and ccmp5) were found to be polymorphic, displaying a total of four alleles. In the case of ccmp2, the variant allele was found in E. tristachya, while for ccmp5 the variant allele was found in E. jaegeri. Overall, the sequences seem to be highly conserved. This is in strong contrast to the observations made in the other genera such as Glycine, Oryza and Hordeum, where different species within the genus and even within different subspecies displayed characteristic haplotypes (Powell , 1996; Provan , 1999). The reason for not obtaining polymorphism in the present study could be attributed to the fact that it is very difficult to design universal primers for microsatellites that show widespread polymorphism at the inter-specific level and at the same time are able to amplify across a broad range of plant genera. Although there are primers available that amplify across wide-ranging taxa, these rarely show widespread inter-specific polymorphism. This could be due to the contrasting requirements in a short stretch of DNA of extreme stability and sequence conservation for priming sites and consistently high levels of polymorphism in the intervening region (Provan ). Second, size homoplasy may not necessarily mean identical intervening microsatellite regions; it may also be due to variation in flanking regions. It is also possible that back mutations have taken place, which might result in identical size but not identical sequence (Bryan ).
Conclusions
Our results based on RFLP of the seven amplified chloroplast genes/intergenic spacers, and the trnK gene sequence in the nine diploid and allotetraploid Eleusine species and two outgroup species resulted in well-resolved phylogenetic trees. The maternal genome donor (E. indica, 2n = 2x = 18) of the allotetraploid (2n = 4x = 36, 2n = 2x = 38) Eleusine species, and the phylogenetic relationships between cultivated E. coracana (2n = 4x = 36) and wild species could be successfully resolved. The species-specific markers were also identified. The two diploid species E. indica and E. tristachya could not be resolved separately by PCR–RFLP of seven chloroplast genic/intergenic spacers, as not a single site change could be scored. However, the trnK gene sequence clearly demonstrated that E. indica is more closely related to all three allotetraploids as compared with E. tristachya. Therefore, E. indica is most likely the maternal parent to all three allotetraploids. Eleusine multiflora (2n = 2x = 16) was found to be the most diverged among all the species. The explicit identification of the maternal parent and that of the immediate wild progenitor of finger millet will be immensely useful for future genetic improvement and biotechnological programmes of the crop species.
Sources of Funding
This work was supported by the Council of Scientific and Industrial Research (CSIR), Government of India and the National Academy of Sciences, India (NASI).
Contributions by the Authors
R.A. was involved in planning and performing all the experiments, data analyses and manuscript writing. N.A. was involved in the data analysis. R.T. was involved in relevant research discussions. S.N.R. was involved in the planning and supervision of all the experimental work and in writing the manuscript. All authors have seen and agreed to the submitted manuscript.
Conflicts of Interest Statement
None declared.
Accession Numbers
The nucleotide sequences of the trnK gene from the nine Eleusine species and one outgroup D. aegyptium have been submitted to GenBank with accession numbers KF357736–KF357745. The nucleotide sequences of six cpSSRs isolated from E. coracana have been submitted to GenBank with accession numbers KF357730–KF357735.
Supporting Information
The following additional information is available in the online version of this article –File 1. Sequence data matrix of the aligned partial trnK gene region of cpDNA of nine Eleusine species and two outgroups, A. racemosa and D. aegyptium. Nucleotide sequences are displayed 5′–3′. Dots indicate the same nucleotide as in E. coracana; dashes indicate gaps.