Literature DB >> 28817640

Evaluation of multilocus marker efficacy for delineating mangrove species of West Coast India.

Ankush Ashok Saddhe1, Rahul Arvind Jamdade2, Kundan Kumar1.   

Abstract

The plant DNA barcoding is a complex and requires more than one marker(s) as compared to animal barcoding. Mangroves are diverse estuarine ecosystem prevalent in the tropical and subtropical zone, but anthropogenic activity turned them into the vulnerable ecosystem. There is a need to build a molecular reference library of mangrove plant species based on molecular barcode marker along with morphological characteristics. In this study, we tested the core plant barcode (rbcL and matK) and four promising complementary barcodes (ITS2, psbK-psbI, rpoC1 and atpF-atpH) in 14 mangroves species belonging to 5 families from West Coast India. Data analysis was performed based on barcode gap analysis, intra- and inter-specific genetic distance, Automated Barcode Gap Discovery (ABGD), TaxonDNA (BM, BCM), Poisson Tree Processes (PTP) and General Mixed Yule-coalescent (GMYC). matK+ITS2 marker based on GMYC method resolved 57.14% of mangroves species and TaxonDNA, ABGD, and PTP discriminated 42.85% of mangrove species. With a single locus analysis, ITS2 exhibited the higher discriminatory power (87.82%) and combinations of matK + ITS2 provided the highest discrimination success (89.74%) rate except for Avicennia genus. Further, we explored 3 additional markers (psbK-psbI, rpoC1, and atpF-atpH) for Avicennia genera (A. alba, A. officinalis and A. marina) and atpF-atpH locus was able to discriminate three species of Avicennia genera. Our analysis underscored the efficacy of matK + ITS2 markers along with atpF-atpH as the best combination for mangrove identification in West Coast India regions.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28817640      PMCID: PMC5560660          DOI: 10.1371/journal.pone.0183245

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Plant DNA barcoding is more complex than animal DNA barcoding and it often requires more than one locus approach. The mitochondrial cytochrome oxidase I (COI) gene fragment is considered as the universal animal barcode. Plant mitochondrial COI was excluded from the barcoding, due to the low substitution rates [1-3]. Later, the Consortium for the Barcode of Life (CBOL) evaluated 7 leading candidate DNA regions (matK, rbcL, trnH–psbA spacer, atpFatpH spacer, rpoB, rpoC1, and psbK–psbI spacer) [4]. The CBOL recommended two-locus combinations of rbcL and matK as the core plant barcode complemented with trnH-psbA intergenic spacer based on the parameters of recoverability, sequence quality, and levels of species discrimination, CBOL [4-6]. China Plant Barcode of Life recommended the internal transcribed spacer (ITS) as an additional candidate plant DNA barcode [7]. Comparative studies of seven markers psbA-trnH, matK, rbcL, rpoC1, ycf5, ITS2, and ITS from medicinal plant species were performed. Authors recommended that ITS2 is the best potential marker which discriminated 92.7% plants at the species level in more than 6600 plant samples [8]. The potential discriminating DNA barcode varies from one botanical family to other. The plastid marker matK can differentiate more than 90% of species in the Orchidaceae (Orchid family) but less than 49% in the Myristicaceae (nutmeg family) [9-10]. However, identification of 92 species from 32 genera using multilocus markers (coding regions (rpoB, rpoC1, rbcL, matK and 23S rDNA) and non-coding (trnH-psbA, atpFatpH, and psbK–psbI) could achieve 69%-71% with several combinations [3]. More than two loci can improve the plant identification success rate; a recent example of the flora of Canada revealed 93% success in species identification with rbcL and matK, while the addition of the trnH-psbA intergenic spacer achieved discrimination up to 95% [11]. rbcL and matK loci showed poor discrimination in species-rich genera and complex taxa of Lysimachia, Ficus, Holcoglossum, and Curcuma [12-15]. The lowest discriminatory power was observed in closely related groups of Lysimachia with rbcL (26.5–38.1%), followed by matK (55.9–60.8%) and combinations of core barcodes (rbcL + matK) had discrimination of 47.1–60.8% [15]. Beside all these markers, several plastid regions such as ycf1, atpF-H, psbK-psbI, ropC1, rpoB, and trnL-trnF were frequently evaluated as plant barcode. However, the application of DNA barcoding has been hindered owing to the difficulty in distinguishing closely related species, especially in recently diverged taxa. Mangroves are unique component of the coastal ecosystem of the world with a niche distribution in tropical and subtropical climates [16]. They are adapted to the local environment like fluctuated water level, salinity and anoxic condition through special features such as aerial breathing and extensive supporting roots, buttresses, salt-excreting leaves and viviparous propagules [17-18]. Plant mangrove species comprise 70 species belonging to about 20 families and 27 genera [19-20]. The West Coast of India is more or less steeply shelved, lack major deltas, river estuaries and dominated by sandy and rocky substratum. The West Coast also harbors one of the world’s biodiversity hotspot of Western Ghats in India. It includes the states of Gujarat, Maharashtra, Goa, Karnataka, and Kerala, which harbors 37 species (25 genera under 16 families). The most dominant mangrove species found along the West Coast of India are Rhizophora mucronata, R. apiculata, Bruguiera gymnorrhiza, B. parviflora, Sonneratia alba, S. caseolaris, Cariops tagal, Heretiera littoralis, Xylocarpus granatum, X. molluscensis, Avicennia officinalis, A. marina, Excoecaria agallocha and Lumnitzera racemosa [21]. In the previous study, we reported the efficacy of single and concatenation of rbcL and matK marker which resolved Acanthus, Excoecaria, Aegiceras, Kandelia, Ceriops and Bruguiera genus perfectly, but were unable to delimit species-rich genera such as Rhizophora, Avicennia and Sonneratia [17]. In the present work, we comprehensively evaluated the potential of ITS2, concatenated ITS2+matK, atpF-atpH, psbK-psbI and ropC1markers for 14 mangroves species. The evaluation was based on genetic distance, diagnostic nucleotide characters, Neighbour-joining (NJ) Kimura 2 Parameter (K2P) tree, TaxonDNA, Automated Barcode Gap Discovery (ABGD), Poisson tree process (PTP) and Generalized mixed Yule- Coalescent model (GMYC) analysis.

Material and methods

Ethics statement

The mangrove samples were collected from different parts of Goa, west coast region, with the permission from the Principal Chief Conservator of Forest, Goa Forest Department, Goa, India. Further, none of the species are endangered or protected species.

Mangrove plant sampling

In the present study, a total of 44 specimens of mangroves belonging to 14 species, 9 genera and 5 families were collected from Goa region, West Coast of India with geographical co-ordinates latitude of 15.5256° N and longitude of 73.8753° E. The selected genera of mangroves such as Rhizophora, Bruguiera, Avicennia, and Sonneratia each represented by at least two species and Aegiceras, Excoecaria, Ceriops, Kandelia, Acanthus genus were represented by single species. Mangrove species were identified based on morphological keys [22] and mounted on herbarium sheets, photographed and deposited at the Botanical Survey of India, Western Regional Centre, Pune, India as barcode vouchers [17]. The well-identified voucher specimens along with their taxonomic information, collection details, and GenBank accession numbers were described in Table 1. For each specimen, leaf tissue was collected in the field, labeled and stored in -80° C for further analysis.
Table 1

Details of the mangrove species.

S. No.SpecimenVoucher No.Accession No. of ITS2
A
1Avicennia officinalisAAS-100-02KU876892, KU876893
2Avicennia marinaAAS-110-12KU876889, KU876890, KU876891
3Avicennia albaAAS-120-22KU876886, KU876887, KU876888
4Acanthus ilicifoliusAAS-230-32KY250442, KY250443
5Bruguiera cylindricaAAS-130-32KU876894, KU876895, KU876896
6Bruguiera gymnorrhizaAAS-140-42KU876897, KU876898, KU876899
7Rhizophora mucronataAAS-150-52KU876910, KU876911, KY250446
8Rhizophora apiculataAAS-160-62KU876908, KU876909, KY250445
9Kandelia candelAAS-190-92KU876906, KU876907, KY250444
10Ceriops tagalAAS-200-02KU876900, KU876901, KU876902
11Excoecaria agallochaAAS-180-82KU876903, KU876904, KU876905
12Aegiceras corniculatumAAS-170-73KU876881, KU876882, KU876883,KU876884
13Sonneratia caseolarisAAS-220-22KY250450, KY250451
14Sonneratia albaAAS-210-12KY250447, KY250448, KY250449
B
S. No.SpecimenatpF-atpHpsbK-psbIrpoC1
1Avicennia officinalisKY754573, KY754574, KY754575KY754564, KY754565, KY754566KY754187, KY754188, KY754189
2Avicennia marinaKY754570, KY754571, KY754572KY754561, KY754562, KY754563KY754184, KY754185, KY754186
3Avicennia albaKY754567, KY754568, KY754569,

Details of the mangrove species with accession numbers used in the present study for ITS2, atpF-atpH, psbK-psbI and rpoC1 with voucher number and GenBank accession numbers.

Details of the mangrove species with accession numbers used in the present study for ITS2, atpF-atpH, psbK-psbI and rpoC1 with voucher number and GenBank accession numbers.

DNA extraction

Genomic DNA was isolated from mangrove species by modified cetyl-trimethyl ammonium bromide (CTAB) protocol [17]. Leaf tissue was homogenized in liquid nitrogen and CTAB buffer containing 2% PVP-30 and 1% β-mercaptoethanol was mixed. The suspension was incubated at 60°C for 60 min and centrifuged at 14000 rpm for 10 min at room temperature. It was further extracted with equal volume of chloroform: isoamyl alcohol (24:1) and precipitated with cold isopropanol (-20°C) and ammonium acetate. The precipitated DNA was washed with 70% ethanol and finally dissolved in TE buffer. Quantity and quality of the DNA samples were confirmed by agarose gel electrophoresis and Nanodrop (Thermo Scientific, USA).

PCR and sequencing

PCR amplification of ITS2, atpF-atpH, psbK-psbI and rpoC1 were carried out in the 50-μl reaction mixture containing 10-20ng of template DNA, 200 μM of dNTPs, 0.1 μM of each primer and 1 unit of Taq DNA polymerase (Thermo Scientific, USA). The reaction mixture was amplified in Bio-Rad (T100 model) thermal cycler with temperature profile for ITS2 (94°C for 4 min; 35 cycles of 94°C for 30 sec, 56°C for 30 sec, 72°C for 1 min; final extension 72°C for 10 min), atpF-atpH (94°C for 1 min; 35 cycles of 94°C for 30 sec, 50°C for 40 sec, 72°C for 40 sec; final extension 72°C for 5 min), psbK-psbI (94°C for 5 min; 35 cycles of 94°C for 30 sec, 55°C for 30 sec, 72°C for 45 sec; final extension 72°C for 10 min), rpoC1 (94°C for 5 min; 35 cycles of 94°C for 30 sec, 55°C for 30 sec, 72°C for 45 sec; final extension 72°C for 10 min). The amplified products were separated by agarose gel (1.2%) electrophoresis and stained with ethidium bromide. The primers used for amplification were listed (Supporting information S1 Table). PCR products were purified as per manufacturer’s instruction (Chromous Biotech) and further sequencing reactions were carried out using the Big Dye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) and analyzed on ABI 3500xL Genetic Analyzer (Applied Biosystems).

Data analysis

Sequence assembly and alignment were performed in Codon Code Aligner v.3.0.1 (Codon Code Corporation) and MEGA 6.0.6 respectively [23]. All sequences were submitted to Barcode of Life Data Systems (BOLD) database under the project code IMDB with their taxonomic and sampling details (doi:10.5883/DS-IMDBNG) [24]. Nucleotide diagnostic characters of mangrove species were analyzed based on the BOLD system. Further, matK and ITS2 sequences were concatenated using DNASP v5.10 tool and analyzed in MEGA 6 [25]. NJ trees were constructed using MEGA 6.0 and Kimura 2 parameter (K2P) genetic distance model with node support based on 1000 bootstrap replicates.

TaxonDNA

TaxonDNA v1.6.2 analysis for species identification with ‘Best Match’ and ‘Best Closest Match’ method was performed [17, 26]. The threshold (T) was set at 95%. All the results above the threshold (T) were treated as ‘incorrect’. Similarly, if all matches of the query sequence were below threshold (T), the barcode assignment was considered to be the ‘correct’ identification. If the matches of the query sequences were good and corresponded to a mixture of species, then it was treated as ambiguous identification.

Automated Barcode Gap Discovery (ABGD)

The ABGD, is a web server based distance method, which can partition the sequences into potential species based on the barcode gap whenever the divergence within the same species is smaller than organisms from different species [27-29]. The ABGD analysis was performed with two relative gap width (X = 1.0, 1.5) and three distance metrics (Jukes-Cantor, K2P, and p-distance) with default parameters.

General Mixed Yule-coalescent (GMYC)

The GMYC method requires a fully resolved ultrametric tree for analysis. This Bayesian tree was built using BEAST v1.8 [30-31]. Input file (XML) for BEAST was compiled in BEAUti v1.83 with an HKY+G molecular evolutionary model for the ITS2 dataset and GTR+G for concatenated dataset of matK+ITS2. These models were derived using PartitionFinder V1.1.1. Tree prior was set to Yule Process and the length of Markov chain Monte Carlo (MCMC) chain was 40,000,000 generation and sampling was performed at every 4000 step. However, all other settings were kept as default. Convergence of the BEAST runs to the posterior distribution. The adequacy of sampling (based on the Effective Sample Size (ESS) diagnostic) was assessed with Tracer v1.4. After removing the first 20% of the samples as burn-in, all other runs were combined to generate posterior probabilities of nodes from the sampled trees using TreeAnnotator v1.7.4. Estimation of the number of species included in the tree was analyzed using GMYC with single and multiple thresholds in R by the APE and SPLITS packages [27, 30–36].

Poisson Tree Process model (PTP)

The PTP model is a tree-based method that differentiates specimen into populations and species level using coalescence theory [27-29] The RaxML tree was constructed using CIPRES portal and input data was generated for bPTP analysis. The calculations were conducted on the bPTP web server (http://species.h-its.org), with the following parameters (500,000 MCMC generations, thinning 100 and burn-in 25%).

Results

Sequence analysis

A total of 148 sequences (44 rbcL, 43 matK, 40 ITS2, 9 atpF-atpH, 6 psbK-psbI and 6 rpoC1) were acquired from 44 specimens of mangrove belonging to 14 species, 9 genera, and 5 families. The sequences (rbcL: 510bp, matK: 712bp, ITS2: 445bp, atpF-atpH: 511bp, psbK-psbI: 360bp and rpoC1: 451bp) with few insertions and deletions, without stop codon, along with the specimen collection details were submitted to the Barcode of Life Data Systems (BOLD) in form of a project ‘IMDB’ (dx.doi.org/10.5883/DS-IMDBNG). These sequences were submitted to the NCBI GenBank through BOLD systems and their accession numbers were obtained (Table 1). The scatter plot represented the number of individuals in each species against their maximum intra-specific distances, as a test for sampling bias (Fig 1). Previous evaluation of DNA barcode using rbcL and matK demonstrated 47.72% and 72.09% efficiency in resolving mangrove taxa respectively. The matK sequence region showed better efficiency than the rbcL for resolution of mangrove taxa [17]. In the present study, matK along with ITS2 and few supplementary markers (atpF-atpH, psbK-psbI and rpoC1) were used for the species identification of the cryptic mangrove taxa. Sequence analysis was performed to estimate the average GC content of the corresponding locus. The average GC content observed was 63.11%, 42.7%, 35.18%, 31.22% and 44.6% for ITS2, matK+ITS2, atpF-atpH, psbK-psbI and rpoC1 locus respectively.
Fig 1

Scatter plot.

The scatter plot represents the number of individuals in each species against their maximum intra-specific distances, as a test for sampling bias. (a) ITS2 locus (b) atpF-atpH locus (c) psbK-psbI locus and (d) rpoC1 locus.

Scatter plot.

The scatter plot represents the number of individuals in each species against their maximum intra-specific distances, as a test for sampling bias. (a) ITS2 locus (b) atpF-atpH locus (c) psbK-psbI locus and (d) rpoC1 locus.

Genetic divergence analysis

The genetic distances were calculated for individual barcode marker by K2P model on the BOLD system. The mean intraspecific distance for ITS2, atpF-atpH, psbK-psbI and rpoC1 was calculated as 1.85%, 0.11%, 1.63% and 0.37% respectively. While mean intrageneric distance for ITS2, atpF-atpH, psbK-psbI and rpoC1was calculated as 5.8%, 1.03%, 2.16% and 0.3% respectively (Table 2). Higher intraspecific distances (>2%) for ITS2 were observed in 19.51% individuals and S. alba exhibited highest intraspecific distance of 16.75%. While lower intrageneric distances (<2%) for ITS2 were observed in 50.98% individuals and A. marina showed the lowest intrageneric distance of 0%. Higher intraspecific distances for matK+ITS2 were observed in 9.30% individuals and S. alba exhibited the highest distance of 4.01%. While lower intrageneric distances were observed in almost 90.69% individuals (Table 2). In some species intraspecific distance was higher than the intrageneric distance (Fig 2A and 2B). Six species (A. alba, A. officinalis, A. marina, B. cylindrica, B. gymnorrhiza and R. mucronata) were resolved with ITS2, while in concatenation of matK+ITS2, error rates were minimized in two species (A. officinalis and A. marina). Avicennia genus in the former and current analysis has revealed low resolution. To resolve this cryptic genus, we used few supplementary markers such as atpF-atpH, psbK-psbI and rpoC1. Avicennia genus showed intraspecific distance ranging from 0%-1.0% with almost all barcode markers, with highest intraspecific distance (>2%) was observed in psbK-psbI (3.85%) (Fig 2B, Table 3). While lower intrageneric distance (<2%) was observed in nearly all barcode markers, except for psbK-psbI (4.94%).
Table 2

Distance summary.

BarcodeLevelNTaxaComparisonsMin Dist(%)Mean Dist(%)Max Dist(%)SE Dist(%)
ITS2Species40143901.8516.750.1
Genus2544505.835.140.25
Family2821335.7212.3540.260.08
matK + ITS2Species39143700.514.020.02
Genus2444301.767.840.05
Family2821333.357.3919.890.03
atpF-atpHSpecies93900.110.60.02
Genus91270.391.031.620.02
psbK-psbISpecies62601.633.850.27
Genus6190.962.164.940.14
rpoC1Species6260.220.370.670.03
Genus61900.30.670.02

Summary distribution of sequence divergence at the species, genus and family level is summarized (Distance summary result—BOLD system). N—Number of sequences.

Fig 2

Genetic distance.

Distribution of intra and inter specific K2P mean divergence (arranged in ascending order). (a) ITS2 and ITS2+matK (concatenated) are represented by grey and black colors respectively. (b) For atpF-atpH, psbK-psbI and rpoC1 markers maximum intraspecific distance and minimum interspecific distance to nearest neighbor are represented by a bar.

Table 3

Mean divergence of Avicennia genus.

atpF-atpHpsbK-psbIrpoC1
Max.IntraspecificMinInterspecificNNMax. IntraspecificMin InterspecificNNMax. IntraspecificMin InterspecificNN
A. officinalis0.390.83.850.960.670
A. marina00.390.320.960.450
A. alba0.60.39NANANANA

Distribution of intra and inter specific K2P mean divergence for atpF-atpH, psbK-psbI and rpoC1 are represented in table for Avicennia genus. NN-Nearest Neighbor, Max-Maximum, Min-Minimum.

Genetic distance.

Distribution of intra and inter specific K2P mean divergence (arranged in ascending order). (a) ITS2 and ITS2+matK (concatenated) are represented by grey and black colors respectively. (b) For atpF-atpH, psbK-psbI and rpoC1 markers maximum intraspecific distance and minimum interspecific distance to nearest neighbor are represented by a bar. Summary distribution of sequence divergence at the species, genus and family level is summarized (Distance summary result—BOLD system). N—Number of sequences. Distribution of intra and inter specific K2P mean divergence for atpF-atpH, psbK-psbI and rpoC1 are represented in table for Avicennia genus. NN-Nearest Neighbor, Max-Maximum, Min-Minimum. Diagnostic character based delineation of mangrove species was done using four barcode markers (ITS2, atpF-atpH, psbK-psbI and rpoC1) along with concatenated matK+ITS2 with minimum 3 specimens per species. Highest diagnostic characters were observed in ITS2 for Excoecaria agallocha (34) and Aegiceras corniculatum (35), whereas single diagnostic character was observed in the species of Avicennia genera followed by Bruguiera cylindrica (Table 4). In concatenated matK+ITS2, highest diagnostic characters were observed in Aegiceras corniculatum (96) and Excoecaria agallocha (60). However, all species of Avicennia genera revealed diagnostic characters, but Bruguiera gymnorrhiza failed to exhibit any diagnostic character. The supplementary marker rpoC1 failed to show any diagnostic character in Avicennia, while atpF-atpH and psbK-psbI exhibited diagnostic characters (Table 4).
Table 4

Diagnostic characters of mangrove taxa.

BarcodeGroup Name (sequences)Diagnostic CharactersDiagnostic or Partial CharactersPartial CharactersPartial orUninformative Characters
matK+ ITS2Aegiceras corniculatum (6)96301
Avicennia alba (3)8001
Avicennia marina (3)5011
Bruguiera cylindrica (3)2000
Bruguiera gymnorrhiza (3)0100
Ceriops tagal (3)5200
Excoecaria agallocha (3)60303
Kandelia candel (3)12011
Rhizophora apiculata (3)20123
Rhizophora mucronata (3)6000
ITS2Aegiceras corniculatum (4)35400
Avicennia alba (3)1010
Avicennia marina (3)1010
Avicennia officinalis (3)0000
Bruguiera cylindrica (3)1000
Bruguiera gymnorrhiza (3)0000
Ceriops tagal (3)4100
Excoecaria agallocha (3)34201
Kandelia candel (3)5011
Rhizophora apiculata (3)2001
Rhizophora mucronata (3)6100
atpF-atpHAvicennia alba (3)0000
Avicennia marina (3)4000
Avicennia officinalis (3)2000
psbK-psbIAvicennia marina (3)30540
Avicennia officinalis (3)30113
rpoC1Avicennia marina (3)0010
Avicennia officinalis (3)0000

Identification of diagnostic nucleotides for each of the 14 mangrove taxa recovered from the BOLD system. Based on their utility for mangrove taxa delineating referred as diagnostic characters, diagnostic or partial character, partial characters and partial or uninformative characters.

Identification of diagnostic nucleotides for each of the 14 mangrove taxa recovered from the BOLD system. Based on their utility for mangrove taxa delineating referred as diagnostic characters, diagnostic or partial character, partial characters and partial or uninformative characters.

Taxonomic assignment

Altogether 40 DNA barcodes from ITS2 and matK+ITS2 were used for species delineation. The Neighbour-Joining (K2P) trees constructed with bootstrap support (1000) and bootstrap values of >60 exhibited substantial resolution among the OTUs corresponding to their genera except for A. marina and A. officinalis (Supporting information S1 Fig).

Species identification based on barcoding gap

The initial partition of ITS2, K2P with X = 1.0, prior maximal distance P = 0.021 produced consistent 12 operational taxonomic units (OTUs). S. alba was split into 3 groups, while members of Rhizophora and Avicennia were merged (Fig 3; Supporting information S2 Table). Whereas, recursive partitioning with P = 0.00167, produced inconsistently18 OTUs, of which A. alba, A. officinalis, and B. cylindrica showed split, while B. gymnorrhiza was clustered perfectly (Fig 4A). In concatenated matK+ITS2, at X = 1.0 for all three metrics, OTUs ranged from 4–11 in the initial partition, but recursive partition tends to exhibit inconsistent OTUs (Fig 4B).
Fig 3

Automated partition.

The automatic partition by ABGD with three metrics (JC69, K2P and p-distance) and two X-values (X = 1, 1.5) for (a) ITS2 (initial partition 1,2 and Recursive partition 3 and 4); (b) matK (initial partition 1,2 and Recursive partition 3 and 4); (c) ITS2+matK (initial partition 1,2 and Recursive partition 3 and 4);(d) atpF-atpH and psbK-psbI (initial partition 1,2 for atpF-atpH and Initial partition 3 and 4 for psbK-psbI).

Fig 4

Bayesian phylogenetic tree.

Bayesian phylogenetic tree of (a) ITS2 and (b) matK+ITS2 gene. Vertical boxes on the right indicate the clades detected by the coalescent-based GMYC, PTP, the distance-based ABGD and TaxonDNA methods.

Automated partition.

The automatic partition by ABGD with three metrics (JC69, K2P and p-distance) and two X-values (X = 1, 1.5) for (a) ITS2 (initial partition 1,2 and Recursive partition 3 and 4); (b) matK (initial partition 1,2 and Recursive partition 3 and 4); (c) ITS2+matK (initial partition 1,2 and Recursive partition 3 and 4);(d) atpF-atpH and psbK-psbI (initial partition 1,2 for atpF-atpH and Initial partition 3 and 4 for psbK-psbI).

Bayesian phylogenetic tree.

Bayesian phylogenetic tree of (a) ITS2 and (b) matK+ITS2 gene. Vertical boxes on the right indicate the clades detected by the coalescent-based GMYC, PTP, the distance-based ABGD and TaxonDNA methods. When relative gap width was increased from X = 1.0 to X = 1.5, suddenly OTUs in ITS2 for initial partition was dropped to maximum 7, while recursive partition showed an increase in OTUs, up to 16 at P = 0.001. The initial partition for matK+ITS2, with X = 1, P = 0.0129 produced 11 OTUs. Avicennia and Bruguiera members were merged, while S. alba showed split. In recursive partition, with P = 0.001, A. alba, B. cylindrica, B. gymnorrhiza were resolved perfectly, while A. officinalis, A. marina along with R. apiculata and R. mucronata remained merged. The initial partition with an atpF-atpH barcode, JC and K2P metrics with (X = 1, 1.5) showed 3 OTUs (P = 0.0027) without any recursive partition except (X = 1.5, P = 0.00278, 1 OTU). With atpF-atpH, at X = 1.5 initial partition with P = 0.00278, 3 OTUs were produced in A. alba, A. officinalis, and A. marina. Similarly, psbK-psbI showed 4 OTUs (P = 0.001) in an initial partition for JC and K2P metrics at X = 1 and p-distance had only 2 OTUs with 1 OTU in the recursive partition. At X = 1.5, only JC and p-distance were able to partition data. JC the initial partition at P = 0.001 produced 4 OTUs, while at P = 0.0046, produced 2 OTUs. Metrics p-distance predicted 2 OTUs in an initial partition and 1 OTU in the recursive partition. Barcode locus rpoC1 at X = 1 with JC and K2P metrics showed initial partition of 2 OTUs and the recursive partition at P = 0.00278 predicted 1 OTU.

Species identification and assignment based on TaxonDNA

The single barcode marker ITS2 produced a moderate rate of correct identification using BM (87.8%) and BCM (75.6%) than the concatenated matK+ITS2 using BM (89.74%), and BCM (84.61%) (Table 5). ITS2 barcode produced 13 clusters at 3% threshold, of which 5 species (A. corniculatum, A. ilicifolius, E. agallocha, K. candel and C. tagal) were the perfect match. Whereas, Avicennia, Rhizophora and Bruguiera species were clumped into 3 clusters, while S. alba and S. caseolaris were split into 5 clusters. As compared to single barcode marker (ITS2), concatenated (matK+ITS2) markers at 3% threshold produced 11 clusters, where S. caseolaris was successfully resolved. Single barcode atpF-atpH demonstrated 100% correct identification in both BM and BCM method for Avicennia genera with 3 clusters. psbK-psbI locus identified 50% Avicennia species in BM and BCM methods, however, rpoC1 showed lowest identification rate of about 33.33% (Table 5).
Table 5

TaxonDNA analysis.

BarcodesNo. of SequencesBest Match (%)Best Closest match (%)T (%)No of ClusterMatch /Mismatch
CAIncCAIncNo match
ITS24087.82.439.7575.62.439.7512.1931410/4
ITS2 + matK3989.72.567.684.62.567.65.123116/8
atpF-atpH9100001000000.333/0
psbK-psbI6500505005000.841/1
rpoC1633.3366.66033.3366.600310/2

TaxonDNA is an alignment-based method based on sequence distance matrices. Percentage of correct/incorrect/ambiguous assignment of a taxon is compared using the molecular operating taxonomic unit (MOTU). The species-specific clustering was performed using match and mismatch criteria. T -Threshold; C–Correct; A–Ambiguous; Inc–Incorrect.

TaxonDNA is an alignment-based method based on sequence distance matrices. Percentage of correct/incorrect/ambiguous assignment of a taxon is compared using the molecular operating taxonomic unit (MOTU). The species-specific clustering was performed using match and mismatch criteria. T -Threshold; C–Correct; A–Ambiguous; Inc–Incorrect.

Species identification and assignment based on GMYC and PTP

The single threshold GMYC (sGMYC) model generated through BEAST using the ultrametric phylogenetic tree resulted in an identification of 9 Maximum Likelihood (ML) clusters for ITS2 with confidence interval (CI) of 4–9 and 14 ML entities with CI of 4–18 (Threshold time: -0.013035). Similarly, with matK+ITS2, 10 ML clusters with CI of 4–10 and 14 ML entities with CI of 4–16 (Threshold time: -0.005793) were identified. The resulting ML entities in ITS2 exhibited 5 species merged in 2 OTUs, while in matK+ITS2 only 4 species were merged with exception of A. alba. Also, splitting of two species (S. alba and S. caseolaris) formed additional OTUs (Fig 4A and 4B). The multiple threshold methods (mGMYC) gave two threshold time for ITS2 (-0.013035 and -0.005441) resulting into 9 clusters (CI:4–9) and 17 ML entities (CI:4–17). matK+ITS2 gave threshold time of -0.010859 and -0.004847, resulting into 9 clusters (CI:5–11) and 13 ML entities (CI:5–16). However, multiple thresholds overestimated the number of species, so we took a more conservative approach to consider only the results obtained from the single threshold (sGMYC) method. In GMYC, apart from other metrics, three unresolved species R. apiculata, R. mucronata and A. alba were distinctly resolved. In addition to the above methods used for taxonomic evaluation, maximum likelihood (ML) based approach was added to get an additional perspective towards the species delineation through Poissons Tree Process (PTP). The ML analysis exhibited 10 OTUs with ITS2, where Avicennia, Bruguiera, Rhizophora, Ceriops, and Kandelia genera were merged while S. alba and S. caseolaris were split (Fig 4A and 4B). With matK+ITS2, 11 OTUs were formed by merging of Avicennia, Bruguiera and Rhizophora genera and S. alba was split.

Discussion

There is no consensus regarding perfect plant DNA barcode, however few of plastid and nuclear coding (rbcL, matK, rpoB, and rpoC1) and non-coding (trnH-psbA, ITS2, psbK-psbI and atpF-atpH) marker fulfilled the required criteria [3, 9, 37]. The rbcL and matK are considered as core barcode, which can be further complemented with trnH-psbA and ITS2 as plant barcode suggested by China Plant BOL [4, 7]. We employed these markers for molecular identification of mangrove plant species. In our earlier report, we have tested potential barcode candidates rbcL and matK individual as well as concatenated rbcL+matK, which demarcated most of the species such as A. ilicifolius, E. agallocha, A. corniculatum, K. candel, C. tagal, B. cylindrica and B. gymnorrhiza. An initial analysis was performed based on traditional barcode methods (Barcode gap analysis and NJ tree with the K2P method) [17]. Individual, as well as concatenated rbcL and matK barcode demarcated almost all mangroves species except for Rhizophora, Sonneratia and Avicennia genera [17]. The Plant CBOL group (2009) reported that only 72% species were resolved using combined rbcL and matK. A similar result was observed after combining rbcL and matK from closely related species of Curcuma [13]. Moreover, Avicennia genera with three species, of which A. alba, was resolved perfectly using matK but A. officinalis and A. marina lumped together and unable to resolve at the species level. Low resolution using DNA barcode regions has been documented in many other plants such as the genus Araucaria (32%), Solidago (17%) and Quercus (0%) [38]. A high percentage of bidirectional reads were critical for a successful plant barcoding system, given the low amount of variation that separates many plant species [3-4]. The risk of misassignment can be anticipated due to sequencing error or incomplete bidirectional reads. We observed the significant quality of PCR amplification and sequencing ranged from 95% to 100% in all tested markers. However, for ITS2 barcode, we performed many amplifications and sequencing attempt for S. alba, S. caseolaris, and A. ilicifolius mangroves taxa. Sequencing of S. alba and S. caseolaris resulted in a mixed and low-quality chromatogram with unidirectional success. The possible explanation for this kind of situation can be underscored by the presence of either ITS as multiple copies or pseudogene or/and fungal ITS contamination in plant [39]. Species identification success rate using rbcL+matK is higher, whereas rbcL sequence recovery ranged from 90–100% [4, 38, 40]. Hence, CBOL group recommends rbcL primers to possess universality for land plants. As reported by CBOL, the matK region showed sequencing success of 90% [4]. The matK marker provided 88% sequencing success, with the use of 10 primer pair combinations [3]. Very few reports are available on the DNA barcoding of the mangrove taxa [17, 41]. Lower genetic distances were observed based on K2P among mangrove taxa particularly genera Rhizophora, Sonneratia, Avicennia, and Bruguiera based on rbcL, matK and ITS barcode [41]. Genetic distance ranged from 0.01 to 0.25 for rbcL gene, 0.01 to 0.89 for matK and 0.01–0.508 for ITS locus [41]. Similar results were observed in our studies, for rbcL and matK the genetic distance ranged from 0–0.68% and 0–1.32% respectively [17]. The discrimination power of proposed DNA barcode by CBOL Plant Working Group may vary in different plant group [12, 42–43]. Depending on the taxon, the use of additional markers may be needed for discrimination [4]. For single barcode ITS2, ABGD (K2P, X = 1), Taxon DNA (T = 3%) and GMYC produced consistent OTUs with corresponding results. Additionally, GMYC resolved R. apiculata, R. mucronata, and A. alba species. Overall highest taxon assignment was observed as 57.14% in GMYC and taxon resolution was up to 42.85% in ABGD, TaxonDNA, and PTP barcoding methods. However, the resolution of Chlorella-like species (microalgae) produced by GMYC, PTP, ABGD and character-based barcoding methods were variables based on several marker studies such as rbcL, ITS, and tufA [27]. Single ITS2 with PTP analysis was not able to resolve C. tagal and K. candel, which was further improved in the matK+ITS2 analysis. Analysis following the above methods, species delimitation through PTP and GMYC was utilized, due to their robustness in the absence of barcoding gap [44]. Even though they are based on different algorithms, both methods calculated the point of transition between species and population [27]. The GMYC method has a theoretically strong background and requires ultrametric gene tree that takes more time to analyse data. In contrast, the PTP is a recently developed method as an alternative to GMYC which requires non- ultrametric gene tree and consumes less time [44-45]. Both the methods revealed sort of identical results, however, the two analyses differed in resolution. In both the methods, five species (B. cylindrica, B. gymnorrhiza, A. officinalis and A. marina) in GMYC and seven species (B. cylindrica, B. gymnorrhiza, A. alba, A. officinalis, A. marina, R. apiculata and R. mucronata) in PTP were merged into single OTUs, potentially indicating low intraspecific diversity. It reflected that there are many overlooked/cryptic species present within the mangroves. When we performed ABGD with relative gap width X = 1.5 for K2P method, S. alba, and S. caseolaris species were demarcated, while rest of the mangrove species were split. At a relative gap width (X = 1) about seven species of the mangrove s were merged into single OTU and observed that the ABGD tends to lump species by increasing the number of merged OTUs [29]. Beside this, we also observed inconsistency of OTUs count during initial partition to recursive partition. Recursive partitioning recognizes more OTUs than initial ones, showing their superior capability to deal with variation in sample sizes of the species under study [29]. Moreover, TaxonDNA with a lower threshold value (0.3%) demarcated B. cylindrica and B. gymnorrhiza. The possible explanation for this might be due to lack of barcode gap resulting in merged OTUs, which can be optimized by analyzing more than 5 sequences per species, and we have used 3 sequences per species [28]. In TaxonDNA analysis, for rbcL threshold (T) was observed 0%, a similar result was recorded for rbcL in the Zingiberaceae family [46]. However, the threshold (T) for Indian Zingiberaceae family members was recorded as 0.20% for rbcL and 0% for rpoB and accD [43]. Avicennia is the most diverse mangrove genus, comprising eight species, out of which three are endemic to the Atlantic-East Pacific (AEP) region and five are endemic in the Indo-West Pacific region (IWP) [47]. A recent systematic revision of Avicennia based on morphological characters formed three groups: (1) A. marina; (2) A. officinalis and A. integra; and (3) A. rumphiana and A. alba [47]. In the current study, we have included A. marina, A. officinalis, and A. alba species, which were resolved with other barcode markers. Two plastid spacers such as psbK-psbI and atpF-atpH are recommended as potential plant DNA barcodes based on the flora of the Kruger National Park South Africa as a model system [48]. Similarly, we used three markers (atpF-atpH, psbK-psbI and rpoC1) for cryptic genera Avicennia and further evaluated with ABGD and TaxonDNA barcode methods. Both the methods consistently resolved all three Avicennia species using an atpF-atpH marker. Similarly, phylogenetic reconstruction of Avicennia genera based on trnT-trnD intergenic spacer region and the psbA gene revealed that A. marina is sister to the A. officinalis/A. integra and A. alba is genetically distinct [47].

Conclusions

In the present study, we tested core DNA barcode rbcL, matK, ITS2, atpF-atpH, psbK-psbI and rpoC1 to resolve mangroves species. Individual, as well as concatenated matK+ITS2 are helpful to demarcate mangroves at the species level. Single barcode matK is sufficient to resolve A. ilicifolius, A. corniculatum, E. agallocha, Ceriops tagal, K. candel, B. cylindrica and B. gymnorrhiza. ITS2 was able to discriminate R. apiculata and R. mucronata species based on GMYC method, while A. alba was resolved by concatenation of matK+ITS2. A cryptic genus Avicennia was delimitated based on the atpF-atpH single barcode. In the present work, the foundation work was done towards DNA barcoding of mangroves plant genera, such as Rhizophora, Avicennia, Acanthus, Kandelia, Ceriops, Bruguiera, Aegiceras and Excoecaria. Compiled mangroves barcoding result had some limitations, most of which are due to the low mangrove taxa sample coverage. Further, there is a need to explore additional mangroves taxa which will improve mangrove species identification for practical conservation.

List of primers used in the current study.

(DOCX) Click here for additional data file.

Automated Barcode Gap Discovery web server based analysis of all barcodes (matK, ITS2, matK+ITS2, atpF-atpH, psbK-psbI and rpoC1 using two relative gap width (X = 1 and 1.5) and three different matrices such as JC, K2P, and p-simple distance.

(DOCX) Click here for additional data file.

Neighbor-joining tree (Kimura 2 Parameter distance using bootstrap value of 1000 replicates) matK+ITS2 concatenated NJ (K2P).

(DOCX) Click here for additional data file.
  36 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

Review 2.  Ribosomal ITS sequences and plant phylogenetic inference.

Authors:  I Alvarez; J F Wendel
Journal:  Mol Phylogenet Evol       Date:  2003-12       Impact factor: 4.286

3.  Evaluation of six candidate DNA barcoding loci in Ficus (Moraceae) of China.

Authors:  H-Q Li; J-Y Chen; S Wang; S-Z Xiong
Journal:  Mol Ecol Resour       Date:  2012-04-27       Impact factor: 7.090

4.  DNA barcoding of the recently evolved genus Holcoglossum (Orchidaceae: Aeridinae): a test of DNA barcode candidates.

Authors:  Xiao-Guo Xiang; Hao Hu; Wei Wang; Xiao-Hua Jin
Journal:  Mol Ecol Resour       Date:  2011-07-04       Impact factor: 7.090

5.  ABGD, Automatic Barcode Gap Discovery for primary species delimitation.

Authors:  N Puillandre; A Lambert; S Brouillet; G Achaz
Journal:  Mol Ecol       Date:  2011-08-29       Impact factor: 6.185

6.  Selecting barcoding loci for plants: evaluation of seven candidate loci with species-level sampling in three divergent groups of land plants.

Authors:  Michelle L Hollingsworth; Alex Andra Clark; Laura L Forrest; James Richardson; R Toby Pennington; David G Long; Robyn Cowan; Mark W Chase; Myriam Gaudeul; Peter M Hollingsworth
Journal:  Mol Ecol Resour       Date:  2009-01-31       Impact factor: 7.090

7.  Testing DNA barcoding in closely related groups of Lysimachia L. (Myrsinaceae).

Authors:  Cai-Yun Zhang; Feng-Ying Wang; Hai-Fei Yan; Gang Hao; Chi-Ming Hu; Xue-Jun Ge
Journal:  Mol Ecol Resour       Date:  2011-10-04       Impact factor: 7.090

8.  Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species.

Authors:  Shilin Chen; Hui Yao; Jianping Han; Chang Liu; Jingyuan Song; Linchun Shi; Yingjie Zhu; Xinye Ma; Ting Gao; Xiaohui Pang; Kun Luo; Ying Li; Xiwen Li; Xiaocheng Jia; Yulin Lin; Christine Leon
Journal:  PLoS One       Date:  2010-01-07       Impact factor: 3.240

9.  A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

Authors:  W John Kress; David L Erickson
Journal:  PLoS One       Date:  2007-06-06       Impact factor: 3.240

10.  Combining and Comparing Coalescent, Distance and Character-Based Approaches for Barcoding Microalgaes: A Test with Chlorella-Like Species (Chlorophyta).

Authors:  Shanmei Zou; Cong Fei; Jiameng Song; Yachao Bao; Meilin He; Changhai Wang
Journal:  PLoS One       Date:  2016-04-19       Impact factor: 3.240

View more
  5 in total

1.  A Cyclic Disulfide Diastereomer From Bioactive Fraction of Bruguiera gymnorhiza Shows Anti-Pseudomonas aeruginosa Activity.

Authors:  Nilesh Lakshman Dahibhate; Sanjeev K Shukla; Kundan Kumar
Journal:  Front Pharmacol       Date:  2022-06-02       Impact factor: 5.988

2.  Establishing community-wide DNA barcode references for conserving mangrove forests in China.

Authors:  Xiaomeng Mao; Wei Xie; Xinnian Li; Suhua Shi; Zixiao Guo
Journal:  BMC Plant Biol       Date:  2021-12-04       Impact factor: 4.215

3.  Diversity investigation by application of DNA barcoding: A case study of lepidopteran insects in Xinjiang wild fruit forests, China.

Authors:  Jinyu Zhan; Yufeng Zheng; Qing Xia; Jin Wang; Sibo Liu; Zhaofu Yang
Journal:  Ecol Evol       Date:  2022-03-07       Impact factor: 2.912

4.  Multilocus marker-based delimitation of Salicornia persica and its population discrimination assisted by supervised machine learning approach.

Authors:  Rahul Jamdade; Khawla Al-Shaer; Mariam Al-Sallani; Eman Al-Harthi; Tamer Mahmoud; Sanjay Gairola; Hatem A Shabana
Journal:  PLoS One       Date:  2022-07-27       Impact factor: 3.752

5.  Selection of reference genes for quantitative real-time PCR analysis in halophytic plant Rhizophora apiculata.

Authors:  Ankush Ashok Saddhe; Manali Ramakant Malvankar; Kundan Kumar
Journal:  PeerJ       Date:  2018-07-12       Impact factor: 2.984

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.