Literature DB >> 21687792

Species identification of marine fishes in china with DNA barcoding.

Junbin Zhang1.   

Abstract

DNA barcoding is a molecular method that uses a short standardized DNA sequence as a species identification tool. In this study, the standard 652 base-pair region of the mitochondrial cytochrome oxidase subunit I gene (COI) was sequenced in marine fish specimens captured in China. The average genetic distance was 50-fold higher between species than within species, as Kimura two parameter (K2P) genetic distances averaged 15.742% among congeners and only 0.319% for intraspecific individuals. There are no overlaps of pairwise genetic variations between conspecific and interspecific comparisons apart from the genera Pampus in which the introgressive hybridization was detected. High efficiency of species identification was demonstrated in the present study by DNA barcoding. Due to the incidence of cryptic species, an assumed threshold is suggested to expedite discovering of new species and biodiversity, especially involving biotas of few studies.

Entities:  

Year:  2011        PMID: 21687792      PMCID: PMC3108176          DOI: 10.1155/2011/978253

Source DB:  PubMed          Journal:  Evid Based Complement Alternat Med        ISSN: 1741-427X            Impact factor:   2.629


1. Introduction

Fishes are important animal protein sources for human beings, and they are frequently used in complementary and alternative medicine/traditional medicine (CAM/TM). The delimitation and recognition of fish species is not only of interest for taxonomy and systematics, but also a requirement in management of fisheries, authentication of food products, and identification of CAM/TM materials [1-3]. Due to the complexity and limitations of morphological characters used in traditional taxonomy, several PCR-based methods of genotype analysis have been developed for the identification of fish species, particularly for eggs, larvae, and commercial products. Sequence analysis of species-specific DNA fragments (often mitochondrial or ribosomal genes) and multiplex PCR of species-conserved DNA fragments are efficient for fish species identification [4-10]. However, these molecular methods are limited to particular known species and are not easily applicable to a wide range of taxa. Therefore, Hebert et al. advocated using a standard DNA sequence that is DNA barcoding to identify species and uncover biological diversity [11, 12]. For many animal taxa, sequence divergences within the 5′ region of the mitochondrial cytochrome oxidase subunit I (COI) gene were much greater between species than within them, and this in turn suggests that the approach is widely applicable across phylogenetically distant animal groups [12, 13]. To date, some published papers explicitly address that COI barcodes effectively discriminate different species for a variety of organisms [14-23]. However, several scientists express concerns that species identification based on variations of single mitochondrial gene fragment may remain incorrect or ambiguous assignments, particularly in cases of possible mitochondrial polyphyly or paraphyly [24, 25]. In the current study, we test the efficacy of DNA barcoding in marine fishes of China. The sea area of China is part of the Indo-West Pacific Ocean, which is regarded as the center of the world's marine biodiversity [26]. Highly species-rich biotas are particularly attractive to test the reliability and efficiency of DNA barcoding.

2. Material and Methods

The majority of fish specimens were captured with the drawl net at 20 localities along the coast of China (collection information available at http://www.barcodinglife.org/). A total of 329 specimens from one hundred species of fish were collected. Vouchers were deposited in the South China Sea Institute of Oceanography, Chinese Academy of Sciences, and all specimens were preserved in 70% ethanol. Tissue samples were dissected from the dorsal muscle, and genomic DNA was extracted according to the standard Barcode of Life protocol [27]. Firstly, fragments of the 5′ region of the mitochondrial COI gene were PCR-amplified using C_FishF1t1/ C_FishR1t1 primer cocktails [28]. The cocktail C_FishF1t1 contained two primers (FishF2_t1/VF2_t1), and C_FishR1t1 also contained two primers (FishR2_t1/ FR1d_t1). All PCR primers were tailed with M13 sequences to facilitate sequencing of products. The nucleotide sequences of the primers were FishF2_t1: *5′-TGTAAAACGACGGCCAGTCGACTAATCATAAAGATATCGGCAC-3′. VF2_t1: *5′-TGTAAAACGACGGCCAGTCAACCAACCACAAAGACATTGGCAC-3′. FishR2_t1:**5′-CAGGAAACAGCTATGACACTTCAGGGTGACCGAAGAATCAGAA-3′. FR1d_t1: **5′-CAGGAAACAGCTATGACACCTCAGGGTGTCCGAARAAYCARAA-3′. *The M13F primer sequence is underlined;**the M13R primer sequence is underlined. PCR reactions were carried out in 96-well plates using Mastercycler Eppendorf gradient thermal cyclers (Brinkmann Instruments, Inc.). The reaction mixture of 825 μl water, 125 μl 10× buffer, 62.5 μl MgCl2 (25 mM), 6.25 μl dNTP (10 mM), 6.25 μl each primer (0.01 mM), and 6.25 μl Taq DNA polymerase (5 U/μl) was prepared for 96 wells of each plate, in which each well contained 10.5 μl mixture and 2 μl genomic DNA. Thermocycling comprised an initial step of 2 min at 95°C and 35 cycles of 30 sec at 94°C, 40 sec at 52°C, and 1 min at 72°C, with a final extension at 72°C for 10 min. Amplicons were visualized on 2% agarose E-Gel 96-well system (Invitrogen). PCR products were amplified again with the primers M13F (5′-TGTAAAACGACGGCCAGT-3′) and M13R (5′-CAGGAAACAGCTATGAC-3′), respectively, using the BigDye Terminator v.3.1 Cycle Sequencing Kit (Applied Biosystems, Inc.). Thermocycling conditions were as follows: an initial step of 2 min at 96°C and 35 cycles of 30 sec at 96°C, 15 sec at 55°C, and 4 min at 60°C. Sequencing was performed on an ABI 3730 capillary sequencer according to manufacturer's instructions. For specimens that failed to yield sequences using the primer combinations above, a second round of PCR using the alternative C_VF1LFt1/ C_VR1LRt1 primer combination was carried out. C_VF1LFt1 consisted of four primers (VF1_t1/VF1d_t1/LepF1_t1/VFli_t1), and C_VR1LRt1 also comprised four primers (VR1_t1/VR1d_t1/LepR1_t1/VRli_t1) [28]. VF1_t1: *5′-TGTAAAACGACGGCCAGTTCTCAACCAACCACAAAGACATTGG-3′. VF1d_t1: *5′-TGTAAAACGACGGCCAGTTCTCAACCAACCACAARGAYATYGG-3′. LepF1_t1: *5′-TGTAAAACGACGGCCAGTATTCAACCAATCATAAAGATATTGG-3′. VFli_t1: *5′-TGTAAAACGACGGCCAGTTCTCAACCAACCAIAAIGAIATIGG-3′. VR1_t1: **5'-CAGGAAACAGCTATGACTAGACTTCTGGGTGGCCRAARAAYCA-3′. VR1d_t1: **5′-CAGGAAACAGCTATGACTAGACTTCTGGGTGGCCAAAGAATCA-3′. LepR1_t1: **5′-CAGGAAACAGCTATGACTAAACTTCTGGATGTCCAAAAAATCA-3′. VRli_t1: **5′-CAGGAAACAGCTATGACTAGACTTCTGGGTGICCIAAIAAICA-3′. *The M13F primer sequence is underlined;**the M13R primer sequence is underlined. The thermocycling protocol used was 1 min at 95°C and 35 cycles of 30 sec at 94°C, 40 sec at 50°C, and 1 min at 72°C, with a final extension at 72°C for 10 min. Sequecing PCR and sequencing followed above procedure. DNA sequences were aligned with SEQSCAPE v.2.5 software (Applied Biosystems, Inc.). Sequence divergences were calculated using the Kimura two parameter (K2P) distance model [29], and unrooted NJ trees based on K2P distances were created in MEGA software [30]. In the chosen taxonomic group, phylogenetic analysis was carried out in PAUP 4.010b using the maximum parsimony (MP) method, with 1,000 replications of the full heuristic search. The following categories of K2P distances were calculated: intraspecific distances (S), interspecies within the congener (G), and interspecies from different genus but within intrafamily (F). These values were plotted using the boxplot representation of R. Boxplots [31] in SPSS 11.5 software (SPSS Inc., Chicago, IL, USA). Only for families containing 2 or more genera, separate boxplot was constructed for the sake of comparisons among taxonomic categories. Boxplots describe median (central bar), interquartile range (IQR: between upper (Q3) and low (Q1) quartile), values lying within 1.5× IQR beneath Q1 or 1.5× above Q3 (“whiskers”), and extreme values (outliers). Mann-Whitney tests were performed between S, G, and F distributions to estimate the overlap among taxonomic ranks.

3. Results

A total of 329 specimens were analyzed, from which 321 sequences (all >500 bp) belonging to 121 species (another species was identified to the genus level) were ultimately obtained (GenBank accession numbers: EF607296-EF607616). These species cover the majority of fishes living in the coastline of the South China Sea. All sequences were aligned with a consensus length of 652 bp, and no insertions, deletions, or stop codons were observed in any sequence. However, multiple haplotypes were detected for some species. Except for Acentrogobius caninus, Scomber japonicus, Terapon jarbua, Upeneus sulphureus, Elops hawaiensis, Gymnothorax pseudothyrsoideus, Dendrophysa russelii, and Pennahia anea (which reached the maximum value of 2.02%), intraspecific genetic distances were generally below 1%, and some decreased to zero (between some intraspecific individuals of Thryssa setirostris, Parapercis ommatura, Scatophagus argus, etc.). The mean intraspecies K2P (Kimura two-parameter) distance was 0.319%; the distance increased sharply to 15.742% among individuals of congeneric species. Overall, the average genetic distance among congeneric species is nearly 50-fold higher than that among individuals within species. For the higher taxonomic ranks (family, order, and class), mean pairwise genetic distances increased gradually and reached 20.199%, 24.656%, and 25.225%, respectively (Table 1). Standard errors for K2P genetic distances were small, and values of the mean and median were close within different taxonomic ranks (Table 1). This indicates fluctuations of K2P genetic distances tend to be convergent (Figures 1 and 2).
Table 1

Genetic divergences (percentage, K2P distance) within various taxonomic levels. Data are based on 321 sequences (>500 bp) from 122 species.

Comparisons withinTaxaNumber of comparisonsMeanMedianMinimumMaximums.e.#
Species1214530.3190.15002.021*0.018
Genus8539715.74216.4900.154**25.1890.292
Family5584820.19919.85011.53234.3330.134
Order151788124.65612.92339.6270.024
Class22926225.22515.73040.8000.016

* Pennahia anea; ** Pampus argenteus versus Pampus cinereus.

#Standard error.

Figure 1

Box plots of K2P distances. IQR: interval into which the “central” 50% of the data fall. Black bar in the box indicates the median. Circle: “mild outlier” and asterisks: “extreme outliers”. Extreme outliers are discussed in the text.

Figure 2

Boxplot distributions of S, G, and F. Intra-species (S), interspecies among congeneric species (G), and intergenera but intrafamily (F) K2P distances for different families.

In the unrooted NJ (neighbour-joining) tree (Figure 3), three specimens of Pampus argentenus were grouped together and contained within the cluster of Pampus cinereus. These Pampus argentenus specimens were collected in the same site off the west coast of the South China Sea, and were difficult to identify because of their complex morphological characteristics (available at http://www.barcodinglife.org/). They possessed combined characteristics of Pampus cinereus and Pampus argentenus: the asymmetrical tail of Pampus cinereus and silver color of Pampus argentenus. If the suspicious congeneric K2P distances in the genera Pampus are excluded (the extreme outliers in Figure 1), the pairwise genetic divergences among congeneric species are above 10%. There are no overlaps between intraspecific and congeneric K2P distances within the same family (Figure 3).
Figure 3

Neighbor-joining (NJ) tree of COI sequences. Scale: 5% K2P distance. The first numbers following species names are the process IDs, and the latter are the sample IDs.

At the species level, all COI sequences clustered in monophyletic species units. At the family level, there were paraphyletic clusters for three families (Carangidae, Gobiidae, and Ariidae) (Figure 3), though over 98% of specimens fell into the expected division of families. Intrafamily K2P distances (F) were generally higher than congeneric (G) distances, which were definitely higher than intraspecific (S) distances (Table 1, all Mann-Whitney tests were highly significant, P value <10−6). However, overlaps between F and G distances were observed in Clupeidae, Carangidae, Mullidae, and Muraenesocidae.

4. Discussion

In morphological taxonomy, characters are delimited usually without any explicit criteria for character selection or coding, and morphological data sets have the potential to be quite arbitrary. For example, morphologists do not generally report their criteria for including or excluding characters, and when criteria are given, they vary considerably among studies [32]. Thus, it is not surprising that there are so many synonyms for organisms [33], and an objective, rigorous species delimitation according to explicit criteria is therefore necessary for many taxonomic studies [34]. While DNA barcoding provides taxonomic identification for a specimen, the accuracy of such an assignment depends on whether species are monophyletic with respect to sequence variations of the COI gene. That is, individuals of a given species are more closely related to all other conspecifics than to any member of other species. Except for the hybridized specimens in the genus Pampus, there are no overlaps between genetic variations of S and G (Figure 1). The factors responsible for deviations from taxonomic monophyly may be varied and complex [35]; one potential cause of species-level polyphyly is the occasional mating between distinct species, resulting in hybrid offspring carrying a mixture of genes from both parent species. Furthermore, mitochondrial genes are generally subjected to introgression more frequently than nuclear ones, and introgression also leads to phylogenetic paraphyly [35-38], like the hybridization between Pampus argentenus and Pampus cinereus in this study. In such cases, combinations of morphological and genotypic data are needed for species assignment of hybrids. Biological mechanisms, water dynamics, or historical events may cause deep genetic structuring of populations in marine species [26, 39]. Many explanations for genetic population structuring on local and regional scales involve behaviors such as the adoption of pelagic early life stages and movement over broad geographic ranges, and these factors are theoretically associated with gene flow [40-42]. For many marine fishes, there is a lack of phylogeographic structure among populations [43, 44]; in this study, for individuals from long distance localities, some intraspecific genetic variations reduced to zero within families Carangidae, Sciaenidae, and Mullidae. However, some pairwise K2P distances exceeded 1.00% within the coastal species such as Acentrogobius caninus, Scomber japonicus, Terapon jarbua, Upeneus sulphureus, Elops hawaiensis, Gymnothorax pseudothyrsoideus, and Dendrophysa russelii. It implied that biological mechanisms were responsible for the fluctuation of intraspecific genetic divergences in marine fishes. The neighbor-joining method was originally employed in this study for species identification, but some phylogenetic information was also revealed by the dendrogram, and over 98% of specimens were allocated into different families without polyphyly/paraphyly in the NJ tree (Figure 3). However, DNA barcoding is independent of the way the taxonomy has been built, and it cannot be regarded as the “taxonomic” tag [45]. DNA barcoding is no substitute for taxonomy Ebach and Holdrege [46], and a great deal of work is needed to bring about the reconciliation between traditional and molecular taxonomy. It is unfeasible to build the phylogeny of fishes only based on mitochondrial DNA fragments. Polyphyly/paraphyly in the NJ tree probably results from “bad taxonomy” when named species fail to identify the genetic limits of separate evolutionary entities, particularly for perplexing taxa involving cryptic species [47]. If we cannot set a threshold of the genetic variation in species delimitation, we find ourselves sunk in the dilemma facing new or cryptic species. On the one hand, the morphological taxonomy cannot give a definite identification. On the other hand, we cannot claim that it may be a new species based on molecular analysis without the species delimitation. An assumed threshold is helpful to expedite discovery of new species and biodiversity, especially in dealing with little-studied biotas, although a single, uniform threshold for species delimitation seems arbitrary because rates of molecular evolution vary widely within and among lineages [24, 25, 48].
  24 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

2.  DNA barcoding is no substitute for taxonomy.

Authors:  Malte C Ebach; Craig Holdrege
Journal:  Nature       Date:  2005-04-07       Impact factor: 49.962

3.  DNA barcodes distinguish species of tropical Lepidoptera.

Authors:  Mehrdad Hajibabaei; Daniel H Janzen; John M Burns; Winnie Hallwachs; Paul D N Hebert
Journal:  Proc Natl Acad Sci U S A       Date:  2006-01-17       Impact factor: 11.205

4.  Quality control for terms and definitions in ontologies and taxonomies.

Authors:  Jacob Köhler; Katherine Munn; Alexander Rüegg; Andre Skusa; Barry Smith
Journal:  BMC Bioinformatics       Date:  2006-04-19       Impact factor: 3.169

5.  DNA barcoding Australia's fish species.

Authors:  Robert D Ward; Tyler S Zemlak; Bronwyn H Innes; Peter R Last; Paul D N Hebert
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2005-10-29       Impact factor: 6.237

6.  Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator.

Authors:  Paul D N Hebert; Erin H Penton; John M Burns; Daniel H Janzen; Winnie Hallwachs
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-01       Impact factor: 11.205

7.  Molecular discrimination of garfish Hyporhamphus (Beloniformes) larvae in southern Australian waters.

Authors:  C J Noell; S Donnellan; R Foster; L Haigh
Journal:  Mar Biotechnol (NY)       Date:  2001-11       Impact factor: 3.619

8.  Identifying Canadian freshwater fishes through DNA barcodes.

Authors:  Nicolas Hubert; Robert Hanner; Erling Holm; Nicholas E Mandrak; Eric Taylor; Mary Burridge; Douglas Watkinson; Pierre Dumont; Allen Curry; Paul Bentzen; Junbin Zhang; Julien April; Louis Bernatchez
Journal:  PLoS One       Date:  2008-06-18       Impact factor: 3.240

9.  Comparing the efficacy of morphologic and DNA-based taxonomy in the freshwater gastropod genus Radix (Basommatophora, Pulmonata).

Authors:  Markus Pfenninger; Mathilde Cordellier; Bruno Streit
Journal:  BMC Evol Biol       Date:  2006-11-23       Impact factor: 3.260

10.  Molecular identification of birds: performance of distance-based DNA barcoding in three genes to delimit parapatric species.

Authors:  Mansour Aliabadian; Mohammad Kaboli; Vincent Nijman; Miguel Vences
Journal:  PLoS One       Date:  2009-01-07       Impact factor: 3.240

View more
  5 in total

Review 1.  Role of DNA barcoding in marine biodiversity assessment and conservation: An update.

Authors:  Subrata Trivedi; Abdulhadi A Aloufi; Abid A Ansari; Sankar K Ghosh
Journal:  Saudi J Biol Sci       Date:  2015-01-12       Impact factor: 4.219

2.  DNA barcoding of coastal ray-finned fishes in Vietnam.

Authors:  Pham The Thu; Wen-Chien Huang; Tak-Kei Chou; Nguyen Van Quan; Pham Van Chien; Fan Li; Kwang-Tsao Shao; Te-Yu Liao
Journal:  PLoS One       Date:  2019-09-19       Impact factor: 3.240

3.  Molecular characterization of marine and coastal fishes of Bangladesh through DNA barcodes.

Authors:  Md Sagir Ahmed; Sujan Kumar Datta; Tonmoy Saha; Zarif Hossain
Journal:  Ecol Evol       Date:  2021-04-07       Impact factor: 2.912

4.  Can DNA barcoding accurately discriminate megadiverse Neotropical freshwater fish fauna?

Authors:  Luiz H G Pereira; Robert Hanner; Fausto Foresti; Claudio Oliveira
Journal:  BMC Genet       Date:  2013-03-09       Impact factor: 2.797

5.  DNA barcoding of marine fish species from Rongcheng Bay, China.

Authors:  Lijuan Wang; Zhihao Wu; Mengxia Liu; Wei Liu; Wenxi Zhao; Hongjun Liu; Feng You
Journal:  PeerJ       Date:  2018-06-25       Impact factor: 2.984

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.