| Literature DB >> 28931084 |
Cecilia Conte-Grand1, Ralf Britz2, Neelesh Dahanukar3,4, Rajeev Raghavan5, Rohan Pethiyagoda6, Heok Hui Tan7, Renny K Hadiaty8, Norsham S Yaakob9, Lukas Rüber1,10.
Abstract
Snakehead fishes of the family Channidae are predatory freshwater teleosts from Africa and Asia comprising 38 valid species. Snakeheads are important food fishes (aquaculture, live food trade) and have been introduced widely with several species becoming highly invasive. A channid barcode library was recently assembled by Serrao and co-workers to better detect and identify potential and established invasive snakehead species outside their native range. Comparing our own recent phylogenetic results of this taxonomically confusing group with those previously reported revealed several inconsistencies that prompted us to expand and improve on previous studies. By generating 343 novel snakehead coxI sequences and combining them with an additional 434 coxI sequences from GenBank we highlight several problems with previous efforts towards the assembly of a snakehead reference barcode library. We found that 16.3% of the channid coxI sequences deposited in GenBank are based on misidentifications. With the inclusion of our own data we were, however, able to solve these cases of perpetuated taxonomic confusion. Different species delimitation approaches we employed (BIN, GMYC, and PTP) were congruent in suggesting a potentially much higher species diversity within snakeheads than currently recognized. In total, 90 BINs were recovered and within a total of 15 currently recognized species multiple BINs were identified. This higher species diversity is mostly due to either the incorporation of undescribed, narrow range, endemics from the Eastern Himalaya biodiversity hotspot or the incorporation of several widespread species characterized by deep genetic splits between geographically well-defined lineages. In the latter case, over-lumping in the past has deflated the actual species numbers. Further integrative approaches are clearly needed for providing a better taxonomic understanding of snakehead diversity, new species descriptions and taxonomic revisions of the group.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28931084 PMCID: PMC5606936 DOI: 10.1371/journal.pone.0184017
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distance tree of snakehead barcodes.
Neighbour joining tree based on HKY distances of 777 channid coxI sequences. Individuals have been collapsed into 61 BINs and 29 potential new BINs (indicated as clades 1–29 and highlighted in dark grey); see Table 1. The Parachanna africana clade is highlighted in light grey. Numbers in brackets behind species names refer to number of individuals from this study / number of individuals from GenBank, followed by BIN or clade designation. BINs from BOLD not previously reported by [32] are indicated in brackets with "new" (BOLD accessed March 31, 2016). Red dots indicate clades containing misidentified or incomplete identified specimens; see Table 2 and the following comments: (a) Protopterus annectens BIN containing five lungfish specimens, (b) three Pa. africana and two Pa. obscura included, (c) one Pa. sp included, (d) two Pa. insignis included, (e) nine Channa sp. included, (f) one C. cf. marulius included, (g) ten C. marulius with accession numbers EU342199-EU342200, HM117192-HM117196, KJ937341, KJ937348, KJ937388 included. These might be C. pseuomarulius, but were not counted as misidentifications since this species was only recently revalidated by Britz et al. [48], (h) includes an exceptionally long branch for a single specimen (accession number JX983243 shown as a grey bar) probably due to sequencing errors, (i) one C. cf. nox included, (j) five C. marulius included, (k) one C. orientalis included, (l) BIN number is for unpublished C. orientalis in BOLD, (m) five C. orientalis included, (n) six C. orientalis included, (o) six C. orientalis included, (p) 17 C. orientalis included, (q) misidentification should be C. bleheri see Table 2, (r) one C. stewartii included, (s) one C. cf. stewartii included, (t) five C. barca included, (u) misidentification should be C. andrao see Table 2, (v) three C. gachua included, (w) BOLD ID showed 97.39% similarity with ACS3540, (x) five C. stewartii included, (y) BOLD ID showed 97.98% similarity with AAC3925, (z) one C. gachua included. For clades composed of misidentified specimens only, the original species designation was kept and these clades are underlined (ACB7513 and ACB8348).
Summary of channid species and their BIN and clade assignment following the clade order in Fig 1.
Species names according to [32] if different from this study are given, BINs reported by [32] are indicated and those species with multiple BINs in the [32] study and in this study are indicated.
| Clade number | Species name this study | Species name Serrao | BIN / clade number | BIN Serrao | BIN group Serrao | BIN group this study |
|---|---|---|---|---|---|---|
| 1 | AAL6055 | no | ||||
| 2 | AAF7843 | yes | ||||
| 3 | n/a | ABW0157 | no | 1 | ||
| 4 | n/a | AAF7842 | yes | |||
| 5 | n/a | clade 01 | no | 1 | ||
| 6 | n/a | AAI7246 | yes | 1 | ||
| 7 | n/a | clade 02 | no | 2 | ||
| 8 | n/a | clade 03 | no | 2 | ||
| 9 | n/a | AAW6833 | yes | 2 | 2 | |
| 10 | n/a | ABW0051 | yes | 2 | 2 | |
| 11 | n/a | ABW0048 | yes | |||
| 12 | n/a | ABW0047 | yes | |||
| 13 | n/a | ABW1866 | yes | |||
| 14 | n/a | AAI7162 | yes | |||
| 15 | n/a | AAD7592 | yes | |||
| 16 | n/a | ACS5422 | no | 3 | ||
| 17 | n/a | AAD2426 | yes | 3 | ||
| 18 | n/a | ACG5323 | yes | 3 | 4 | |
| 19 | n/a | AAE8814 | yes | 3 | 4 | |
| 20 | AAC6049 | yes | ||||
| 21 | n/a | clade 04 | no | 5 | ||
| 22 | n/a | ABW0012 | yes | 4 | 5 | |
| 23 | AAI7187 | yes | 4 | |||
| 24 | n/a | ABA8625 | no | 5 | ||
| 25 | n/a | AAW6834 | yes | 5 | 6 | |
| 26 | n/a | ACH5880 | yes | 5 | 6 | |
| 27 | n/a | ACH5881 | yes | 5 | 6 | |
| 28 | n/a | clade 05 | no | |||
| 29 | n/a | ABW1864 | yes | |||
| 30 | n/a | clade 06 | no | 7 | ||
| 31 | n/a | AAB2498 | yes | 6 | 7 | |
| 32 | n/a | ACB7973 | yes | 6 | 7 | |
| 33 | n/a | AAB2497 | yes | 6 | 7 | |
| 34 | n/a | AAF3770 | yes | |||
| 35 | n/a | ACS6326 | no | 8 | ||
| 36 | n/a | clade 07 | no | 8 | ||
| 37 | n/a | AAW6831 | yes | 8 | ||
| 38 | n/a | clade 08 | no | 9 | ||
| 39 | n/a | clade 09 | no | 9 | ||
| 40 | n/a | clade 10 | no | |||
| 41 | n/a | clade 11 | no | 10 | ||
| 42 | n/a | clade 12 | no | 11 | ||
| 43 | ABV9995 | yes | 7 | 10 | ||
| 44 | n/a | clade 13 | no | 10 | ||
| 45 | n/a | ACX6936 | no | 10 | ||
| 46 | n/a | clade 14 | no | 11 | ||
| 47 | ACA9095 | yes | 7 | 10 | ||
| 48 | AAC6050 | yes | 7 | 10 | ||
| 49 | ABA8489 | yes | 7 | 10 | ||
| 50 | n/a | ACH0185 | no | 10 | ||
| 51 | n/a | clade 15 | no | |||
| 52 | n/a | clade 16 | no | |||
| 53 | n/a | clade 17 | no | |||
| 54 | n/a | clade 18 | no | |||
| 55 | n/a | clade 19 | no | 12 | ||
| 56 | n/a | AAE1408 | yes | 12 | ||
| 57 | ACB7513 | yes | 12 | |||
| 58 | n/a | clade 20 | no | 12 | ||
| 59 | n/a | ACH1447 | no | |||
| 60 | n/a | AAF3792 | yes | |||
| 61 | n/a | ACG5458 | yes | |||
| 62 | ACH0210 | no | 13 | |||
| 63 | AAC6053 | yes | 13 | |||
| 64 | n/a | AAF3772 | yes | 8 | 14 | |
| 65 | n/a | AAF3764 | yes | 8 | 14 | |
| 66 | ACB8348 | yes | 9 | 15 | ||
| 67 | AAC3928 | yes | 9 | 15 | ||
| 68 | n/a | clade 21 | no | 10 | ||
| 69 | n/a | ACB7510 | yes | 9 | 10 | |
| 70 | n/a | clade 22 | no | 10 | ||
| 71 | n/a | clade 23 | no | 10 | ||
| 72 | n/a | ACS3403 | no | 10 | ||
| 73 | n/a | clade 24 | no | 10 | ||
| 74 | n/a | ABV9996 | yes | 9 | 10 | |
| 75 | n/a | clade 25 | no | 10 | ||
| 76 | n/a | ACQ3951 | no | 10 | ||
| 77 | n/a | ACI8494 | no | 10 | ||
| 78 | n/a | ABV9969 | yes | 9 | 10 | |
| 79 | n/a | ABV9993 | yes | 9 | 10 | |
| 80 | n/a | ABV9994 | yes | 9 | 10 | |
| 81 | n/a | clade 26 | no | 10 | ||
| 82 | n/a | clade 27 | no | 10 | ||
| 83 | n/a | ACP4442 | no | 10 | ||
| 84 | n/a | clade 28 | no | 10 | ||
| 85 | ACS3540 | no | 10 | |||
| 86 | n/a | AAC3925 | yes | 9 | 10 | |
| 87 | n/a | clade 29 | no | 10 | ||
| 88 | n/a | ACM5826 | no | 10 | ||
| 89 | n/a | AAC3927 | yes | 9 | 10 | |
| 90 | AAC3926 | yes | 9 | |||
| n/a | n/a | ACE8403 | yes | 1 | ||
| n/a | ABW0050 | yes |
(a)wrong species assignment in BOLD and GenBank; not used by Serrao et al. [32]
(b)reassigned by BOLD to BIN ABW0157.
(c)not used in this study.
Fig 2Snakehead chronogram.
BEAST analysis using a coalescence prior. Species delimitations based on BIN, GMYC single, GMYC multiple, and PTP thresholds are indicated by black bars. The same figure with species names is given in S4 Fig.
Channid misidentifications.
Summary of channid misidentifications and incompletely identified channid specimens with their BIN assignment, GenBank accession numbers and number of specimens found in GenBank, with how many specimens have been used and / or generated by [32].
| Speciemen ID wrong | Specimen ID correct | Category | BOLD:BIN | GenBank accession numbers | Information sequences | Comment |
|---|---|---|---|---|---|---|
| misidentified | ACB7513 | HM117177-HM117181 | 5/5/0 | q | ||
| misidentified | AAF3764 | KJ847147-KJ847151 | 5/0/0 | t | ||
| Channa marulioides | incomplete ID | AAC6049 | KJ937378 | 1/1/1 | f | |
| incomplete ID | ACH1447 | KF511545 | 1/0/0 | n/a | ||
| Channa asiatica? | incomplete ID | ACH5881 | LR1804 | 1/0/0 | i | |
| incomplete ID | AAC6053 | KJ937384 | 1/1/1 | s | ||
| misidentified | AAC3928 | EU342197-EU342198, KJ937393 | 3/2/1 | v | ||
| misidentified | ACB8348 | HM117187-HM117191 | 5/5/0 | u | ||
| misidentified | AAC3926 | KJ937367 | 1/1/1 | z | ||
| misidentified | ACB7973 | KF430019, FJ459472-FJ459475 | 5/0/0 | j | ||
| misidentified | ABV9995 | KJ937374 | 1/1/1 | k | ||
| misidentified | ACA9095 | JN245991, JX105470, JX105472-JX105474 | 5/5/0 | m | ||
| misidentified | AAC6050 | FJ459480-FJ459484, KJ937436 | 6/6/1 | n | ||
| misidentified | ABA8489 | JQ667514, JX983245-JX983249 | 6/1/0 | o | ||
| misidentified | ACH0185 | KF742420, KF742438, KJ847117-KJ847131 | 17/0/0 | p | ||
| incomplete ID | ABW0048 | KJ937350, KJ937357, KJ937398, KJ937405-KJ937406, KJ937439, KJ937447, KJ937452, KJ937454 | 9/9/9 | e | ||
| misidentified | BW0050 | KJ937355 | 1/1/1 | n/a | ||
| incomplete ID | ACH0210 | KF742419 | 1/0/0 | r | ||
| misidentified | ACS3540 | KJ847152-KJ847156 | 5/0/0 | x | ||
| misidentified | AAF7843 | KJ937351, KJ937391, KJ937418 | 3/3/3 | b | ||
| misidentified | AAF7842 | AP006042, NC_022480 | 2/0/0 | d | ||
| misidentified | AAF7843 | HM880234, KJ937453 | 2/1/1 | b | ||
| incomplete ID | ABW0157 | KJ937414 | 1/1/1 | c | ||
| misidentified | AAL6055 | HM882951-HM882954, HQ927824 | 5/0/0 | a |
(a)number of specimens / used by [32] / generated by [32].
(b)not used in this study, only used in S6 Fig.
Fig 3Summary of species delimitation.
Cumulative number of channid species from the year 1758 to 2016 and results of channid species numbers estimated by different species delimitation methods. For the PTP analyses the species number is given (dot) and the mean and minimum and maximum range is given based on the analyses of 100 ML bootstrap trees. GMYCs and GMYCm the number of species and the confidence interval are given. For the bGMYC analyses, the mean and minimum and maximum range is given. The horizontal lines indicate different species counts: a) 38 valid species, b) plus seven undescribed species included in this study (see Fig 1): Channa sp. Assam, C. sp. Bhutan foothills, C. sp. Rakhine Yoma, Channa sp. Northeast India, Channa sp. Tenasserim, Channa sp. Mogaung and P. sp DRCongo c) plus an additional eight potential species based on a conserved estimate of additional intraspecific diversity within C. bankanensis (two clades in total), C. marulius (three clades in total), C. striata (four clades in total), C. gachua (at least three clades), totaling a conservative estimate of 53 channid species.
Fig 4Barcode gap analysis.
Dotplot illustrating the presence/absence of local barcode gaps. For each individual the maximum intraspecific distance is plotted against the minimum interspecific distance. The "species" groupings were A) current taxonomy with the exception of species with prominent intraspecific clades that were split into distinct units (Channa bankanensis, C. gachua, C. marulius, C. striata), B) according to the 90 BINs identified in this study. The slope 1:1 is indicated by a red line.