| Literature DB >> 31273217 |
Erwan Delrieu-Trottin1,2,3,4, Jeffrey T Williams5, Diane Pitassy6, Amy Driskell7, Nicolas Hubert8, Jérémie Viviani9,10,11, Thomas H Cribb12, Benoit Espiau9,13, René Galzin9,13, Michel Kulbicki14, Thierry Lison de Loma9,13, Christopher Meyer15, Johann Mourier9,13,16, Gérard Mou-Tham14, Valeriano Parravicini9,13, Patrick Plantard9,13, Pierre Sasal9,13, Gilles Siu9,13, Nathalie Tolou9,13, Michel Veuille13,17, Lee Weigt7, Serge Planes18,19.
Abstract
The emergence of DNA barcoding and metabarcoding opened new ways to study biological diversity, however, the completion of DNA barcode libraries is fundamental for such approaches to succeed. This dataset is a DNA barcode reference library (fragment of Cytochrome Oxydase I gene) for 2,190 specimens representing at least 540 species of shore fishes collected over 10 years at 154 sites across the four volcanic archipelagos of French Polynesia; the Austral, Gambier, Marquesas and Society Islands, a 5,000,000 km2 area. At present, 65% of the known shore fish species of these archipelagoes possess a DNA barcode associated with preserved, photographed, tissue sampled and cataloged specimens, and extensive collection locality data. This dataset represents one of the most comprehensive DNA barcoding efforts for a vertebrate fauna to date. Considering the challenges associated with the conservation of coral reef fishes and the difficulties of accurately identifying species using morphological characters, this publicly available library is expected to be helpful for both authorities and academics in various fields.Entities:
Mesh:
Year: 2019 PMID: 31273217 PMCID: PMC6609690 DOI: 10.1038/s41597-019-0123-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Overview of data generation. From collection of specimen to the validation of data generation.
Fig. 2Sampling localities across French Polynesia. The number of sampling sites and the number of specimens collected are displayed for each archipelago. Several sampling localities may be represented by a single dot due to the geographic scale of French Polynesia. Map data: Google, DigitalGlobe.
Overview of the dataset.
| BOLD project | Geographical location | No. of specimen collected | No. of species collected | Sampling effort (No. of sampling days/No. of sites) |
|---|---|---|---|---|
| AUSTR | Austral Islands | 560 | 263 | 12/25 |
| GAMBA | Gambier Islands | 705 | 290 | 18/53 |
| MARQ | Marquesas Islands | 386 | 182 | 18/41 |
| MOH | Marquesas Islands | 190 | 107 | 5/11 |
| MOOP | Society Islands | 42 | 27 | 4/4 |
| SCIL | Society Islands | 309 | 213 | 8/20 |
Number of specimens and species collected for each scientific expedition. Sampling effort expressed in number of sampling days and number of sites.
Fig. 3Species diversity and distribution of genetic distance across the DNA barcode library. (a) Species diversity by family for the four archipelagoes sampled; (b) Distribution of maximum intraspecific distances (K2P, percent); (c) Distribution of nearest neighbor distances (K2P, percent); (d) Relationship between maximum intraspecific and nearest neighbor distances. Points above the diagonal line indicate species with a barcode gap.
Potential new species detected in the dataset.
| Sample ID | Family | Genus | Species | No. of specimens |
|---|---|---|---|---|
| AUST-419 | Antennariidae |
| sp1 | 1 |
| AUST-570, GAM-355 | Antennariidae |
| sp2 | 2 |
| MARQ-022, MOH-087 | Antennariidae |
| sp3 | 2 |
| SCIL-293 | Antennariidae |
| sp4 | 1 |
| MARQ-105, MARQ-106, MARQ-107 | Apogonidae |
| sp | 3 |
| MARQ-380, MARQ-381, MOH-068 | Apogonidae |
| sp | 3 |
| AUST-142, AUST-143 | Apogonidae |
| sp | 2 |
| AUST-470, AUST-471 | Apogonidae |
| sp | 2 |
| MARQ-177, MARQ-208, MOH-040 | Blenniidae |
| sp | 3 |
| AUST-242, GAM-791, GAM-792, MARQ-139, MARQ-140 | Blenniidae |
|
| 5 |
| GAM-278, GAM-279, SCIL-017, SCIL-021, SCIL-058, SCIL-288, SCIL-322 | Blenniidae |
|
| 7 |
| AUST-407, AUST-408, AUST-409 | Blenniidae |
| sp | 3 |
| MARQ-184, MARQ-187, MARQ-378, MARQ-379 | Blenniidae |
| sp | 4 |
| MOOP-028 | Chlopsidae |
| sp | 1 |
| AUST-600 | Congridae |
| sp1 | 1 |
| MARQ-318 | Congridae |
| sp2 | 1 |
| MARQ-314, MARQ-315, MARQ-316 | Congridae |
| sp | 3 |
| MARQ-397, MOH-062 | Creediidae |
| sp | 1 |
| AUST-427, AUST-538, AUST-539 | Creediidae |
|
| 3 |
| AUST-413 | Gobiesocidae |
| sp | 1 |
| AUST-159, AUST-305 | Gobiesocidae |
| sp1 | 2 |
| AUST-532, AUST-533, AUST-534 | Gobiesocidae |
| sp | 3 |
| AUST-082 | Gobiidae |
| sp | 1 |
| GAM-374, GAM-375 | Gobiidae |
| sp | 2 |
| MARQ-430, MOH-129, MOH-211 | Gobiidae |
| sp | 3 |
| AUST-303, GAM-379 | Gobiidae |
| sp1 | 1 |
| GAM-697 | Gobiidae |
| sp2 | 1 |
| SCIL-038 | Gobiidae |
| sp3 | 1 |
| AUST-346, AUST-347 | Gobiidae |
| sp | 2 |
| MARQ-097, MARQ-098 | Gobiidae |
| sp | 2 |
| SCIL-240 | Gobiidae |
| sp | 1 |
| AUST-566, AUST-567, AUST-568, GAM-364 | Gobiidae |
| sp | 4 |
| MARQ-363, MARQ-364, MARQ-365 | Gobiidae |
| sp1 | 3 |
| MOOP-007 | Gobiidae |
| sp2 | 1 |
| SCIL-237 | Gobiidae |
| sp | 1 |
| AUST-032 | Gobiidae |
| sp | 1 |
| MOOP-047 | Gobiidae |
| sp | 1 |
| AUST-446 | Gobiidae |
| sp1 | 1 |
| MARQ-435, MARQ-436 | Gobiidae |
| sp2 | 2 |
| GAM-32 | Gobiidae |
| sp1 | 1 |
| GAM-33 | Gobiidae |
| sp2 | 1 |
| MOOP-002 | Gobiidae |
| sp3 | 1 |
| AUST-423, AUST-424, AUST-425, AUST-544 | Isonidae |
| sp1 | 4 |
| GAM-521, GAM-522 | Isonidae |
| sp2 | 2 |
| MOH-079 | Lethrinidae |
| sp | 1 |
| AUST-383, GAM-176, GAM-177, GAM-178, MOH-200, SCIL-299 | Moringuidae |
| sp1 | 6 |
| SCIL-300 | Moringuidae |
| sp2 | 1 |
| MARQ-505 | Muraenidae |
| sp1 | 1 |
| SCIL-335 | Muraenidae |
| sp2 | 1 |
| AUST-573, GAM-709 | Ophidiidae |
| sp | 2 |
| AUST-297, AUST-298, AUST-299, AUST-300, GAM-761 | Pempheridae |
| sp1 | 5 |
| MARQ-063, MARQ-165, MARQ-166, MARQ-167, MARQ-276, MARQ-382, MOH-178, MOH-179 | Pempheridae |
| sp2 | 8 |
| AUST-308, AUST-309, AUST-310, AUST-311 | Pomacentridae |
| sp | 4 |
| MOOP-018, MOOP-019, MOOP-034, MOOP-035 | Pseudochromidae |
| sp | 4 |
| GAM-599, GAM-600 | Scorpaenidae |
| sp1 | 2 |
| MOH-137, MOH-151 | Scorpaenidae |
| sp2 | 2 |
| GAM-56, GAM-569, GAM-574, GAM-58 | Scorpaenidae |
| sp1 | 4 |
| MARQ-328, SCIL-114 | Scorpaenidae |
| sp2 | 1 |
| MARQ-046, MOH-027 | Syngnathidae |
| sp | 1 |
| MARQ-321 | Synodontidae |
| sp | 1 |
| AUST-011, AUST-048 | Tripterygiidae |
| sp1 | 1 |
| AUST-360, AUST-057, AUST-058, AUST-059 | Tripterygiidae |
| sp2 | 4 |
| GAM-68, GAM-69, GAM-123, GAM-124, GAM-125 | Tripterygiidae |
| sp3 | 3 |
| GAM-002 | Tripterygiidae |
| sp4 | 1 |
| SCIL-112, SCIL-133, SCIL-156 | Tripterygiidae |
| sp5 | 3 |
Specimens which were identified only to the genus level and which represent potentially new species waiting to be described. Number of specimens included in each Barcode Index Number (BIN).
Potential cryptic species.
| BINs | Taxa | No. of specimens |
|---|---|---|
| BOLD:AAF8427 |
| 2 |
| BOLD:ABW7007 |
| 4 |
| BOLD:ACE7901 |
| 1 |
| BOLD:ACX1964 |
| 1 |
| BOLD:ABW8494 |
| 2 |
| BOLD:AAF5636 |
| 1 |
| BOLD:AAF5637 |
| 4 |
| BOLD:AAD2580 |
| 2 |
| BOLD:AAD9019 |
| 6 |
| BOLD:ACD1956 |
| 5 |
| BOLD:AAD1050 |
| 1 |
| BOLD:AAA6306 |
| 9 |
| BOLD:AAC6155 |
| 5 |
| BOLD:ACC5235 |
| 3 |
| BOLD:AAC8364 |
| 5 |
| BOLD:AAF0704 |
| 3 |
| BOLD:AAL6561 |
| 2 |
| BOLD:ACD1820 |
| 1 |
| BOLD:AAE0976 |
| 2 |
| BOLD:AAB3862 |
| 6 |
| BOLD:ACD1989 |
| 3 |
| BOLD:ACD1988 |
| 3 |
| BOLD:AAX7961 |
| 1 |
| BOLD:AAB4082 |
| 1 |
| BOLD:ABZ7996 |
| 7 |
| BOLD:ACC5180 |
| 10 |
| BOLD:AAD3038 |
| 9 |
| BOLD:AAB4821 |
| 4 |
| BOLD:ACK9118 |
| 1 |
| BOLD:ACP9778 |
| 1 |
| BOLD:AAJ8783 |
| 2 |
| BOLD:AAC7090 |
| 11 |
| BOLD:ABZ0285 |
| 2 |
| BOLD:ACC5053 |
| 1 |
| BOLD:ACC5109 |
| 1 |
| BOLD:ACD1642 |
| 1 |
| BOLD:AAU1965 |
| 2 |
Species with number of specimens collected displaying taxonomic paraphyly most likely representing undescribed cryptic species. Sample ID includes sampling location (AUST: Austral Islands, GAMB: Gambier Islands, MARQ and MOH: Marquesas Islands, SCIL and MOOP: Society Islands).
Species displaying either incomplete lineage sorting or shallow inter-species divergence.
| Family | Species | Mean Intra-Sp | Max Intra-Sp | Nearest Neighbour | Nearest Species | Distance to NN |
|---|---|---|---|---|---|---|
| Acanthuridae |
| 0.08 | 0.15 | AUSTR453-13 |
| 0 |
| Holocentridae |
| 0.28 | 0.62 | SCILL065-15 |
| 0 |
| Monacanthidae |
| 0.36 | 0.62 | SCILL083-15 |
| 0 |
| Tetraodontidae |
| 0 | 0 | MOH030-16 |
| 0 |
| Mullidae |
| 0.52 | 0.52 | AUSTR089-13 |
| 0.17 |
| Pomacentridae |
| 0 | 0 | SCILL209-15 |
| 0.31 |
| Labridae |
| 0 | 0 | SCILL040-15 |
| 0.46 |
| Apogonidae |
| N/A | 0 | SCILL142-15 |
| 0.93 |
| Tetraodontidae |
| 0.21 | 0.31 | MARQ456-12 |
| 1.1 |
| Pomacentridae |
| 0.15 | 0.15 | GAMBA844-12 |
| 1.24 |
| Monacanthidae |
| 0.15 | 0.31 | GAMBA711-12 |
| 1.4 |
| Pomacentridae |
| 0.08 | 0.15 | AUSTR222-13 |
| 1.56 |
| Holocentridae |
| 0.68 | 1.1 | SCILL104-15 |
| 1.57 |
| Acanthuridae |
| 0 | 0 | AUSTR376-13 |
| 1.72 |
| Apogonidae |
| 0.23 | 0.31 | GAMBA657-12 |
| 1.88 |
| Chaetodontidae |
| 0.08 | 0.15 | SCILL269-15 |
| 1.88 |
| Chaetodontidae |
| 0.1 | 0.15 | GAMBA555-12 |
| 1.88 |
Mean and Maximum intra-Species distances (Mean Intra-Sp and Max Intra-Sp), and Kimura 2 Parameter distances from the Nearest Neighbour (NN).
| Design Type(s) | population data analysis objective • biodiversity assessment objective |
| Measurement Type(s) | fish |
| Technology Type(s) | taxonomic diversity assessment by targeted gene survey |
| Factor Type(s) | Species • geographic location |
| Sample Characteristic(s) | Actinopterygii • French Polynesia • ocean biome |