| Literature DB >> 33216894 |
Stéphane Teletchea1, Fabrice Teletchea2.
Abstract
Teleost fishes show the most outstanding reproductive diversity of all vertebrates. Yet to date, no one has been able to decisively explain this striking variability nor to perform large-scale phylogenetic analyses of reproductive modes. Here, we describe STrategies Of REproduction in FISH (STOREFISH) 2.0, an online database easing the sharing of an original data set on reproduction published in 2007, enriched with automated data extraction and presentation to display the knowledge acquired on temperate freshwater fish species. STOREFISH 2.0 contains the information for 80 freshwater fish species and 50 traits from the analysis of 1219 references. It is anticipated that this new database could be useful for freshwater biodiversity research, conservation, assessment and management. Database URL: www.storefish.org.Entities:
Year: 2020 PMID: 33216894 PMCID: PMC7678788 DOI: 10.1093/database/baaa095
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Data processing and feature extraction workflow. Traits defined as numbers were extracted using python’s regular expression, text entries were extracted semi-automatically by keywords extraction. Incomplete or difficult data extraction were curated by experts.
Life history trait definitions in STOREFISH [from Teletchea et al. (31)]
| Life stage | Trait Nb | Trait | Primary data | Type | Secondary data | Annotation |
|---|---|---|---|---|---|---|
| Egg | 1 | Oocyte diameter (mm) | Average diameter of the unfertilized egg, i.e. mature, fully yolked ovarian oocyte ready to be or just released (in mm) | Numerical | One value | regexp |
| Egg | 2 | Egg size after water-hardening (mm) | Average diameter of the fertilized egg after swelling, due to water uptake (in mm) | Numerical | One value | regexp |
| Egg | 3 | Egg buoyancy | Egg buoyancy of the fertilized swollen egg, being either demersal (sinks to the bottom), semi-pelagic (remains for a short period in the water column, then sinks) or pelagic (floats on or near the surface of the water and could derive for a long distance, several kilometres), scored as a three-state variable [demersal, semi-pelagic and pelagic] | Categorical | Demersal | regexp + human curation |
| Egg | 4 | Egg adhesiveness | Egg adhesiveness of the fertilized egg after water-hardening: eggs, which are slightly sticky only prior to water-hardening, were considered as non-adhesive, scored as a binary variable [non-adhesive, adhesive] | Categorical | Adhesive | regexp + human curation |
| Egg | 5 | Incubation time (days) | Average number of days required in natural conditions from fertilization to hatching, period designed as ‘incubation’ | Numerical | One value | regexp |
| Egg | 6 | Temperature for incubation (°C) | Average temperature, in °C, encountered during the incubation period of egg | Numerical | One value | regexp |
| Egg | 7 | Degree-days for incubation (°C/day) | Average degree-days (temperature × time, with temperature in °C and time in days), required for the entire incubation of egg | Numerical | One Value | regexp |
| Larvae | 8 | Initial larval size | Average larval size upon hatching (in mm) | Numerical | One value | regexp |
| Larvae | 9 | Larvae behaviour | Behaviour of the larvae prior to exogenous feeding, being either demersal (remain near the bottom) or pelagic (swim actively near the surface), scored as a binary variable [demersal, pelagic] | Categorical | Demersal | Human curation |
| Larvae | 10 | Reaction to light | Larvae during the first days after hatching are either negatively (photophobic) or positively (photopositive) attracted to light, scored as a binary variable [photophobic, photopositive] | Categorical | Photophobic | Human curation |
| Larvae | 11 | Temperature during larval development | Average temperature, in °C, encountered until the post-larvae stage (i.e. while starting exogenous feeding) | Numerical | One value | regexp |
| Larvae | 12 | Sibling intracohort cannibalism | Intra-specific predation of members of the same cohort during the post-larval growth, scored as a binary variable [absent, present] | Categorical | Absent | regexp |
| Larvae | 13 | Full yolk-sac resorption | Average period of time, in degree-days, required for the complete resorption of the yolk-sac | Numerical | One value | Human curation |
| Larvae | 14 | Onset of exogeneous feeding | Average period of time, in degree-days, required for the beginning of the exogenous feeding | Numerical | One value | Human curation |
| Life Stage | Trait Nb | Trait | Primary data | Type | Secondary data | Annotation |
| Female | 15 | Age at sexual maturity | Average age at maturity (in years) | Numerical | One value | regexp |
| Female | 16 | Length at sexual maturity | Average total body length at maturity (in cm) | Numerical | One value | regexp |
| Female | 17 | Weight at sexual maturity | Average body weight at maturity (in kg) | Numerical | One value | regexp |
| Female | 18 | Female sexual dimorphism | Females of some species develop secondary sexual characters during the breeding season, scored as a binary variable [absent, present] | Categorical | Absent | Human curation |
| Female | 19 | Relative fecundity | Average number of eggs (in thousands) per kilogram of body weight | Numerical | One value | regexp |
| Female | 20 | Absolute fecundity | Average number of eggs (in thousands) recorded by individual female | Numerical | One value | regexp |
| Female | 21 | Oocyte development | Synchronous (all oocytes present within the ovary are at the same stage of development), group-synchronous (at least two distinct populations of oocytes at different development stages) and asynchronous (oocytes at all stages of development), scored as a three-state variable | Categorical | Synchronous | Human curation |
| Female | 22 | Onset of oogenesis | Defined as the months in the year when an initial significant inflexion and increase in the GSI is observed. This period corresponds to the onset of the active vitellogenesis, mainly endogenous | Categorical | One month | Human curation |
| Female | 23 | Intensifying oogenesis activity | This is defined as the months in the year when the GSI displays the largest increase. This period mainly corresponds to the end of the active vitellogenesis, prior to the final maturation of oocytes (i.e. oocyte meiotic resumption and ovulation) | Categorical | One month | Human curation |
| Female | 24 | Maximum GSI value | This corresponds to the average maximal GSI value (%) observed within the entire reproductive cycle | Numerical | One value | regexp |
| Female | 25 | Oogenenesis duration | This corresponds to the total duration of the oogenesis process from the initial significant inflexion and increase in GSI until ovulation (in months) | Numerical | One value | regexp |
| Female | 26 | Resting period | This corresponds to the duration of gonadal quiescence between two consecutive reproductive cycles when the ovaries are recovering from the spawning act (in months) | Numerical | One value | regexp |
| Male | 27 | Age at sexual maturity | Average age at maturity (in years) | Numerical | One value | regexp |
| Male | 28 | Length at sexual maturity | Average total body length at maturity (in cm) | Numerical | One value | regexp |
| Male | 29 | Weight at sexual maturity | Average body weight at maturity (in kg) | Numerical | One value | regexp |
| Male | 30 | Male sexual dimorphism | Males belonging to certain species develop secondary sexual characters during the breeding season, scored as a binary variable [absent, present] | Categorical | Absent | regexp |
| Male | 31 | Onset of spermatogenesis | This is defined as the months in the year when a significant increase in the GSI is observed. This period mainly corresponds to the initial proliferation of spermatogonia through repeated mitotic divisions, and primary spermatocytes differentiation | Categorical | One month | Human curation |
| Male | 32 | Main spermatogenesis activity | This is defined as the months in the year when the GSI displays a sharp increase. This period mainly corresponds to the transformation of spermatocytes into mature spermatozoa (including the spermiogenesis process) and prior to spermiation | Categorical | One month | Human curatin |
| Male | 33 | Maximum GSI value | This corresponds to the average maximal GSI value (%) observed within the entire reproductive cycle | Numerical | One value | regexp |
| Male | 34 | Spermatogenesis duration | This corresponds to the total duration of the spermatogenesis process from the initial proliferation of spermatogonia to spermiation (in months) | Numerical | One value | regexp |
| Male | 35 | Resting period | This corresponds to the duration of gonadal quiescence between two consecutive reproductive cycles when the testis are recovering from the spawning act (in months) | Numerical | One value | Human curation |
| Life Stage | Trait Nb | Trait | Primary data | Type | Secondary data | Annotation |
| Spawning conditions | 36 | Spawning migration distance | Average distance run by adults to get to the spawning grounds (in km) | Numerical | One value | regexp |
| Spawning conditions | 37 | Spawning migration period | Months in the year when some species display extensive spawning run | Categorical | Months | Human curation |
| Spawning conditions | 38 | Homing | Accurate returning behaviour of some teleosts to their natal areas to spawn, scored as a binary variable [absent, present] | Categorical | Absent | Human curation |
| Spawning conditions | 39 | Spawning season | Usual months of the presence of spawners on the spawning ground | Categorical | January–March | Human curation |
| Spawning conditions | 40 | Spawning period duration | Period of time when spawners are present on the spawning ground (in weeks) | Numerical | One value | regexp |
| Spawning conditions | 41 | Spawning temperature | Average temperature observed during the spawning period (°C) | Numerical | One value | regexp |
| Spawning conditions | 42 | Spawning water type | Kind of water frequented during the spawning season, could be either stagnant water such as ponds or lakes (with no or slight current) or rivers and streams (with much more current), scored as a binary character [stagnant water, flowing or turbulent water] | Categorical | Stagnant water | Human curation |
| Spawning conditions | 43 | Spawning depth | Average depth at which spawning occurs (in m) | Numerical | One value | regexp |
| Spawning conditions | 44 | Spawning substrate | Teleost species either scatter their eggs in the water column (pelagophils), or deposit their eggs (i) on a rock or gravel bottom (lithophils), (ii) on plants (phytophils), (iii) on roots or grass above the sandy bottom or on the sand itself (psammophils) or (iv) into gill cavity of mussels (ostracophils). This character is mainly derived from Balon (1975) classification of reproductive guilds of teleost fishes, and scored as a five-state variable [pelagophils, lithophils, phytophils, psammophils, ostracophils] | Categorical | Pelagophils | Human curation |
| Spawning conditions | 45 | Spawning site preparation | Teleost species scatter their eggs either in the water column, directly over the substrates, or within a nest, which is a depression dug into the substrate by either the male, the female or both parents, scored as a five-state variable [open water/substratum scatter, substrate chooser, nest built by male, nest built by female, nest built by both parents] | Categorical | Open water/substratum scatter | Human curation |
| Spawning conditions | 46 | Nyctemeral period of oviposition | Main period during the day when mass spawning occurs, scored as a four-state variable [night, dawn, day, dusk] | Categorical | Night | Human curation |
| Spawning conditions | 47 | Mating system | Teleost species display three main kind of mating system: monogamous (one male and one female), polygamous (an individual, either the male or the female, has several mates), and promiscuity (both sexes have multiple partners within a single season), scored as a four-state variable [monogamy, polygyny, polyandry, promiscuity] | Categorical | Monogamy | Human curation |
| Spawning conditions | 48 | Spawning release | Teleost species display three main kind of egg release during the breeding season: total (all eggs are shed at the same time), fractional (several batches of eggs are released at intervals, usually over several days or weeks, but the potential breeding season fecundity is fixed before spawning, also known as determinate fecundity) or multiple (several batches of eggs are shed more than once through a long spawning season, and there is a recruitment to the stock of spawnable oocytes during the entire spawning season, also known as underminate fecundity), scored as three-state variable [total, fractional, multiple] | Categorical | Total | Human curation |
| Spawning conditions | 49 | Parity | Teleost species are either iteroparous (most individuals survive after the spawning act, i.e. several reproductive cycles during a lifetime) or semelparous (most or all individuals die, i.e. only one reproductive cycle during a lifetime), scored as a binary variable [semelparous, iteroparous] | Categorical | Semelparous | Human curation |
| Spawning conditions | 50 | Parental care | Association between one or both parents and offspring that enhances offspring development and survival (e.g. males of some species guard and aerate their eggs and larvae for several weeks), scored as a four-state variable [no care, male parental care, female parental care, biparental care] | Categorical | No care | Human curation |
Each trait was characterized as number, text, or hybrid according to the comment indicated in the initial version of the database. The Annotation column indicates the method used to transform data comments.
Examples of transformation of primary data into secondary data.
| Trait(db id) | Trait type | Primary data | Processing type | Secondary data | Expert curation |
|---|---|---|---|---|---|
|
| |||||
| Oocyte diameter | Numerical | 1 mm | Regexp (single value) | 1 mm | Not needed |
| Oocyte diameter (9223) | Numerical | 0.4–0.5 mm | Regexp (interval) | 0.45 mm | Not needed |
| Oocyte diameter (9293) | Numerical | The mean diameter of ripe eggs in ovaries of females in Upper Beverley Lake was 0.529 mm, signigficantly larger than 0.477 mm in Lower Beverley Lake | Regexp (first value) | 0.529 mm |
|
| Oocyte diameter (9054) | Numerical | 5.186 ± 0.263 mm for oocytes at the final maturation stage [For broodstock population cultured in a fish farm in Southern Chile] | Regexp (mean) | 5.186 mm | Not needed |
| Oocyte diameter (9046) | Numerical | Big eggs | No data found | No data |
|
| Reaction to light (11 339) | Categorical | Photophobic | Keyword | Photophobic | Not needed |
| Spawning season (10 792) | Categorical | From late September to early October | Keyword | [‘September’, ‘October’] | Not needed |
| Reaction to light (11 339) | Categorical | Photophobic | Keyword | Photophobic | Not needed |
| Sibling intracohort cannibalism (11 587) | Categorical | At 28–35 days, cannibalism occured indepedently in all 12 tanks. The mean age at first cannibalism was 32 days (SD = 1.5 days) which occured at a mean length of 30.3 mm (SD = 4.3 mm) | Keywords | Present |
|
|
| |||||
| Onset of oogenesis (7857) | Categorical | A slight increase from November until March | Keywords | [‘March’, ‘November’] |
|
|
| |||||
|
|
|
|
|
|
|
| Oocyte diameter (9331) | Numerical | Mean ooocyte diameter from all females at the second sampling (20 March) was 838 ± 18 µm. When oocytes completed FOM and were ovulated (1131 ± 20 µm in diameter) | Regexp (mean) | 838 µm |
|
Figure 2.STOREFISH home page. Accessible data are available by species, traits or by reference. A search-as-you-type box allows to rapidly find a species of interest.
Figure 3.Comparison of trait enrichment between Perca fluviatilis (442 primary data, resulting in 237 secondary data) and Alburnus alburnus (146 primary data, resulting in 102 secondary data).
Figure 4.Example of detailed database statistics for numerical traits (oocyte diameter, left) and categorical traits (reaction to light, right). The graphs are interactive online, discrete values are shown upon mouse hovering on the displayed data. Void categories indicate there is no primary data, No category indicates that no category was detected, ambiguous values indicate that multiple categories were found for one species, with no category being more representative of the secondary than any other.