Literature DB >> 35356563

A novel index to aid in prioritizing habitats for site-based conservation.

Anthony Basooma1, Herbert Nakiyende1, Mark Olokotum1, John S Balirwa1, Winnie Nkalubo1, Laban Musinguzi1, Vianny Natugonza2.   

Abstract

Funding biodiversity conservation strategies are usually minimal, thus prioritizing habitats at high risk should be conducted. We developed and tested a conservation priority index (CPI) that ranks habitats to aid in prioritizing them for conservation. We tested the index using 1897 fish species from 273 African inland lakes and 34 countries. In the index, lake surface area, rarity, and their International Union for Conservation of Nature (IUCN) Red List status were incorporated. We retrieved data from the Global Biodiversity Information Facility (GBIF) and IUCN data repositories. Lake Nyasa had the highest species richness (424), followed by Tanganyika (391), Nokoué (246), Victoria (216), and Ahémé (216). However, lakes Otjikoto and Giunas had the highest CPI of 137.2 and 52.1, respectively. Lakes were grouped into high priority (CPI > 0.5; n = 56) and low priority (CPI < 0.5; n = 217). The median surface area between priority classes was significantly different (W = 11,768, p < .05, effect size = 0.65). Prediction accuracy of Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) for priority classes were 0.912 and 0.954, respectively. Both models exhibited lake surface area as the variable with the highest importance. CPI generally increased with a decrease in lake surface area. This was attributed to less ecological substitutability and higher exposure levels of anthropogenic stressors such as pollution to a species in smaller lakes. Also, the highest species richness per unit area was recorded for high-priority lakes. Thus, smaller habitats or lakes may be prioritized for conservation although larger waterbodies or habitats should not be ignored. The index can be customized to local, regional, and international scales as well as marine and terrestrial habitats.
© 2022 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.

Entities:  

Keywords:  Africa; GBIF; IUCN; biodiversity loss; multiple stressors

Year:  2022        PMID: 35356563      PMCID: PMC8956788          DOI: 10.1002/ece3.8762

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

Human‐induced multiple stressors are exacerbating global biodiversity loss (Dobson, 1992; Dudgeon et al., 2006; Reid et al., 2019) and persistently altering ecosystem to function and provide services such as flood mitigation and food (Hooper et al., 2005; O’Connor & Crowe, 2005). These stressors, including climate change, habitat degradation, pollution, species invasions, and overexploitation, exhibit synergistic impacts on the ecosystems (Brook, 2008; Craig et al., 2017; Dudgeon et al., 2006; Hermoso et al., 2009), thereby increasing the complexity in managing and monitoring the ecosystem integrity (Craig et al., 2017). New communities are created within ecosystems (Pandolfi et al., 2020); loss of species spatial insurance is observed (Thompson et al., 2017); and ecological interactions within ecosystem are impeded, affecting species survival and ecosystem functioning and consequently increasing biodiversity loss (De Bernardi, 1981). Freshwater biodiversity loss has outpaced both terrestrial and marine ecosystems (Collen et al., 2014; Dudgeon et al., 2006; WWF, 2020). Contrarily, efforts to reduce defaunation are mostly directed towards the two latter ecosystems (Abell, 2002; Tickner et al., 2020), which are poor surrogates for conserving freshwater biodiversity (Darwall et al., 2011). Furthermore, due to high levels of endemism, freshwater species have less ecological substitutability when habitats are lost, fragmented, polluted, or invaded by exotic species (Abell, 2002; Arthington et al., 2016; Dudgeon et al., 2006). In addition, because freshwater ecosystems provide myriad ecosystem services to humans (McIntyre et al., 2016), they are predisposed to drastic and intermittent reclamation, increased pollution, and overexploitation (Hermoso et al., 2009). As a result, freshwater species have reduced by 76% in the last 50 years compared with 39% decline for marine and terrestrial populations (WWF, 2020). Of the 29,500 freshwater species assessed by the International Union for Conservation of Nature (IUCN), 27% are threatened with extinction and megafauna populations have declined by 88% from 1970 to 2012 (Tickner et al., 2020). Freshwater biodiversity has consistently continued to decline (Hermoso et al., 2016), despite the proliferation in development and use of different conservation management strategies such as conservation planning tools, priority indices, and protected areas (Hermoso et al., 2016). The anomaly is alluded to the inability to manage freshwater protected areas, political interference, poor sensitization, and poor delineation (Bastin et al., 2019; Hermoso et al., 2016; Holland et al., 2012). Also, the ecological integrity is mostly assessed at species level rather than the whole ecosystem (Vié et al., 2008), albeit habitat loss and degradation being the major catalyst to freshwater biodiversity loss (Dudgeon et al., 2006; Vié et al., 2008). Furthermore, conservation priorities are mostly inclined to large waterbodies because of their high species richness, endemism, and threatened species (Sayer et al., 2018). Likewise, most studies are skewed to large waterbodies (Biggs et al., 2017), despite the significance of small waterbodies as refugia for threatened species (Biggs et al., 2017; Olwa et al., 2020). Besides species richness, ecosystem parameters such as surface area need to be incorporated in designing freshwater conservation strategies (Grzybowski & Glińska‐Lewczuk, 2019). Also, ecosystem competing interests and costs should be considered (Dudgeon et al., 2006; Nieto et al., 2017). For effective conservation, the catchment may need to be included when delineating areas for conservation (Dudgeon et al., 2006; Nieto et al., 2017), except that there is high cost associated with conserving even small catchment (Dudgeon et al., 2006). In this context, priority‐based approaches are necessary to rank different habitats or waterbodies at high risk of degradation, but also practically viable for conservation (Howard et al., 2018). Elsewhere, prioritization indices have been implemented for caves in Brazilian Atlantic Rain Forest (Souza Silva et al., 2014); forests between Atlantic forest and Cerrado (de Mello et al., 2016), and rivers in the Mediterranean basin (Hermoso et al., 2009) to ensure preferential ecosystem selection for conservation. Freshwater ecosystems have not been widely considered as most of the studies have been waterbody or habitat specific, with limited information to rank them for site‐based conservation. The paucity of data on most taxa, including fishes, had in the past curtailed broader‐scale distribution analysis. However, recently, substantial amounts of data on the occurrences of different taxa have been made freely available in online repositories such as the Global Biodiversity Information Facility portal and International Union for Conservation of Nature (IUCN). This study aims to construct and test a novel conservation priority index on the fish species distribution from African lakes. We apply two model ensemble methods, eXtreme gradient boosting (XGBoost) and Random Forest (RF), to determine the most important variables in ranking the lakes for conservation. The index will aid in preferentially selecting ecosystems for site‐based conservation, especially where resources are limiting.

METHODS

Study area, data acquisition, and processing

We considered fish species records from all the lakes in Africa found in the Global Biodiversity Information Facility (GBIF, 2020). We retrieved the records with the occ_download_get function in rgbif package (Chamberlain et al., 2020), and genera names were changed in conformity with FishBase nomenclature (Froese & Pauly, 2021), which is based on Van Oijen (1996). We used the coordinates to correctly reference the records that were outside the geographic range described in FishBase. For instance, Haplochromis eduardii records that were found in Lake Albert in GBIF data were moved to Lake Edward, where the species is endemic (Froese & Pauly, 2021). Records with incomplete scientific epithets such as Oreochromis spp. and Thoracochromis spp. were discarded. Records without a lake or waterbody of origin, but with coordinates, were geo‐referenced using Google Earth Pro or used habitat descriptions, verbatim locality, and location remarks (Figure 1).
FIGURE 1

Data filtering pipeline for records obtained from GBIF (2020). GBIF, Global Biodiversity Information Facility

Data filtering pipeline for records obtained from GBIF (2020). GBIF, Global Biodiversity Information Facility To avoid duplication of lakes, we changed all lake nomenclature to their English names; for example, changing “lac Edouard” to Lake Edward. However, presently accepted names were maintained for lakes whose names have changed over time. To retrieve the species IUCN conservation status from IUCN Red List database (www.iucnredlist.org), we used iucn_summary and iucn_status functions in the taxize R package (Chamberlain et al., 2020). The species were classified according to IUCN Red List for threatened species (IUCN, 2012). Species with IUCN status of LR/nt were changed to near threatened (NT). The data retrieved were processed through a pipeline and only 36,541 (5.16%) records were retained for analysis (Figure 1). We used a species accumulation curve to evaluate whether most of the fish species from African lakes were represented in our data in order to test the conservation priority index (Figure 3). Fish species richness was determined for each lake. Both waterbody relative rarity and total IUCN weighting were computed for each species found in a particular lake and country. The distribution of analyzed fish species records was mapped (Figure 2).
FIGURE 3

Species accumulation curve for the species obtained from the lakes in Africa

FIGURE 2

Map showing the georeferenced fish species 42,588 records from 273 inland Africa lakes (Data obtained from GBIF, 2020)

Map showing the georeferenced fish species 42,588 records from 273 inland Africa lakes (Data obtained from GBIF, 2020)

Evaluating species relative rarity among waterbodies or habitats

Conservation strategies, laws, policies, and binding targets are mostly drawn at regional and international conventions such as Convention of Biological Diversity (CBD), and Ramsar Convention, Convention on International Trade in Endangered Species of Wild Fauna and Flora. However, national authorities or parties are vital in implementing or enforcing the treaties, laws, and policies such as CBD (1992). Thus, the species relative rarity, and consequently the conservation priority index, were computed at a national level. We computed the species relative rarity (SRR) of each species as the ratio of the number of waterbodies where the species was found in a particular country to total number of waterbodies with fish species data and correctly georeferenced in that country (Equation 1). We introduced an arbitrary value of 1 to convert the ratio to be highest for the rarest species and approach 0 if a species is found in most of the waterbodies (Equation 1). We assumed that if a species is found in many waterbodies, it has a greater area of occupancy, and thus not threatened by a single stressor. In IUCN, such a species is listed as least concern (IUCN, 2012). Furthermore, a species that is threatened but found in many waterbodies or habitats would have a low relative rarity compared with a least concern species found in only one lake or waterbody or habitat. The relative rarity weight for a waterbody was computed as the sum of rarity weights for the species in that waterbody scaled to the total number of species in the same waterbody (Equation 1). Species relative rarity (SRR) for a species was computed as: Ws is the total number of lakes where the species was observed in particular country; and Wt is the total number of correctly referenced lakes obtained in particular country. The conservation priority index for the waterbody will be 0 if only one waterbody is considered in a particular country. Relative rarity for each waterbody is the summation of the relative rarity for each species found in that waterbody. Relative rarity for waterbody was computed as follows:

Incorporating waterbody or habitat surface area in the conservation priority index

Implementation of conservation strategies is often constrained by financial resources. Also, most countries invest less in biodiversity conservation compared with other sectors such as agriculture, industrialization, and infrastructure development (Bayon et al., 2000). Thus, before delineating any lake or habitat for conservation, its surface area should be considered to ensure unit costs per unit area are known. In developing this index, the unit cost could not be established, thus surface area was incorporated as a proxy of the financial implications if the waterbody or habitat is prioritized for conservation. Indirectly, the competing interests such as food provision are implied because most of the commercial fishing activities such as trawling are conducted in larger waterbodies. Rivers were not included in the index because the hydro‐geomorphological transformations such as damming create distinct ecosystems along the river course that would require to conserve a particular section independently. In the study, for each correctly referenced waterbody, we collated its surface area from literature (Burgis & Symoens, 1987; Ogutu‐Ohwayo et al., 1999; Olowo et al., 2004; Schofield & Chapman, 1999; Vanden Bossche & Bernacsek, 1990). Where it was not possible to get this information from literature, we used the coordinates to locate the lake on Google Earth and approximated its surface area. This method was applied to lakes Gawa, Kabaleka, Wamala, Nakabale, Owapet, Kirimira, Gashana, Birengero, Igombe, Blue Lake, Carumbo, Chahafi, Rwanyakizinga, Mirayi, Maxai, Sheba Kelbia, Avanga, Chidya, Nkuruba, and Kabaka. The surface area of lakes Natuali, Chankaranga, Okurachere, Kasunju, and Mutabyo could not be determined from both literature and Google Earth, and these lakes were discarded. These are mostly minor lakes that have not been georeferenced on Google Earth. We cross‐referenced all records in lakes with the country of origin; for example, where records from Lake Victoria had country of origin different from Uganda, Kenya, or Tanzania were discarded or georeferenced using Google Earth to determine the country or lake of origin. For shared lakes, such as Tanganyika, Victoria, Chala, Nyasa, Cyohoha, Albert, Kivu, Mweru, Rweru, Jipe, Edward, Turkana, and Kariba, the surface area was approximated for the portion shared in each country. We considered the country‐specific surface area because the conservation funding would possibly be determined by the area of national jurisdiction (CBD, 1992).

Incorporating species' IUCN Red List status in the conservation priority index

The IUCN Red List for threatened species is the most comprehensive and detailed databases that have evaluated the species’ threat levels worldwide (Vié et al., 2008). The species evaluations are conducted at national, regional. or international levels to inform policies, laws, and targets (Vié et al., 2008). Most indices or conservation strategies are designed to cater for threatened species, while excluding data deficient, least concern, near threatened, and not evaluated species. This anomaly predisposes mostly not evaluated (NE) and data deficient (DD) species to become extinct unknowingly. Thus, IUCN suggested that DD and NE species should be considered as critically endangered until their status is known (IUCN, 2012). For least concern species, managers have put less effort to monitor the trends of their stock sizes. For example, species such as Labeo victorianus and Oreochromis esculentus which once dominated the commercial stocks in Lake Victoria (Cadwalladr, 1965) are now critically endangered (FishBase team RMCA & Geelhand, 2016). Thus, in this index, all species were considered and weights were given based on the threat level: highest and lowest weights assigned to extinct and least concern conservation categories, respectively. The weights were assigned as follows: ET = 7, EXw = 6, CR = 5, DD = 5, NE = 5, EN = 4, VU = 3, NT = 2, and LC = 1. We computed a conservation score (Cwt) as the product of total number of species in a given IUCN Red List category and weight assigned to that threat category, summed across all the IUCN Red List categories (Equation 3). The conservation priority index (CPI) was then formulated as a product of the conservation score (IUCN total weights) and relative rarity for each species, summed across all of the species within a lake, divided by the area of the lake, and a scaling constant to account for the number of categories (Equation 4). Here, we used lake area as a penalty based on several assumptions. (1) The cost of conservation is generally higher for large lakes compared to small lakes. (2) Biodiversity generally have limited room for adaptation to stressors in small waterbodies than large waterbodies. (3) Conservation actions are likely to be more effective in small lakes than in large waterbodies. These assumptions aimed at controlling for the size of the waterbody so that the index is not necessarily higher for larger lakes. Where, for each waterbody: Aw is the total surface area of the waterbody. Cwt and RRw are the IUCN total weights and relative rarity for a particular waterbody. The value of 8 was a scaling constant indicating the total number of IUCN categories considered. Thus, the scaling constant depends on the IUCN categories the user considers in the index.

Significance of the variables on the performance of the index

We used Pearson correlation to determine the relationship among species richness, IUCN total weighting, surface area, and waterbody relative rarity. To determine the relative importance of the variables in the conservation priority index (CPI), we arbitrarily classified the waterbody as high priority (CP1 > 0.5, n = 56) and low priority (CPI < 0.5, n = 217). We compared two ensemble machine learning algorithms: Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) to develop model classification predictions for the priority classes (high and low). We converted the categorical variable (country where the lakes were found) into dummy variables (numerical codes) using One‐Hot Encoding, and this variable was included in the model to account for national variations in species composition and conservation status. The pre‐processed data were randomly partitioned into training (70%) and testing (30%). Random Forest (RF) training model was tuned with tuneRF function in randomForest package with a step factor of 2, and an improve rate of 0.05 (Liaw & Wiener, 2002). Partial dependence plots were used to indicate how the index varied with changes in the index parameters, namely conservation score (IUCN total weights, waterbody relative rarity, country, and surface area). For XGBoost, parameters such as mglogloss (evaluation metric) and softprob (objective) were included in the watch list prior to constructing the best model. The model performance was measured using metrics including recall (sensitivity), specificity, precision, F1‐score, and accuracy (Yokoyama & Yamaguchi, 2020). For both models, variable importance plot was determined. We used both XGBoost (tree combinations at the start) and RF (independent tree building) to compare the predictability accuracy and identify the best algorithms to classify the data. Both algorithms are suitable for variances on the input data, and can handle overfitting with variation on the hyperparameter tuning. We used the Wilcoxon signed‐rank test to compare the differences in the median surface area, IUCN total weights, and the waterbody relative rarity for the two priority classes (high and low). The effect size was computed with the wilcox_effsize function from rstatix package.

RESULTS

Species composition, richness, and relationship among index variables

A total of 1897 species were recorded from 273 lakes in 34 countries of Africa. Uganda had the highest number of lakes assessed (39), Democratic Republic of Congo (26), and Rwanda (25). The species accumulation curve increased at a low rate after 20 lakes (Figure 3). Lake Nyasa had the highest species richness (424), followed by lakes Tanganyika (391), Nokoué (246), Victoria (216), and Ahémé (216). In lakes Gashanga, Kingiri, and Saka, only one fish species was observed in the assessed GBIF data. Of 1897 species, 1269 (66.9%) were least concern, 209 (11.0%) not evaluated, 160 (8.4%) data deficient, 81 (4.1%) vulnerable, 92 (4.8%) critically endangered, 45 (2.4%) endangered, 39 (2.1%) near threatened, and 2 (0.1%) were extinct. Species accumulation curve for the species obtained from the lakes in Africa Malawi side of Lake Nyasa had the highest IUCN weighting of 760 followed by Lake Victoria‐ Uganda (609); Lake Tanganyika‐Tanzania (480); and Lake Tanganyika‐DRC (458). Lake Nyasa‐ Malawi, had the highest relative rarity (362.5) followed by Lake Tanganyika‐Tanzania (257.2); Lake Tanganyika‐DRC (243.3); and Lake Tanganyika‐Zambia (179) (Appendix 1). In contrast, Lakes Otjikoto and Guinas had the highest species richness per unit surface area of 800 and 303 individuals/km2, respectively (Table 1). Strong positive correlation was observed between IUCN total weights and species richness (r = .944, df = 271, p < .001), and relative rarity and species richness (r = .966, df = 271, p < .001). However, moderate relationship was observed between surface area and richness (r = .564, df = 271, p < .001), and IUCN total weights (r = .689, df = 271, p < .001) and relative rarity (r = .625, df = 271, p < .001) (Figure 4).
TABLE 1

Model classification performance for index priority classes in testing data (n = 91)

ModelAccuracyRecall/SensitivitySpecificityPrecisionF1 Score
RF0.9120.9330.8130.9580.954
XGBoost0.93410.9470.8750.9720.959

Abbreviations: RF, Random Forest; XGBoost, eXtreme Gradient Boosting.

FIGURE 4

Correlation coefficient of variables incorporated in the conservation priority index (***p value < .001)

Model classification performance for index priority classes in testing data (n = 91) Abbreviations: RF, Random Forest; XGBoost, eXtreme Gradient Boosting. Correlation coefficient of variables incorporated in the conservation priority index (***p value < .001)

Conservation priority index, classification of lakes, and model predictions

Lakes Otjikoto and Guinas from Namibia had the highest conservation priority index of 137.5 and 52.1, respectively, followed by Lake Nkuruba in Uganda at 34.1 (Appendix 1). CPI for lakes Ngami, Piso, and Faguibine was zero (Appendix 1). After classifying the lake into 2 priority conservation classes, of 273 lakes, 56 (20.5%) were of high conservation priority class and 217 (79.5%) with low priority (Appendix 1). Uganda had the highest number of lakes with high priority (14), followed by Cameroon, Madagascar, and South Africa (6) (Appendix 1). The median surface area (km2) for low priority class (64) was significantly higher than that of high priority class (1) (Wilcoxon signed rank: W = 11,768, p < .05, n1 = 217, n2 = 56, effect size = 0.65). The median IUCN total weights and waterbody relative rarity for both low and high priority classes were not significantly different (IUCN: W = 5840, p = .66 and RRw: W = 5522, p = .29). XGBoost model had the highest model classification performance with an accuracy of 0.934 (95% CI: 0.86, 0.98) for test data compared with 0.921 (95% CI: (0.834, 0.96) for Random Forest (RF) (Table 1). However, both models had similar F1 score of 0.959 for XGBoost and 0.954 for Random Forest (RF). The partial dependence plots with test data using RF for priority classes varied depending on the variables (Figure 5). When the surface area of the lake was small, the model predicted a high priority class (Figure 5a). In contrast to surface area, the increase in both IUCN total weights and waterbody relative rarity led to prediction of high priority class for the lakes by RF model (Figure 5b,c). The model prediction for priority classes versus the 34 country codes was not presented in plots.
FIGURE 5

Partial dependence plots for testing data of the variables on predicting high priority classifications (a: Lake surface area (km2), b: IUCN total weights; c: waterbody relative rarity)

Partial dependence plots for testing data of the variables on predicting high priority classifications (a: Lake surface area (km2), b: IUCN total weights; c: waterbody relative rarity) In both models, surface area had the highest variable importance (19.2% mean decrease in accuracy and 29.3% mean decrease in Gini for RF) and 72.7% Gain for XGBoost (Figures 6 and 7). In RF, surface area was followed by IUCN total weights (6.44%), rarity (4.4%), and least for country codes. Similarly, in XGBoost, surface area was followed by IUCN total weights (20.8%), waterbody relative rarity, and country codes. In both models, the country variable was converted into dummy codes and only three country codes were significant and displayed in the plot (Figures 6 and 7).
FIGURE 6

Variable importance: Gain for XGBoost (code22: Uganda; code32: Rwanda; and code23: Nigeria)

FIGURE 7

Variable importance (mean decrease in accuracy and mean decrease in Gini) for the Random Forest model (Country codes: code 23: Uganda; Code 15: Kenya; Code 17: Madagascar; Code 32: Rwanda; and Code 1: Angola)

Variable importance: Gain for XGBoost (code22: Uganda; code32: Rwanda; and code23: Nigeria) Variable importance (mean decrease in accuracy and mean decrease in Gini) for the Random Forest model (Country codes: code 23: Uganda; Code 15: Kenya; Code 17: Madagascar; Code 32: Rwanda; and Code 1: Angola)

DISCUSSION

Species occurrences, rarity, and relationship among index variables

A total of 1897 fish species from 36,541 occurrence records were included in this index, representing 64% of all known freshwater fish species in Africa (Eschmeyer, 2005). The species accumulation curve approached asymptote, which suggested that most of the species were considered in the experimentation of the index (Gotelli & Colwell, 2001). Uganda followed by DRC had the highest number of lakes considered in the index, which could be attributed to extensive open source data sharing in the GBIF compared to other countries (PBES Technical Support Unit on Knowledge & Data, 2020). Some data points shared in the portal were filtered out in the index because key attributes such as waterbody name, geolocation, site description, habitat description, and verbatim locality were not provided, which reduced the number of occurrences considered in the index. Lakes Nyasa, Victoria, Tanganyika, Nokoué, and Ahémé had the highest species richness, while lakes including Kingiri, Saka, and Gashanga had the least. However, lakes Giunas and Otijkoto had the highest species richness per unit surface area. Also, strong positive correlations were observed among the index variables. The strong relationship between species richness and lake surface area agreed to the biogeographical principles that the larger the habitat, the more species it harbors (Rosenzweig, 1995). Differences in species richness and relative rarity among waterbodies are attributed to geomorphological, abiotic, and biotic factors (Brown et al., 2007). Also, isolated inland waterbodies favor rapid allopatric speciation and adaptive radiation because the species are exposed to different evolutionary pressures (Basiita et al., 2018). Lakes Nyasa, Victoria, and Tanganyika are endowed with diversity haplochromine cichlids, which has been attributed to the rapid speciation rates associated with habitat heterogeneity and disruptive sexual selection (Salzburger et al., 2005; Seehausen, 2000). The haplochromine lineages are endemic to Lake Tanganyika (Salzburger et al., 2005). The geomorphological barriers, for example, the Murchison Falls along Victoria Nile hindered fish migration to Lake Kyoga from Albert (Basiita et al., 2018). The falls along River Semiliki prevent fish passage to Lake Albert from Edward (Acere & Mwene‐Beyanga, 1990). Similarly, Lake Victoria and Kyoga were previously separated by the Owen and Bujagali falls (Basiita et al., 2018), and a sandbar separated Lake Nabugabo and Victoria (Stager et al., 2005). These biogeographical barriers may have led to allopatric speciation. For example, Lake Nabugabo which was once connected to Lake Victoria had five endemic species (van Alphen et al., 2004; Ogutu‐Ohwayo, 1993). The high species richness of Lake Nokoué could be attributed to its high habitat and seasonal variability; for example, the estuarine, freshwater, and marine habits (Lalèyè et al., 2003). The lake is connected to Atlantic Ocean, Porto‐Novo lagoon (30 km2), and the Ouémé Delta (Lalèyè et al., 2003). These habitats support a diversity of species within the lake. However, Lalèyè et al. (2003) identified only 51 species from Lake Nokoué while understanding its spatial and seasonal ichthyofaunal distribution. GBIF holds data from different sources, and thus the 216 species obtained from the data could be attributed to different data sources. Lake Nokoué, similar to lakes Nyasa, Victoria, and Tanganyika is threatened by anthropogenic pressure due to intense settlement around them. According to the Global Nature Fund (2019), along the Cotonou channel to the Atlantic Ocean was surrounded by about 700,000 people who depended on the lake for food and water. In the study, 66.9% of the species were classified as least concern under the global IUCN Red List for threatened species, which accounts for the low conservation priority indices. Although most of the species were not endangered, localized threats to the individual species cannot be ignored. National Red Lists should be developed to ensure effective monitoring of the species threat status. For example, species such as Mormyrus kanuume is exploited around Lake Victoria as bait in the Lates niloticus fishery (Bassa, 2018), which would threaten its population at national and regional levels. In Uganda, a National Red List was produced for all other taxa except fish (MTWA, 2018). Differences were observed in the species threat levels between the global and national lists; for example, Oreotragus oreotragus (Klipspringer), Otomys typus (Northern Groove toothed Rat), Scotophilus leucogaster (Northern Lesser House Bat and White‐bellied Yellow Bat) were regarded as least concern by global IUCN Red List but evaluated as vulnerable in Uganda National Red List (MTWA, 2018). In South Africa, Kniphofia leucocephala (Red‐hot Pokers) are not evaluated under the global IUCN Red List but critically endangered in South Africa (SANBI, 2010).

Conservation priority index and model predictions

Lakes Giunas and Otjikoto had the highest conservation priority index. The lakes harbor Tilapia guinasana—a critically endangered species introduced from Lakes Giunas to Otjikoto (Skelton, 1978). Generally, the restricted size of the lakes increases the vulnerability of their biota to sudden changes in ecological pressures such as alien species invasion and habitat degradation (Irish, 1991; Skelton, 1978). For example, T. guinasana was introduced to Lake Otjikoto from Giunas and invaded the niches for populations of Pseudocrenilabrus philander (Irish, 1991). Both lakes are threatened by habitat degradation and pollution because they are surrounded by agricultural fields (Irish, 1991). Lake Otjikoto was declared a national monument in Namibia, and the only underwater museum in the world because ammunitions were dumped in the lake after World War 1 (van Rooyen, 2010). In addition, both lakes are the only permanent lakes in Namibia (ATFALCO, 2012) and thus requires urgent conservation measures both nationally and globally. Noticeably, the Ramsar Secretariat (2002) noted that the two sinkhole lakes exclusively qualify because of the composition of threatened fish species. Although lakes Nyasa, Victoria, Tanganyika, and Nokoué had the highest species richness, their CPI were low. The index considered surface area as a proxy for the cost of implementing a biodiversity strategy that would be applied on the lake. According to article 5a of the Convention on Biological Diversity multilateral treaty, each contracting party shall “develop national strategies, plans or programmes for the conservation and sustainable use of biological diversity or adapt for this purpose existing strategies plans or programmes which shall reflect, inter alia, the measures set out in this Convention relevant to the Contracting Party concerned.” Therefore, the cost involved in conserving Lake Tanganyika, which is shared by four countries, would require harmonized national strategies. Also, because the surface area is large, ecological variabilities are available for the species to seek refugia if its native habitat or niche is affected or invaded by a predator. For example, in Lake Victoria, rocky dwelling Paralabidochromis species were not highly affected by the invasion of Lates niloticus (Balirwa et al., 2003). In small lakes or habitats, due to less ecological variability, the species are highly exposed to ecological threats. Also, it requires less costs to conserve small habitats and local community conservation approach can easily be applied. The CPI of the lake was zero if it was only one lake or habitat assessed in a particular country, thus no comparisons could be made for priority selection.

Classification and prediction of habitat priority

Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) had similar model prediction accuracy, sensitivity (recall), F1 score, precision, and specificity for both training and testing data. After classification of the lakes into high and low priority, the partial dependence plots from RF models showed that high priority classes were predicted at smaller surface area of the lake. The higher the surface, the lower the priority for conservation. Partial dependence plots are vital in determining the relationship between the variables and the predicted probabilities of the classes (Cutler et al., 2007). Surface area was ranked as the variable with highest importance in the index by both models. Although the species rarity, richness, and threat status are vital biodiversity conservation science, the ability and success of managing the habitat will depend on the costs required to implement the strategies. Biodiversity funding mostly in developing countries and world over is still minimal (Bayon et al., 2000), despite the 33 trillion US dollars that is averagely generated from ecosystem services annually (Costanza et al., 1997). The IUCN key biodiversity areas (KBA) are sites vital to protect global biodiversity dependent on key trigger species (IUCN, 2016). However, priority areas for conservation such as protected area can lie beyond key biodiversity areas (IUCN, 2016). Similar to study, IUCN (2016) noted that for conservation priority areas, the cost, connectivity, and evolutionary history should be considered.

CONCLUSION

The aim of this study was to construct a novel conservation priority index to aid in selection of a habitat for site‐based conservation. The index was designed and tested on 1897 fish species from 273 Africa inland lakes in 34 countries. We applied two model ensemble methods, eXtreme gradient boosting (XGBoost) and Random Forest (RF), to determine the most important variables in ranking the lakes for conservation. Results showed that lake surface area was the most important variable for ranking habitats for site‐based conservation. While species richness is generally higher for large lakes compared to small ones, this study suggests that smaller waterbodies need to be prioritized for because of the low habitat heterogeneity, low ecological substitutability for the species, and higher levels of exposure to human‐induced threats in small waterbodies compared to large systems. For large systems with vast habitat heterogeneity, fish species can easily seek refugia in other habitats. This index can be applied at local, national, and regional scale for other taxa, and can aid in preferentially selecting ecosystems for site‐based conservation, especially where resources are limiting. This index can be greatly affected by incorrect identification of the species. Also, species need to be correctly geolocated to avoid over‐ or underweighting/ranking of the waterbody, habitat, or any ecosystem.

CONFLICT OF INTEREST

The authors declare no conflict of interest.

AUTHOR CONTRIBUTIONS

Anthony Basooma: Conceptualization (lead); Data curation (lead); Formal analysis (lead); Investigation (lead); Visualization (lead); Writing – original draft (lead); Writing – review & editing (lead). Herbert Nakiyende: Conceptualization (equal); Methodology (equal); Writing – review & editing (equal). Mark Olokotum: Conceptualization (equal); Investigation (equal); Methodology (equal); Validation (equal); Writing – review & editing (equal). John S. Balirwa: Conceptualization (equal); Methodology (equal); Writing – review & editing (equal). Winnie Nkalubo: Investigation (equal); Methodology (equal); Writing – review & editing (supporting). Laban Musinguzi: Conceptualization (equal); Investigation (equal); Methodology (equal); Validation (equal); Writing – review & editing (equal). Vianny Natugonza: Conceptualization (equal); Investigation (equal); Methodology (equal); Supervision (lead); Validation (equal); Writing – review & editing (equal).

OPEN RESEARCH BADGES

This article has earned an Open Data Badge for making publicly available the digitally‐shareable data necessary to reproduce the reported results. The data is available at [https://doi.org/10.5061/dryad.4b8gthtcx].
CountryLakeSpecies richnessSurface area (km2)Richness/SA (ind/km2)IUCN total weightsRelative rarityCPI
AngolaCuanavale300.953254205
Salia Kuembo140.56251871.9
Cuito Source60.4912102.30.9
Calundo284.273818.30.8
Carumbo62.13183.50.7
BeninHlan431.923489.80.9
CameroonEjagham80.4916367.28.3
Bemin60.5810305.45.8
Barombi Koto273.389122.32.8
Barombi17536013.41.2
Soden51.334940.6
Manengouba20.37521.80.6
CongoBlue150.121251911.815.8
Yangala101.825127.30.6
DRCLukushi232.6692819.51.1
Fwa122.13612110.6
GabonKayaus131.1411214.90.8
Nguene253.7674110.90.6
MadagascarSarodrano30.13072.27.2
Ravelobe90.3625216.45.1
Djabala20.12070.94.2
Andjavibe30.311081.81.7
Amparihibe41.383112.30.6
Andrapongy104.692296.90.6
MalawiChikukutu222.59269.70.6
MozambiqueMaxai60.4115164.83.9
NamibiaOtjikoto40.005800102137.5
Guinas20.006630360.852.1
NigeriaIsemu350.122923921.525.5
Kware170.17100178.86.4
RwandaKisantu80.93985.90.8
Sierra LeoneKwako20120259.51.5
South AfricaShazibe80.180154.19.2
Makhawulani241243120.23.2
Kuzilonde110.8313236.92.5
Bhangazi191193011.82.1
Mgobezeleni201.28163512.82.1
Shengeza91.227193.90.9
TanzaniaNala30.221472.43.5
Malimbe40.271582.92.4
Chala‐TZ42.62163.20.7
UgandaNkuruba30.0215071.834.1
Kayugi120.2548237.47.4
Manywa80.2532174.55.1
Gigate201.7125812.32.9
Kayanja171.2143310.52.3
Naragaga131.987338.21.4
Agu24466215.61.4
Nawampasa32849221.91.1
Kawi24555815.61
Kabaka's10.11010.81
Kasodo40.71682.30.9
Gawa31.422112.20.7
Kabaleka51.14473.30.6
Kimira82.214164.10.5
ZambiaChunga40.192141.61.1
AngolaDilolo318.90.159310.007
BeninNokoué246495.023391110.41
Lake Ahémé2161002.1628688.80.152
Toho379.523.8874670.121
BotswanaNgami421540.2735400
Burkina FasoBam7200.35700
BurundiRweru‐BI15800.188265.80.018
Tanganyika‐BI18626000.072289135.80.01
Cyohoha‐BI9550.164131.30.004
Rugwero101000.1151.80.003
CameroonDouloumi77.790.89976.30.101
Nyos11.580.63310.90.071
Ossa639.270.153104.90.026
Mbakaou35000.00672.60.001
ChadIro51100.04552.50.003
Chad713500.005113.50.001
CongoKobambi123.243.704128.30.322
Telle15200.752712.20.139
Youbi43.51.143430.107
Cayo418.90.212830.042
Côte d'IvoireKossou301780.1693111.50.008
Ayame141100.127153.50.005
DRCNdaraga33.730.80472.80.222
Mukamba36.70.44832.30.044
Lac Nkolentulu125.30.04510.024
Tumba757650.09810768.60.016
Mulenda8620.12986.60.013
Yandja192170.0882517.10.013
Kisale293000.0973424.20.012
Upemba455300.0855639.10.012
Kabamba131260.1031410.80.012
Zimbambo51890.026104.50.006
Edward‐DRC3316500.028030.90.006
Mai‐Ndombe5723050.02510152.80.005
Mweru‐DRC6219500.0328756.30.005
lac Pempere118.90.05310.80.005
Tshangalele6362.50.017125.50.004
Tanganyika‐DRC25614,8000.017458243.60.004
Kisangolungwe103260.031108.30.003
Albert‐DRC3424200.0146331.60.003
Lualaba42260.01863.40.003
Kivu‐DRC2113700.0153319.50.003
Mbula‐Matari126590.01812110.002
de N'Zilo12000.00510.90.001
Mobutu15300010.90
Mocro115,1000110
EgyptTimsah20141.4294115.70.304
Idku1462.780.22321100.031
Bitter Lake92500.036217.50.009
Mariout7106.10.06694.70.008
Tismah1140.07110.60.006
Karun102330.043147.40.006
Manzala4213600.0316133.90.005
Burrullus114620.024177.50.003
Waadi El‐Raiyan Lakes11130.00910.60.001
Zietoon11200.00810.50.001
Nasser943570.00297.20
EthiopiaAfrera21000.0281.80.009
Awasa51290.03993.30.007
Ziway124400.027238.90.005
Tana2721560.0138422.50.004
Hayq1230.04310.70.004
Chamo43170.01383.10.002
Abaya811620.0071660.001
Ganjule43280.01242.50.001
Langano22300.00921.10.001
Turkana‐ET314300.00232.70
GabonNzile214.664.506299.70.322
Ndeguelie207.212.774288.40.185
Kebanda123.253.692202.40.181
Ingoyo2211.31.9473010.30.145
Menguegne134.143.14173.90.119
Ayem2012.41.613288.40.107
Nkonié1922.40.848357.30.082
Ezanga2468.50.353613.30.034
Avanga1331.20.417214.20.026
Azingo1958.50.325316.60.024
Anengue20880.227328.30.018
Onangue462880.1662290.016
GhanaBosomtwe12490.245207.50.034
Volta6485020.0087146.30.001
Aby Lagoon14240.00210.80
Volta lake18502010.50
KenyaSare851.6226.20.427
Chala‐KE22.60.76961.80.262
Kanyaboli12150.8288.90.17
Jipe‐KE4240.167123.30.055
Nakuru3520.058122.30.021
Magadi41000.0493.10.009
Victoria‐KE10841000.02630895.50.008
Naivasha31500.02112.30.007
Baringo71680.04275.70.004
Turkana‐KE4761400.0086941.60.001
Jilore24440.00561.50.001
LiberiaPiso141030.1362600
MadagascarKinkony161000.163812.70.038
Itasy7350.2115.30.027
Alaotra79000.008215.20.002
MalawiChiuta361990.1815318.80.018
Malombe324500.0714520.20.008
Nyasa‐MW41224,4000.017760326.50.003
Chilwa166000.0272260.002
Kingiri10.293.448100
MaliFaguibine115900.0191100
MozambiqueMachane53.251.538112.80.248
Piti2727.30.9895621.50.211
Chualo1613.31.2032013.40.16
Chingute89.480.844145.30.117
Pave69.490.63283.60.057
Chicunga626.30.22894.60.035
Chicamba151600.0941711.50.01
Poelela51130.04473.50.005
Nyasa‐MZ14064000.022198126.60.004
Cabora‐Bassa227390.00121.80
NamibiaLiambezi3117980.0173322.50.002
Nyasa4329,6000.0017632.30
NigeriaToidi70.937.52772.80.37
Kainji5712430.04662350.004
RwandaRukira20.5421.60.41
Mpanga219.52.21128160.272
Kilimbi53.471.44192.80.183
Birengero43.161.26662.80.179
Sake1214.30.839238.80.149
Bilira55.40.926930.122
Rumira32.21.364320.114
Mirayi13.750.26750.60.093
Hago1216.10.745168.80.085
Rwanyakizinga1423.60.5932210.20.083
Rweru‐RW9200.451460.057
Mugesera10390.256237.20.056
Ihema28860.3264421.90.051
Karago11.180.84710.50.051
Lokondo226.10.077101.80.044
Luhondo326.10.115102.30.042
Gashanga12.10.47610.60.036
Mohasi1034.10.293127.60.034
Cyohoha‐RW4190.21142.60.017
Bulera2540.03751.40.01
Kivu‐RW4110000.0416234.90.007
Lohondo126.10.03810.80.004
Mutanda1220.04510.50.003
Chahafi126.60.03810.50.002
SenegalGuiers581700.3416200
Sierra LeoneMabesi123.10.043100
South AfricaCubhu134.622.814277.80.419
Mpungwini112.674.12118.90.416
Mzingazi19111.7274512.20.349
Teza82.623.053143.70.297
Jeffreys Bay2920.791.3954326.90.24
Nhlabane75.861.195103.90.11
Kuhlange1531.60.4752811.40.08
Sibaya27640.4225118.90.07
St Lucia Lake683500.19414257.70.043
Funduzi6150.463.90.032
Mentz934.520.261126.30.028
Chrissie3180.16732.10.014
South SudanYirol117.131.543115.30.094
Shambe1030.30.33143.30.025
No251000.252913.30.02
SudanNubia118920.0121100
TanzaniaLutamba31.71.76532.20.164
Jipe‐TZ4150.267163.20.103
Ilamba28.10.24761.70.083
Igombe311.70.25672.20.049
Chidya270.28621.40.025
Kitere27.90.25321.30.02
Nyamagoma81080.074126.70.012
Kitangiri21050.01991.40.007
Babati1210.048110.006
Burigi1700.01450.60.005
Tanganyika‐TZ27213,5000.02480257.20.004
Nyasa‐TZ10455690.01917798.80.004
Sagara53620.014133.60.003
Manyara54700.011124.10.003
Natron610400.006195.40.002
Victoria‐TZ12433,7000.004428116.90.002
Rukwa3957600.0076035.20.001
Eyasi410500.004112.70.001
Sulunga210290.00251.50
TogoTogo7640.109700
TunisiaSebhika Kelbia21240.016200
UgandaNakabale1071.429385.90.458
Dalaja21.71.17661.20.345
Nabugabo42241.757631.30.311
Lemwa14101.4387.50.286
Nakuwa1682269.70.25
Opeta40420.95210028.10.22
Nyaguo27330.8186517.70.172
Mburo1210.41.1542270.165
Owapet65.231.147103.40.147
Nabisojo761.167133.20.131
Nakivale8260.308185.50.067
Meito414.640.273102.20.052
Nyamusigire44.40.90941.70.049
Kachiira1336.30.358217.90.048
George562500.2249246.80.04
Saka11.10.90910.30.038
Bisina45349.310.12911832.10.032
Kijanebolola9420.214175.40.031
Edward‐UG436750.06410736.10.018
Bunyonyi4570.0782.70.014
Kwania315080.0617121.80.013
Kyoga581821.60.03213145.30.007
Albert‐UG6228500.02212354.10.005
Wamala101870.0531250.004
Victoria‐UG17731,0000.006609158.70.002
ZambiaChila71773.30.406
Wakawaka104.592.179105.30.143
Ishiba77.50.933113.30.113
Tanganyika‐ZM21320000.107347179.40.019
Mweru‐ZM8127000.0310658.50.004
Kariba‐ZM5027000.0197635.30.003
Bangweulu6815,1000.00510047.30.001
ZimbabweKyle52490.0292.80.003
Kariba‐ZW5227000.0198237.80.003
Chivero226.320.07620.50.002
Mutirikwi1900.011100
  12 in total

1.  Random forests for classification in ecology.

Authors:  D Richard Cutler; Thomas C Edwards; Karen H Beard; Adele Cutler; Kyle T Hess; Jacob Gibson; Joshua J Lawler
Journal:  Ecology       Date:  2007-11       Impact factor: 5.499

2.  Cave conservation priority index to adopt a rapid protection strategy: a case study in Brazilian Atlantic rain forest.

Authors:  Marconi Souza Silva; Rogério Parentoni Martins; Rodrigo Lopes Ferreira
Journal:  Environ Manage       Date:  2014-12-21       Impact factor: 3.266

3.  Emerging threats and persistent conservation challenges for freshwater biodiversity.

Authors:  Andrea J Reid; Andrew K Carlson; Irena F Creed; Erika J Eliason; Peter A Gell; Pieter T J Johnson; Karen A Kidd; Tyson J MacCormack; Julian D Olden; Steve J Ormerod; John P Smol; William W Taylor; Klement Tockner; Jesse C Vermaire; David Dudgeon; Steven J Cooke
Journal:  Biol Rev Camb Philos Soc       Date:  2018-11-22

4.  Linking freshwater fishery management to global food security and biodiversity conservation.

Authors:  Peter B McIntyre; Catherine A Reidy Liermann; Carmen Revenga
Journal:  Proc Natl Acad Sci U S A       Date:  2016-10-24       Impact factor: 11.205

Review 5.  Freshwater biodiversity: importance, threats, status and conservation challenges.

Authors:  David Dudgeon; Angela H Arthington; Mark O Gessner; Zen-Ichiro Kawabata; Duncan J Knowler; Christian Lévêque; Robert J Naiman; Anne-Hélène Prieur-Richard; Doris Soto; Melanie L J Stiassny; Caroline A Sullivan
Journal:  Biol Rev Camb Philos Soc       Date:  2005-12-12

6.  Out of Tanganyika: genesis, explosive speciation, key-innovations and phylogeography of the haplochromine cichlid fishes.

Authors:  Walter Salzburger; Tanja Mack; Erik Verheyen; Axel Meyer
Journal:  BMC Evol Biol       Date:  2005-02-21       Impact factor: 3.260

7.  Global patterns of freshwater species diversity, threat and endemism.

Authors:  Ben Collen; Felix Whitton; Ellie E Dyer; Jonathan E M Baillie; Neil Cumberlidge; William R T Darwall; Caroline Pollock; Nadia I Richman; Anne-Marie Soulsby; Monika Böhm
Journal:  Glob Ecol Biogeogr       Date:  2013-07-03       Impact factor: 7.144

8.  The role of macroinvertebrates for conservation of freshwater systems.

Authors:  Carolina Nieto; Ximena M C Ovando; Rafael Loyola; Andrea Izquierdo; Fátima Romero; Carlos Molineri; José Rodríguez; Paola Rueda Martín; Hugo Fernández; Verónica Manzo; María José Miranda
Journal:  Ecol Evol       Date:  2017-06-15       Impact factor: 2.912

9.  Inland surface waters in protected areas globally: Current coverage and 30-year trends.

Authors:  Lucy Bastin; Noel Gorelick; Santiago Saura; Bastian Bertzky; Grégoire Dubois; Marie-Josée Fortin; Jean-Francois Pekel
Journal:  PLoS One       Date:  2019-01-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.