| Literature DB >> 34347806 |
Virginia Ahedo1, Débora Zurro2, Jorge Caro1, José Manuel Galán1.
Abstract
The transition to agriculture is regarded as a major turning point in human history. In the present contribution we propose to look at it through the lens of ethnographic data by means of a machine learning approach. More specifically, we analyse both the subsistence economies and the socioecological context of 1290 societies documented in the Ethnographic Atlas with a threefold purpose: (i) to better understand the variability and success of human economic choices; (ii) to assess the role of environmental settings in the configuration of the different subsistence economies; and (iii) to examine the relevance of fishing in the development of viable alternatives to cultivation. All data were extracted from the publicly available cross-cultural database D-PLACE. Our results suggest that not all subsistence combinations are viable, existing just a subset of successful economic choices that appear recurrently in specific ecological systems. The subsistence economies identified are classified as either primary or mixed economies in accordance with an information-entropy-based quantitative criterion that determines their degree of diversification. Remarkably, according to our results, mixed economies are not a marginal choice, as they constitute 25% of the cases in our data sample. In addition, fishing seems to be a key element in the configuration of mixed economies, as it is present across all of them.Entities:
Year: 2021 PMID: 34347806 PMCID: PMC8336859 DOI: 10.1371/journal.pone.0254539
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Focal time periods of the societies documented in the EA.
| Focal time period | % of EA societies |
|---|---|
| Before 1800 | 3 |
| 1800–1899 | 27 |
| 1900–1950 | 66 |
| After 1950 | 2 |
Variables selected for the present study from the different sources available at D-PLACE.
| Dataset | Variables | |||
|---|---|---|---|---|
| D-PLACE variable ID | Short name | Description | Data type | |
| Ethnographic Atlas (Ethnology, 1962; Gray, 1998; Murdock, 1967) | EA001 | % dependence on gathering | Percentage of dependence on the gathering of wild plants and small land fauna, relative to other subsistence activities. | Ordinal |
| EA002 | % dependence on hunting | Percentage of dependence on hunting, including trapping and fowling, relative to other subsistence activities. | Ordinal | |
| EA003 | % dependence on fishing | Percentage of dependence on fishing, including shellfishing and the pursuit of large aquatic animals, relative to other subsistence activities. | Ordinal | |
| EA004 | % dependence on husbandry | Percentage of dependence on animal husbandry, relative to other subsistence activities. | Ordinal | |
| EA005 | % dependence on agriculture | Percentage of dependence on agriculture, relative to other subsistence activities. | Ordinal | |
| EA028 | Agriculture intensity | Intensity of cultivation. Levels: no agriculture, casual, extensive/shifting, horticulture, intensive, intensive irrigated. | Categorical | |
| EA029 | Major crop type | Principal type of crop cultivated. Levels: no agriculture, non-food, vegetables, tree-fruits, roots/tubers, cereals. | Categorical | |
| EA030 | Settlement patterns | The prevailing type of settlement pattern. Levels: nomadic, seminomadic, semisedentary, impermanent, dispersed homesteads, hamlets, villages/towns, complex permanent. | Categorical | |
| EA033 | Jurisdictional hierarchy beyond local community and political complexity | The number of jurisdictional levels beyond the local community, with 1 representing the theoretical minimum (none/autonomous band or villages) and 4 representing the theoretical maximum (villages nested within parishes, districts, provinces, and a complex state). This variable also provides a measure of political complexity, ranging from 1 for stateless societies, through 2 or 3 for petty and larger paramount chiefdoms or their equivalent, to 4 or 5 for large states. Imposed colonial regimes are excluded. | Ordinal | |
| EA039 | Plow cultivation | Indicates whether or not animals are employed in plow cultivation, and whether plow cultivation is aboriginal or dates to the post-contact period. Levels: absent, not aboriginal but present, present. | Categorical | |
| EA040 | Type of domestic animals | The predominant type of animals kept. Levels: absence or near absence, pigs, sheep/goats, equine, deer, camelids, bovine. | Categorical | |
| EA041 | Milking | Indicates whether or not domestic animals milked. Levels: absence or near absence, more than sporadically. | Categorical | |
| EA202 | Population size | Population of ethnic group as a whole. Note that source differs by society; EA bibliography is source where possible, otherwise Ember [ | Continuous | |
| Jenkins et al. [ | AmphibianRichness | Amphibian richness | Number of coexisting amphibian species. | Continuous |
| BirdRichness | Bird richness | Number of coexisting bird species. | Continuous | |
| MammalRichness | Mammal richness | Number of coexisting mammal species. | Continuous | |
| Kreft and Jetz [ | VascularPlantsRichness | Vascular plant richness | Number of coexisting vascular plant species. | Continuous |
| Moderate Resolution Imaging Spectroradiometer [ | AnnualNetPrimaryProductionVariance | Variance in net primary production per month | Variance in net primary production per month. | Continuous |
| MonthlyMeanNetPrimaryProduction | Net primary production per month (grams of carbon uptake per square meter of land per month) | Net primary production per month (grams of carbon uptake per square meter and land per month). | Continuous | |
| NetPrimaryProductionConstancy | Net primary production constancy | Colwell’s [ | Continuous | |
| NetPrimaryProductionContingency | Net primary production contingency | Colwell’s [ | Continuous | |
| NetPrimaryProductionPredictability | Net primery production predictability | Colwell’s [ | Continuous | |
| Terrestrial Ecoregions of the World | Biome | Biome | Classification by Dinerstein et al [ | Categorical |
| Baseline Historical (1900–1949), CCSM ecoClimate model [ | AnnualMeanTemperature | Mean value of monthly temperature across the year | Mean value of monthly temperature across the year | Continuous |
| AnnualPrecipitationVariance | Variance in monthly precipitation means | Variance in montly precipitation means | Continuous | |
| AnnualTemperatureVariance | Variance in monthly temperature means | Variance in monthly temperature means | Continuous | |
| MonthlyMeanPrecipitation | Mean monthly precipitation in ml/m2/month | Mean monthly precipitation in ml/m2/month | Continuous | |
| PrecipitationConstancy | Precipitation constancy | Colwell’s [ | Continuous | |
| PrecipitationContingency | Precipitation contingency | Colwell’s [ | Continuous | |
| PrecipitationPredictability | Precipitation predictability | Colwell’s [ | Continuous | |
| TemperatureConstancy | Temperature constancy | Colwell’s [ | Continuous | |
| TemperatureContingency | Temperature contingency | Colwell’s [ | Continuous | |
| TemperaturePredictability | Temperature predictability | Colwell’s [ | Continuous | |
Results from PCA on the subsistence dataset.
| PC Dim. | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| Eigenvalues | 2.624 | 1.043 | 0.842 | 0.492 | 0.000 |
| % of var. | 52.477 | 20.851 | 16.841 | 9.831 | 0.000 |
| Cumulative % of var. | 52.477 | 73.328 | 90.169 | 100.000 | 100.000 |
The first four dimensions explain all the variance in the data.
Fig 1PCA graph of variables for dimensions 1:2 and 3:4 respectively.
Fig 2ODIs of the subsistence dataset (on the left) and a random dataset generated from it (on the right).
Note that according to the legend colour red means high similarity (low dissimilarity) and blue high dissimilarity. After comparison with the random dataset, it becomes clear that our subsistence data have structure.
Fig 3Map with the subsistence clusters obtained for k = 7.
The different societies have been placed in accordance with their latitude and longitude and coloured according to the cluster they belong to. The different world regions have been coloured to represent the different biomes. Two legends provided: the upper right one presents the detail of the average percentages of dependence on gathering, hunting, fishing, husbandry and agriculture of each of the clusters. The lower right one is the legend of the biomes. Map source–[150]. Biome cartography is licensed under CC-BY 4.0 (https://ecoregions2017.appspot.com/).
Fig 4Map with 9 of the subsistence clusters obtained for k = 15.
The different societies are placed in accordance with their latitude and longitude and coloured according to the cluster they belong to. Only those subsistence strategies with a significant level of agriculture and/or husbandry have been considered. Two legends provided: The upper right one presents the detail of the average percentages of dependence on gathering, hunting, fishing, husbandry and agriculture of each of the clusters. The lower right one is the legend of the biomes. Map source–[150]. Biome cartography is licensed under CC-BY 4.0 (https://ecoregions2017.appspot.com/).
Summary table for k = 7.
| Clusters’ average strategies (Mean values per variable and cluster) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cluster nb | Gathering (%) | Hunting (%) | Fishing (%) | Husbandry (%) | Agriculture (%) | Entropy | SD | Limit 15 | Limit 10 | Interpretation |
| 2 | 16,40 | 69,82 | 8,50 | 2,64 | 2,64 | 0,95 | 28,42 | 2 | 2 | HGF–Hunters |
| 5 | 4,74 | 8,21 | 5,91 | 19,67 | 61,47 | 1,14 | 23,93 | 2 | 2 | Agriculture(62%) + Husb.(20%) |
| 4 | 4,43 | 9,51 | 6,77 | 61,89 | 17,40 | 1,15 | 23,92 | 2 | 2 | Husbandry(62%) + Agric.(17%) |
| 3 | 18,16 | 26,36 | 49,42 | 2,73 | 3,33 | 1,22 | 19,27 | 3 | 3 | HGF–Fishers |
| 1 | 44,00 | 33,64 | 16,00 | 2,64 | 3,73 | 1,24 | 18,33 | 3 | 3 | HGF–Gatherers |
| 6 | 4,77 | 6,16 | 28,20 | 10,30 | 50,57 | 1,25 | 19,49 | 2 | 3 | AgroFishing |
| 7 | 15,21 | 24,55 | 16,46 | 3,80 | 39,99 | 1,42 | 13,40 | 4 | 4 | WSE: HGF + Agriculture |
It includes cluster number, each cluster’s average strategy (the average of the percentages of dependence on gathering, hunting, fishing, husbandry and agriculture across all societies in the cluster), their entropy, standard deviation, the number of variables with a percentage of dependence equal or greater than 15% and 10%, and a succinct interpretation of the cluster. Note that the table has been sorted in ascending order of entropy.
Fig 5Ridgeline plot of the entropy distributions of each cluster for k = 7.
Recall that the distributions have been sorted in ascending order of entropy along the vertical axis.
Fig 6Breiman’s individual variable importance averaged across the 100 imputed datasets.
One standard deviation error bar.
Fig 7Group variable importance averaged across the 100 imputed datasets.
One standard deviation error bar.
Fig 8Sunburst of the different subsistence strategies identified via hierarchical clustering for k = 2, 7, 15.
A short description is provided for each of the clusters. Note that the size of the circular sectors is proportional to the number of societies falling under each cluster.