| Literature DB >> 33270680 |
Balázs Bodó1, Dániel Antal2, Zoltán Puha3.
Abstract
Library Genesis is one of the oldest and largest illegal scholarly book collections online. Without the authorization of copyright holders, this shadow library hosts and makes more than 2 million scholarly publications, monographs, and textbooks available. This paper analyzes a set of weblogs of one of the Library Genesis mirrors, provided to us by one of the service's administrators. We reconstruct the social and economic factors that drive the global and European demand for illicit scholarly literature. In particular, we test if lower income regions can compensate for the shortcomings in legal access infrastructures by more intensive use of illicit open resources. We found that while richer regions are the most intensive users of shadow libraries, poorer regions face structural limitations that prevent them from fully capitalizing on freely accessible knowledge. We discuss these findings in the wider context of open access publishing, and point out that open access knowledge, if not met with proper knowledge absorption infrastructures, has limited usefulness in addressing knowledge access and production inequalities.Entities:
Year: 2020 PMID: 33270680 PMCID: PMC7714232 DOI: 10.1371/journal.pone.0242509
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Daily aggregate download volumes.
Country level statistics for the first 20 countries by aggregate download volume.
| Country name | Total downloads | download per DAY PER million | per capita download rank | |
|---|---|---|---|---|
| 1 | United States | 1683353 | 39 | 49 |
| 2 | India | 1272124 | 7 | 131 |
| 3 | Germany | 765170 | 69 | 19 |
| 4 | United Kingdom | 594925 | 68 | 21 |
| 5 | China | 580808 | 3 | 158 |
| 6 | Iran, Islamic Republic of | 563798 | 53 | 35 |
| 7 | Italy | 469676 | 57 | 30 |
| 8 | Canada | 369962 | 77 | 17 |
| 9 | Indonesia | 341269 | 10 | 119 |
| 10 | Spain | 327326 | 52 | 37 |
| 11 | Turkey | 323204 | 30 | 63 |
| 12 | Brazil | 307376 | 11 | 112 |
| 13 | France | 290734 | 32 | 59 |
| 14 | Greece | 237657 | 163 | 3 |
| 15 | Mexico | 200792 | 12 | 108 |
| 16 | Australia | 200109 | 62 | 24 |
| 17 | Russian Federation | 196087 | 10 | 118 |
| 18 | Netherlands | 189747 | 83 | 14 |
| 19 | Vietnam | 179758 | 14 | 101 |
| 20 | Egypt | 169421 | 14 | 102 |
Fig 2Geographical distribution of download locations aggregated over the total observation period.
Fig 3a, b. Country-level and regional variance of the dependent variable.
Global models I.
(DV: download per capita).
| Model 1 | Model 2 | Model 3 | |
|---|---|---|---|
| (Intercept) | -5.26e+03 | 2.5 | 2.5 |
| (3.22e+03) | (0.0188) | (0.893) | |
| log(population per million) | -83.1 | -0.0181 | -0.0181 |
| (125) | (0.000537) | (0.0255) | |
| log(gdp) | 712 | 0.531 | 0.531 |
| (376) | (0.00205) | (0.0972) | |
| broadband_subscribers | 1.95e+04 | 2.82 | 2.82 |
| (3.36e+03) | (0.0134) | (0.635) | |
| N | 190 | 190 | 190 |
| Null deviance | 4.51e+09 | 9.26e+05 | 9.26e+05 |
| res.deviance | 2.48e+09 | 3.86e+05 | 3.86e+05 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.
Global models II.
(DV: download per capita).
| Model 4 | Model 5 | Model 6 | |
|---|---|---|---|
| (Intercept) | 2.44 | 8.15 | 7.12 |
| (1.67) | (0.267) | (0.358) | |
| log(population per million) | -0.0926 | -0.326 | -0.341 |
| (0.0875) | (0.0719) | (0.0672) | |
| log(gdp) | 0.583 | ||
| (0.181) | |||
| Broadband_subscribers | 1.61 | ||
| (1.43) | |||
| Tertiary_education_enrollment_ratio | 0.0023 | 0.0102 | 0.0297 |
| (0.0043) | (0.00444) | (0.0061) | |
| Expenditure_tertiary_education_per_student | -1.37e-05 | 9.11e-06 | -2.49e-06 |
| (1.57e-05) | (1.78e-05) | (1.56e-05) | |
| Percentage_of_GDP_spending_on_R&D | 0.0875 | 0.148 | 1.56 |
| (0.103) | (0.11) | (0.323) | |
| H_index | -0.000156 | 0.000743 | 0.000896 |
| (0.000491) | (0.000446) | (0.000447) | |
| Tertiary_education_enrollment_ratio: Percentage_of_GDP_spending_on_R&D | -0.0211 | ||
| (0.00476) | |||
| N | 86 | 86 | 86 |
| Null deviance | 4.7e+05 | 4.7e+05 | 4.7e+05 |
| res.deviance | 1.97e+05 | 2.53e+05 | 1.91e+05 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.
Global models III.
Random effects model by continent (DV: download per capita).
| log(1+ gdp_scaled) | (Intercept) | Population_per_million (scaled) | broadband_subscribers (scaled) | |
|---|---|---|---|---|
| Africa | 1.1025678 | 8.248046 | -0.3200811 | 0.2780263 |
| Americas | 0.2314024 | 7.858342 | -0.3200811 | 0.2780263 |
| Asia | 0.5302122 | 7.981479 | -0.3200811 | 0.2780263 |
| Europe | 0.1876811 | 8.645233 | -0.3200811 | 0.2780263 |
| North_America | 0.3898888 | 8.455814 | -0.3200811 | 0.2780263 |
| Oceania | 1.2764129 | 7.672324 | -0.3200811 | 0.2780263 |
Global models IV.
GDP random effects model by income category (DV: download per capita rounded, quasipoission).
| log(1+gdp_scaled) | (Intercept) | Population per million (scaled) | Broadband subscribers (scaled) | |
|---|---|---|---|---|
| High_income | -0.003871827 | 0.30688303 | -0.1200297 | 0.3071157 |
| Upper_middle_income | 0.105214527 | -0.06604125 | -0.1200297 | 0.3071157 |
| Lower_middle_income | 0.397072298 | -0.07077302 | -0.1200297 | 0.3071157 |
| Low_income | 0.348602516 | -0.17360155 | -0.1200297 | 0.3071157 |
Global models V.
R&D and education random effects model by income category (DV: download per capita rounded, quasipoission).
| Tertiary education enrollment ratio (scaled) | Percentage of GDP Spending on R&D (scaled) | (Intercept) | Population per million (scaled) | Broadband subscribers (scaled) | |
|---|---|---|---|---|---|
| High income | 0.010932241 | -0.05358038 | 8.543931 | -0.2996871 | 0.4264995 |
| Upper middle income | -0.001022486 | 0.35217880 | 8.230012 | -0.2996871 | 0.4264995 |
| Lower middle income | 0.050455057 | 1.63851738 | 8.360397 | -0.2996871 | 0.4264995 |
| Low income | 2.797771598 | 1.45719062 | 9.857220 | -0.2996871 | 0.4264995 |
Fig 4European download locations.
European models I.
(DV: download per capita).
| Model 7 | Model 8 | Model 9 | Model `10 | Model 11 | |
|---|---|---|---|---|---|
| (Intercept) | 6.438 | 6.295 | -0.143 | 6.353 | 4.050 |
| (0.794) | (0.838) | (2.457) | (0.802) | (1.110) | |
| log(GDP purchasing power parity) | 0.247 | 0.242 | 0.175 | 0.258 | 0.490 |
| (0.077) | (0.081) | (0.075) | (0.078) | (0.105) | |
| % of R&D personnel and researchers in the workforce | 0.697 | 0.683 | 0.570 | 0.702 | |
| (0.057) | (0.063) | (0.068) | (0.055) | ||
| % of the population that used the internet for online banking | -0.011 | -0.015 | -0.009 | -0.003 | |
| (0.003) | (0.003) | (0.003) | (0.005) | ||
| % of the population that used the internet for online shopping | -0.006 | ||||
| (0.003) | |||||
| log(disposable income) | 0.792 | ||||
| (0.280) | |||||
| R&D expenditure | -0.070 | 0.059 | |||
| (0.053) | (0.076) | ||||
| null.deviance | 2990524.371 | 2990524.371 | 2990524.371 | 2990524.371 | 2990524.371 |
| deviance | 1415805.433 | 1507337.393 | 1343129.996 | 1396838.896 | 2455507.137 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.
European models II.
(DV: download per researcher).
| Model 12 | Model 13 | Model 14 | |
|---|---|---|---|
| (Intercept) | 5.530 | 8.034 | 6.486 |
| (2.240) | (2.546) | (1.147) | |
| log(GDP purchasing power parity) | 0.161 | 0.174 | 0.175 |
| (0.071) | (0.078) | (0.113) | |
| log(disposable income) | 0.148 | -0.143 | |
| (0.255) | (0.291) | ||
| educational attainment | 0.008 | -0.000 | |
| (0.008) | (0.009) | ||
| R&D expenditure | -0.253 | -0.310 | -0.155 |
| (0.079) | (0.088) | (0.901) | |
| % of the population that used the internet for online banking | -0.018 | ||
| (0.004) | |||
| % of the population that used the internet for online shopping | -0.006 | ||
| (0.004) | |||
| log(GDP purchasing power parity): R&D expenditure | -0.024 | ||
| (0.084) | |||
| null.deviance | 495798.787 | 495798.787 | 495798.787 |
| deviance | 362061.621 | 398536.668 | 414631.765 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.
European models III.
(DV: download count).
| Model 15 | |
|---|---|
| (Intercept) | 8.546 |
| (0.159) | |
| GDP purchasing power parity | 1.050631e-05 |
| (1.278794e-06) | |
| % of R&D personnel and researchers in the workforce | 0.918 |
| (0.116) | |
| GDP purchasing power parity: % of R&D personnel and researchers in the workforce | -3.479490e-06 |
| (7.194871e-07) | |
| null.deviance | 7192467.382 |
| deviance | 3556373.970 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.
Fig 5Interaction effects between GDP_PPS and researcher employment percentage (DV: Download count).
Fig 6Random forest feature importance of EUROSTAT+EUROBAROMETER (DV: Count per capita, number of runs: 100).
European models IV: Eurobarometer variables (DVs: Download per capita, download per researcher).
| Model 16 DV:count per capita | Model 17a DV:count per researcher | Model 17b DV:count per researcher | |
|---|---|---|---|
| (Intercept) | 6.204 | 8.160 | 6.905 |
| (0.843) | (0.813) | (0.886) | |
| log(GDP purchasing power parity) | 0.261 | 0.006 | 0.053 |
| (0.080) | (0.078) | (0.082) | |
| % of R&D personnel and researchers in the workforce | 0.673 | ||
| (0.056) | |||
| % of population who visited a public library at least once a year | -1.116 | -1.391 | |
| (0.415) | (0.428) | ||
| % of population who not visited public libraries more often because of perceived low-quality local supply | 3.963 | ||
| (0.886) | |||
| null.deviance | 2553172.101 | 350303.560 | 350303.560 |
| Deviance | 1061645.079 | 324682.437 | 312052.421 |
*** p < 0.001;
** p < 0.01;
* p < 0.05.