| Literature DB >> 32097430 |
Maddalena Favaretto1, Eva De Clercq1, Christophe Olivier Schneble1, Bernice Simone Elger1.
Abstract
METHODS: Thirty-nine interviews were performed with Swiss and American researchers involved in Big Data research in relevant fields. The interviews were analyzed using thematic coding.Entities:
Year: 2020 PMID: 32097430 PMCID: PMC7041862 DOI: 10.1371/journal.pone.0228987
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Keywords for candidate selection.
| Keywords for Systematic Web Search |
|---|
| 1. Big Data |
| 2. Internet |
| 3. Social Media |
| 4. (Data) Linkage |
| 5. Neural Networks |
| 6. Machine Learning |
| 7. Computational/Computer Based |
| 8. Prediction |
| 9. Data Mining |
| 10. Algorithms |
| 11. Data Analytics |
| 12. Deep Learning |
| 13. Profiling |
| 14 Scoring System |
| 15. (Algorithmic) Modelling |
| 16. Network Analysis |
| 17. Informatics/ Bioinformatics |
Relevant questions from the interview guide.
| Sample questions |
|---|
| Are you currently working on any Big Data research project? |
| Which one(s) of your research project(s) would you consider as involving Big Data methods or related to Big Data? |
| What do you think is the main difference between Big Data research and more traditional research in your field? |
| How would you define Big Data? |
Demographics.
| Sociology (S) | Psychology (P) | Data Science (D) | Total | |
|---|---|---|---|---|
| CH Researchers | 9 | 6 | 5 | 20 |
| US Researchers | 12 | 5 | 2 | 19 |
| Professors | 20 | 9 | 5 | 34 |
| Postdocs/Senior researchers | 1 | 2 | 2 | 5 |
| Participants' self-involvement in a Big Data Project | ||||
| Yes | 15 | 6 | 6 | 27 |
| No | 1 | 3 | 0 | 4 |
| Uncertain | 2 | 5 | 1 | 8 |
Definitions.
| 1.1 Several Vs definition | Definition based on the traditional attributes of Big Data (Volume, Velocity, Variety, Veracity …) | P27CH-D; P29CH-D; P32CH-D; P33CH-S; P35CH-S. |
| 1.2 Volume | Vast amounts of data | P39CH-S; P2US-S; P9US-S; P13US-P; P14US-P; P17US-P; P20US-S |
| 1.3 Variety | Heterogeneous data, both structured and unstructured | P30CH-S; P34CH-D |
| 1.4 Complexity | Very complex data compared to data that is traditionally collected in research | P5CH-S; P19US-S |
| 1.5 Impact | Data that has a huge impact and value for society | P21US-S |
| 2.1 Source of Data | Data that comes from digital technologies | P25CH-P; P26CH-P; P23CH-S; P2US-S; P22US-P |
| 2.1.1 The Human Component | Data that is generated from people | P22CH-P; P24CH-P; P37CH-S; P38CH-S P11US-P; P12US-S; P17US-P; P19US-S; |
| 2.3 Collection | Data collected with no purpose or with no informed consent | P9CH-P; P24CH-P; P26CH-S P30CH-S; P31CH-D; P38CH-S; P3US-S; P4US-P; P5US-S; |
| 2.4 Data Processing | Data that needs sophisticated computational processes to be analyzed | P30CH-S; P37CH-S; P2US-S; P6US-S; P16US-S; P18US-D; P19US-S; P34CH-D |
| 2.5 Problem Solving Tool | Method that is capable of answering questions | P28CH-S; P29CH-D; P30CH-S; P31CH-D; P8US-D; |