Literature DB >> 23923099

Google Scholar is not enough to be used alone for systematic reviews.

Dean Giustini1, Maged N Kamel Boulos.   

Abstract

BACKGROUND: Google Scholar (GS) has been noted for its ability to search broadly for important references in the literature. Gehanno et al. recently examined GS in their study: 'Is Google scholar enough to be used alone for systematic reviews?' In this paper, we revisit this important question, and some of Gehanno et al.'s other findings in evaluating the academic search engine.
METHODS: The authors searched for a recent systematic review (SR) of comparable size to run search tests similar to those in Gehanno et al. We selected Chou et al. (2013) contacting the authors for a list of publications they found in their SR on social media in health. We queried GS for each of those 506 titles (in quotes ""), one by one. When GS failed to retrieve a paper, or produced too many results, we used the allintitle: command to find papers with the same title.
RESULTS: Google Scholar produced records for ~95% of the papers cited by Chou et al. (n=476/506). A few of the 30 papers that were not in GS were later retrieved via PubMed and even regular Google Search. But due to its different structure, we could not run searches in GS that were originally performed by Chou et al. in PubMed, Web of Science, Scopus and PsycINFO®. Identifying 506 papers in GS was an inefficient process, especially for papers using similar search terms.
CONCLUSIONS: Has Google Scholar improved enough to be used alone in searching for systematic reviews? No. GS' constantly-changing content, algorithms and database structure make it a poor choice for systematic reviews. Looking for papers when you know their titles is a far different issue from discovering them initially. Further research is needed to determine when and how (and for what purposes) GS can be used alone. Google should provide details about GS' database coverage and improve its interface (e.g., with semantic search filters, stored searching, etc.). Perhaps then it will be an appropriate choice for systematic reviews.

Entities:  

Keywords:  MeSH Keywords: Google Scholar; PubMed; information retrieval; searching; systematic reviews

Year:  2013        PMID: 23923099      PMCID: PMC3733758          DOI: 10.5210/ojphi.v5i2.4623

Source DB:  PubMed          Journal:  Online J Public Health Inform        ISSN: 1947-2579


Introduction

Since its debut in 2004, Google Scholar (GS) has been viewed in the field of biomedical research as a flawed but useful tool in searching the scientific literature [1,2]. GS is widely-recognized as an excellent source of grey literature in biomedicine [3-5]. Despite its broad coverage, GS is considered ill-designed for expert searching [6]. One librarian said that “… plug-in-the-keyword-and-hope-for-the-best tools like Google Scholar are poor choices for serious search questions such as clinical queries, bibliographic reviews, comprehensive literature searches, or other questions that require a more sophisticated approach” [7]. Expert searchers were admonished to use trusted databases such as the Cochrane Library, PubMed and Embase when literature reviews were required (i.e., for grants, clinical trials and systematic reviews) [8]. The early buzz of GS eventually ebbed and was replaced by detailed comparisons against other tools such as PubMed and Scirus [9]. A consensus seemed to emerge that GS was not as current as PubMed and some expert searchers placed it a year behind or more [7]. Searchers also noticed that PubMed and Google Scholar fulfilled different purposes [10]. In head-to-head comparisons with curated databases, GS was deemed inadequate for subject searching and did not offer what expert searchers wanted to see in a literature database. MEDLINE, produced by the US National of Medicine in Bethesda, Maryland, has been the gold standard for structured searching (especially via Ovid’s Interface) for decades. While its place in biomedical searching seems secure, some researchers have argued that GS is a better choice for some retrieval queries, especially in browsing for articles and locating highly-cited papers [11]. In recognition of its speed and familiar interface, one editorial asked Google to think about creating a subset of GS for evidence-based medicine. But that would require transparency from Google about GS, and they were not about to produce a list of journal suppliers and grey literature that were crawled to create the database. Searchers were left to surmise its scope and make guesses as to what was in it [12]. Google Scholar is a useful tool to help researchers locate in seconds relevant papers from billions of pages across the Web [13] (and in many cases directly retrieve the full text of those papers). For that, it is highly-valued and useful, and every expert searcher should use it for that purpose. Allied to its easy-to-use interface, GS is a time-saver for quick searches especially compared to similar searches on PubMed, which can be unwieldy. In any case, knowing the strengths and weaknesses of GS will help researchers decide when and how to use it. Google has created a useful tool with links to articles and grey literature. But GS was already deemed unsuitable for literature reviews due to its limited search (filtering and qualifiers) functionality; its inability to draw on the power of the MeSH vocabulary (used in MEDLINE/PubMed) was cited as a critical flaw [14,15]. In 2013, French researchers, Gehanno et al., published a study that asked a simple question to which most expert searchers thought they knew the answer: ‘Is Google scholar enough to be used alone for systematic reviews?’ [16] The authors state that GS’ coverage has improved and ask whether its “coverage is high enough to be used alone in systematic reviews”. In other words, the authors ask whether GS might replace MEDLINE and other bibliographic databases to perform costly, time-intensive searches for systematic reviews. The clearly-stated question and conclusions of Gehanno et al. are examined in this paper; we ask whether Google Scholar has improved enough over the years to be used alone in systematic reviews.

Methods

The authors searched for a systematic review that was comparable in size to Gehanno et al. We selected a recent study in our area of expertise (health/public health informatics), Chou et al. (2013), and contacted the authors for a list of the 506 publications they found in their SR on social media in health. To test Google Scholar’s ability to locate articles from an existing systematic review, we searched for all of the publications found by Chou et al. [17]. We tested whether the 500+ articles that formed the basis of Chou et al.’s SR were indexed by GS. Since we knew what we were looking for, and were not testing GS’ ability to produce relevant documents, our searches were straightforward title searches. Chou et al. provided us with an Excel spreadsheet of the titles of papers (n=514) that comprised their systematic review. After correcting for minor errors, we looked for 506 unique items occurring either as simple citations or full-text links to papers within GS. We checked for the presence of these 506 publications by querying GS for the title of each study (in quotes ""), one by one. When a search failed to retrieve the required article, or produced too many results to browse, we opted to use Google’s allintitle: command to increase our precision and search accuracy by limiting our search to the titles of articles. Some papers that were not found in GS were later searched and found in regular Google Search. Our results were double-checked title-by-title for completeness and accuracy against those listed by Chou et al. Secondly, we tried to replicate Chou et al.’s search strategy and keywords (as detailed in [17]) in GS. We queried GS for: health* AND ("social media" OR "new media" OR "participatory media" OR "user-generated content" OR Facebook OR MySpace OR Twitter OR YouTube OR "Second Life" OR LinkedIn OR wiki* OR blog* OR "Web 2.0" OR "online social network" OR "social networking"). We set query conditions as follows: year range as 2004-2011; include citations. It should be noted that Google uses stemming technology instead of asterisks, so those asterisks in the above query are ignored/not needed in GS. Due to the different database structure and search syntax used in GS, our searches for these 506 papers using Chou et al.’s original search strategy and keywords yielded unmanageable results of approximately >750,000 items (as at 5 February 2013). Using Google’s allintitle: command reduced our search results considerably to a collection of <450 items, but this was not a full subset of Chou et al.’s 506 items. Multiple attempts and combinations of keywords (and syntax) are needed if one were to find (discover) in GS the 476 (out of 506) papers cited in Chou et al.’s systematic review without already knowing their full titles.

Results

Even though GS produced records for ~95% of those papers as cited by Chou et al. (n=476/506), numerous iterative searches were required to find all of them. In GS, we could not build search sets effectively or transfer results to a spreadsheet or reference manager. GS made our work more difficult as citations had to be managed one at a time. Due to its rudimentary structure, we could not run the search strings as used by Chou et al. in PubMed, Web of Science, Scopus and PsycINFO®. GS did not understand these expert search strings and was unable to translate them in any coherent way using its auto-correct feature. A few of the 30 papers that we could not find in GS (see Table I) were found in PubMed and even regular Google Search. Identifying each paper was inefficient, especially where two papers used similar keywords or metadata. GS’ ability to search into the full-text of papers combined with PageRank’s algorithm is useful, and helps with browsing. These features on their own do not compensate for GS’ obvious problems with searchability (discoverability) and database quality.
Table I

Articles missing in Google Scholar (as at 5 February 2013) and their original database sources where Chou et al. found them. The last two articles in the table were only (indirectly) retrievable when we dug deep in search results, but not as direct/first search result hits or via GS allintitle: command.

Authors Title Source title Year Database where Chou et al. found article
White, J.Everything you always wanted to know about stress (but were afraid to ask) or trying to reach the 'hard to reach'Clinical Psychology Forum2011Scopus
Gannon, KE; Moreno, MADisplay of risk and protective health behaviors on incoming freshmen's Facebook profilesPediatric Research2009Web of Science
Horrigan, BJNIH and Wikimedia Foundation Collaborate to Improve Online Health InformationExplore-The Journal of Science and Healing2009Web of Science
Hwang, K; Etchegaray, J; Bernstam, E; Thomas, EPredictors of intention to share educational health information via online social network tiesJournal of General Internal Medicine2010Web of Science
Pemu, PE; Quarshie, AQ; Josiah-Willock, R; Ojutalayo, FO; Alema-Mensah, E; Ofili, EOSocio-demographic Psychosocial and Clinical Characteristics of Participants in e-HealthyStrides (c): An Interactive ehealth Program to Improve Diabetes Self-Management SkillsJournal of Health Care for the Poor and Underserved2011Web of Science
[No authors listed]Why blog on about mental health?Mental Health Today2006MEDLINE
Gronstedt, A.Second Life produces real training resultsT+D (Training + Development)2007Scopus
Hawn, C.Report from the field: Take two aspirin and tweet me in the morning: How twitter, Facebook, and other social media are reshaping health careHealth Affairs2009Scopus
Malvey, D., Alderman, B., Todd, A.D.Blogging and the health care managerHealth Care Manager2009Scopus
Russell, J.Web 2.0 technology: How is it impacting your employer brand?Nursing Economics2009Scopus
Strongin, R.Health reform in 140 charactersMedical Device and Diagnostic Industry2010Scopus
Tan, L.Psychotherapy 2.0: MySpace® blogging as self-therapyAmerican Journal of Psychotherapy2008Scopus
[Anonymous]Web 2.0, Health and InformaticsMethods of Information in Medicine2009Web of Science
Arikan, Y; Benker, TInternet and Social Media Impacts on Turkish Healthcare Professionals' Reaching Health and Drug Side Effect-Related InformationDrug Safety2011Web of Science
Benker, T; Arikan, YTurkish Patients' Use of Internet and Social Media for Healthcare and Drug Side Effect InformationDrug Safety2011Web of Science
Botelho, RMotivate healthy habits (part II): using web 2.0 & 3.0 technologies to generate social movementsSwiss Medical Weekly2009Web of Science
Evans, WD; McLeod, C; Thomas, SLSocial Media Marketing and Health Behaviours: Industry Strategies, Consumer Behaviours, and Public Health ResponsesAnnals of Behavioral Medicine2011Web of Science
Grinfeld, MJ; Hensel, BK; Cassidy, JT; Walker, SE; Parker, JCA new media solution to coordination of care for juvenile arthritis: The JAHelp.org advocacy-oriented health care access projectArthritis and Rheumatism2006Web of Science
Hamm, KM; Simeonov, IM; Heard, SEUsing Technology To Harness and Organize Expertise in the Development of Health Education Materials: How a Wiki Can Help You CollaborateClinical Toxicology2009Web of Science
Hartland, D; Duffton, R; Home, J; D'Aguilar, C; Berktay, L; Tomkinson, A; et al.Health promotion (HP) and health outcomes: impacts of old and new media campaigns on referral patterns for HIV testing: implications for the National HIV Saving Lives CampaignHIV Medicine2011Web of Science
Hartoonian, N; Ormseth, S; Bantum, EO; Owen, JProcess and outcome evaluation of a social-networking website for health promotionAnnals of Behavioral Medicine2008Web of Science
Kane, I; Walkosz, B; Giese, BDOSOMETHINGONTHE.NET: Health Marketing for New MediaAnnals of Behavioral Medicine2010Web of Science
Kondro, WHealth and environment blogCanadian Medical Association Journal2011Web of Science
Nocker, G; Schachinger, ATrends of future health communication and promotion via-Web 2.0 /Social MediaEuropean Journal of Public Health2010Web of Science
Ojcius, DTracking public health via TwitterNature Reviews Microbiology2011Web of Science
Paek, HJ; Hove, T; Jeong, HJ; Kim, MPeer or expert? The persuasive impact of YouTube public service announcement producersInternational Journal of Advertising2011Web of Science
Toth-Cohen, SThe garden of healthy aging: collaborative project development in the virtual world of Second LifeGerontologist2009Web of Science
Wapner, JThe healthy type - The therapeutic value of blogging becomes a focus of studyScientific American2008Web of Science
Truccolo, I.; Bufalino, R.; Annunziata, M.A.; Caruso, A.; Costantini, A.; Cognetti, G.; et al.National Cancer Information Service in Italy: An information points network as a new model for providing information for cancer patientsTumori2011Scopus
Bastida, RUse of collaborative web-based technology in mental health - Wiki use in practiceInternational Journal of Mental Health Nursing2008Web of Science
PubMed is clear that its database is built on a foundation of medical subject headings or MeSH terms, and each field in its 23 million citations is searchable. GS builds its structure on a simple interface design, vast interdisciplinary content and link popularity (which papers are cited most often). On the positive side, GS achieved a high percentage (95%) of “known-items” from Chou et al. but not all. Papers not found in GS were unique items from the four curated databases mentioned by Chou et al., PubMed, Web of Science, Scopus and PsycINFO® (Table I). GS is not flexible, precise or indexed (enough) to be used alone for systematic reviews. Its ‘keyword search' capability, allied to Google’s PageRank, is a poor replacement for controlled vocabulary searching and its interface does not provide enough flexibility to accommodate search filters, wildcards and expert search hedges, all of which are required for systematic reviews. We particularly noted the lack of a GS search filtering option to limit the scope of search results ‘by discipline’ such as ‘health and medicine’, since GS is catering for, and indexing articles from a very wide range of disciplines, and the same keywords can sometimes retrieve irrelevant, non-health-and-medicine-related articles. In this modest study, identifying 506 papers among results and multiple screens was akin to searching for a needle in a haystack – painful, prickly and a time waster. Gehanno et al.’s search for 738 papers from 29 systematic reviews was similarly onerous but they, like us, knew what they were looking for [16]. This is a critical point in both studies: searching for known items is a much simpler exercise than trying to locate (or discover) those papers in the first place. GS’ broad, undocumented corpus produces a lot of noise (and irrelevant hits) in its results, making it an unsuitable exclusive choice for systematic review searching.

Conclusions

Is Google Scholar enough to be used for systematic review searching? No. Contrary to Gehanno et al.’s conclusions that GS “could even be used alone” [16], we found that GS was not up to the required search standard for systematic reviews. Despite its high sensitivity and vast coverage, GS was unable to locate all known-items cited in a previously-completed systematic review. We were able to retrieve most (but not all) of the papers used by Chou et al. [17] in their systematic review, because we already knew their titles and were searching for them one by one. But would we have been able to discover them as easily if we did not already know their exact titles? Based on our results, the answer was ‘no’ (when we tried to replicate Chou et al.’s search strategy in GS and queried GS for the topics of those papers [instead of their titles]). GS can sometimes be less precise than PubMed and similar bibliographic databases, returning hundreds or thousands of results, many of them irrelevant, thus requiring extensive human filtering of the results [5,18]. Furthermore, GS’ changing content, unknown updating practices and poor reliability make it an inappropriate sole choice for systematic reviewers. As searchers, we were often uncertain that results found one day in GS had not changed a day later and trying to replicate searches with date delimiters in GS did not help. Papers found today in GS did not mean they would be there tomorrow. In summary, GS could not be viewed on par with tools such as MEDLINE, Embase and the Web of Science. Gray et al. said it best that "Google scholar's value to the sciences" may be that it can be used “for initial & supplemental information gathering" [11]. Google Scholar’s shortcomings, while not insignificant, should not exclude it from being used in systematic reviews [18]. On the contrary, we argue that further investigation is needed to determine when and how (and for what subjects, or disciplines) GS can be used for systematic review searching. Until then, its engineers should provide full details about its database coverage and aim to improve its interface search capabilities (e.g., indexing, semantic search filters, stored searching, etc.). Only then will it be equal to the demands of thorough, replicable searches as required by systematic reviews.
  9 in total

1.  How Google is changing medicine.

Authors:  Dean Giustini
Journal:  BMJ       Date:  2005-12-24

2.  Google Scholar: A source for clinicians?

Authors:  Jim Henderson
Journal:  CMAJ       Date:  2005-06-07       Impact factor: 8.262

3.  Comparing test searches in PubMed and Google Scholar.

Authors:  Mary Shultz
Journal:  J Med Libr Assoc       Date:  2007-10

4.  Medical literature searches: a comparison of PubMed and Google Scholar.

Authors:  Eva Nourbakhsh; Rebecca Nugent; Helen Wang; Cihan Cevik; Kenneth Nugent
Journal:  Health Info Libr J       Date:  2012-06-19

Review 5.  Web 2.0 for health promotion: reviewing the current evidence.

Authors:  Wen-ying Sylvia Chou; Abby Prestin; Claire Lyons; Kuang-yi Wen
Journal:  Am J Public Health       Date:  2012-11-15       Impact factor: 9.308

6.  Comparison of PubMed and Google Scholar literature searches.

Authors:  Michael E Anders; Dennis P Evans
Journal:  Respir Care       Date:  2010-05       Impact factor: 2.258

7.  Literature search on risk factors for sarcoma: PubMed and Google Scholar may be complementary sources.

Authors:  Giuseppe Mastrangelo; Emanuela Fadda; Carlo R Rossi; Emanuele Zamprogno; Alessandra Buja; Luca Cegolon
Journal:  BMC Res Notes       Date:  2010-05-10

8.  The excitement of Google Scholar, the worry of Google Print.

Authors:  Marcus A Banks
Journal:  Biomed Digit Libr       Date:  2005-03-22

9.  Is the coverage of Google Scholar enough to be used alone for systematic reviews.

Authors:  Jean-François Gehanno; Laetitia Rollin; Stefan Darmoni
Journal:  BMC Med Inform Decis Mak       Date:  2013-01-09       Impact factor: 2.796

  9 in total
  23 in total

1.  Blame the Patient, Blame the Doctor or Blame the System? A Meta-Synthesis of Qualitative Studies of Patient Safety in Primary Care.

Authors:  Gavin Daker-White; Rebecca Hays; Jennifer McSharry; Sally Giles; Sudeh Cheraghi-Sohi; Penny Rhodes; Caroline Sanders
Journal:  PLoS One       Date:  2015-08-05       Impact factor: 3.240

Review 2.  A systematic review of barriers and facilitators to minority research participation among African Americans, Latinos, Asian Americans, and Pacific Islanders.

Authors:  Sheba George; Nelida Duran; Keith Norris
Journal:  Am J Public Health       Date:  2013-12-12       Impact factor: 9.308

3.  Impact of the transparent reporting of evaluations with nonrandomized designs reporting guideline: ten years on.

Authors:  Thomas Fuller; Jaime Peters; Mark Pearson; Rob Anderson
Journal:  Am J Public Health       Date:  2014-09-11       Impact factor: 9.308

4.  Citation searches are more sensitive than keyword searches to identify studies using specific measurement instruments.

Authors:  Suzanne K Linder; Geetanjali R Kamath; Gregory F Pratt; Smita S Saraykar; Robert J Volk
Journal:  J Clin Epidemiol       Date:  2014-12-29       Impact factor: 6.437

Review 5.  Measuring oral health literacy: a scoping review of existing tools.

Authors:  Virginia Dickson-Swift; Amanda Kenny; Jane Farmer; Mark Gussy; Sarah Larkins
Journal:  BMC Oral Health       Date:  2014-12-04       Impact factor: 2.757

6.  The Role of Google Scholar in Evidence Reviews and Its Applicability to Grey Literature Searching.

Authors:  Neal Robert Haddaway; Alexandra Mary Collins; Deborah Coughlin; Stuart Kirk
Journal:  PLoS One       Date:  2015-09-17       Impact factor: 3.240

Review 7.  A scoping review of the association between rural medical education and rural practice location.

Authors:  Jane Farmer; Amanda Kenny; Carol McKinstry; Richard D Huysmans
Journal:  Hum Resour Health       Date:  2015-05-06

8.  Web searching for systematic reviews: a case study of reporting standards in the UK Health Technology Assessment programme.

Authors:  Simon Briscoe
Journal:  BMC Res Notes       Date:  2015-04-16

9.  Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough.

Authors:  Martin Boeker; Werner Vach; Edith Motschall
Journal:  BMC Med Res Methodol       Date:  2013-10-26       Impact factor: 4.615

10.  The comparative recall of Google Scholar versus PubMed in identical searches for biomedical systematic reviews: a review of searches used in systematic reviews.

Authors:  Wichor M Bramer; Dean Giustini; Bianca Mr Kramer; Pf Anderson
Journal:  Syst Rev       Date:  2013-12-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.