| Literature DB >> 26462491 |
A Cecile J W Janssens1,2, M Gwinn3.
Abstract
BACKGROUND: Finding eligible studies for meta-analysis and systematic reviews relies on keyword-based searching as the gold standard, despite its inefficiency. Searching based on direct citations is not sufficiently comprehensive. We propose a novel strategy that ranks articles on their degree of co-citation with one or more "known" articles before reviewing their eligibility.Entities:
Mesh:
Year: 2015 PMID: 26462491 PMCID: PMC4604708 DOI: 10.1186/s12874-015-0077-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Overview of the search method. a Indirect citations (co-citations). Bold circles represent articles known at the beginning of the search. Squares represent citing articles; the articles on their reference lists (co-citing articles) are represented by circles. Numbers within circles indicate the number of times an article is co-cited (dashed circles represent articles co-cited only once). b. Direct citations. Bold circles represent articles known at the beginning of the search. Dashed squares represent citing articles; dashed circles represent articles on the known articles’ reference lists. Numbers within dashed squares and circles indicate the number of times an article cites or is cited by a known article
Articles screened and retrieved in the replication of ten published meta-analyses
| Original meta-analysis | All co-citations | All co-cited >1 | Frequently co-cited | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| First author | Articles screened | Studies included | Articles screened | Studies retrieved | Articles screened | Studies retrieved | Articles screened | Studies retrieved | ||||||
| Boothe [ | 17,500 | 8 | 5,595 | (32) | 8 | (100) | 913 | (5) | 8 | (100) | 109 | (1) | 8 | (100) |
| Frolkis [ | 9,151 | 12 | 967 | (11) | 10 | (83) | 224 | (2) | 7 | (58) | 108 | (1) | 6 | (50) |
| Oliver-Williams [ | 8,646 | 10 | 588 | (7) | 8 | (80) | 62 | (1) | 5 | (50) | 62 | (1) | 5 | (50) |
| Knoll [ | 2,365 | 21 | 7,638 | (323) | 19 | (90) | 1,719 | (73) | 18 | (86) | 132 | (6) | 11 | (52) |
| Stevanovic [ | 2,090 | 13 | 987 | (47) | 12 | (92) | 186 | (9) | 10 | (77) | 77 | (4) | 10 | (77) |
| De Vries [ | 1,194 | 9 | 8,388 | (703) | 9 | (100) | 1,924 | (161) | 9 | (100) | 124 | (10) | 8 | (89) |
| Crider [ | 1,154 | 5 | 1,006 | (87) | 5 | (100) | 120 | (10) | 5 | (100) | 120 | (10) | 5 | (100) |
| Herretes [ | 898 | 4 | 670 | (75) | 3 | (75) | 111 | (12) | 3 | (75) | 111 | (12) | 3 | (75) |
| Gharaibeh [ | 836 | 27 | 880 | (105) | 26 | (96) | 173 | (21) | 21 | (78) | 116 | (14) | 19 | (70) |
| Gu [ | 784 | 6 | 3,234 | (413) | 6 | (100) | 780 | (99) | 6 | (100) | 129 | (16) | 5 | (83) |
| Median | 1,642 | 10 | 997 | (81) | 9 | (94) | 205 | (11) | 8 | (82) | 110 | (8) | 7 | (76) |
Percentages are shown in parentheses; values greater than 100 indicate that more articles were selected for screening than in the original meta-analysis. “Frequently co-cited” refers to citations above a threshold in the ranked list that was chosen such that 100–150 articles needed to be screened (See Methods; Additional file 1: Figure S2)
Characteristics of studies included in published meta-analyses that were not retrieved by citation-based literature search at each selection threshold
| All co-citations | All co-cited >1 | Frequently co-cited | |
|---|---|---|---|
| Retrieved | 106 | 92 | 80 |
| Missed | 9 (5) | 14 (6) | 12 (7) |
| Abstract | 2 (0) | 0 (0) | 1 (0) |
| Non-English language | 1 (0) | 6 (1) | 0 (0) |
| Old publication (<1975) | 2 (2) | 2 (0) | 1 (0) |
| Recent publication (2014) | 2 (2) | 1 (1) | 0 (0) |
| Other | 2 (1) | 5 (5) | 10 (7) |
| Total | 115 | 106 | 92 |
Legend: The ten meta-analyses included 115 studies, of which 106 were retrieved by our search. Of those, 92 were co-cited more than once and 80 appeared in the list of frequently co-cited articles. The headings of the table refer to the thresholds presented in Table 1. The numbers in parentheses indicate how many articles had direct connections with other articles in the meta-analysis, because they were either citing or cited by those articles. These numbers indicate whether the articles could have been found by adding a search for direct citations, as was done in Study 2. For example, five of the nine studies that were missed in the first selection were citing or cited by other articles included in the meta-analysis
Retrieval of articles that had no direct connections to the known articles
| Published meta-analysis | Number of articles without direct connections | Retrieved in: | ||
|---|---|---|---|---|
| All co-citations | All co-cited > 1 | Frequently co-cited | ||
| Boothe [ | 1 | 1 | 1 | 1 |
| Frolkis [ | 8 | 7 | 5 | 4 |
| Oliver-Williams [ | 5 | 3 | 0 | 0 |
| Knoll [ | 16 | 14 | 14 | 7 |
| Stevanovic [ | 4 | 4 | 2 | 2 |
| De Vries [ | 4 | 4 | 4 | 2 |
| Crider [ | 1 | 1 | 1 | 1 |
| Herretes [ | 0 | 0 | 0 | 0 |
| Gharaibeh [ | 14 | 13 | 8 | 6 |
| Gu [ | 2 | 2 | 2 | 1 |
| Total | 55 | 49 | 37 | 24 |
The table summarizes data presented in Additional file 1: Figure S1. For example, in the meta-analysis of Boothe et al. [27], only one article included in the meta-analysis had no direct connection with either of the two known studies. That article was frequently co-cited and was thus identified at any of the three thresholds
Number of articles screened and retrieved in Study 2
| Original meta-analysis | Indirect citations (search 1) | Indirect and direct citations (search 1 + 2) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Articles screened | Studies included | Citing articles | Articles screened | Studies retrieved | Articles screened | Studies retrieved | |||||
| Mehrabi [ | 4,148 | 29 | 170 | 1,113 | (27) | 29 | (100) | 1,383 | (33) | 29 | (100) |
| Pathak [ | 543 | 6 | 1,437 | 584 | (108) | 6 | (100) | 886 | (163) | 6 | (100) |
| Viswanathan [ | 2,749 | 6 | 74 | 627 | (23) | 6 | (100) | 689 | (25) | 6 | (100) |
| Vrablik [ | 7,771 | 3 | 28 | 68 | (1) | 3 | (100) | 81 | (1) | 3 | (100) |
| vanWely [ | 894 | 18 | 106 | 444 | (50) | 18 | (100) | 615 | (69) | 18 | (100) |
| Schuit [ | 39 | 13 | 171 | 1,221 | (3,131) | 12 | (92) | 1,385 | (3,551) | 13 | (100) |
| Deng [ | 362 | 9 | 928 | 533 | (147) | 8 | (89) | 1,726 | (477) | 9 | (100) |
| Nwachuku [ | 464 | 15 | 62 | 404 | (87) | 13 | (87) | 502 | (108) | 15 | (100) |
| Gu [ | 764 | 19 | 104 | 719 | (94) | 16 | (84) | 908 | (119) | 19 | (100) |
| SanLorenzo [ | 3,529 | 19 | 67 | 296 | (08) | 15 | (79) | 468 | (13) | 19 | (100) |
| Al-Wassia [ | 166 | 7 | 8 | 32 | (19) | 4 | (57) | 52 | (31) | 7 | (100) |
| Elshaer [ | 750 | 30 | 35 | 210 | (28) | 21 | (70) | 235 | (31) | 29 | (97) |
| Mumme [ | 701 | 21 | 55 | 271 | (39) | 19 | (90) | 468 | (67) | 20 | (95) |
| Hazlewood [ | 1,463 | 35 | 897 | 861 | (59) | 28 | (80) | 3,162 | (216) | 33 | (94) |
| Sheyin [ | 221 | 17 | 40 | 180 | (81) | 16 | (94) | 392 | (177) | 16 | (94) |
| Yuan [ | 7,175 | 14 | 51 | 490 | (7) | 10 | (71) | 596 | (8) | 13 | (93) |
| Elmariah [ | 1,934 | 14 | 3,870 | 599 | (31) | 5 | (36) | 836 | (43) | 13 | (93) |
| Cheelo [ | 1,192 | 11 | 112 | 919 | (77) | 9 | (82) | 1,017 | (85) | 10 | (91) |
| Gu [ | 326 | 18 | 14 | 59 | (18) | 13 | (72) | 233 | (71) | 16 | (89) |
| Saleh [ | 1,480 | 14 | 49 | 964 | (65) | 12 | (86) | 1,055 | (71) | 12 | (86) |
| Emdin [ | 10,598 | 45 | 3,223 | 395 | (4) | 26 | (58) | 6,116 | (58) | 36 | (80) |
| Sayegh [ | 594 | 22 | 69 | 529 | (89) | 14 | (64) | 759 | (128) | 17 | (77) |
| Kamper [ | 6,189 | 41 | 96 | 857 | (14) | 28 | (68) | 1,227 | (20) | 31 | (76) |
| Taioli [ | 98 | 24 | 85 | 441 | (450) | 16 | (67) | 595 | (607) | 18 | (75) |
| Sharpe [ | 3,875 | 7 | 92 | 886 | (23) | 5 | (71) | 911 | (24) | 5 | (71) |
| Zhang [ | 468 | 7 | 221 | 140 | (30) | 5 | (71) | 198 | (42) | 5 | (71) |
| Siddiqui [ | 3,119 | 13 | 129 | 824 | (26) | 8 | (62) | 1,002 | (32) | 9 | (69) |
| Mair-Jenkins [ | 1,449 | 32 | 75 | 971 | (67) | 22 | (69) | 1,086 | (75) | 22 | (69) |
| Bonitsis [ | 795 | 52 | 117 | 937 | (118) | 30 | (58) | 1,489 | (187) | 34 | (65) |
| Williams [ | 1,976 | 19 | 21 | 95 | (5) | 10 | (53) | 186 | (9) | 12 | (63) |
| Souto [ | 4,527 | 23 | 580 | 913 | (20) | 12 | (52) | 1,372 | (30) | 14 | (61) |
| Zhen [ | 742 | 25 | 59 | 215 | (29) | 13 | (52) | 290 | (39) | 15 | (60) |
| Shan [ | 243 | 19 | 60 | 289 | (119) | 9 | (47) | 344 | (142) | 11 | (58) |
| Marcuzzi [ | 5,009 | 15 | 85 | 739 | (15) | 7 | (47) | 851 | (17) | 8 | (53) |
| Lipinski [ | 824 | 17 | 420 | 531 | (64) | 6 | (35) | 610 | (74) | 9 | (53) |
| Stevens [ | 400 | 6 | 62 | 536 | (134) | 3 | (50) | 551 | (138) | 3 | (50) |
| Bernstein [ | 1,837 | 53 | 98 | 376 | (20) | 19 | (36) | 505 | (27) | 22 | (42) |
| Avni [ | 5,365 | 103 | 104 | 698 | (13) | 29 | (28) | 1,259 | (23) | 39 | (38) |
| Kumar [ | 573 | 16 | 101 | 926 | (162) | 5 | (31) | 1,013 | (177) | 5 | (31) |
| Fazeli [ | 1,195 | 5 | 4 | 7 | (1) | 1 | (20) | 7 | (1) | 1 | (20) |
| Brydges [ | 11,628 | 33 | 63 | 347 | (03) | 4 | (12) | 391 | (3) | 6 | (18) |
| McNally [ | 2,453 | 88 | 45 | 374 | (15) | 6 | (7) | 399 | (16) | 9 | (10) |
| Median | 1,194 | 18 | 85 | 530 | (29) | 12 | (69) | 652 | (50) | 13 | (79) |
| Mean | 2,396 | 23 | 336 | 539 | (58)a | 13 | (65) | 901 | (90)a | 15 | (75) |
Values in parentheses are the number of articles screened or studies retrieved as percentages of the numbers in the original meta-analyses. aCalculated after removing outlier [40]
Fig. 2Articles screened and studies retrieved in Study 2. a. Number of articles screened for the published meta-analysis, compared with the number selected for screening by the new method (searches for indirect and direct citations combined). b Studies retrieved in Study 2 (searches for indirect and direct citations combined) as percent of the number of studies included in the published meta-analysis (numbered as in Fig. 2a)
Fig. 3Articles screened and studies retrieved in Study 2 (indirect citations), in relation to the number of citing articles. a Number of articles screened. b Studies retrieved (percent)