| Literature DB >> 32028894 |
A Cecile J W Janssens1, Marta Gwinn2, J Elaine Brockman2, Kimberley Powell3, Michael Goodman2.
Abstract
BACKGROUND: We recently developed CoCites, a citation-based search method that is designed to be more efficient than traditional keyword-based methods. The method begins with identification of one or more highly relevant publications (query articles) and consists of two searches: the co-citation search, which ranks publications on their co-citation frequency with the query articles, and the citation search, which ranks publications on frequency of all citations that cite or are cited by the query articles.Entities:
Keywords: Citation; Co-citation; Keywords; Literature search; Meta-analysis; Systematic review
Year: 2020 PMID: 32028894 PMCID: PMC7006380 DOI: 10.1186/s12874-020-0907-5
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Overview of the search method. Circles represent articles and lines are the citations between them. Arrows indicate the direction of the citation. Bold circles represent the query articles that are used to begin a search. Numbers in the circles indicate the co-citation or citation counts. Dashed circles represent articles that will not be screened for eligibility if the screening threshold is higher than one. Figure is adapted from [7] (distributed under the Creative Commons Attribution 4.0 International License)
Fig. 2Flowchart for inclusion of systematic reviews. WOS Web of Science
Articles screened and retrieved in replicating the results of literature searches in 250 published reviews
| Articles screened, number | Articles retrieved, percentage | |
|---|---|---|
| In published review | 794 (273, 2132) | 100 |
| All co-cited articles | 5151 (2709, 10,490) | 75.0 (58.2, 87.5) |
| Co-cited > 1 | 1119 (544, 2509) | 60.0 (45.2, 78.3) |
| Co-cited > 1%* | 696 (461, 978) | 56.1 (40.0, 75.0) |
| 100 Top-ranked** | 109 (103, 123) | 37.5 (22.5, 50.0) |
| Citing or cited by > 1 | 83 (38, 176) | 50.0 (17.9, 75.8) |
| Total | 873 (540, 1204) *** | 75.0 (50.0, 90.1) |
All values are median and inter-quartile range (IQR). *Co-cited more than once and in more than 1% of the citing articles. The articles retrieved from this search were used to run the citation search. ** Median is higher than 100 because we included all articles that had the same co-citation frequency as the 100th article. *** Sum of results in the co-citation ‘co-cited> 1%’ and citation searches combined, without removing duplicates. See details in methods
Fig. 3Numbers of articles screened in published reviews versus numbers screened by CoCites’ co-citation and citation searches. Legend: Compared for reviews in which authors screened fewer than 500 articles (left, n = 95) and more than 500 articles (right, n = 155)
Factors that influence the percentage of articles included in each review that were retrieved by co-citation and citation searches combined
| Number of reviews | Percentage of articles retrieved | |
|---|---|---|
| Number of citing articles | ||
| < 20 | 7 | 37.5 (28.6, 60.0) |
| 20–50 | 26 | 63.3 (39.4, 85.7) |
| 50–100 | 48 | 65.0 (50.0, 87.5) |
| 100–200 | 74 | 80.0 (50.0, 90.1) |
| > 200 | 95 | 77.6 (47.4, 93.8) |
| Similarity index* | ||
| < 0.1 | 19 | 34.4 (15.4, 56.3) |
| 0.1–0.2 | 26 | 56.8 (36.1, 72.3) |
| 0.2–0.5 | 109 | 75.0 (51.7, 92.3) |
| > =0.5 | 96 | 83.3 (60.4, 94.4) |
| Percentage of articles in PubMed | ||
| < 90 | 64 | 62.4 (41.3, 80.3) |
| 90–100 | 64 | 67.1 (49.4, 84.5) |
| 100 | 122 | 82.8 (59.6, 100.0) |
*Similarity index = number of co-citations between query articles / number of citations of the less-cited query article. IQR inter-quartile range
Percentage of retrieved articles by the number of citing articles and similarity index
| Number of citing articles | Similarity index* | Percentage of articles in PubMed | Number | Percentage retrieved, median |
|---|---|---|---|---|
| > 20 | > 0.2 | 100 | 101 | 87.5 (68.0, 100.0) |
| > 100 | > 0.2 | 100 | 70 | 90.8 (77.6, 100.0) |
| > 20 | > 0.5 | 100 | 52 | 87.5 (77.4, 100.0) |
| > 100 | > 0.5 | 100 | 32 | 91.9 (82.2, 100.0) |
| > 20 | > 0.2 | > 90 | 150 | 83.3 (64.1, 96.5) |
| > 100 | > 0.2 | > 90 | 111 | 85.7 (66.7, 96.7) |
| > 20 | > 0.5 | > 90 | 71 | 87.5 (69.2, 95.2) |
| > 100 | > 0.5 | > 90 | 47 | 87.5 (76.9, 95.2) |
| > 20 | > 0.2 | All | 200 | 80.0 (60.0, 94.1) |
| > 100 | > 0.2 | All | 139 | 81.2 (64.3, 95.2) |
| > 20 | > 0.5 | All | 94 | 84.5 (61.5, 94.4) |
| > 100 | > 0.5 | All | 59 | 86.8 (66.7, 94.4) |
*Similarity index = number of co-citations between query articles / number of citations of the less-cited query article. IQR inter-quartile range
Factors that affect retrieval of individual articles
| Co-citation search | Co-citation + citation searches | ||||
|---|---|---|---|---|---|
| Total | Number retrieved | Percentage* | Number retrieved | Percentage* | |
| Overall | 4261 | 1938 | 45.5 | 2674 | 62.8 |
| Times cited | |||||
| Not indexed | 591 | 157 | 26.6 | 181 | 30.6 |
| 0 | 365 | 1 | 0.3 | 176 | 48.2 |
| 1–5 | 787 | 104 | 13.2 | 385 | 48.9 |
| 6–9 | 411 | 150 | 36.5 | 237 | 57.7 |
| 10–19 | 658 | 406 | 61.7 | 480 | 72.9 |
| 20–49 | 862 | 616 | 71.5 | 691 | 80.2 |
| > 50 | 587 | 504 | 85.9 | 524 | 89.3 |
| Indexed in WOS** | |||||
| No | 703 | 184 | 26.2 | 235 | 33.4 |
| Yes | 3558 | 1754 | 49.3 | 2439 | 68.5 |
| Number of references | |||||
| Not indexed | 591 | 157 | 26.6 | 181 | 30.6 |
| < 5 | 97 | 44 | 45.4 | 45 | 46.4 |
| 5–9 | 108 | 65 | 60.2 | 75 | 69.4 |
| > =10 | 3465 | 1672 | 48.3 | 2373 | 68.5 |
| Years since publication | |||||
| 0–1 | 544 | 44 | 8.1 | 285 | 52.4 |
| 1–2 | 487 | 118 | 24.2 | 243 | 49..9 |
| 2–5 | 1081 | 402 | 37.2 | 606 | 56.1 |
| 5–10 | 994 | 585 | 58.9 | 686 | 69.0 |
| > 10 | 1155 | 789 | 68.3 | 854 | 73.9 |
*Percentage of articles included in reviews that were retrieved, by category. Query articles for all 250 reviews (n = 500) were removed from the dataset. **All articles are in WOS but not all are indexed. See methods for details