| Literature DB >> 23977157 |
Xingshun Qi1, Man Yang, Weirong Ren, Jia Jia, Juan Wang, Guohong Han, Daiming Fan.
Abstract
BACKGROUND: Finding duplicates is an important phase of systematic review. However, no consensus regarding the methods to find duplicates has been provided. This study aims to describe a pragmatic strategy of combining auto- and hand-searching duplicates in systematic review and to evaluate the prevalence and characteristics of duplicates. METHODS ANDEntities:
Mesh:
Year: 2013 PMID: 23977157 PMCID: PMC3748039 DOI: 10.1371/journal.pone.0071838
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Examples of type I duplicates (duplicates among databases).
| Example | Index paper | Database | Redundant papers | Database | Acceptable causes | Unacceptable causes |
| No. 1 | Ames PR. Recurrent abdominal thrombosis despite heparin thromboprophylaxis in a patient with transient eosinophilia. | PubMed | Ames PR | EMBASE | 1) The author's middle name was missing in PubMed. 2) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 3) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | None |
| No. 2 | Boylan JF, Klinck JR, Sandler AN, Arellano R, Greig PD, Nierenberg H, et al. Tranexamic acid reduces blood loss, transfusion requirements, and coagulation factor use in primary orthotopic liver transplantation. Anesthesiology. 1996 | PubMed | Boylan JF, Klinck JR, Sandler AN, Arellano R, Greig PD, Nierenberg H, et al. Tranexamic acid reduces blood loss, transfusion requirements, and coagulation factor use in primary orthotopic liver transplantation. Anesthesiology | Cochrane library (upper) EMBASE (lower) | 1) The publication date was expressed in “year” style in EMBASE and Cochrane library, but in “year month” style in PubMed. 2) The page was expressed in different styles between PubMed and EMBASE. | 1) The volume and page were missing in Cochrane library. |
| No. 3 |
| PubMed |
| Cochrane Library | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 2) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) The authors' order was wrong in Cochrane library. 2) The volume and page were missing in Cochrane library. |
| No. 4 | Bjorkman JA, Ilebekk A, Jern C. Release of tissue-type plasminogen activator (t-PA) in the splanchnic circulation of the anaesthetised pig during high sympathetic tone. | PubMed | Bjorkman JA, Ilebekk A, Jern C. Release of tissue-type plasminogen activator (t-PA) in the splanchnic circulation of the anaesthetised pig during high sympathetic tone. | EMBASE | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. | 1) The publication date was wrong in EMBASE. 2) The volume, issue, and page were missing in EMBASE. |
| No. 5 | Cappellini MD, Grespi E, Cassinerio E, Bignamini D, Fiorelli G. Coagulation and splenectomy: | PubMed | Cappellini MD, Grespi E, Cassinerio E, Bignamini D, Fiorelli G. Coagulation and splenectomy: | EMBASE | 1) The title was in uppercase in EMBASE, but in lowercase in PubMed. 2) The page was expressed in different styles. | 1) The journal's name was missing in EMBASE. 2) The volume was missing in EMBASE. |
| No. 6 | Choi JY, Lee JY, | PubMed | Choi JY, Lee JY, | EMBASE | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 2) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) The family and given names of authors were reversed in EMBASE. |
| No. 7 | Akin O, Dixit D, Schwartz L. Bland and tumor thrombi in abdominal malignancies: | PubMed |
| EMBASE | 1) The title was in uppercase in EMBASE, but in lowercase in PubMed. 2) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 3) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) One author was wrongly added in EMBASE. |
| No. 8 |
| PubMed |
| EMBASE | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 2) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) All authors were wrongly listed in EMBASE. |
| No. 9 | Kimura K, Okuda K, Takara K, | PubMed | Kimura K, Okuda K, Takara K. Membranous obstruction of the portal vein. A case report. Gastroenterology. 1985;88(2):571– | EMBASE | 1) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) Two authors were missing in EMBASE. 2) The page was wrong in EMBASE. |
| No. 10 | Ozbulbul NI, Yurdakul M, Tola M. | PubMed | Ozbulbul NI, Yurdakul M, Tola M. | EMBASE | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 2) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) Some words of the titles were missing in EMBASE. |
| No. 11 | Senzol | PubMed | Senzol | EMBASE | 1) The journal's name was spelt in full style in EMBASE, but in abbreviated style in PubMed. 2) The publication date was expressed in “year” style in EMBASE, but in “year month” style in PubMed. | 1) One author's name was wrongly spelt in EMBASE. |
Notes:
– All examples originated from the literatures regarding portal vein thrombosis.
– In every example, the same study was simultaneously recorded by two or three databases.
– All literatures were expressed in Vancouver reference type.
– Bold and italics formatting indicated the different styles between index and redundant paper(s).
– In every example, the reference recorded by PubMed database had more complete information.
Examples of type II duplicates (duplicate publications).
| Example | References | Databases | No. Pts | Target population | Study objectives | Publication type |
|
| Carr BI, Pancoska P, Branch RA. Tumor and liver determinants of prognosis in unresectable hepatocellular carcinoma: a | PubMed | 967 | Unresectable and untransplantable, biopsy-proven hepatocellular carcinoma | Survival analysis | Full-text |
| Carr BI, | PubMed | 967 | Unresectable and untransplantable, biopsy-proven hepatocellular carcinoma | Survival analysis | Full-text | |
|
| Ban D, Shimada K, Nara S, Esaki M, Sakamoto Y, | EMBASE | 45 | Tumor invasion of the first branch of the portal vein and tumor in the main portal trunk or the opposite-side portal branch | Efficacy of hepatectomy and tumor thrombectomy | Abstract |
| Ban D, Shimada K, | PubMed | 45 | Portal vein tumor thrombus extending to the first portal branch and main portal vein trunk | Efficacy of hepatectomy and tumor thrombectomy | Full-text | |
|
| Bartlett D, Lloyd C, Mirza D, McKiernan P, Newsome P. Nitisinone treatment reduces the need for iver transplantation in children with | EMBASE | 38 | Tyrosinaemia Type 1 | Efficacy of Nitisione | Abstract |
| Bartlett D | EMBASE | 38 | Tyrosinaemia Type 1 | Efficacy of Nitisione | Abstract |
Notes:
– All examples originated from the literature regarding portal vein thrombosis.
– In every example, the same study was published in two different journals.
– All literatures were expressed in Vancouver reference type.
– Bold and italics formatting indicated the different styles between index and redundant paper(s).
Figure 1Study flowchart of finding duplicates in the literatures regarding portal vein thrombosis (panel A) and Budd-Chiari syndrome (panel B).
Characteristics of duplicates in literatures regarding portal vein thrombosis and Budd-Chiari syndrome.
| Characteristics | Portal vein thrombosis | Budd-Chiari syndrome | |||
| Auto-searched duplicates | Hand-searched duplicates | Auto-searched duplicates | Hand-searched duplicates | ||
| No. total papers | 2399 | 1307 | 3275 | 2064 | |
| – Index papers | 1198 | 642 | 1635 | 1025 | |
| – Redundant papers | 1201 | 665 | 1640 | 1039 | |
|
| |||||
| – Type I duplicates | 2385 | 1046 | 3263 | 1790 | |
| – Type II duplicates | 14 | 261 | 12 | 274 | |
|
| |||||
| – Double duplicates (type I/II) | 2392 (2378/14) | 1242 (1022/220) | 3262 (3250/12) | 2022 (1772/250) | |
| – Triple duplicates (type I/II) | 3 (3/0) | 57 (24/33) | 9 (9/0) | 42 (18/24) | |
| – Quadruple duplicates (type I/II) | 4 (4/0) | 8 (0/8) | 4 (4/0) | 0 (0/0) | |
|
| |||||
| – PubMed-PubMed (type I/II) | 4 (0/4) | 28 (0/28) | 2 (0/2) | 24 (2/22) | |
| – PubMed-EMBASE (type I/II) | 2373 (2371/2) | 939 (856/83) | 3259 (3259/0) | 1794 (1701/93) | |
| – PubMed-Cochrane (type I/II) | 0 (0/0) | 113 (108/5) | 0 (0/0) | 42 (42/0) | |
| – EMBASE-EMBASE (type I/II) | 20 (12/8) | 180 (35/145) | 12 (2/10) | 161 (2/159) | |
| – EMBASE-Cochrane (type I/II) | 0 (0/0) | 32 (32/0) | 0 (0/0) | 28 (28/0) | |
| – Cochrane-Cochrane (type I/II) | 2 (2/0) | 0 (0/0) | 2 (2/0) | 0 (0/0) | |
| – PubMed-EMBASE-Cochrane (type I/II) | 0 (0/0) | 15 (15/0) | 0 (0/0) | 15 (15/0) | |
|
| |||||
| – Abstract-Abstract | 8 | 64 | 8 | 80 | |
| – Abstract-Full text | 0 | 137 | 0 | 127 | |
| – Full text-Full text | 6 | 60 | 4 | 67 | |
Notes: Type I duplicates represent duplicates among databases; type II duplicates represent duplicate publications in different journals/issues.
Type I duplicates – difference between index and redundant papers.
| Items | Portal vein thrombosis | Budd-Chiari syndrome | |||
| Auto-searched duplicates | Hand-searched duplicates | Auto-searched duplicates | Hand-searched duplicates | ||
| No. type I duplicates | 2385 | 1046 | 3263 | 1790 | |
|
| |||||
| – Same | 2371 | 491 | 3235 | 788 | |
| – Different (Acceptable/Unacceptable) | 14 (14/0) | 555 (269/286) | 28 (28/0) | 1002 (358/644) | |
|
| |||||
| – Same | 1641 | 500 | 2252 | 728 | |
| – Different (Acceptable/Unacceptable) | 744 (744/0) | 546 (517/29) | 1011 (1011/0) | 1062 (1025/37) | |
|
| |||||
| – Same | 270 | 137 | 416 | 267 | |
| – Different (Acceptable/Unacceptable) | 2115 (2115/0) | 909 (905/4) | 2847 (2847/0) | 1523 (1519/4) | |
|
| |||||
| – Same | 172 | 189 | 172 | 805 | |
| – Different (Acceptable/Unacceptable) | 2213 (2213/0) | 857 (841/16) | 3091 (3091/0) | 985 (977/8) | |
|
| |||||
| – Same | 2364 | 885 | 3243 | 1693 | |
| – Different (Acceptable/Unacceptable) | 21 (0/21) | 161 (0/161) | 20 (4/16) | 97 (5/92) | |
|
| |||||
| – Same | 2314 | 1012 | 3130 | 1718 | |
| – Different (Acceptable/Unacceptable) | 71 (48/23) | 34 (12/22) | 133 (115/18) | 72 (30/42) | |
|
| |||||
| – Same | 2246 | 815 | 3109 | 1630 | |
| – Different (Acceptable/Unacceptable) | 139 (96/43) | 231 (46/185) | 154 (124/30) | 160 (86/74) | |
Notes: Type I duplicates represent duplicates among databases.
Figure 2Proportion of wrong information of auto-searched (panel A) and hand-searched (panel B) type I duplicates from the literatures regarding portal vein thrombosis and that of auto-searched (panel C) and hand-searched (panel D) type I duplicates from literatures regarding Budd-Chiari syndrome.
Figure 3Simplified scheme to identify duplicates in systematic review.
The scheme includes the third main steps. First, all literatures retrieved from different databases are combined into one Endnote library. In this Endnote library, “Find Duplicates” preferences are defined on “Edit” menu. Thus, duplicates can be automatically searched by Endnote library. Subsequently, the review authors should check the accuracy and identify the type of duplicates. Finally, the redundant papers are excluded. Considering that a single strategy of auto-searching method was inadequate, additional search should be very necessary. Second, the remaining literatures are alphabetically ordered according to the first authors’ names in the Endnote library. If the first authors were the same between two or more articles, the review authors would further read the titles, journals’ names, volumes, issues, and pages. Subsequently, if these articles had the same titles, journals’ names, and issues, they would be attributed to the type I duplicates. Notably, the review authors should identify whether the difference between index and redundant papers was acceptable or not. On the other hand, if these had the same or similar titles but different journals or issues, the review authors would further read the abstracts and/or full-texts to judge whether or not they could be attributed to the type II duplicates. Third, the remaining literatures were also alphabetically ordered according to the titles in the Endnote library. If the titles were the same between two or more articles, the review authors would further read the journals’ names, volumes, issues, and pages. Subsequently, if these articles had the same journals’ names and issues, they would be attributed to type I duplicates. Notably, the review authors should identify whether the difference between index and redundant papers was acceptable or not. On the other hand, if these articles had the same or similar titles but different journals or issues, the review authors would further read the abstracts and/or full-texts to judge whether or not they could be attributed to the type II duplicates. Finally, review authors should check the accuracy.