Literature DB >> 18056062

Déjà vu--a study of duplicate citations in Medline.

Mounir Errami1, Justin M Hicks, Wayne Fisher, David Trusty, Jonathan D Wren, Tara C Long, Harold R Garner.   

Abstract

MOTIVATION: Duplicate publication impacts the quality of the scientific corpus, has been difficult to detect, and studies this far have been limited in scope and size. Using text similarity searches, we were able to identify signatures of duplicate citations among a body of abstracts.
RESULTS: A sample of 62,213 Medline citations was examined and a database of manually verified duplicate citations was created to study author publication behavior. We found that 0.04% of the citations with no shared authors were highly similar and are thus potential cases of plagiarism. 1.35% with shared authors were sufficiently similar to be considered a duplicate. Extrapolating, this would correspond to 3500 and 117,500 duplicate citations in total, respectively. AVAILABILITY: eTBLAST, an automated citation matching tool, and Déjà vu, the duplicate citation database, are freely available at http://invention.swmed.edu/ and http://spore.swmed.edu/dejavu

Mesh:

Year:  2007        PMID: 18056062     DOI: 10.1093/bioinformatics/btm574

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Spotlight on plagiarism.

Authors:  Stephan Mertens
Journal:  Dtsch Arztebl Int       Date:  2010-12-10       Impact factor: 5.594

2.  Conflict of interest in science communication: more than a financial issue. Report from Esteve Foundation Discussion Group, April 2009.

Authors:  Harvey Marcovitch; Virginia Barbour; Carme Borrell; Felix Bosch; Esteve Fernández; Helen Macdonald; Ana Marusić; Magne Nylenna
Journal:  Croat Med J       Date:  2010-02       Impact factor: 1.351

3.  Comparing Medline citations using modified N-grams.

Authors:  Rao Muhammad Adeel Nawab; Mark Stevenson; Paul Clough
Journal:  J Am Med Inform Assoc       Date:  2013-05-28       Impact factor: 4.497

4.  Exploring why and how journal editors retract articles: findings from a qualitative study.

Authors:  Peter Williams; Elizabeth Wager
Journal:  Sci Eng Ethics       Date:  2011-07-15       Impact factor: 3.525

5.  Ethics in science: are we losing the moral high ground?

Authors:  Faisal M Sanai
Journal:  Saudi J Gastroenterol       Date:  2008-07       Impact factor: 2.485

6.  Identifying duplicate content using statistically improbable phrases.

Authors:  Mounir Errami; Zhaohui Sun; Angela C George; Tara C Long; Michael A Skinner; Jonathan D Wren; Harold R Garner
Journal:  Bioinformatics       Date:  2010-05-13       Impact factor: 6.937

7.  Author's misconduct inviting risk: duplicate publication.

Authors:  B K Nayak
Journal:  Indian J Ophthalmol       Date:  2009 Nov-Dec       Impact factor: 1.848

8.  Quantitative comparison of catalytic mechanisms and overall reactions in convergently evolved enzymes: implications for classification of enzyme function.

Authors:  Daniel E Almonacid; Emmanuel R Yera; John B O Mitchell; Patricia C Babbitt
Journal:  PLoS Comput Biol       Date:  2010-03-12       Impact factor: 4.475

9.  Deja vu: a database of highly similar citations in the scientific literature.

Authors:  Mounir Errami; Zhaohui Sun; Tara C Long; Angela C George; Harold R Garner
Journal:  Nucleic Acids Res       Date:  2008-08-30       Impact factor: 16.971

10.  Iron behaving badly: inappropriate iron chelation as a major contributor to the aetiology of vascular and other progressive inflammatory and degenerative diseases.

Authors:  Douglas B Kell
Journal:  BMC Med Genomics       Date:  2009-01-08       Impact factor: 3.063

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.