Literature DB >> 34850956

msRepDB: a comprehensive repetitive sequence database of over 80 000 species.

Xingyu Liao1,2, Kang Hu2, Adil Salhi1, You Zou2, Jianxin Wang2, Xin Gao1.   

Abstract

Repeats are prevalent in the genomes of all bacteria, plants and animals, and they cover nearly half of the Human genome, which play indispensable roles in the evolution, inheritance, variation and genomic instability, and serve as substrates for chromosomal rearrangements that include disease-causing deletions, inversions, and translocations. Comprehensive identification, classification and annotation of repeats in genomes can provide accurate and targeted solutions towards understanding and diagnosis of complex diseases, optimization of plant properties and development of new drugs. RepBase and Dfam are two most frequently used repeat databases, but they are not sufficiently complete. Due to the lack of a comprehensive repeat database of multiple species, the current research in this field is far from being satisfactory. LongRepMarker is a new framework developed recently by our group for comprehensive identification of genomic repeats. We here propose msRepDB based on LongRepMarker, which is currently the most comprehensive multi-species repeat database, covering >80 000 species. Comprehensive evaluations show that msRepDB contains more species, and more complete repeats and families than RepBase and Dfam databases. (https://msrepdb.cbrc.kaust.edu.sa/pages/msRepDB/index.html).
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 34850956      PMCID: PMC8728181          DOI: 10.1093/nar/gkab1089

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  42 in total

1.  The Ensembl genome database project.

Authors:  T Hubbard; D Barker; E Birney; G Cameron; Y Chen; L Clark; T Cox; J Cuff; V Curwen; T Down; R Durbin; E Eyras; J Gilbert; M Hammond; L Huminiecki; A Kasprzyk; H Lehvaslaiho; P Lijnzaad; C Melsopp; E Mongin; R Pettett; M Pocock; S Potter; A Rust; E Schmidt; S Searle; G Slater; J Smith; W Spooner; A Stabenau; J Stalker; E Stupka; A Ureta-Vidal; I Vastrik; M Clamp
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  Automated de novo identification of repeat sequence families in sequenced genomes.

Authors:  Zhirong Bao; Sean R Eddy
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

3.  Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS.

Authors:  Mariely DeJesus-Hernandez; Ian R Mackenzie; Bradley F Boeve; Adam L Boxer; Matt Baker; Nicola J Rutherford; Alexandra M Nicholson; NiCole A Finch; Heather Flynn; Jennifer Adamson; Naomi Kouri; Aleksandra Wojtas; Pheth Sengdy; Ging-Yuek R Hsiung; Anna Karydas; William W Seeley; Keith A Josephs; Giovanni Coppola; Daniel H Geschwind; Zbigniew K Wszolek; Howard Feldman; David S Knopman; Ronald C Petersen; Bruce L Miller; Dennis W Dickson; Kevin B Boylan; Neill R Graff-Radford; Rosa Rademakers
Journal:  Neuron       Date:  2011-09-21       Impact factor: 17.173

Review 4.  Why repetitive DNA is essential to genome function.

Authors:  James A Shapiro; Richard von Sternberg
Journal:  Biol Rev Camb Philos Soc       Date:  2005-05

5.  Characteristic enrichment of DNA repeats in different genomes.

Authors:  R Cox; S M Mirkin
Journal:  Proc Natl Acad Sci U S A       Date:  1997-05-13       Impact factor: 11.205

6.  RepeatModeler2 for automated genomic discovery of transposable element families.

Authors:  Jullien M Flynn; Robert Hubley; Clément Goubert; Jeb Rosen; Andrew G Clark; Cédric Feschotte; Arian F Smit
Journal:  Proc Natl Acad Sci U S A       Date:  2020-04-16       Impact factor: 11.205

7.  The effects of repeated whole genome duplication events on the evolution of cytokinin signaling pathway.

Authors:  Elisabeth Kaltenegger; Svetlana Leng; Alexander Heyl
Journal:  BMC Evol Biol       Date:  2018-05-29       Impact factor: 3.260

8.  Rapid and precise alignment of raw reads against redundant databases with KMA.

Authors:  Philip T L C Clausen; Frank M Aarestrup; Ole Lund
Journal:  BMC Bioinformatics       Date:  2018-08-29       Impact factor: 3.169

9.  A sensitive repeat identification framework based on short and long reads.

Authors:  Xingyu Liao; Min Li; Kang Hu; Fang-Xiang Wu; Xin Gao; Jianxin Wang
Journal:  Nucleic Acids Res       Date:  2021-09-27       Impact factor: 16.971

10.  Extensive somatic L1 retrotransposition in colorectal tumors.

Authors:  Szilvia Solyom; Adam D Ewing; Eric P Rahrmann; Tara Doucet; Heather H Nelson; Michael B Burns; Reuben S Harris; David F Sigmon; Alex Casella; Bracha Erlanger; Sarah Wheelan; Kyle R Upton; Ruchi Shukla; Geoffrey J Faulkner; David A Largaespada; Haig H Kazazian
Journal:  Genome Res       Date:  2012-09-11       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.