Literature DB >> 29121165

PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

Akshay Kumar Avvaru1, Divya Tej Sowpati1, Rakesh Kumar Mishra1.   

Abstract

Motivation: Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.
Results: We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. Availability and implementation: PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. Contact: tej@ccmb.res.in. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29121165     DOI: 10.1093/bioinformatics/btx721

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Complete chloroplast genome of Myracrodruon urundeuva and its phylogenetics relationships in Anacardiaceae family.

Authors:  Bruno Cesar Rossini; Mario Luiz Teixeira de Moraes; Celso Luis Marino
Journal:  Physiol Mol Biol Plants       Date:  2021-04-11

2.  Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19.

Authors:  Mahmoud Naghibzadeh; Hossein Savari; Abdorreza Savadi; Nayyereh Saadati; Elahe Mehrazin
Journal:  Inform Med Unlocked       Date:  2020-05-21

3.  Long-read sequencing across the C9orf72 'GGGGCC' repeat expansion: implications for clinical use and genetic discovery efforts in human disease.

Authors:  Mark T W Ebbert; Stefan L Farrugia; Jonathon P Sens; Karen Jansen-West; Tania F Gendron; Mercedes Prudencio; Ian J McLaughlin; Brett Bowman; Matthew Seetin; Mariely DeJesus-Hernandez; Jazmyne Jackson; Patricia H Brown; Dennis W Dickson; Marka van Blitterswijk; Rosa Rademakers; Leonard Petrucelli; John D Fryer
Journal:  Mol Neurodegener       Date:  2018-08-21       Impact factor: 14.195

4.  Patterns of microsatellite distribution across eukaryotic genomes.

Authors:  Surabhi Srivastava; Akshay Kumar Avvaru; Divya Tej Sowpati; Rakesh K Mishra
Journal:  BMC Genomics       Date:  2019-02-22       Impact factor: 3.969

5.  MSDB: a comprehensive, annotated database of microsatellites.

Authors:  Akshay Kumar Avvaru; Deepak Sharma; Archana Verma; Rakesh K Mishra; Divya Tej Sowpati
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

6.  Microsatellite Variation in the Most Devastating Beetle Pests (Coleoptera: Curculionidae) of Agricultural and Forest Crops.

Authors:  Manee M Manee; Badr M Al-Shomrani; Musaad A Altammami; Hamadttu A F El-Shafie; Atheer A Alsayah; Fahad M Alhoshani; Fahad H Alqahtani
Journal:  Int J Mol Sci       Date:  2022-08-30       Impact factor: 6.208

7.  Long-Read Genome Sequencing and Assembly of Leptopilina boulardi: A Specialist Drosophila Parasitoid.

Authors:  Shagufta Khan; Divya Tej Sowpati; Arumugam Srinivasan; Mamilla Soujanya; Rakesh K Mishra
Journal:  G3 (Bethesda)       Date:  2020-05-04       Impact factor: 3.154

8.  Draft genome sequence data of maqui (Aristotelia chilensis) and identification of SSR markers.

Authors:  Adriana Bastías; Francisco Correa; Pamela Rojas; Constanza Martin; Jorge Pérez-Diaz; Cristian Yáñez; Mara Cuevas; Ricardo Verdugo; Boris Sagredo
Journal:  Data Brief       Date:  2019-09-20

9.  Genome-wide characterization of simple sequence repeats in Palmae genomes.

Authors:  Manee M Manee; Badr M Al-Shomrani; Mohamed B Al-Fageeh
Journal:  Genes Genomics       Date:  2020-04-03       Impact factor: 1.839

10.  BigFiRSt: A Software Program Using Big Data Technique for Mining Simple Sequence Repeats From Large-Scale Sequencing Data.

Authors:  Jinxiang Chen; Fuyi Li; Miao Wang; Junlong Li; Tatiana T Marquez-Lago; André Leier; Jerico Revote; Shuqin Li; Quanzhong Liu; Jiangning Song
Journal:  Front Big Data       Date:  2022-01-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.