Literature DB >> 25050709

fcGENE: a versatile tool for processing and transforming SNP datasets.

Nab Raj Roshyara1, Markus Scholz1.   

Abstract

BACKGROUND: Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.
METHODS: In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.
RESULTS: fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.
CONCLUSIONS: We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.

Entities:  

Mesh:

Year:  2014        PMID: 25050709      PMCID: PMC4106754          DOI: 10.1371/journal.pone.0097589

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  17 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

2.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

3.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

4.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

5.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

6.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

7.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

8.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

9.  Imputation-based analysis of association studies: candidate regions and quantitative traits.

Authors:  Bertrand Servin; Matthew Stephens
Journal:  PLoS Genet       Date:  2007-05-30       Impact factor: 5.917

10.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

View more
  24 in total

1.  Population-specific genotype imputations using minimac or IMPUTE2.

Authors:  Elisabeth M van Leeuwen; Alexandros Kanterakis; Patrick Deelen; Mathijs V Kattenberg; P Eline Slagboom; Paul I W de Bakker; Cisca Wijmenga; Morris A Swertz; Dorret I Boomsma; Cornelia M van Duijn; Lennart C Karssen; Jouke Jan Hottenga
Journal:  Nat Protoc       Date:  2015-07-30       Impact factor: 13.491

2.  A genome-wide association study confirms PNPLA3 and identifies TM6SF2 and MBOAT7 as risk loci for alcohol-related cirrhosis.

Authors:  Stephan Buch; Felix Stickel; Eric Trépo; Michael Way; Alexander Herrmann; Hans Dieter Nischalke; Mario Brosch; Jonas Rosendahl; Thomas Berg; Monika Ridinger; Marcella Rietschel; Andrew McQuillin; Josef Frank; Falk Kiefer; Stefan Schreiber; Wolfgang Lieb; Michael Soyka; Nasser Semmo; Elmar Aigner; Christian Datz; Renate Schmelz; Stefan Brückner; Sebastian Zeissig; Anna-Magdalena Stephan; Norbert Wodarz; Jacques Devière; Nicolas Clumeck; Christoph Sarrazin; Frank Lammert; Thierry Gustot; Pierre Deltenre; Henry Völzke; Markus M Lerch; Julia Mayerle; Florian Eyer; Clemens Schafmayer; Sven Cichon; Markus M Nöthen; Michael Nothnagel; David Ellinghaus; Klaus Huse; Andre Franke; Steffen Zopf; Claus Hellerbrand; Christophe Moreno; Denis Franchimont; Marsha Y Morgan; Jochen Hampe
Journal:  Nat Genet       Date:  2015-10-19       Impact factor: 38.330

3.  Identification of shared and unique susceptibility pathways among cancers of the lung, breast, and prostate from genome-wide association studies and tissue-specific protein interactions.

Authors:  David C Qian; Jinyoung Byun; Younghun Han; Casey S Greene; John K Field; Rayjean J Hung; Yonathan Brhane; John R Mclaughlin; Gordon Fehringer; Maria Teresa Landi; Albert Rosenberger; Heike Bickeböller; Jyoti Malhotra; Angela Risch; Joachim Heinrich; David J Hunter; Brian E Henderson; Christopher A Haiman; Fredrick R Schumacher; Rosalind A Eeles; Douglas F Easton; Daniela Seminara; Christopher I Amos
Journal:  Hum Mol Genet       Date:  2015-10-19       Impact factor: 6.150

4.  Leveraging health systems data to characterize a large effect variant conferring risk for liver disease in Puerto Ricans.

Authors:  Gillian M Belbin; Stephanie Rutledge; Tetyana Dodatko; Sinead Cullina; Michael C Turchin; Sumita Kohli; Denis Torre; Muh-Ching Yee; Christopher R Gignoux; Noura S Abul-Husn; Sander M Houten; Eimear E Kenny
Journal:  Am J Hum Genet       Date:  2021-10-21       Impact factor: 11.025

5.  Selection Signatures in South African Nguni and Bonsmara Cattle Populations Reveal Genes Relating to Environmental Adaptation.

Authors:  Bhaveni B Kooverjee; Pranisha Soma; Magrieta A Van Der Nest; Michiel M Scholtz; Frederick W C Neser
Journal:  Front Genet       Date:  2022-06-17       Impact factor: 4.772

6.  Genome wide association mapping and candidate gene analysis for pod shatter resistance in Brassica juncea and its progenitor species.

Authors:  Jasmeet Kaur; Javed Akhatar; Anna Goyal; Navneet Kaur; Snehdeep Kaur; Meenakshi Mittal; Nitin Kumar; Heena Sharma; Shashi Banga; S S Banga
Journal:  Mol Biol Rep       Date:  2020-03-26       Impact factor: 2.316

7.  Collagenous Colitis Is Associated With HLA Signature and Shares Genetic Risks With Other Immune-Mediated Diseases.

Authors:  Eli Stahl; Giulia Roda; Amanda Dobbyn; Jianzhong Hu; Zhongyang Zhang; Helga Westerlind; Ferdinando Bonfiglio; Towfique Raj; Joana Torres; Anli Chen; Robert Petras; Darrell S Pardi; Alina C Iuga; Gabriel S Levi; Wenqing Cao; Prantesh Jain; Florian Rieder; Ilyssa O Gordon; Judy H Cho; Mauro D'Amato; Noam Harpaz; Ke Hao; Jean Frederic Colombel; Inga Peter
Journal:  Gastroenterology       Date:  2020-05-01       Impact factor: 22.682

8.  Impact of genetic similarity on imputation accuracy.

Authors:  Nab Raj Roshyara; Markus Scholz
Journal:  BMC Genet       Date:  2015-07-22       Impact factor: 2.797

9.  Impact of pre-imputation SNP-filtering on genotype imputation results.

Authors:  Nab Raj Roshyara; Holger Kirsten; Katrin Horn; Peter Ahnert; Markus Scholz
Journal:  BMC Genet       Date:  2014-08-12       Impact factor: 2.797

10.  Association genetics of the parameters related to nitrogen use efficiency in Brassica juncea L.

Authors:  Neha Gupta; Mehak Gupta; Javed Akhatar; Anna Goyal; Rimaljeet Kaur; Sanjula Sharma; Prinka Goyal; Archana Mukta; Navneet Kaur; Meenakshi Mittal; Mohini Prabha Singh; Baudh Bharti; V K Sardana; Surinder S Banga
Journal:  Plant Mol Biol       Date:  2020-09-30       Impact factor: 4.076

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.