| Literature DB >> 27272119 |
Ren-Hua Chung1, Wei-Yun Tsai1, Chen-Yu Kang1, Po-Ju Yao1, Hui-Ju Tsai1,2,3, Chia-Hsiang Chen4,5.
Abstract
In disease studies, family-based designs have become an attractive approach to analyzing next-generation sequencing (NGS) data for the identification of rare mutations enriched in families. Substantial research effort has been devoted to developing pipelines for automating sequence alignment, variant calling, and annotation. However, fewer pipelines have been designed specifically for disease studies. Most of the current analysis pipelines for family-based disease studies using NGS data focus on a specific function, such as identifying variants with Mendelian inheritance or identifying shared chromosomal regions among affected family members. Consequently, some other useful family-based analysis tools, such as imputation, linkage, and association tools, have yet to be integrated and automated. We developed FamPipe, a comprehensive analysis pipeline, which includes several family-specific analysis modules, including the identification of shared chromosomal regions among affected family members, prioritizing variants assuming a disease model, imputation of untyped variants, and linkage and association tests. We used simulation studies to compare properties of some modules implemented in FamPipe, and based on the results, we provided suggestions for the selection of modules to achieve an optimal analysis strategy. The pipeline is under the GNU GPL License and can be downloaded for free at http://fampipe.sourceforge.net.Entities:
Mesh:
Year: 2016 PMID: 27272119 PMCID: PMC4894624 DOI: 10.1371/journal.pcbi.1004980
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Flowchart of FamPipe.
Fig 2ROC curves for the Segregation score and weighted-sum statistic under Scen1 and Scen2 as described in S1 Text.
A data label in the Figure shows the rank cut-off value, specificity, and sensitivity for the data point.
Fig 3IQS for GIGI and Merlin across different intervals of MAFs for the medium families.
Fig 4IQS for GIGI and Merlin across different intervals of MAFs for the large families.
Comparisons among different family-based analysis pipelines for sequencing data.
| Disease model | IBD analysis | Linkage analysis | Imputation | Statistical association test for disease | |
|---|---|---|---|---|---|
| FamPipe | ○ | ○ | ○ | ○ | ○ |
| Merlin | ○ | ○ | ○ | ○ | |
| MORGAN | ○ | ○ | ○ | ||
| VAR-MD | ○ | ||||
| FamAnn | ○ | ||||
| VariantDB | ○ | ||||
| MendelScan | ○ | ○ | |||
| Olorin | ○ | ||||
| RVSharing | ○ | ||||
| pVAAST | ○ | ||||
| Weighted-sum statistic | ○ |
A ○ represents that the function is implemented in the tool.