| Literature DB >> 30779702 |
Xiaoqiang Wang1, Emilie Lebarbier2, Julie Aubert2, Stéphane Robin2.
Abstract
Hidden Markov models provide a natural statistical framework for the detection of the copy number variations (CNV) in genomics. In this context, we define a hidden Markov process that underlies all individuals jointly in order to detect and to classify genomics regions in different states (typically, deletion, normal or amplification). Structural variations from different individuals may be dependent. It is the case in agronomy where varietal selection program exists and species share a common phylogenetic past. We propose to take into account these dependencies inthe HMM model. When dealing with a large number of series, maximum likelihood inference (performed classically using the EM algorithm) becomes intractable. We thus propose an approximate inference algorithm based on a variational approach (VEM), implemented in the CHMM R package. A simulation study is performed to assess the performance of the proposed method and an application to the detection of structural variations in plant genomes is presented.Keywords: copy number variation; coupled Hidden Markov models; variational approximation
Mesh:
Year: 2019 PMID: 30779702 DOI: 10.1515/ijb-2018-0023
Source DB: PubMed Journal: Int J Biostat ISSN: 1557-4679 Impact factor: 0.968