| Literature DB >> 31705189 |
Marta Casanellas1, Jesús Fernández-Sánchez2, Jordi Roca-Lacostena2.
Abstract
Deciding whether a substitution matrix is embeddable (i.e. the corresponding Markov process has a continuous-time realization) is an open problem even for [Formula: see text] matrices. We study the embedding problem and rate identifiability for the K80 model of nucleotide substitution. For these [Formula: see text] matrices, we fully characterize the set of embeddable K80 Markov matrices and the set of embeddable matrices for which rates are identifiable. In particular, we describe an open subset of embeddable matrices with non-identifiable rates. This set contains matrices with positive eigenvalues and also diagonal largest in column matrices, which might lead to consequences in parameter estimation in phylogenetics. Finally, we compute the relative volumes of embeddable K80 matrices and of embeddable matrices with identifiable rates. This study concludes the embedding problem for the more general model K81 and its submodels, which had been initiated by the last two authors in a separate work.Entities:
Keywords: Embedding problem; Markov generator; Markov matrix; Matrix logarithm; Nucleotide substitution model; Rate identifiability
Mesh:
Substances:
Year: 2019 PMID: 31705189 DOI: 10.1007/s00285-019-01446-0
Source DB: PubMed Journal: J Math Biol ISSN: 0303-6812 Impact factor: 2.259