| Literature DB >> 21134474 |
Véronique Campbell1, François-Joseph Lapointe.
Abstract
Supermatrices are often characterized by a large amount of missing data. One possible approach to minimize such missing data is to create composite taxa. These taxa are formed by sampling sequences from different species in order to obtain a composite sequence that includes a maximum number of genes. Although this approach is increasingly used, its accuracy has rarely been tested and some authors prefer to analyze incomplete supermatrices by coding unavailable sequences as missing. To further validate the composite taxon approach, it was applied to complete mitochondrial matrices of 102 mammal species representing 93 families with varying amount of missing data. On average, missing data and composite matrices showed similar congruence to model trees obtained from the complete sequence matrix. As expected, the level of congruence to model trees decreased as missing data increased, with both approaches. We conclude that the composite taxon approach is worth considering in a phylogenomic context since it performs well and reduces computing time when compared to missing data matrices.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21134474 DOI: 10.1016/j.ympev.2010.11.017
Source DB: PubMed Journal: Mol Phylogenet Evol ISSN: 1055-7903 Impact factor: 4.286