| Literature DB >> 18282301 |
Indra Neil Sarkar1, Mary G Egan, Gloria Coruzzi, Ernest K Lee, Rob DeSalle.
Abstract
BACKGROUND: The availability of sequences from whole genomes to reconstruct the tree of life has the potential to enable the development of phylogenomic hypotheses in ways that have not been before possible. A significant bottleneck in the analysis of genomic-scale views of the tree of life is the time required for manual curation of genomic data into multi-gene phylogenetic matrices.Entities:
Mesh:
Year: 2008 PMID: 18282301 PMCID: PMC2270260 DOI: 10.1186/1471-2105-9-103
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Overview of ASAP Data Flow. Data originate in the form of either a collection of systematically annotated FASTA files (1), a NEXUS file with partitions delineated using the CharPars command (2), or a list of NCBI GI/Accession numbers that are organized according to partitions (3). An alignment tool, such as MUSCLE, is applied to unaligned data (A). At this stage, ASAP will assemble a NEXUS file that is a compilation of all the data partitions and performs a tree search to generate a simultaneous analysis tree (4) as well as the requisite tests to determine the hidden branch supports on this tree (5). The results of the hidden branch supports are saved to a tab-delimited text file that can be imported into popular spreadsheet applications like Microsoft Excel.