| Literature DB >> 26264559 |
Jean-Pierre Flandrois1, Guy Perrière2, Manolo Gouy3.
Abstract
BACKGROUND: Estimating the phylogenetic position of bacterial and archaeal organisms by genetic sequence comparisons is considered as the gold-standard in taxonomy. This is also a way to identify the species of origin of the sequence. The quality of the reference database used in such analyses is crucial: the database must reflect the up-to-date bacterial nomenclature and accurately indicate the species of origin of its sequences. DESCRIPTION: leBIBI(QBPP) is a web tool taking as input a series of nucleotide sequences belonging to one of a set of reference markers (e.g., SSU rRNA, rpoB, groEL2) and automatically retrieving closely related sequences, aligning them, and performing phylogenetic reconstruction using an approximate maximum likelihood approach. The system returns a set of quality parameters and, if possible, a suggested taxonomic assigment for the input sequences. The reference databases are extracted from GenBank and present four degrees of stringency, from the "superstringent" degree (one type strain per species) to the loosely parsed degree ("lax" database). A set of one hundred to more than a thousand sequences may be analyzed at a time. The speed of the process has been optimized through careful hardware selection and database design.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26264559 PMCID: PMC4531848 DOI: 10.1186/s12859-015-0692-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List of the genes included in leBIBIQBPP
| Prokaryotes | RNA/Protein | Stringency | Nb of sequences | |
|---|---|---|---|---|
| SSU rDNA lax | Archaea+Bacteria | RNA | All sequences | 1,309,339 |
| SSU rDNA stringent | Archaea+Bacteria | RNA | Valid names | 234,263 |
| SSU rDNA TS stringent | Archaea+Bacteria | RNA | TS sequences | 21,451 |
| SSU rDNA superstringent | Archaea+Bacteria | RNA | 1 TS/species | 11,289 |
| SSU rDNA genus-level | Archaea+Bacteria | RNA | 1 TS/genus | 2291 |
| LSU rDNA lax | Archaea+Bacteria | RNA | All sequences | 19,357 |
| LSU rDNA stringent | Archaea+Bacteria | RNA | Valid names | 9735 |
| LSU rDNA TS-stringent | Archaea+Bacteria | RNA | TS/species | 2031 |
| tmRNA lax | Bacteria | RNA | All sequences | 1273 |
| tmrNA stringent | Bacteria | RNA | Valid names | 1044 |
| rpoB lax | Bacteria | Protein | All sequences | 29,101 |
| rpoB stringent | Bacteria | Protein | Valid names | 20,062 |
| dnaJ+dnak lax | Bacteria | Protein | All sequences | 12,780 |
| dnaJ+dnaK stringent | Bacteria | Protein | Valid names | 9606 |
| fusA lax | Bacteria | Protein | All sequences | 4009 |
| fusA stringent | Bacteria | Protein | Valid names | 3463 |
| groEL lax | Bacteria | Protein | All sequences | 24,344 |
| groEL stringent | Bacteria | Protein | Valid names | 11,845 |
| groES lax | Bacteria | Protein | All sequences | 335 |
| groES stringent | Bacteria | Protein | Valid names | 277 |
| glyA lax | Bacteria | Protein | All sequences | 3155 |
| glyA stringent | Bacteria | Protein | Valid names | 2732 |
| gyrB lax | Bacteria | Protein | All sequences | 30,537 |
| gyrB stringent | Bacteria | Protein | Valid names | 23,803 |
| recA lax | Bacteria | Protein | All sequences | 25,616 |
| recA stringent | Bacteria | Protein | Valid names | 16,526 |
| sodA lax | Bacteria | Protein | All sequences | 3975 |
| sodA stringent | Bacteria | Protein | Valid names | 3736 |
| tuf lax | Bacteria | Protein | All sequences | 7930 |
| tuf stringent | Bacteria | Protein | Valid names | 6756 |
| groEL2 lax | Actinobacteria | Protein | All sequences | 2942 |
| groEL2 stringent | Actinobacteria | Protein | Valid names | 2086 |
| groEL2 TS-stringent | Actinobacteria | Protein | TS sequences | 521 |
Fig. 1LeBIBIQBPP report summarizes the analysis and gives additional informations that may be useful to interprete the phylogenetic tree