| Literature DB >> 23772051 |
Jérôme Ambroise1, Anne-Sophie Piette, Cathy Delcorps, Leen Rigouts, Bouke C de Jong, Leonid Irenge, Annie Robert, Jean-Luc Gala.
Abstract
MOTIVATION: Converting a pyrosequencing signal into a nucleotide sequence appears highly challenging when signal intensities are low (unitary peak heights ) or when complex signals are produced by several target amplicons. In these cases, the pyrosequencing software fails to provide correct nucleotide sequences. Accordingly, the objective was to develop the AdvISER-PYRO algorithm, performing an automated, fast and reliable analysis of pyrosequencing signals that circumvents those limitations.Entities:
Mesh:
Year: 2013 PMID: 23772051 PMCID: PMC3722527 DOI: 10.1093/bioinformatics/btt339
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Examples of pyrosequencing signal. (A) Pyrosequencing signal obtained with high DNA concentration in an SAS. The noise intensity is close to 105 while intensities of unitary peaks are close to 135. The unitary peak heights are therefore close to 30. (B) Pyrosequencing signal obtained with low DNA concentration in an SAS. The unitary peak heights are close to 2.5. (C): Pyrosequencing signal obtained with an MAS including two distinct amplicons
Correspondence between amplicons and mycobacterial species
| Amplicon | Amplicon | Amplicon | |||
|---|---|---|---|---|---|
| Amplicon1 | Amplicon12 | Amplicon24 | |||
| Amplicon13 | Amplicon25 | ||||
| Amplicon14 | Amplicon26 | ||||
| Amplicon2 | Amplicon15 | Amplicon27 | |||
| Amplicon3 | Amplicon16 | ||||
| Amplicon4 | Amplicon17 | Amplicon28 | |||
| Amplicon5 | Amplicon18 | Amplicon29 | |||
| Amplicon19 | Amplicon30 | ||||
| Amplicon6 | Amplicon31 | ||||
| Amplicon7 | Amplicon20 | ||||
| Amplicon8 | Amplicon32 | ||||
| Amplicon9 | Amplicon21 | ||||
| Amplicon10 | Amplicon22 | ||||
| Amplicon11 | Amplicon23 | Amplicon33 |
Percentage of correct SAS- and MAS-signal identification with AdvISER-PYRO according to and norm penalties and the Significant Contribution Threshold
| Significant contribution threshold | SAS ( | MAS ( | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.00 | 0.01 | 0.05 | 0.10 | 0.50 | 0.00 | 0.01 | 0.05 | 0.10 | 0.50 | ||
| 1 | 0.00 | / | 90.3 | 84.5 | 82.5 | 68.9 | / | 62.3 | 58.2 | 52.5 | 29.5 |
| 0.01 | 89.3 | 89.3 | 84.5 | 82.5 | 68.9 | 65.6 | 62.3 | 59.0 | 52.5 | 29.5 | |
| 0.05 | 89.3 | 90.3 | 84.5 | 82.5 | 68.9 | 67.2 | 62.3 | 59.0 | 52.5 | 29.5 | |
| 0.10 | 89.3 | 90.3 | 84.5 | 82.5 | 68.9 | 66.4 | 61.5 | 59.0 | 52.5 | 29.5 | |
| 0.50 | 89.3 | 90.3 | 84.5 | 82.5 | 68.9 | 65.6 | 63.1 | 59.0 | 51.6 | 29.5 | |
| 2 | 0.00 | / | 94.2 | 93.2 | 91.3 | 83.5 | / | 77.0 | 75.4 | 73.0 | 59.0 |
| 0.01 | 94.2 | 94.2 | 93.2 | 91.3 | 83.5 | 77.9 | 77.0 | 74.6 | 73.0 | 59.0 | |
| 0.05 | 94.2 | 93.2 | 91.3 | 83.5 | 77.0 | 73.8 | 73.0 | 59.0 | |||
| 0.10 | 94.2 | 94.2 | 93.2 | 91.3 | 83.5 | 77.9 | 77.0 | 74.6 | 73.0 | 59.0 | |
| 0.50 | 94.2 | 94.2 | 93.2 | 91.3 | 83.5 | 77.0 | 77.9 | 75.4 | 71.3 | 59.0 | |
| 3 | 0.00 | / | 95.1 | 95.1 | 94.2 | 90.3 | / | 66.4 | 66.4 | 65.6 | 59.0 |
| 0.01 | 95.1 | 95.1 | 95.1 | 94.2 | 90.3 | 66.4 | 66.4 | 65.6 | 65.6 | 59.0 | |
| 0.05 | 95.1 | 95.1 | 95.1 | 94.2 | 90.3 | 66.4 | 66.4 | 66.4 | 65.6 | 59.0 | |
| 0.10 | 95.1 | 95.1 | 95.1 | 94.2 | 90.3 | 66.4 | 66.4 | 66.4 | 65.6 | 59.0 | |
| 0.50 | 95.1 | 95.1 | 95.1 | 94.2 | 90.3 | 67.2 | 65.6 | 65.6 | 64.8 | 58.2 | |
Fig. 2.Comparison of the percentage of correct identification as a function of signal intensities (FUPH). The comparison was performed between AdvISER-PYRO and the Software V.2.1.1, using Local Polynomial Regression Models on identifications obtained with SAS signals. The symbols on the x-axis represent the distribution of the FUPH in the SAS dataset
Fig. 3.Four examples of signal identification with AdvISER-PYRO. The pyrosequencing signal is represented by vertical black lines. The contribution of each atom is represented with boxes stacked one on top of the other