| Literature DB >> 24773593 |
You Li, Hao Chi, Leihao Xia, Xiaowen Chu1.
Abstract
BACKGROUND: Tandem mass spectrometry-based database searching is currently the main method for protein identification in shotgun proteomics. The explosive growth of protein and peptide databases, which is a result of genome translations, enzymatic digestions, and post-translational modifications (PTMs), is making computational efficiency in database searching a serious challenge. Profile analysis shows that most search engines spend 50%-90% of their total time on the scoring module, and that the spectrum dot product (SDP) based scoring module is the most widely used. As a general purpose and high performance parallel hardware, graphics processing units (GPUs) are promising platforms for speeding up database searches in the protein identification process.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24773593 PMCID: PMC4049470 DOI: 10.1186/1471-2105-15-121
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1GPU cluster architecture. A GPU cluster has one master (mu01) and four computing nodes (Fermi.1-4). All of the nodes have a XeonE5620, and perform at 2.40 GHZ with two GeForce GTX580. The GTX580 has 512 cores, performs at 1.54 GHz, and has 1.54GB global memory with a peak bandwidth of 192.4 GB/sec. All of the nodes are connected by the GIGABIT line, and mu01 is connected to the Internet.
GPU cluster specifications
| GPU | N/A | 2 × GTX580 |
| CPU | 2 × XeonE5620(2.40GHz)/5.86GT/12 M/1066 | |
| Memory | 6 × 4G Registered ECC 1333 MHz DDR3 | |
| Others | 1 × 1000G 3.5inch SATA, 2 × 1000 M Ethernet | |
Database searching parameters
| Exp. 1 | Instrument | QSTAR |
| Spectra | 46195 DTA files | |
| Database | Yeast (13434 proteins, target + reversed) | |
| Enzyme | Trypsin (max missed cleavage sites = 2) | |
| Tolerance | Precursor: 0.2 Da; Fragment: 0.2 Da | |
| Modifications | Fixed: Carbamidomethylation (C) | |
| Variable: Oxidation (M), Phosphorylation (S, T, Y) | ||
| Exp. 2 | Instrument | LTQ |
| Spectra | 43493 DTA files | |
| Database | IPI.Human v3.49 ( 148034 protein, target + reversed) | |
| Enzyme | Trypsin (max missed cleavage sites = 2) | |
| Tolerance | Precursor: 3 Da; Fragment: 0.5 Da | |
| Modifications | Fixed: Carbamidomethylation (C) | |
| Variable: Oxidation (M), Phosphorylation (S, T, Y) | ||
| Exp.3 | Instrument | QSTAR |
| Spectra | 46195 DTA files | |
| Database | UniprotKB/Swiss-Prot (540171 proteins) | |
| Enzyme | Trypsin (max missed cleavage sites = 2) | |
| Tolerance | Precursor: 0.2 Da; Fragment: 0.2 Da | |
| Modifications | Fixed: Carbamidomethylation (C) | |
| Variable: Oxidation (M), Phosphorylation (S, T, Y) |
Time usage of database searching (minutes)
| X!Tandem | Total time | 45 | 1011 | 253 |
| | Scoring time | 24 | 566 | 138 |
| | Scoring time percentage | 54% | 56% | 55% |
| pFind | Total time | 22 | 601 | 132 |
| | Scoring time | 18 | 530 | 107 |
| Scoring time percentage | 82% | 89% | 81% |
Note: pFind and X!Tandem both use a one-step mode.
Speedup effect of SDP using a single GPU
| CPU | 968 s | 32587 s | 5529 s |
| GPU | 35 s | 502 s | 191 s |
| Speedup | 27 | 65 | 29 |
Note: in Exp.1, threshold is set to 1, whereas in Exp. 2, threshold is set to 2.
Speedup effect of SDP using the GPU cluster
| CPU-cluster | 273 s | 14 s | 8991 s | 242 s | 1568s | 94 s |
| GPU-cluster | 13 s | 3 s | 136 s | 8 s | 62 s | 5 s |
| speedup | 21 | 5 | 66 | 30 | 25 | 19 |
Note: we use one CPU and one GPU in each node of the cluster.
Speedup effect of the pre-calculation strategy in Exp.2
| one | 32587 s | 242 s | | 502 s | 8 s | |
| two | 17997 s | | 89.3% | 279 s | | 87.4% |
| three | 12153 s | | 87.6% | 178 s | | 89.9% |
| four | 8991 s | 88.2% | 136 s | 87.1% | ||
Note: speedup percentage equals to: one scoring time/ (N node scoring time + pre calculation time)/N.
Figure 2The number of operations in each 0.1 Da mass window, from 300 Da-4000 Da, in Exp. 1. The x-axis stands for the mass range; divide the mass range, from 300 Da to 4000 Da, into 36000 equal-sized 0.1 Da mass windows. The y-axis stands for the operation number between the experimental and theoretical spectrum in each mass window.
Figure 3The computing process in a dense mass window. The figure shows the calculation in one dense mass window. The result is a Scor[TH][TW × TDimY], which is equal to theo[TH][N] × expe[N][TW × TDimY]. Load the first tile (in blue) from the theo into the shared memory; score the blue tile in the theo with the blue tile in the expe, which is stored in the texture memory; accumulate the temporary results into TResult, whose initial value is zero; then repeat loading the next tile (in orange), scoring and accumulating, until theo[TH][N] and expe[N][TW × TDimY] have all been accessed.