| Literature DB >> 29746514 |
Lutz Fischer1, Juri Rappsilber1,2.
Abstract
False discovery rate (FDR) estimation is a cornerstone of proteomics that has recently been adapted to cross-linking/mass spectrometry. Here we demonstrate that heterobifunctional cross-linkers, while theoretically different from homobifunctional cross-linkers, need not be considered separately in practice. We develop and then evaluate the impact of applying a correct FDR formula for use of heterobifunctional cross-linkers and conclude that there are minimal practical advantages. Hence a single formula can be applied to data generated from the many different non-cleavable cross-linkers.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29746514 PMCID: PMC5944926 DOI: 10.1371/journal.pone.0196672
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Formula symbols.
| Symbol | Meaning |
|---|---|
| Ta | Target entries in the database linkable by side A of the cross-linker |
| Tb | Target entries in the database linkable by side B of the cross-linker |
| Tab | Target entries in the database linkable by both sides the cross-linker |
| Da | Decoy entries in the database linkable by side A of the cross-linker |
| Db | Decoy entries in the database linkable by side B of the cross-linker |
| Dab | Decoy entries in the database linkable by both sides the cross-linker |
| TT | Observed target target matches with |
| TD | Observed target decoy and decoy target matches |
| DD | Observed decoy-decoy matches |
Fig 1Random search spaces for false positive matches.
To model matches where one correct and one incorrect partner are combined requires considering a linear random match space (A). In contrast, when modelling matches with two incorrect partners it requires construction of a quadratic random match space depending on whether the cross-linker is homodimeric, non-directional (B), homodimeric, directional (C), heterodimeric, non-directional (D), or heterodimeric, directional (E).
Fig 2Maximal error from using formula 1.
Maximal expected error when using formula 1, exemplified for the extreme case of every possible combination of links being observed. X-axis is the size of the database and Y-axis is the maximal error. The green and blue line give the border cases of 0% overlap for both sides of the cross-linker and 100% overlap respectively. The gray area represents possible errors for all cross-linker with partial overlap. Residue-level for HSA cross-linked SDA (dark red dot) and HSA cross-linked with EDC (light red dot) are given as reference.
Examples of maximal expected error when using the simple formula for HSA, cross-linked with either EDC or SDA.
| Cross-Linker | Level | Ta | Tb | Tab | Maximal Error | Formula | Formula |
|---|---|---|---|---|---|---|---|
| 0 | 455 | 130 | 0.19% | 5.00% | 5.01% | ||
| 0 | 27 | 360 | 0.48% | 5.00% | 5.02% | ||
| 99 | 130 | 0 | 0.00% | 5.00% | 5.00% | ||
| 23 | 31 | 329 | 0.45% | 5.00% | 5.02% |