| Literature DB >> 21172049 |
Kang Ning1, Alexey I Nesvizhskii.
Abstract
BACKGROUND: Most mass spectrometry (MS) based proteomic studies depend on searching acquired tandem mass (MS/MS) spectra against databases of known protein sequences. In these experiments, however, a large number of high quality spectra remain unassigned. These spectra may correspond to novel peptides not present in the database, especially those corresponding to novel alternative splice (AS) forms. Recently, fast and comprehensive profiling of mammalian genomes using deep sequencing (i.e. RNA-Seq) has become possible. MS-based proteomics can potentially be used as an aid for protein-level validation of novel AS events observed in RNA-Seq data.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21172049 PMCID: PMC3024872 DOI: 10.1186/1471-2105-11-S11-S14
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Overview of the computational strategy.
Average number of distinct junctions (AS) per gene for three different tissues in mouse, and average number of distinct junctions per gene in three different tissues that could be identified by tryptic peptides.
| Tissue | Ave. unique junctions per gene (All) | Ave. unique junctions identifiable by tryptic peptides per gene (All) | Ave. unique junctions per gene (Mitochondrial) | Ave. unique junctions identifiable by tryptic peptides per gene (Mitochondrial) |
|---|---|---|---|---|
| Brainstem | 4.00 | 0.27 | 5.30 | 0.29 (0.45) |
| Liver | 2.44 | 0.16 | 4.91 | 0.27 (0.42) |
| Skeletal Muscle | 2.42 | 0.16 | 4.57 | 0.24 (0.39) |
Column 2 and column 3 were based on all junction reads identified by RNA-Seq for all NCBI Entrez genes. Column 4 and column 5 were based on junction reads by RNA-Seq for 1098 mitochondrial genes. Numbers in brackets were based on tryptic peptides allowing one missed cleavage.
Figure 2Concordance between mRNA abundance and the proportion of genes that could be identified by peptides from MS/MS spectra. (a) Histogram showing the normalized number (frequency) of all genes, and genes that are identified by peptides from MS/MS spectra, across the wide range of mRNA abundances (RPKM values, log scale). (b) Proportion of genes identified by peptides from MS/MS data. Data for brainstem tissue.