| Literature DB >> 28566488 |
Michael Buckley1,2, Stacey Warwood3, Bart van Dongen2, Andrew C Kitchener4,5, Phillip L Manning2,6.
Abstract
A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus The resulting LC-MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated.Entities:
Keywords: Brachylophosaurus; Tyrannosaurus; ancient collagen; dinosaur protein; ostrich; palaeoproteomics
Mesh:
Substances:
Year: 2017 PMID: 28566488 PMCID: PMC5454271 DOI: 10.1098/rspb.2017.0544
Source DB: PubMed Journal: Proc Biol Sci ISSN: 0962-8452 Impact factor: 5.349
Peptide matches to Brachylophosaurus collagen from Schweitzer et al. [14] and the taxa they can be found in as originally stated (with added observations by BLAST search in parentheses, with an emphasis on the potential contaminants). Underlined residues indicate post-translational modification (oxidation of P; deamidation of N).
| peptide sequence | protein | taxa |
|---|---|---|
| GLTGPIGP | COL1A1 | ostrich and mammals |
| GSAGP | COL1A1 | |
| GATGA | COL1A1 | |
| GETGPAGPAGP | COL1A1 | chicken (and other birds, including ostrich) |
| GVQGP | COL1A1 | |
| GPSGPQGPSGA | COL1A1 | chicken, alligator, rat and opossum (and others, including ostrich) |
| GS | COL1A2 | chicken and alligator (and other birds, including ostrich) |
| COL1A2 |
*Note that this amino acid sequence (bold) was ‘corrected’ from the T. rex sequence originally proposed by Asara et al. [5].
aThis part of the sequence is not currently complete for ostrich (Struthio) by BLAST search.
Peptide matches to Brachylophosaurus collagen from Schroeter et al. [22] and the taxa they can be found in by BLAST search in parentheses, with an emphasis on the potential contaminants. Underlined residues indicate post-translational modification (oxidation of P). The emboldened peptide is reported as having two hydroxylated prolines even though we routinely observe this peptide with only one (at P3), with a nearby A–S transition identified previously as being problematic to distinguish; they acknowledge in the electronic supplementary material that it could be either.
| peptide sequence | protein | taxa |
|---|---|---|
| *GSAGP | COL1A1 | |
| *GATGA | COL1A1 | |
| GF | COL1A1 | ostrich and others (*alligator) |
| GF | COL1A1 | alligator and ostrich (and others, ranging from fish to mammals) |
| GQAGVMGF | COL1A1 | alligator and ostrich (and others) |
| EGPVGF | COL1A2 | alligator and ostrich (and others, including reptiles, birds and mammals) |
| GATGL | COL1A2 | alligator (and rodents) |
| GE | COL1A2 | alligator and ostrich (and others, including birds and mammals) |
*Note that these were the two peptides observed in both analyses [5].
Mascot search result scores of digested proteome extracts from three different ostrich bone specimens analysed months apart and the similarity to the ostrich (Struthio) sequence as a percentage. Note that the first tryptic peptide of the emboldened sequence was also matched in every sample without the missed cleavage at the K residue. Underlined residues indicate post-translational modification (oxidation of P/K; deamidation of N); scores in parentheses represent higher scores with deamidated peptides or with one additional oxidation. m/z, mass-to-charge ratio.
| peptide sequence | CC1254 | CC507 | UM902 | similarity (%) | ||
|---|---|---|---|---|---|---|
| BrachyA1/TrexA1 | GVQGP | 1161.6 | 57 | 53 (60) | 59 (65) | 100 |
| BrachyA1 | GPSGPQGPSGA | 1305.6 | 71 | 74 | 53 (80) | 100 |
| BrachyA1/TrexA1 | GSAGP | 1458.7 | 94 | 120 | 123 | 100 |
| BrachyA1 | GETGPAGPAGP | 1531.7 | 84 | 85 | 66 | 100 |
| BrachyA1 | GLTGPIGP | 1589.8 | 58 | 75 | 45 | 100 |
| BrachyA1 | 2878.4 | 45 | 45 | 36 (44a) | 100 | |
| BrachyA2/TrexA2 | GLPGESGAVGPAGP | 1577.8 | 70 | 76 | 78 | 89 |
| BrachyA2 | GSNGE | 1608.7 | 80 (93) | 58 (79) | 71 | 100 |
| BrachyA1/TrexA1 | GATGA | 1571.8 | 74 | 73 | 58 | 100 |
| TrexA1 | GA | 1305.6 | 69 | 62 | 46 (71) | 100 |
| TrexA1 | GVVGL | 897.5 | 50 | 46 | 50 | 100 |
aScore with one additional oxidation.
bPeptide was considered unique to T. rex but see electronic supplementary material, S1–S6.
Figure 1.Tandem mass spectrum from high-resolution (HCD) fragmentation analysis of the peptide sequence (GPPGESGAVGPAGPIGSR) matched from our analysis of ostrich bone collagen that is homologous to the peptide proposed as unique to T. rex and B. canadensis.
The m/z values for detected fragment y ions for the peptide at 789.898 selected for HCD fragmentation. The observed value is given, along with the calculated values for each fragment according to the two sequences (T. rex and Struthio) under question, emphasizing the ability to distinguish between an oxidated proline and isoleucine/leucine residue in the carboxy-terminal region of the peptide.
| Frag ion | observed | P(Oxidation)- L | ppm | L-P(Oxidation) | ppm | ppm calc |
|---|---|---|---|---|---|---|
| y4 | — | 432.2565 | — | 432.2201 | — | 75 |
| y5 | 529.3055 | 529.3093 | 7.2 | 529.2729 | 61.6 | 69 |
| y6 | 586.3375 | 586.3307 | 11.6 | 586.2944 | 73.5 | 62 |
| y7 | 657.369 | 657.3678 | 1.8 | 657.3315 | 57 | 55 |
| y8 | 754.4185 | 754.4206 | 2.8 | 754.3842 | 45.5 | 48 |
| y9 | 811.4386 | 811.4421 | 4.3 | 811.4057 | 40.5 | 45 |
| y10 | 910.5069 | 910.5105 | 4 | 910.4741 | 36 | 40 |
| y11 | 981.5446 | 981.5476 | 3.1 | 981.5112 | 34 | 37 |
| y12 | 1038.5646 | 1038.5691 | 4.3 | 1038.5327 | 30.7 | 35 |
| y13 | 1125.5952 | 1125.6011 | 5.2 | 1125.5647 | 27.1 | 32 |
| y14 | 1254.6443 | 1254.6437 | 0.5 | 1254.6073 | 29.5 | 29 |
| y15 | 1311.6582 | 1311.6652 | 5.3 | 1311.6288 | 22.4 | 28 |
| y16 | 1424.7063 | 1424.7128 | 4.6 | 1408.6815 | — | — |
Figure 2.Tandem mass spectrum of the peptide sequence claimed as being endogenous to both dinosaurs with sequence (GLPGESGAVGPAGPPGSR) downloaded from the B. canadensis analysis by Schweitzer et al. [14].