| Literature DB >> 35434776 |
Annamaria Cucina1, Antonella Di Francesco1, Rosaria Saletti1, Maria Gaetana Giovanna Pittalà1, Gleb Zilberstein2, Svetlana Zilberstein2, Alexei Tikhonov3, Andrey G Bublichenko3, Pier Giorgio Righetti4, Salvatore Foti1, Vincenzo Cunsolo5.
Abstract
The recent paleoproteomic studies, including paleo-metaproteomic analyses, improved our understanding of the dietary of ancient populations, the characterization of past human diseases, the reconstruction of the habitat of ancient species, but also provided new insights into the phylogenetic relationships between extant and extinct species. In this respect, the present work reports the results of the metaproteomic analysis performed on the middle part of a trunk, and on the portion of a trunk tip tissue of two different woolly mammoths some 30,000 years old. In particular, proteins were extracted by applying EVA (Ethylene-Vinyl Acetate studded with hydrophilic and hydrophobic resins) films to the surface of these tissues belonging to two Mammuthus primigenus specimens, discovered in two regions located in the Russian Far East, and then investigated via a shotgun MS-based approach. This approach allowed to obtain two interesting results: (i) an indirect description of the habitat of these two mammoths, and (ii) an improved characterization of the collagen type I, alpha-1 and alpha-2 chains (col1a1 and col1a2). Sequence characterization of the col1a1 and col1a2 highlighted some differences between M. primigenius and other Proboscidea together with the identification of three (two for col1a1, and one for col1a2) potentially diagnostic amino acidic mutations that could be used to reliably distinguish the Mammuthus primigenius with respect to the other two genera of elephantids (i.e., Elephas and Loxodonta), and the extinct American mastodon (i.e., Mammut americanum). The results were validated through the level of deamidation and other diagenetic chemical modifications of the sample peptides, which were used to discriminate the "original" endogenous peptides from contaminant ones. The data have been deposited to the ProteomeXchange with identifier < PXD029558 > .Entities:
Keywords: Chemical modifications; Collagen type I, alpha-1 and alpha-2 sequence; Deamidation; Mammoth; Meta-paleoproteomics; Orbitrap fusion tribrid high-resolution mass spectrometer; Shotgun proteomics
Mesh:
Substances:
Year: 2022 PMID: 35434776 PMCID: PMC9213349 DOI: 10.1007/s00726-022-03160-6
Source DB: PubMed Journal: Amino Acids ISSN: 0939-4451 Impact factor: 3.789
Fig. 1Classification of Proboscidea. †Extinct species
Fig. 2a Mapping of fluorescence of phenylalanine, tyrosine and tryptophan under flash UV illumination of the trunk tip of the Woolly Mammoth discovered in the Kolyma district (on the left), and from the middle part of the trunk of a female mammoth discovered in Sanga-Yuryakhsky (on the right); b regions of sampling by EVA diskettes
Classification of the proteins identified by searching Proboscidea database and after peptide BLAST search in “trunk” and “trunk tip” samples. More details are reported in the Supplementary Material (Tables S1 and S2)
| Proteins | Taxonomya | Razor + unique peptides | Chemical Modificationsb | Trunk | Trunk tip |
|---|---|---|---|---|---|
| 60S ribosomal protein L40 | Not specific (including Homo sapiens) | 3 | – | x | x |
| Annexin | Not specific (including Homo sapiens) | 3 | pyro-E | x | |
| ATP synthase subunit alpha | Not specific (including Homo sapiens) | 2 | - | x | |
| CD109 molecule | Loxodonta africana | 2 | KAcetyl; De(NQ); Mox | x | |
| Collagen type I alpha 1 chain (col1a1) | Loxodonta africanac | 46 | De(NQ); pyro-E; Pox; Mox | x | |
| Collagen type I alpha 2 chain (col1a2) | Loxodonta africana | 32 | Pox; PCarbox; | x | |
| Collagen type IV alpha 2 chain (col4a2) | Loxodonta africana | 3 | Mox; Pox | x | |
| Collagen type V alpha 1 chain (col5a1) | Mammalia (including Homo sapiens) | 2 | De(NQ); Pox | x | |
| Collagen type XVII alpha 1 chain (col17a1) | Loxodonta africana | 2 | Pox; Mox | x | |
| Uncharacterized protein (Acc. No. G3T4X7, similar to col2a1) | Not specific (including Homo sapiens) | 2 | De(NQ); Pox | x | |
| Uncharacterized protein (Acc. No.G3TH25, similar to col3a1) | Loxodonta africana | 13 | De(NQ); Pox | x | x |
| CTD small phosphatase-like 2 | Not specific (not Homo sapiens) | 2 | De(NQ) | x | |
| Dedicator of cytokinesis 9 | Elephantidae | 2 | De(NQ); Y2ox; Yiodin; Ydi−iodin | x | |
| Desmin | Loxodonta africana | 2 | – | x | |
| Desmoglein 1 | Loxodonta africana | 3 | Mox; pyro-E | x | |
| Elastin microfibril interfacer 3 | Loxodonta africana | 2 | Mox; Trp → Kyn | x | |
| Formin-like 1 | Loxodonta africana | 2 | pyro-E; Pox | x | |
| Golgin B1 | Mammalia (not Homo sapiens) | 2 | KAcetyl; M2ox; De(NQ) | x | |
| GTF2I repeat domain containing 1 | Loxodonta africana | 2 | Tyr → DQ | x | |
| IF rod domain-containing protein | Eutheria (not Homo sapiens) | 6 | De(NQ); Yiodin | x | |
| IF rod domain-containing protein | Mammalia (not Homo sapiens) | 2 | – | x | |
| IF rod domain-containing protein (KRT74) | Not specific (not Homo sapiens) | 2 | Yox | x | |
| Inositol-polyphosphate 5-phosphatase | Mammalia (not Homo sapiens) | 3 | De(NQ); Yox | x | |
| Junction plakoglobin | Mammalia (including Homo sapiens) | 9 | De(NQ); Pox; Mox | x | x |
| Kelch-like family member 32 | Loxodonta africana | 2 | De(NQ); W2ox; Trp → Kyn | x | |
| Matrix remodeling associated 5 | Loxodonta africana | 2 | De(NQ); Woxolact; Mox | x | |
| Microtubule actin crosslinking factor 1 | Loxodonta africana | 2 | De(NQ); Mox | x | |
| Myosin heavy chain 7 | Mammalia (not Homo sapiens) | 6 | De(NQ); W2ox; Mox | x | |
| Nebulin | Loxodonta africana | 3 | Tyr → DQ, De(NQ); Yox; Y2ox | x | |
| PATJ crumbs cell polarity complex component | Elephantidae | 2 | De(NQ); Trp → Kyn | x | |
| PEAK1 related, kinase-activating pseudokinase 1 | Loxodonta africana | 2 | Pox; De(NQ) | x | |
| Plakophilin 1 | Not specific (including Homo sapiens) | 2 | – | x | |
| Regulatory factor X1 | Loxodonta africana | 2 | Mox; Pox; De(NQ) | x | |
| RING-type E3 ubiquitin transferase | Elephantidae | 2 | De(NQ); W2ox; Trp → Oxo | x | |
| Sacsin molecular chaperone | Loxodonta africana | 2 | pyro-E; De(NQ); KAcetyl | x | |
| Secreted frizzled related protein 4 | Loxodonta africana | 2 | De(NQ) | x | |
| SH3 domain-containing protein | Afrotheria | 7 | Mox; pyro-E | x | x |
| SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily d, member 1 | Loxodonta africana | 3 | De(NQ) | x | |
| Ubiquitin carboxyl-terminal hydrolase | Mammalia (including Homo sapiens) | 2 | pyro-E; De(NQ); Yox | x | |
| Uncharacterized protein (Acc. No. G3TGT5; similar to titin) | Loxodonta africana | 2 | Mox; Y2ox; De(NQ); Trp → Kyn; Pox | x | x |
| Uncharacterized protein (Acc. No. G3U5E2, similar to nuclear body protein SP140) | Loxodonta africana | 2 | Mox; pyro-E, De(NQ) | x | |
| Uncharacterized protein (Acc. No. G3U5V7, similar to RUN domain-containing protein) | Loxodonta africana | 2 | KAcetyl; pyro-E; De(NQ) | x | |
| VLIG-type G domain-containing protein | Mammalia (not Homo sapiens) | 2 | pyro-E; Y2ox | x | |
| Zinc finger and BTB domain containing 26 | Elephantidae | 2 | De(NQ);, Trp → Kyn; KAcetyl;Mox | x |
aTaxonomy classification after the BLASTp search
bChemical Modifications: pyro-E (pyro-glutamic form of glutamine or glutamic acid residues at the N-term of the peptide); Pox (oxidation of proline to hydroxiproline); Mox (oxidation of methionine to methionine sulfoxide); M2ox (oxidation of methionine to methionine sulfone); Yox (mono-oxidation of tyrosine); Y2ox (di-oxidation of tyrosine); Tyr → DQ (tyrosine oxidation to dopaquinone) De(NQ) (Deamidation of asparagine/glutamine residues); KAcetyl (acetylation of lysine residue); Trp → Kyn (oxidative modification of tryptophan to kynurenine); W2ox (di-oxidation of tryptophan); Trp → Oxo (tryptophan oxidation to oxolactone)
cThree of the peptides are razor + unique of Mammut americanum
Classification of the proteins identified by searching Viridiplantae and Bacteria/Nematoda database and after peptide BLAST search in “trunk” and “trunk tip” samples. More details are reported in the Supplementary Material (Tables S3, S4, S5 and S6)
| Proteins | Taxonomy | Razor + unique peptides | PTMs | Sample |
|---|---|---|---|---|
| 14–3-3 protein 1 | Mesangiospermae | 2 | Mox,De(NQ), Y2ox | Trunk |
| ATP synthase subunit alpha | Mesangiospermae | 2 | – | Trunk |
| DNA polymerase alpha catalytic subunit | Mesangiospermae | 2 | De(NQ), Trp- > Kyn, C3ox | Tip |
| DNA-directed RNA polymerase subunit beta | Mesangiospermae | 2 | De(NQ) | Trunk |
| Glyceraldehyde-3-phosphate dehydrogenase | Mesangiospermae | 3 | – | Tip |
| Histidine kinase CKI1 | Brassicaceae | 2 | Mox,De(NQ) | Trunk |
| Protein TIC 214 | Amborella trichopoda | 2 | De(NQ) | Tip |
| tRNA-specific adenosine deaminase TAD2 | Arabidopsis thaliana | 2 | Tyr → DQ, De(NQ), M2ox | Tip |
| | ||||
| Cytidylate kinase | Mesoplasma florum | 2 | De(NQ) | tip |
(a) Taxonomy classification after the BLASTp search
(b) Chemical Modifications: Mox (oxidation of methionine to methionine sulfoxide); M2ox (oxidation of methionine to methionine sulfone); Y2ox (di-oxidation of tyrosine); Tyr → DQ (tyrosine oxidation to dopaquinone) De(NQ) (Deamidation of asparagine/glutamine residues); Trp → Kyn (oxidative modification of tryptophan to kynurenine
Fig. 3TreeView of the peptides belonging to Proboscidea in (a) trunk and (b) trunk tip sample
Fig. 4Metaproteomic analysis: tree-view of the identified peptides belonging to Viridiplantae in a trunk and b trunk tip sample. The percentage of peptides is calculated considering the 100% as the total number of peptides of the previous node
Fig. 5Metaproteomic analysis: tree-view of the identified peptides belonging to Nematoda in a trunk and b trunk tip sample
Fig. 6Metaproteomic analysis: tree-view of the identified peptides belonging to Bacteria in a trunk and b trunk tip sample. The percentage of peptides is calculated considering the 100% as the total number of peptides of the previous node
Fig. 7Percentage of deamidation of asparagine (N) and glutamine (Q) amino acids in a trunk and b trunk tip samples. Error bars indicate a confidence interval around 1000 bootstrap replicates
Fig. 8Primary structure of the col1a1 of the woolly mammoth (Mammuthus primigenus) identified in the protein extracts of an EVA diskette (see text). Amino acid sequence was characterized by tryptic digestion and nLC-nESI MS/MS. The sequence characterized by MS/MS data is underlined. The amino acid differences with respect to the two col1a1 unreviewed sequences of L. africana (UniProt Acc. No. G3SSE0, and NCBI Acc. No. XP_010592644.1), the experimentally determined Elephantidae sequences by Buckley (Buckley et al. 2011) and the reviewed sequence of M. americanum (UniProt Acc. No. P0C2W8), are reported in bold. The experimentally determined Elephantidae sequences by Buckley (Buckley et al. 2011) include col1a1 of a North Sea M. primigenius, L. africana and E. maximus
Fig. 9MS/MS spectrum of the triply-charged molecular ion at m/z 850.4013 (molecular mass 2548.1802) of the Mammuthus primigenius col1a1 tryptic peptide. The proline residues identified as hydroxyproline are reported in bold. De novo deduced sequence corresponds to the tryptic fragment Gly162-Lys192 of the col1a1 from L. africana, but showing the differences commented in the manuscript (see the text for details)
Fig. 10MS/MS spectrum of the triply-charged molecular ion at m/z 738.6978 (molecular mass 2213.0697) of the col1a1 Mammuthus primigenius tryptic peptide. The proline residues identified as hydroxyproline are reported in bold. De novo deduced sequence corresponds to the tryptic fragment Val723-Lys747 of the col1a1 from L. africana, but showing the substitution of the glycine at position n. 746 with an alanine residue, that is underlined (see the text for details)
Fig. 11MS/MS spectrum of the triply-charged molecular ion at m/z 828.8729 (molecular mass 2483.595) of the col1a1 Mammuthus primigenius tryptic peptide (position 982–1008). The proline residues identified as hydroxyproline are reported in bold. The asparagine identified in deamidated form is reported in italics. De novo deduced sequence corresponds to the tryptic fragment Gly982-Arg1008 of the col1a1 from L. africana, but showing the substitution of the alanine at position n. 989 with a threonine residue, that is underlined (see the text for details)
Fig. 12Primary structure of the col1a2 of the woolly mammoth (Mammuthus primigenus) identified in the protein extracts of an EVA diskette (see text). Amino acid sequence was characterized by tryptic digestion and nLC-nESI MS/MS and using the known mature form of the unreviewed col1a2 from Loxodonta africana (UniProt Acc. No. G3TIC0) as reference. The sequence characterized by MS/MS data is underlined. The amino acid differences with respect to the col1a2 unreviewed sequence of L. africana, the experimentally determined Elephantidae sequences by Buckley et al. 2011 and the experimentally determined sequences of M. Americanum by Buckley et al. 2011 are reported in bold. The experimentally determined Elephantidae sequences by Buckley et al. (Buckley et al., 2011) include col1a2 of a North Sea M. primigenius, L. africana and E. maximus
Fig. 13MS/MS spectrum of the double charged molecular ion at m/z 784.8686 (molecular mass 1567.7214) of Mammuthus primigenius col1a2 tryptic peptide. De novo deduced sequence corresponds to the tryptic fragment Gly303- Arg320 of the unreviewed col1a2 from Loxodonta africana (UniProt Acc. No. G3TIC0) but carrying the substitution of the serine at position n. 305 with an aspartic acid residue, that is underlined (see the text for details)