| Literature DB >> 33529527 |
Noemi Procopio1,2, Rachel J A Hopkins3,4, Virginia L Harvey2,5, Michael Buckley2,5.
Abstract
Isotope analyses are some of the most common analytical methods applied to ancient bone, aiding the interpretation of past diets and chronology. For this, the evaluation of "collagen yield" (as defined in radiocarbon dating and stable isotope research) is a routine step that allows for the selection of specimens that are deemed adequate for subsequent analyses, with samples containing less than ∼1% "collagen yield" normally being used for isotopic analysis but discounted for radiocarbon dating. The aims of this study were to use proteomic methods of MALDI-TOF (matrix assisted laser desorption ionization time-of-fligh mass spectrometry) and LC-ESI-MS/MS (liquid chromatography electrospray ionization tandem mass spectrometry) to investigate the endogeneity of the dominant proteinaceous biomolecules within samples that are typically considered to contain poorly preserved protein. Taking 29 archaeological samples, we evaluated the proteome variability between different acid-soluble fractions removed prior to protein gelatinization and considered waste as part of the radiocarbon dating process. We then correlated these proteomes against the commonly used "collagen yield" proxy for preservation. We found that these waste fractions contained a significant amount of both collagenous and noncollagenous proteins (NCPs) but that the abundance of these was not correlated with the acquired "collagen yield". Rather than a depleted protein load as would be expected from a low "collagen yield", the variety of the extracted NCPs was comparable with that commonly obtained from ancient samples and included informative proteins useful for species identification, phylogenetic studies, and potentially even for isotopic analyses, given further method developments. Additionally, we did not observe any correlation between "collagen yield" and peptide mass fingerprint success or between the different fractions taken from the same sample but at different radiocarbon pretreatment stages. Overall, these findings highlight the value in retaining and analyzing sample fractions that are otherwise discarded as waste during the radiocarbon dating process but more importantly, that low "collagen yield" specimens that are often misinterpreted by archaeologists as being devoid of protein can still yield useful molecular sequence-based information.Entities:
Keywords: NCPs; ancient bone; collagen; proteomics; radiocarbon dating; stable isotopes
Mesh:
Substances:
Year: 2021 PMID: 33529527 PMCID: PMC7944572 DOI: 10.1021/acs.jproteome.0c01014
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Bone Specimens Used in the Study
| sample name | cave | approx. age | trench | Sq. | layer | depth (cm) | solvent wash | fractions sampled | AG collagen yield (%) | AF collagen yield (%) | morphological identification | ZooMS identification | ZooMS + LC–ESI–MS/MS identification |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Man1 | Manastira | Late Holocene | n/a | H24 | n/a | n/a | A, B | 5.41 | 4.09 | human | human | ||
| MR1 | Máriaremete | Late Holocene | n/a | 6° | sárga agyag | 100 | A, B | 7.25 | 6.43 | human? | human | ||
| MR8 | Máriaremete | LMP-EUP | n/a | 6° | sárga agyag | 100 | A, B | 1.46 | 1.03 | suspected bovidae/cervidae | |||
| TD1 | Temnata | LMP-EUP | V | O2 | 5pg | 540–545 | Y | A, B | 0.88 | 1.08 | unidentified mammal | bovine | bovine |
| TD4 | Temnata | LMP-EUP | V | O2 | 4pg | 540–545 | Y | A, B | 3.10 | 2.24 | bovine | bovine | |
| TD5 | Temnata | LMP-EUP | V | Λ1 | 3jx/4 | n/a | Y | A, B | 2.82 | 2.22 | unidentified mammal | suspected bovidae/cervidae | |
| TD6 | Temnata | LMP-EUP | V | H1 | 3d/4 | 460–465 | Y | A, B | 3.62 | 3.43 | unidentified mammal | horse | horse |
| TD7 | Temnata | LMP-EUP | V | Λ2 | 3jx | 505–510 | Y | A, B | 3.03 | 3.68 | unidentified mammal | horse | horse |
| TD8 | Temnata | LMP-EUP | V | M2 | 3j/w | 480–485 | Y | A, B | 7.21 | 5.73 | unidentified mammal | suspected bovidae/cervidae | |
| TD14 | Temnata | LMP-EUP | V | Λ1 | 3dg | 495–500 | Y | A, B | 3.94 | 4.85 | unidentified mammal | horse | horse |
| TD15 | Temnata | LMP-EUP | V | Λ2 | 3dg | 480–485 | A, B | 5.55 | 6.19 | unidentified mammal | horse | horse | |
| TD16 | Temnata | LMP-EUP | I | δ2 | 4 | 450–452.5 | Y | A, B | 5.27 | 3.55 | unidentified mammal | horse | horse |
| TD17 | Temnata | LMP-EUP | I | Γ4 | 4 | 460–465 | Y | A, B | 2.57 | 3.14 | unidentified mammal | bovine | bovine |
| TD19 | Temnata | LMP-EUP | V | H1 | 3d/w | 460–465 | Y | A, B | 3.40 | 3.50 | unidentified mammal | horse | horse |
| TD20 | Temnata | LMP-EUP | V | H1 | 3d/w | 465–470 | Y | A, B | 7.20 | 5.28 | unidentified mammal | horse | horse |
| KZ-06 | Kozarnika | LMP-EUP | 1 | E9 | 4 = IVa | 390–393 | B | 2.73 | 1.60 | large ungulate | cervine | cervine | |
| KZ-10 | Kozarnika | LMP-EUP | 1 | E7 | 4 = IVa | 393–395 | B | 4.47 | 3.17 | large ungulate | horse | horse | |
| KZ-19 | Kozarnika | LMP-EUP | 1 | E9 | 4 = IVb | 410–415 | B | 7.57 | 5.16 | bovinae | bovine | bovine | |
| KZ-23 | Kozarnika | LMP-EUP | 1 | D10 | 4 = IVb | 405–410 | B | 6.17 | 4.47 | large artiodactyla | cervine | cervine | |
| KZ-24 | Kozarnika | LMP-EUP | 1 | F10 | 5a = V | 420–425 | B | 3.88 | 2.76 | large ungulate | cervine | cervine | |
| KZ-25 | Kozarnika | LMP-EUP | 1 | F10 | 5a = V | 425–430 | B | 5.85 | 4.17 | large ungulate | horse | horse | |
| KZ-43 | Kozarnika | LMP-EUP | 1 | G5 | 5c = VII | 440–443 | B | 5.83 | 3.24 | red deer | cervine | cervine | |
| KZ-44 | Kozarnika | LMP-EUP | 1 | F7 | 6/7 = VIII | 460–465 | B | 0.64 | large ungulate | bovine | bovine | ||
| KZ-47 | Kozarnika | LMP-EUP | 1 | F8 | 5c = VII | 445–450 | B | 4.69 | 4.59 | unidentified mammal | horse | ||
| KZ-49 | Kozarnika | LMP-EUP | 1 | G7 | 5c = VII | 450–455 | B | 3.83 | 2.68 | large ungulate | cervine | cervine | |
| KZ-52 | Kozarnika | LMP-EUP | 1 | F5 | 6/7 = VIII | 453–455 | B | 4.06 | 2.76 | large ungulate | bovine | bovine | |
| KZ-53 | Kozarnika | Late Holocene | 3 | n/a | n/a | n/a | B | 6.07 | 4.95 | human | human | human | |
| KZ-54 | Kozarnika | Late Holocene | 3 | n/a | n/a | n/a | B | 2.53 | human | human | human | ||
| KZ-58 | Kozarnika | LMP-EUP | 3 | F30 | 6/7? = VIII? | 504 | B | 0.00 | unidentified mammal | suspected bovidae/cervidae |
Samples that were pretreated with solvents are indicated with “Y” in the “Solvent wash” column. This step was only applied to samples that were suspected of containing exogenous carbon derived from conservation treatment, as is standard procedure for radiocarbon dating.[20] Samples from which the second prewash (HCl) fraction was sampled are indicated by “A”, and samples from which the incubation (HCl) fraction was sampled are indicated by “B” in the “Fractions sampled” column (for details see text). “AG” and “AF” collagen yields refer to measurements taken before and after ultrafiltration, respectively. Context information contains, where known, excavation trench (“Trench”), excavation square (“Sq.”), and stratigraphic unit (“Layer”). For the latter, at Kozarnika, Arabic numbers indicate geological units, and Roman numerals indicate archaeological units; at Temnata, Arabic numbers were used for geological units, and letters for subunits of varying character, and at Máriaremete, the levels were given descriptions (“sárga agyag” = yellow clay).
LMP-EUP = late middle palaeolithic to early upper palaeolithic.
Bovine = cattle and bison.
Cervine = red deer, fallow deer or elk.
Figure 1Schematic representation of the treatments and analyses to which samples have been subjected. “R/T” indicates room temperature, “O/N” indicates overnight.
List of all NCPs Identified in the 44 Ancient Samples (n = 36), Including Protein Function, Localization, and all Species Listed on Swiss-Prot That Have a 100% Amino Acid Sequence Match
| protein ID | name | NCP contained in samples: | role | localization | species |
|---|---|---|---|---|---|
| ALB | serum albumin | TD4.A–TD4.B–TD1.A–TD1.B–KZ-24–KZ-52–TD17.A–TD6.A–TD6.B–TD19.A–TD19.B–TD16.A–TD16.B–TD7.A–TD7.B–MAN1.A–MAN1.B–KZ-23–TD14.A–TD14.B–KZ-53–KZ-19–TD20.A–TD20.B–TD8.A–TD8.B–TD15.A–TD15.B | binding protein | serum | bovine, horse, human, cervine |
| APOA1 | apolipoprotein A-I | TD4.B–TD1.A–TD1.B–KZ-19 | transport of cholesterol | serum | bovine |
| APOA2 | apolipoprotein A-II | TD1.A | stabilize high density lipoprotein | serum | bovine |
| F10 | coagulation factor X | KZ-54–KZ-53 | blood coagulation | serum | human |
| F7 | coagulation factor VII | KZ-53 | blood coagulation | serum | human |
| F9 | coagulation factor IX | TD17.A | blood coagulation | serum | bovine |
| GC | vitamin D-binding protein | TD4.B–TD1.B | vitamin D transport | serum | bovine |
| PROS1 | vitamin K-dependent protein S (fragment) | TD19.A–TD14.B | blood coagulation | serum | horse |
| SERPINC1 | antithrombin-III | TD4.B–TD1.A–TD1.B | blood coagulation | serum | bovine |
| SERPIND1 | heparin cofactor 2 | TD19.A | blood coagulation | serum | horse |
| AHSG | α-2-HS-glycoprotein/fetuin-A | TD4.A–TD4.B–TD1.A–TD1.B–TD17.A–MAN1.B–KZ-53–KZ-19–TD8.A–TD8.B | skeletal mineralization | bone/serum | bovine, human |
| ALPL | alkaline phosphatase, tissue-nonspecific isozyme | TD4.B–TD19.A | skeletal mineralization | bone | bovine, horse |
| BGN | biglycan | TD4.A–TD4.B–TD1.A–TD1.B–MR8.B–TD5.A–TD5.B–KZ-54–KZ-24–KZ-52–TD17.A–TD17.B–TD6.A–TD6.B–TD19.A–TD19.B–TD16.A–TD16.B–TD7.A–TD7.B–MAN1.A–MAN1.B–KZ-23–TD14.A–TD14.B–KZ-53–KZ-19–TD20.A–TD20.B–TD8.A–TD8.B–TD15.A–TD15.B–MR1.A | collagen assembly | bone/ECM | bovine, horse, human, cervine |
| C3 | complement C3 | TD4.A–TD1.A–TD1.B–TD19.B–TD14.B | role in osteogenesis | bone/serum | bovine, horse |
| C9 | complement component C9 | TD19.A–TD19.B | role in osteogenesis | bone/serum | horse |
| CLEC11A | C-type lectin domain family 11 member A | TD19.A–TD19.B–TD7.A–TD14.B | promoting osteogenesis | bone | bovine, horse |
| CLEC3B | tetranectin | TD4.A–TD4.B–TD1.B–KZ-24–TD17.A–MAN1.B–KZ-23 | bone mineralization | bone/serum | bovine, human, cervine |
| DPT | dermatopontin | KZ-53 | collagen fibril formation | bone | human |
| F2 | prothrombin | TD4.A–TD4.B–TD1.B–KZ-54–TD17.A–TD17.B–TD19.A–TD19.B–MAN1.A–MAN1.B–TD14.A–TD14.B–KZ-53–KZ-19–MR1.A | blood coagulation | bone/serum | bovine, horse, human |
| HTRA1 | serine protease HTRA1B | TD1.A–TD1.B–TD19.A | osteogenesis regulation | bone | bovine, horse |
| IBSP | bone sialoprotein | TD6.B–TD16.A | binding mineral matrix | bone | horse |
| LUM | lumican | TD17.A–MAN1.A–MAN1.B–KZ-53 | collagen binding | bone/ECM | bovine, human |
| NUCB1 | nucleobindin-1 | TD4.A–TD4.B–TD1.A–TD1.B–KZ-24–KZ-52–TD19.A–TD19.B–TD16.A–KZ-23–TD14.A–TD14.B–KZ-19–TD8.A | bone matrix maturation | bone | bovine, horse, cervine |
| OMD | osteomodulin | MAN1.A–MAN1.B–KZ-53 | bone remodelling | bone | human |
| PANX3 | pannexin-3 | KZ-24 | bone growth regulation | bone | cervine |
| POSTN | periostin | TD19.A–MAN1.B–TD14.A–TD14.B–KZ-53 | bone remodelling | Bone | horse, human |
| SERPINF1 | pigment epithelium-derived factor | TD4.B–TD1.B–TD17.A–TD17.B–TD6.A–TD6.B–TD19.A–TD19.B–TD16.A–TD16.B–TD7.A–MAN1.A–MAN1.B–TD14.A–TD14.B–KZ-53–TD20.A–TD20.B–TD8.A–TD8.B- TD15.A–TD15.B | bone remodelling | bone/ECM | bovine, horse, human |
| SPARC | SPARC | TD4.A–TD4.B–TD1.B–TD6.A–TD6.B–TD19.A–TD19.B–TD16.A–TD16.B–TD7.A–TD14.A–TD14.B–TD20.A | calcium and collagen binding | bone | bovine, horse |
| SPP24 | secreted phosphoprotein 24 | TD17.A | bone turnover | bone/serum | bovine |
| THBS1 | thrombospondin-1 | TD4.A–TD4.B–TD1.A–TD1.B–KZ-24–KZ-52–TD19.A–TD19.B–TD7.A–MAN1.A–MAN1.B–KZ-23–TD14.A–TD14.B–KZ-53–KZ-19–TD8.A–TD8.B | bone homeostasis | bone | bovine, horse, human, cervine |
| VTN | vitronectin | KZ-54–TD19.B–KZ-53–MR1.A | bone mineralization | bone | horse, human |
| CHAD | chondroadherin | TD1.A–TD17.A–TD6.A–TD6.B–TD16.A–TD16.B–MAN1.A–MAN1.B–KZ-23–TD14.B–KZ-53–TD20.A–TD20.B–TD15.B | promoting chondrocyte growth | cartilage/extracellular | bovine, horse, human, cervine |
| DSP | desmoplakin | KZ-53 | desmosomes assembly | intracellular | human |
| IGFALS | insulin-like growth factor-binding protein complex acid labile subunit | KZ-53 | protein–protein interaction | extracellular | human |
| RDX | radixin | TD14.B | actin binding | intracellular | horse |
| S100A7 | protein S100-A7 | TD15.A | calcium binding | extracellular | horse |
Bovine = cattle and bison.
Cervine = red deer, fallow deer or elk.
Figure 2Scatterplot for the percentage of “AG Collagen yield” (X axis) and for the number of NCPs (Y axis) identified in the dataset. Different colors and symbols (legend) represent different species. Species identifications achieved using both ZooMS and LC–ESI–MS/MS proteomics data are indicated with full shapes, whereas species identifications achieved uniquely with LC–ESI–MS/MS proteomics (and where ZooMS analyses failed) are indicated with empty shapes.
“AG Collagen Yield” (See Experimental Section, under Collagen Yield), Number of Total Spectrum Count for Collagen α-1(I) and Collagen α-2(I) Chains, Number of NCPs, and Percentage of Coverage for Collagen α-1(I) and Collagen α-2(I), Biglycan (BGN) and Albumin (ALB)
| sample | AG collagen yield | COL1A1 | COL1A2 | NCPs | coverage COL1A1 (%) | coverage COL1A2 (%) | coverage BGN (%) | coverage ALB (%) |
|---|---|---|---|---|---|---|---|---|
| KZ-58 | 0 | 13 | 13 | 0 | 11 | 11 | ||
| TD4.A | 3.1 | 284 | 194 | 9 | 47 | 45 | 11 | 12 |
| TD4.B | 3.1 | 267 | 183 | 13 | 51 | 51 | 15 | 16 |
| KZ-44 | 0.64 | 357 | 277 | 0 | 50 | 54 | ||
| TD1.A | 0.88 | 296 | 234 | 11 | 51 | 55 | 18 | 21 |
| TD1.B | 0.88 | 307 | 223 | 14 | 49 | 50 | 15 | 21 |
| MR8.A | 1.46 | 241 | 201 | 0 | 47 | 48 | ||
| MR8.B | 1.46 | 306 | 216 | 1 | 45 | 47 | 7 | |
| KZ-06 | 1.6 | 355 | 227 | 0 | 51 | 46 | ||
| TD5.A | 2.22 | 259 | 149 | 1 | 50 | 46 | 7 | |
| TD5.B | 2.22 | 316 | 192 | 1 | 44 | 47 | 7 | |
| KZ-54 | 2.53 | 295 | 152 | 4 | 47 | 35 | ||
| KZ-49 | 2.68 | 376 | 211 | 0 | 50 | 39 | 11 | 5 |
| KZ-24 | 2.76 | 334 | 179 | 6 | 49 | 53 | 11 | 8 |
| KZ-52 | 2.76 | 347 | 261 | 4 | 51 | 53 | 16 | 9 |
| TD17.A | 3.14 | 340 | 261 | 10 | 51 | 52 | 13 | |
| TD17.B | 3.14 | 315 | 226 | 3 | 70 | 72 | ||
| KZ-10 | 3.17 | 294 | 227 | 0 | 47 | 43 | ||
| KZ-43 | 3.24 | 343 | 253 | 0 | 77 | 80 | 14 | 9 |
| TD6.A | 3.43 | 313 | 247 | 5 | 61 | 80 | 15 | 5 |
| TD6.B | 3.43 | 312 | 277 | 6 | 83 | 77 | 29 | 28 |
| TD19.A | 3.5 | 295 | 247 | 10 | 77 | 77 | 18 | 21 |
| TD19.B | 3.5 | 313 | 255 | 9 | 72 | 77 | 19 | 8 |
| TD16.A | 3.55 | 243 | 219 | 7 | 69 | 72 | 19 | 6 |
| TD16.B | 3.55 | 265 | 218 | 5 | 75 | 81 | 15 | 6 |
| TD7.A | 3.68 | 254 | 231 | 5 | 75 | 74 | 11 | 4 |
| TD7.B | 3.68 | 297 | 251 | 2 | 52 | 56 | 14 | 8 |
| MAN1.A | 4.09 | 359 | 256 | 9 | 49 | 58 | 18 | 8 |
| MAN1.B | 4.09 | 368 | 247 | 12 | 72 | 79 | ||
| KZ-25 | 4.17 | 349 | 258 | 0 | 51 | 37 | 14 | 5 |
| KZ-23 | 4.47 | 328 | 190 | 6 | 73 | 63 | ||
| KZ-47 | 4.59 | 234 | 176 | 0 | 76 | 80 | 27 | 27 |
| TD14.A | 4.85 | 308 | 260 | 7 | 75 | 78 | 29 | 27 |
| TD14.B | 4.85 | 274 | 266 | 10 | 51 | 56 | 23 | 8 |
| KZ-53 | 4.95 | 380 | 243 | 15 | 52 | 54 | 11 | 8 |
| KZ-19 | 5.16 | 355 | 233 | 7 | 75 | 76 | 14 | 6 |
| TD20.A | 5.28 | 312 | 230 | 5 | 79 | 69 | 14 | 6 |
| TD20.B | 5.28 | 295 | 201 | 4 | 49 | 58 | 7 | 5 |
| TD8.A | 5.73 | 263 | 218 | 6 | 47 | 51 | 11 | 5 |
| TD8.B | 5.73 | 324 | 222 | 5 | 47 | 42 | 7 | |
| TD15.A | 6.19 | 350 | 196 | 3 | 78 | 80 | 7 | 4 |
| TD15.B | 6.19 | 294 | 275 | 4 | 74 | 82 | 11 | 4 |
| MR1.A | 6.43 | 282 | 191 | 3 | 51 | 52 | 7 | |
| MR1.B | 6.43 | 134 | 93 | 0 | 41 | 41 |
Figure 3Scatterplot for the percentage of “AG collagen yield” (X axis) and for the total spectrum counts for (A) COL1A1 and B) COL1A2 (Y axis) identified in the dataset. Different colors and symbols (legend) represent different species. Species identifications achieved using both ZooMS and LC–ESI–MS/MS proteomics data are indicated with full shapes, whereas species identifications achieved uniquely with LC–ESI–MS/MS proteomics (and where ZooMS analyses failed) are indicated with empty shapes.
Figure 4Scatterplot for the percentage of “AG Collagen Yield” (X axis) and for the total spectrum counts for biglycan (Y axis) identified in the dataset. Different colors and symbols (legend) represent different species. Species identifications achieved using both ZooMS and LC–ESI–MS/MS proteomic data are indicated with full shapes, whereas species identifications achieved uniquely with LC–ESI–MS/MS proteomics (and where ZooMS analyses failed) are indicated with empty shapes.
Figure 5Bar plot representing (A) number of NCPs and (B) COL1A1 total spectrum count obtained from fraction A (red) and fraction B (blue) of samples subjected to a prewashing step with HCl. Sample names are indicated to the side of the two bar plots.
Percentage Coverage for Collagen α-1(I) Chain, Collagen α-2(I) Chain, Biglycan, Serum Albumin and Fetuin-A for Fraction A and B for Bovine Samples MR8, TD1, TD17, TD4, and TD5
| MR8.A (%) | MR8.B(%) | TD1.A(%) | TD1.B (%) | TD17.A (%) | TD17.B (%) | TD4.A (%) | TD4.B(%) | TD5.A (%) | TD5.B (%) | |
|---|---|---|---|---|---|---|---|---|---|---|
| COL1A1 | 47 | 45 | 51 | 49 | 51 | 51 | 47 | 51 | 47 | 50 |
| COL1A2 | 48 | 47 | 55 | 50 | 53 | 52 | 45 | 51 | 42 | 46 |
| BGN | 18 | 15 | 16 | 13 | 11 | 15 | 7 | 7 | ||
| ALB | 21 | 21 | 9 | 12 | 16 | |||||
| AHSG | 12 | 14 | 10 | 16 | 16 |
Figure 6STRING association network of the NCPs extracted from TD1.A and TD1.B. The line thickness indicates the strength of data support (edge confidence). Proteins marked with the star symbol were identified in both sample fractions A and B (second prewash and final overnight HCl incubation).