| Literature DB >> 33028848 |
Huiyun Rao1,2,3, Yimin Yang4, Jinyi Liu5,6, Michael V Westbury7, Chi Zhang8,9, Qingfeng Shao10.
Abstract
The spotted hyena (Crocuta crocuta) is the only extant species of the genus Crocuta, which once occupied a much wider range during the Pliocene and Pleistocene. However, its origin and evolutionary history is somewhat contentious due to discordances between morphological, nuclear, and mitochondrial data. Due to the limited molecular data from east Asian Crocuta, also known as cave hyena, and the difficulty of extracting ancient DNA from this area, here we present proteomic analysis of cave hyenas from three locations in northern China. This marks the first proteomic data generated from cave hyenas, adding new molecular data to the east Asian populations. Phylogenetic analysis based on these protein sequences reveals two different groups of cave hyenas in east Asia, one of which could not be distinguished from modern spotted hyenas from northern Africa, tentatively the result of previously suggested gene flow between these lineages. With developments of instrumentation and analytical methods, proteomics holds promising potential for molecular phylogenetic reconstructions of ancient fauna previously thought to be unreachable using ancient DNA.Entities:
Mesh:
Year: 2020 PMID: 33028848 PMCID: PMC7541484 DOI: 10.1038/s41598-020-73542-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The phylogenetic relationships of modern spotted hyenas and cave hyenas based on different aDNA datasets: (a) short mitochondrial DNA fragments (366 bp of cyt b sequences)[10,11], (b) complete mitochondrial genomes[12], and (c) nuclear genomes[12]. All the analyzed samples from Africa were modern spotted hyenas and all the analyzed samples from Eurasia were Late Pleistocene cave hyenas. The figure was generated using Adobe Illustrator CC 2015 (https://www.adobe.com/cn/products/illustrator.html).
Figure 2Geographical locations of cave hyenas with molecular data from Asia. The filled circles represent the three localities in this study that successfully provided proteomic data. The filled triangles represent the localities where genetic analyses have been successfully employed 10–12. Except Denisova Cave, located in northern Asia, all the other localities are in east Asia. Both Tonghe Bridge and Kunnigou are located in Zhaodong County, Heilongjiang Province. Both Lingxian Cave and Shanyangzhai Site are located in Qinhuangdao City, Hebei Province. The map was generated using ArcMap v10.2 (https://www.esri.com/software/arcgis/arcgis-for-desktop) on the basis of SRTM data courtesy of the U.S. Geological Survey (https://landsat.usgs.gov).
The diagenetic parameters obtained by FTIR, elemental and ZooMS analyses and the coverage of type I collagen.
| Sample code | Provenance | Age | Element | Am/P | IRSF | N wt% | C wt% | C:N ratio | α-value * | Coverage (%)# |
|---|---|---|---|---|---|---|---|---|---|---|
| HZD | Kunnigou, Zhaodong County, Heilongjiang Province | > 43.5 kya (AMS14C) | Skull | 0.14 | 3.59 | 3.37 | 10.53 | 3.64 | 0.75 | 95.7 |
| SYZ | Shanyangzhai site, Qinhuangdao City, Hebei Province | Middle and Late Pleistocene | Skull | 0.03 | 5.30 | 0.52 | 5.10 | 11.37 | 0.19 | 89.8 |
| LXD-1B | Lingxian Cave, Qinhuangdao City, Hebei Province | > 94.4 kya (U-series) | Mandible | 0.02 | 4.74 | 0.18 | 5.11 | 32.53 | 0.06 | 79.6 |
| LXD-1D | Lingxian Cave, Qinhuangdao City, Hebei Province | > 94.4 kya (U-series) | Tooth | 0.02 | 4.76 | 0.13 | 3.86 | 33.93 | n.d | |
| LXD-2B | Lingxian Cave, Qinhuangdao City, Hebei Province | > 99.0 kya (U-series) | Mandible | 0.02 | 4.50 | 0.13 | 3.67 | 32.09 | n.d | |
| LXD-2D | Lingxian Cave, Qinhuangdao City, Hebei Province | > 99.0 kya (U-series) | Tooth | 0.02 | 4.83 | 0.26 | 4.32 | 19.72 | n.d |
*The value (between 0.0 and 1.0) shows the level of undeamidated glutamine, with 0.0 being totally deamidated and 1.0 being not deamidated at all.
n.d. indicates that the α-value failed to be calculated due to poor signal of the peptide in the PMF spectrum.
#This value represents the coverage of type I collagen when the raw datafiles were searched against the custom carnivorous type I collagen database. It is higher than the coverages of COL1A1 and COL1A2 displayed in Table 2 because the signal peptides and propeptides were removed and these regions rarely survived in our fossil samples.
The detailed information of the 10 proteins used to construct the phylogenetic datasets.
| Proteins | Score | Coverage (%) | Peptides | Unique peptides | PSM | NQ count* | Deamidation |
|---|---|---|---|---|---|---|---|
| SYZ | |||||||
| COL1A1 | 380.1 | 67 | 537 | 507 | 1915 | 36 | 0.97 |
| COL1A2 | 322.88 | 60 | 347 | 343 | 991 | 31 | 0.94 |
| AHSG | 44.95 | 8 | 1 | 1 | 20 | 4 | 0.63 |
| SERPINF1 | 41.46 | 9 | 2 | 2 | 5 | 1 | 1 |
| HZD | |||||||
| COL1A1 | 403.2 | 73 | 657 | 618 | 3618 | 39 | 0.71 |
| COL1A2 | 384.74 | 71 | 478 | 471 | 1940 | 42 | 0.58 |
| AHSG | 255.03 | 62 | 89 | 15 | 388 | 17 | 0.72 |
| BGN | 193.34 | 49 | 40 | 40 | 178 | 22 | 0.72 |
| CHAD | 204.81 | 53 | 42 | 4 | 197 | 23 | 0.45 |
| CLEC3B | 109.38 | 45 | 7 | 7 | 13 | 7 | 0.77 |
| OMD | 191.16 | 33 | 32 | 14 | 166 | 14 | 0.72 |
| SERPINF1 | 252.16 | 53 | 68 | 11 | 232 | 11 | 0.59 |
| SPARC | 156.16 | 35 | 22 | 6 | 63 | 7 | 0.78 |
| VTN | 179.56 | 32 | 38 | 16 | 107 | 7 | 0.43 |
| LXD | |||||||
| COL1A1 | 301.29 | 58 | 415 | 388 | 2745 | 30 | 0.97 |
| COL1A2 | 280.94 | 53 | 257 | 255 | 1299 | 31 | 0.99 |
| AHSG | 106.94 | 20 | 11 | 11 | 107 | 7 | 0.68 |
| SERPINF1 | 67.6 | 19 | 5 | 5 | 11 | 2 | 1 |
| VTN | 28.49 | 2 | 1 | 1 | 2 | 2 | 0.78 |
*This value represents the number of observed glutamines (Q) and asparagines (N) which have sufficient ion intensities for the deamidation calculation.
Figure 3Deamidation frequencies of the identified Pleistocene proteins used in cluster analysis for the (a) SYZ, (b) HZD and (c) LXD sample. 0% indicates no deamidation and 100% indicates complete deamidation. Black triangles represent contaminating proteins, and circles represent endogenous proteins, with their gene names labelled beside. Note that some proteins have the same or close NQ observations and deamidation values, their circles or triangles are overlapped. The figure was generated using RStudio v1.2.1335 (https://rstudio.com/products/rstudio/).
Figure 4Time-scaled maximum clade credibility tree estimated using BEAST and a concatenated alignment of 10 proteins from 8 extant and extinct taxa. Posterior Bayesian probabilities are indicated at nodes with a probability of ≤ 1. Horizontal error bars at each node represent 95% highest posterior density intervals. The figure was generated using FigTree v1.4.3 (https://tree.bio.ed.ac.uk/software/figtree/).