| Literature DB >> 30657921 |
Nami Arakawa1, Daisuke Utsumi2, Kenzo Takahashi2, Akiko Matsumoto-Oda3, Atunga Nyachieo4, Daniel Chai4, Ngalla Jillani4, Hiroo Imai5, Yoko Satta1, Yohey Terai1.
Abstract
Human skin is morphologically and physiologically different from the skin of other primates. However, the genetic causes underlying human-specific skin characteristics remain unclear. Here, we quantitatively demonstrate that the epidermis and dermis of human skin are significantly thicker than those of three Old World monkey species. In addition, we indicate that the topography of the epidermal basement membrane zone shows a rete ridge in humans but is flat in the Old World monkey species examined. Subsequently, we comprehensively compared gene expression levels between human and nonhuman great ape skin using next-generation cDNA sequencing (RNA-Seq). We identified four structural protein genes associated with the epidermal basement membrane zone or elastic fibers in the dermis (COL18A1, LAMB2, CD151, and BGN) that were expressed significantly greater in humans than in nonhuman great apes, suggesting that these differences may be related to the rete ridge and rich elastic fibers present in human skin. The rete ridge may enhance the strength of adhesion between the epidermis and dermis in skin. This ridge, along with a thick epidermis and rich elastic fibers might contribute to the physical strength of human skin with a low amount of hair. To estimate transcriptional regulatory regions for COL18A1, LAMB2, CD151, and BGN, we examined conserved noncoding regions with histone modifications that can activate transcription in skin cells. Human-specific substitutions in these regions, especially those located in binding sites of transcription factors which function in skin, may alter the gene expression patterns and give rise to the human-specific adaptive skin characteristics.Entities:
Keywords: elastic fibers; epidermal basement membrane zone; primates; rete ridge; transcriptional regulatory region
Mesh:
Substances:
Year: 2019 PMID: 30657921 PMCID: PMC6402313 DOI: 10.1093/gbe/evz007
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Phylogenetic relationships between human, nonhuman great apes, and rhesus macaque. A phylogenetic tree was constructed using the Neighbor-Joining method and the pair-wise nucleotide divergence of whole genome sequences (Scally et al. 2012). The scale bar represents 0.005 substitutions per site. The distance on each branch was calculated by the Fitch–Margoliash algorithm (Fitch and Margoliash 1967) using the pair-wise nucleotide divergence.
. 2.—The thickness and epidermal BM zone topography of skin in humans and three Old World monkey species. The comparison of thickness of the epidermis (a) and dermis (b). Pa, Ca, Cp, and Hs indicate Papio anubis, Cercopithecus albogularis, Chlorocebus pygerythrus, and Homo sapiens, respectively. The numbers of individuals used for measurements are shown under each species name abbreviation. Photographs of hematoxylin–eosin stained histologic skin sections are from H. sapiens (c), P. anubis (d), C. albogularis (e), and Ch. pygerythrus (f). Scale bars are shown in each panel. ep. and d. indicate the epidermis and dermis, respectively. *P < 0.05, **P < 0.01, and ***P < 0.001, t-test with Bonferroni correction.
. 3.—Volcano plots for the gene expression differences between humans and nonhuman great apes. Each blue dot represents the gene expressed in the skin. The normalized RPKM values are based on the mapping to the reference genome of (a) human, (b) chimpanzee, (c) gorilla, and (d) orangutan. The log 2-fold changes of average normalized RPKM values of nonhuman great apes compared with those of humans and the −log 10 P values resulting from Baggerley’s test comparing average normalized RPKM values between humans and nonhuman great apes for each gene are plotted on the x and y axis, respectively. Differentially expressed genes between humans and nonhuman great apes selected in this study are shown by black and red dots. The gene names colored in red indicate structural protein genes.
Differentially Expressed Genes between Human and Nonhuman Great Ape Skin
| Average Normalized RPKM | FD | |||
|---|---|---|---|---|
| Humans | NH Great Apes | |||
| Higher expression in humans | ||||
| 137.0 | 32.1 | 4.3 | ||
| 20.9 | 0.7 | 30.8 | ||
| 29.3 | 1.9 | 15.5 | ||
| 547.6 | 112.8 | 4.9 | ||
| 986.0 | 130.0 | 7.6 | ||
| 34.3 | 12.5 | 2.7 | ||
| 7.0 | 2.8 | 2.5 | ||
| 42.8 | 19.1 | 2.2 | ||
| 75.2 | 2.0 | 36.9 | ||
| 56.2 | 6.4 | 8.8 | ||
| 30.8 | 7.0 | 4.4 | ||
| 64.8 | 10.4 | 6.2 | ||
| 20.5 | 9.3 | 2.2 | ||
| 40.1 | 2.4 | 17.0 | ||
| 361.7 | 103.5 | 3.5 | ||
| 80.0 | 42.1 | 1.9 | ||
| 12.0 | 5.0 | 2.4 | ||
| 14.3 | 3.2 | 4.4 | ||
| 14.0 | 6.4 | 2.2 | ||
| 47.2 | 23.3 | 2.0 | ||
| 118.6 | 45.5 | 2.6 | ||
| Lower expression in humans | ||||
| 14.7 | 39.3 | 2.7 | ||
| 4.8 | 22.9 | 4.7 | ||
| 19.6 | 45.6 | 2.3 | ||
| 41.1 | 74.2 | 1.8 | ||
| 19.7 | 47.0 | 2.4 | ||
| 0.4 | 7.7 | 18.6 | ||
Note.—Bold letters: structural protein genes.
In mapping to the human reference genome.
NH great apes: nonhuman great apes.
FD: fold difference.
P < 0.05,
P < 0.01, and
P < 0.001, Baggerley’s test with FDR P value correction.
Collagen Genes with Relatively High Expression in Skin
| Average Normalized RPKM | Fold Difference | |||
|---|---|---|---|---|
| Humans | NH Great Apes | |||
| (a) Genes encoding proteins that form collagens in the epidermal BM zone | ||||
| 54.6 | 21.5 | 2.5 | ||
| 77.5 | 32.9 | 2.4 | ||
| 32.9 | 16.8 | 2.0 | ||
| 181.7 | 76.4 | 2.4 | ||
| 41.7 | 18.3 | 2.3 | ||
| (b) Genes encoding proteins that form BGN-binding collagens | ||||
| 444.4 | 198.5 | 2.2 | ||
| 311.3 | 134.9 | 2.3 | ||
| 404.7 | 148.9 | 2.7 | ||
| 276.3 | 106.2 | 2.6 | ||
| 459.3 | 185.1 | 2.5 | ||
| 45.6 | 27.3 | 1.7 | ||
In mapping to the human reference genome.
NH great apes: nonhuman great apes.
P < 0.05,
P < 0.01, and
P < 0.001, t-test with Bonferroni correction.
. 4.—The positions of the putative transcriptional regulatory regions with the candidate substitutions. The exons of (a) COL18A1, (b) LAMB2, (c) CD151, and (d) BGN genes are shown in orange in each panel. The symbols “<” and “>” indicate the direction of the genes. All genes (except genes of interest) are shown by gray arrows. White arrowheads at the end of horizontal lines indicate that the genes are continuous beyond the schematic representation. Orange vertical lines indicate the position of the putative transcriptional regulatory regions numbered by Roman numerals. The zoomed-in view around these regions are shown in the rectangles of each panel. H3K4m1, H3K4m2, H3K4m3, H3K9ac, and H3K27ac indicate monomethylation of histone H3 lysine 4, dimethylation of histone H3 lysine 4, trimethylation of histone H3 lysine 4, acetylation of histone H3 lysine 9, and acetylation of histone H3 lysine 27, respectively. The positions and density of the gray scale bars indicate the positions and intensity of histone modifications shown at the side of the bars, respectively. The skin cell strains referred for the histone modifications were as below: a: NHEK, b: NHDF-Ad, c: NHEK, and d: NHDF-Ad. For LAMB2 and BGN, we also referred to the NHEK strain, in which the genes are also expressed (supplementary fig. S4b and d, Supplementary Material online). The red arrowheads indicate the numbers of the candidate substitutions located in each putative transcriptional regulatory region. A substitution in COL18A1 region I was later removed from the candidate substitutions because of the high ancestral allele frequency in the locus.
The Numbers of Substitutions in Conserved 120-bp Regions with Candidate Substitutions
| Region | Candidate Substitution | Substitutions | Substitutions |
|---|---|---|---|
| S1 | 3.79 | 1 | |
| S1 | |||
| S2 | |||
| S1 | 1 | ||
| S1 | 1 | ||
| S1 | 4.05 | 1 | |
| S1 | 4.05 | 1 | |
| S1 | 4.06 | ||
| S2 | |||
| S1 | 3.03 | 1 | |
| S1 | 3.80 | 1 | |
| S1 | 3.03 | ||
| S1 | 3.03 | ||
| S1 | 4.09 | 1 | |
| S2 | 4.06 | 1 | |
| S1 | 3.04 | 1 | |
| S1 | 3.04 | 1 | |
| S1 | 3.03 | ||
| S1 | 4.06 | 1 | |
| S1 | |||
| S1 | 1 | ||
| S1 | 4.25 | ||
| S2 | |||
Note.—Bold letters: significantly highly conserved regions in nonhuman lineages (P < 0.01, Poisson distribution) or conserved regions with the significantly larger numbers of substitutions in the human lineage (P < 0.05, Poisson distribution).
S1 and S2 represent substitutions 1 and 2 in one region, respectively.
The 120-bp regions with the smallest numbers of substitutions were selected for each candidate substitution.
The 120-bp regions with the largest numbers of substitutions were selected for each candidate substitution.
Two substitutions (S1 and S2) were located in the same 120-bp region.
P < 0.05 and **P < 0.01, Poisson distribution.
Candidate Substitutions in Binding Sites of TFs with a Function in Skin
| Region | Candidate Substitution | TF | Binding Site | Relative Score | |
|---|---|---|---|---|---|
| Ancestral Allele | Human-Specific Allele | ||||
| II | S2 | HOXB2 | 22–31 (f) | 0.819 | 0.713 |
| III | S1 | FLI1 | 22–31 (r) | 0.701 | 0.827 |
| VI | S1 | KLF4 | 20–29 (f) | 0.808 | 0.719 |
| VII | S1, S2 | SPDEF | 20–30 (f) | 0.595 | 0.803 |
| FLI1 | 21–30 (f) | 0.614 | 0.825 | ||
| II | S1 | HOXB2 | 23–32 (f) | 0.755 | 0.821 |
| MSX2 | 24–31 (f) | 0.686 | 0.810 | ||
| PRRX2 | 24–31 (f) | 0.757 | 0.858 | ||
| I | S1 | FOSL2 | 26–36 (f) | 0.859 | 0.841 |
| SNAI2 | 23–31 (f) | 0.763 | 0.902 | ||
| II | S1 | KLF4 | 21–30 (f) | 0.809 | 0.814 |
| I | S1 | SNAI2 | 20–28 (r) | 0.971 | 0.876 |
| II | S1 | KLF4 | 25–34 (f) | 0.818 | 0.727 |
| SPDEF | 19–29 (r) | 0.842 | 0.731 | ||
| III | S1 | FLI1 | 20–29 (r) | 0.736 | 0.866 |
| V | S1 | SNAI2 | 20–28 (f) | 0.779 | 0.929 |
| SNAI2 | 24–32 (f) | 0.937 | 0.842 | ||
| VI | S1 | SNAI2 | 22–30 (r) | 0.840 | 0.785 |
| MSX2 | 21–28 (f) | 0.713 | 0.810 | ||
| VII | S1 | HOXB2 | 21–30 (r) | 0.854 | 0.748 |
S1 and S2 represent substitutions 1 and 2 in one region, respectively.
The numbers represent the positions within the 51-bp sequences retrieved for the TF search. Each candidate substitution is located at position 26. “f” and “r” indicate TF-binding site on forward and reverse strands, respectively.
This score shows the similarity with the consensus sequence of TF-binding site in the JASPAR database. The score changes by the candidate substitutions from the ancestral alleles to the human-specific alleles are shown.
These two substitutions (S1 and S2) are located next to each other. The 25-bp sequences on both sides of these substitutions were retrieved for the TF search (candidate substitutions: positions 26 and 27).