| Literature DB >> 32330268 |
Gerrald A Lodewijk1, Diana P Fernandes1, Iraklis Vretzakis1, Jeanne E Savage2,3, Frank M J Jacobs1,3.
Abstract
Ever since the availability of genomes from Neanderthals, Denisovans, and ancient humans, the field of evolutionary genomics has been searching for protein-coding variants that may hold clues to how our species evolved over the last ∼600,000 years. In this study, we identify such variants in the human-specific NOTCH2NL gene family, which were recently identified as possible contributors to the evolutionary expansion of the human brain. We find evidence for the existence of unique protein-coding NOTCH2NL variants in Neanderthals and Denisovans which could affect their ability to activate Notch signaling. Furthermore, in the Neanderthal and Denisovan genomes, we find unusual NOTCH2NL configurations, not found in any of the modern human genomes analyzed. Finally, genetic analysis of archaic and modern humans reveals ongoing adaptive evolution of modern human NOTCH2NL genes, identifying three structural variants acting complementary to drive our genome to produce a lower dosage of NOTCH2NL protein. Because copy-number variations of the 1q21.1 locus, encompassing NOTCH2NL genes, are associated with severe neurological disorders, this seemingly contradicting drive toward low levels of NOTCH2NL protein indicates that the optimal dosage of NOTCH2NL may have not yet been settled in the human population.Entities:
Keywords: Neanderthal; archaic genomes; brain size; gene conversion; human evolutionary genomics; human-specific genes, segmental duplications
Mesh:
Substances:
Year: 2020 PMID: 32330268 PMCID: PMC7475042 DOI: 10.1093/molbev/msaa104
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.NOTCH2NL copy-number analysis in modern human and archaic DNA samples. (A) Overview of NOTCH2 and NOTCH2NL loci in the human genome (hg38). Zoom-ins show sequence read depth at the different loci of data mapped on hg38 or masked hg38 reference genome. (B) Tracks showing NOTCH2NL duplicons from the segmental UCSC browser duplication track in the NOTCH2 locus. (C) Example showing NOTCH2- and NOTCH2NL-derived sequencing reads piled up on the NOTCH2 locus on the masked hg38 genome. (D) Quantification of NOTCH2 + NOTCH2NL alleles per individual using relative coverage of multicopy/single-copy regions. Modern human, n = 279. Ancient human: high (n = 27)/low (n = 53) coverage; Neanderthal high (n = 3)/low (n = 9) coverage; Denisova high (n = 1)/low (n = 1) coverage. (E, F) NOTCH2NL allele counts estimated from the average density of paralog-specific SUNs in modern human outliers (E) and Neanderthals (F) showing evidence for the presence of 11 alleles in total (two alleles NOTCH2 + nine alleles NOTCH2NL). (G) Comparison of allele count grouped by NOTCH2NLA + NOTCH2NLB (Kruskal–Wallis P = 1.8e-8), and NOTCH2NLR + NOTCH2NLC + NOTCH2 (Kruskal–Wallis P = 0.0055). Kruskal–Wallis test was followed up by Dunn’s test, significant comparisons are indicated in the plots. Modern human, N = 279; ancient human, N = 80; Neanderthal, N = 12; and Denisova, N = 2.
. 2.Characterization of archaic NOTCH2NL coding variants. (A) Overview of modern human, Neanderthal-specific, and Denisovan-specific coding variants. (B) Coculture NOTCH2 reporter assay testing Neanderthal and Denisovan variants reconstructed in the human NOTCH2NLA cDNA (n = 15 in three experiments, analysis of variance (ANOVA) P = 0.002, followed by Tukey’s test), or the human NOTCH2NLB cDNA (n = 20 in four experiments, ANOVA P = 0.07). (C) Western blot analysis of Neanderthal and Denisovan variants. Plasmids were transfected in equimolar amounts. (D) Quantification of protein level from three independent experiments for NOTCH2NLA (ANOVA P = 0.12) and NOTCH2NLB (ANOVA P = 0.006, followed by Tukey’s test). Asterisks indicate significant values from Tukey’s post hoc tests: *P < 0.05 and **P < 0.01.
. 3.NOTCH2NL Exon1 variants define protein expression level. (A) Overview of NOTCH2NL Exon1 variants in NOTCH2NLB (blue), NOTCH2NLA (orange), and NOTCH2NLC (green). The ORFs produced by each variant are indicated in dark green. (B) Western blot analysis of NOTCH2NL Exon1 coding variants. (C) Quantification of protein expression level from equimolar quantities of NOTCH2NLB, NOTCH2NLA, or NOTCH2NLC full-length cDNAs. Data from six independent experiments, analysis of variance (ANOVA) (Welch corrected) P = 2.7e-05, followed Games–Howell test: **P < 0.01 and ***P < 0.001. (D) Overview of NOTCH2NL loci, the configuration of the Exon1 variants, and the relative levels of NOTCH2NL protein they produce.
. 4.NOTCH2NLC configuration in Denisova3 compared with modern humans. (A) Plot showing the Exon1C-(X-low) allele count for modern humans, ancient humans, Neanderthals, and Denisovan. Note the unusual allele count for Denisovan. (B) Modern human’s median allele count plotted for each of the NOTCH2NLC-specific SUNs distributed along the NOTCH2NL locus. Vertical dashed lines indicate the region around Exon1. Zoom-in shows SUN count in Exon1, including the Exon1C-(X-low) variant indicated by green arrowhead. (C) NOTCH2NL allele counts in the Denisova3 genome, estimated from the average density of paralog-specific SUNs. (D) Denisova3 allele count plotted for each of the NOTCH2NLC-specific SUNs distributed along the NOTCH2NL locus. Zoom-in shows NOTCH2NLC SUN count in Exon1, including the Exon1C-(X-low) variant as indicated by green arrowhead.
. 5.Exon1 variant frequencies in modern human and ancient genomes. (A) Median allele count for each of the NOTCH2NLA- and NOTCH2NLB-specific SUNs along the NOTCH2NL locus in Simons diversity genomes (N = 279). (B) Zoomed in region of Exon1, orange arrowhead indicates Exon1B-(High) (ATG)/Exon1A-(Low) (ATA) variant positions. (C) Distribution of Exon1A-(Low) and Exon1B-(High) (inferred) variants in Simons diversity genomes. Expected distribution models equal frequency of both variants. Vertical dashed lines indicate medians. N = 279, Kolmogorov–Smirnov test: P < 2e-16. (D) Analysis of Exon1A-(Low) and Exon1B-(High) (inferred) variant frequency in modern humans and archaic genomes. Red lines indicate medians.
Fig.6.Additional deleterious NOTCH2NL variants are present specifically in humans. (A) Overview of the R113* and Exon2B-(Splice-mut) deleterious variants on NOTCH2NL protein structure. (B) R113* and Exon2B-(Splice-mut) allele count in modern human and archaic genomes. (C) UK Biobank data for SAS, AMR, and EUR ancestries showing association of Exon1A-(Low) frequency with R113* frequency, Exon2B-(Splice-mut) frequency, and their combined total grouped by ancestry. R113* Kruskal–Wallis: SAS P = 2.2e-16, AMR P = 7.8e-5, and EUR P = 2.2e-16. Exon2B-(Splice-mut) Kruskal–Wallis: SAS P = 4.6e-15, AMR P = 0.04, and EUR P = 1.1e-15. Combined Kruskal–Wallis: SAS P = 2.2e-16, AMR P = 9.9e-7, and EUR P = 2.2e-16. Significant groups were followed by Dunn’s test. (D, E) Dose–response curve using increasing amounts of NOTCH2NL in the coculture NOTCH2 reporter assay. (D) NOTCH2 expressing cells are cocultured with U2OS (− ligand, analysis of variance [ANOVA] P = 7.4e-7, followed by Tukey’s test: *P < 0.05) or (E) U2OS-JAG2 (+ ligand, ANOVA P = 4.7e-9, followed by Tukey’s test: **P < 0.01 and ***P < 0.001) cells. n = 5 per condition, displayed as mean ± SD. (F) General overview schematic showing the impact of variants in NOTCH2NL genes on the production of NOTCH2NL protein and the time/lineage where they were segregating. Asterisks indicate significant values from Dunn’s post hoc tests: *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. EAS N = 266, SAS N = 1,174, AMR N = 444, EUR N = 46,578, and AFR N = 1,087.
|
| ||
| PRJEB9586 ( | Simons diversity genomes | |
| NA ( | UK Biobank exomes | |
|
| ||
| PRJEB6622 ( | Ust’-Ishim | |
| PRJEB6272 ( | Loschbour, StuttgartLBK, Motala3, Motala12 | |
| PRJNA240906 ( | NE1, BR2, IR1, KO1, NE6, NE7, CO1, NE5, BR1 | |
| PRJEB4604 ( | 12880A, 12881A, 12883A, 12884A, 15594A-sc-20 | |
| PRJEB21878 ( | I9028, I9133, I9134 | |
| PRJEB11004 ( | 3DRIF-16, 3DRIF-26, 6DRIF-18, 6DRIF-21, 6DRIF-22, 6DRIF-23, 6DRIF-3, M1489, NO3423 | |
| PRJEB24629 ( | IPK12, IPY10 | |
| PRJEB27628 ( | chy002, kzb002, kzb005, kzb006, kzb007, kzb008, mur003, mur004, scy009, scy301, scy303 | |
| PRJEB13123 ( | Karelia | |
| PRJEB11364 ( | Bichon, Kotias, Satsurblia | |
| PRJEB21940 ( | Sf12, H22, Sf913, Stg001 | |
| PRJEB9783 ( | atp002, atp12-1240 | |
| PRJNA218466 ( | Mal’Ta | |
| PRJEB21037 ( | Kunila1, Ardu2 | |
| PRJEB18067 ( | Latvia_HG1, Latvia_HG2, Latvia_HG3, Latvia_MN2 | |
| PRJEB11995 ( | BA64, RM127, RSK1, RSK2 | |
| PRJEB29663 ( | MH8 | |
| PRJEB31045 ( | ans017, prs016, prs002, prs009 | |
| PRJNA338374 ( | Atacama | |
| PRJEB23467 ( | atp002, atp016 | |
| PRJEB7618 ( | Kostenki 14 | |
| PRJNA284124 ( | Kennewick | |
| PRJNA46213 ( | Saqqaq | |
| PRJNA229448 ( | Anzick-1 | |
| PRJEB6943 | Cr10-sc, PA38-sc, PA30-sc | |
| PRJEB2830 ( | Ötzi | |
| PRJNA230689 ( | La Brana | |
| PRJEB6090 ( | Gökhem2, Ajvide58 | |
|
| ||
| PRJEB1265 ( | Altai | |
| PRJEB21157 ( | Vindija | |
| PRJEB21195 ( | Mezmaiskaya1 | |
| NA ( | Chagyrskaya | |
| PRJEB21870 ( | Goyet Q56-1 | |
| PRJEB21875 ( | Les Cottes Z4-1514 | |
| PRJEB21881 ( | Mezmaiskaya2 | |
| PRJEB21882 ( | Vindija 87 | |
| PRJEB21883 ( | Spy 94a | |
| PRJEB2065 ( | Vi33.16, Vi33.25, Vi33.26 | |
|
| ||
| PRJEB3092 ( | Denisova3 | |
| PRJEB20653 ( | Denisova2 | |
|
| ||
| PRJEB24663 ( | Denisova11 | |
| Position | Locus | Orientation | Reference sequence hg38 |
|---|---|---|---|
| chr1:120069403–120069404 | NOTCH2 | − | AT |
| chr1:120724179–120724180 | NOTCH2NLR | + | AT |
| chr1:146228778–146228779 | NOTCH2NLA | − | AT |
| chr1:148679531–148679532 | NOTCH2NLB | − | AT |
| chr1:149390853–149390854 | NOTCH2NLC | + | AT |
| Position | Locus | Orientation | Reference sequence hg38 |
|---|---|---|---|
| chr1:120029988–120029989 | NOTCH2 | − | T |
| chr1:120763625–120763626 | NOTCH2NLR | + | A |
| chr1:146189382–146189383 | NOTCH2NLA | − | T |
| chr1:148640098–148640099 | NOTCH2NLB | − |
|
| chr1:149430931–149430932 | NOTCH2NLC | + | A |
| Position | Locus | Orientation | Reference sequence hg38 |
|---|---|---|---|
| chr1:119997052–119997053 | NOTCH2 | − | T |
| chr1:120793439–120793440 | NOTCH2NLR | + | A |
| chr1:146156535–146156536 | NOTCH2NLA | − | T |
| chr1:148607465–148607466 | NOTCH2NLB | − | T |
| chr1:149463769–149463770 | NOTCH2NLC | + | A |
| 6 | 16 | 48 | 140 | 420 | |
|---|---|---|---|---|---|
|
| 1,050 | 1,050 | 1,050 | 1,050 | 1,050 |
|
| 70 | 70 | 70 | 70 | 70 |
|
| 35 | 35 | 35 | 35 | 35 |
|
| 21 | 21 | 21 | 21 | 21 |
|
| 5/6 | 14/16 | 41/48 | 120/140 | 361/420 |
|
| 1,315/1,314 | 1,306/1,304 | 1,279/1,272 | 1,200/1,180 | 959/900 |
| Plasmids |
|---|
| pCAGEN-GFP (Addgene #11150) |
| pCAGN1- hCas9 (Addgene #51142) |
| pCAGN1- EV |
| pCAGN1-NOTCH2NL |
| pCAGN1-NOTCH2NL-T197I |
| pCAGN1-NOTCH2NL-M40I, T197I |
| pCAGN1-NOTCH2NL-N232S, T197I |
| pCAGN1-NOTCH2NL-E258A, T197I |
| pCAGN1-NOTCH2NL-M1I |
| pCAGN1-NOTCH2NL-M1I, T197I |
| pCAGN1-NOTCH2NL-M1I, M40I |
| pCAGN1-NOTCH2NL-M1I, N232S |
| pCAGN1-NOTCH2NL-M1I, E258A |
| pCAGN1-NOTCH2NL-HA |
| pCAGN1-NOTCH2NL-HA-T197I |
| pCAGN1-NOTCH2NL-HA-M40I, T197I |
| pCAGN1-NOTCH2NL-HA-N232S, T197I |
| pCAGN1-NOTCH2NL-HA-E258A, T197I |
| pCAGN1-NOTCH2NL-HA-M1I |
| pCAGN1-NOTCH2NL-HA-M1I, T197I |
| pCAGN1-NOTCH2NL-HA-M1I, M40I |
| pCAGN1-NOTCH2NL-HA-M1I, N232S |
| pCAGN1-NOTCH2NL-HA-M1I, E258A |
| pCAGN1-NOTCH2NL-HA-5′ M1 |
| pCAGN1-NOTCH2NL-HA-5′ M1 + kozak |
| pCAGN1-NOTCH2NL-HA-5′ M40 |
| pCAGN1-NOTCH2NL-HA-5′ M40 + kozak |
| pCAGN1-NOTCH2NL-HA-5′ M84 |
| pCAGN1-NOTCH2NL-HA-5′ M84 + kozak |
| pCAGN1-NOTCH2NL-HA-5′ M1I-I1 |
| pCAGN1-NOTCH2NL-HA-5′ P2 |
| pCAGN1-NOTCH2NL-HA-5′ L12 |
| pCAGN1-NOTCH2NL-HA-5′ P22 |
| pCAGN1-NOTCH2NL-HA-5′ C28 |
| pCAGN1-NOTCH2NL-HA-M1I-ΔI1 |
| pCAGN1-NOTCH2NL-HA-M1I-ΔL4 |
| pCAGN1-NOTCH2NL-HA, 5′ M1, CTG(1-5) > CTA(1-5) |
| pCAGN1-NOTCH2NL-M1I-HA, 5′ I1, CTG(1-5) > CTA(1-5) |
| pCAGN1-NOTCH2NL-M1I-HA, 5′ I1, Δata-CTG(1-5) > CTA(1-5) |
| pcDNA5-NOTCH2-GAL4-TAD-N1 |
| pRL-CMV (Promega E2261) |