Literature DB >> 26693966

The evolution of Homo sapiens denisova and Homo sapiens neanderthalensis miRNA targeting genes in the prenatal and postnatal brain.

Konstantin V Gunbin, Dmitry A Afonnikov, Nikolay A Kolchanov, Anatoly P Derevianko, Eugeny I Rogaev.   

Abstract

BACKGROUND: As the evolution of miRNA genes has been found to be one of the important factors in formation of the modern type of man, we performed a comparative analysis of the evolution of miRNA genes in two archaic hominines, Homo sapiens neanderthalensis and Homo sapiens denisova, and elucidated the expression of their target mRNAs in bain.
RESULTS: A comparative analysis of the genomes of primates, including species in the genus Homo, identified a group of miRNA genes having fixed substitutions with important implications for the evolution of Homo sapiens neanderthalensis and Homo sapiens denisova. The mRNAs targeted by miRNAs with mutations specific for Homo sapiens denisova exhibited enhanced expression during postnatal brain development in modern humans. By contrast, the expression of mRNAs targeted by miRNAs bearing variations specific for Homo sapiens neanderthalensis was shown to be enhanced in prenatal brain development.
CONCLUSIONS: Our results highlight the importance of changes in miRNA gene sequences in the course of Homo sapiens denisova and Homo sapiens neanderthalensis evolution. The genetic alterations of miRNAs regulating the spatiotemporal expression of multiple genes in the prenatal and postnatal brain may contribute to the progressive evolution of brain function, which is consistent with the observations of fine technical and typological properties of tools and decorative items reported from archaeological Denisovan sites. The data also suggest that differential spatial-temporal regulation of gene products promoted by the subspecies-specific mutations in the miRNA genes might have occurred in the brains of Homo sapiens denisova and Homo sapiens neanderthalensis, potentially contributing to the cultural differences between these two archaic hominines.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26693966      PMCID: PMC4686780          DOI: 10.1186/1471-2164-16-S13-S4

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Early in the 21st century, fossils of an ancient man who lived 50-45 thousand years (ky) ago were found in the Denisova cave in the Altai mountains [1]. The Homo sapiens denisova (H. s. d.) nuclear genome was first sequenced in 2010 [2] and then re-sequenced in 2012 [3]. The Homo sapiens neanderthalensis (H. s. n.) nuclear genome was first sequenced in 2010 [4] and for a second time in 2014 [5]. A comparative analysis of the archaic hominine (H. s. d., H. s. n.) and Homo sapiens sapiens (H. s. s.) genomes assessed the genetic contribution of H. s. d. and H. s. n to the genetic and biological profile of the modern H. s. s. populations [6]. It was shown that the H. s. n. lineage is a sister lineage to H. s. d. [2]. The population split of Neanderthals and Denisovans from modern humans was estimated to have occurred 550-765 ky ago and the split time of Neanderthals and Denisovans 380-473 ky ago [5]. Considering that the evolution of miRNA genes could be one of the key factors in the formation of the modern type of man [7-10], we analyzed the evolution of miRNA genes in H. s. n. and H. s. d. and the expression of their target mRNAs. Using improved versions of the H. s. d. and H. s. n. genomes, we performed computer-assisted comparisons of the H. s. n., H. s. d. and H. s. s. genomes to reveal the structural and functional organization of microRNAs (miRNAs) as well as the mRNAs targeted by these miRNAs. In both the Neanderthal and Denisovan genomes, we found miRNA genes with fixed substitutions in mature miRNAs and multiple substitutions in pre-miRNA regions involved in pre-miRNA processing. Our analysis of spatiotemporal gene expression in human tissues demonstrated that the miRNAs bearing new genetic variants fixed in the Denisovan genome regulated target mRNAs with the highest levels of expression in the postnatal human brain.

Results and discussion

To identify miRNA genes that evolved divergently in the genus Homo, we first used the map and pre-miRNA gene sequence data for 1595 experimentally confirmed H. s. s. miRNAs from the miRBase database rel. 19 [11]. We selected the H. s. d. and H. s. n. pre-miRNAs with highly confident sequences that are orthologous to H. s. s. pre-miRNAs. Using the selection procedure described in the Methods section (Selection of miRNA genes in H. s. d. and H. s. n. with no match in H. s. s.), we identified 1298 H. s. d. and 1329 H. s. n. pre-miRNAs perfectly matched to H. s. s. pre-miRNA gene sequences. In addition, we identified and selected for further study 106 H. s. d. and 102 H. s. n. diverged genes for pre-miRNA sequences, which were different from H. s. s. pre-miRNA genes by at least a single nucleotide. The other 191 H. s. d. and 164 H. s. n. pre-miRNAs did not pass quality sequence control in the selection procedure Further, we explored the alignment of the genomes of six primates available in Ensembl rel. 69 [12]. Each of the selected H. s. d. and H. s. n. pre-miRNA genes non-identical to H. s. s. orthologs had at least one ortholog in another primate. This result indicated that these pre-miRNA coding genes were present in the genome of the common ancestor of H. s. s., H. s. d. and H. s. n. long before the evolutionary split between these hominines. In following up this analysis, we confirmed the existence of orthologs for all selected H. s. d. and H. s. n. miRNA genes except one H. s. d. miRNA gene. Next, we analyzed all substitutions found in the archaic hominine miRNA genes selected as described above. First, we excluded a few doubtful nucleotide substitutions observed in the sequencing data for the pre-miRNA genes of H. s. d. and H. s. n. (see Methods section). Then, we mapped the remaining genetic variants found in the pre-miRNA genes of H. s. d. and H. s. n. onto the secondary structures of the corresponding pre-miRNAs in H. s. s. available at the miRBase rel. 19 [11]. We selected the H. s. d. and H. s. n. pre-miRNAs with (i) nucleotide substitutions (deletions, insertions) in the regions corresponding to the sequences of mature miRNAs (and/or miRNAs*) responsible for binding to target mRNAs or (ii) multiple (two or more) densely spaced substitutions within a pre-miRNA region involved in pre-miRNA processing. From these pre-miRNA groups, we excluded the pre-miRNAs that occur in the H. s. s. genome in more than one copy. The aim of this exclusion was to select pre-miRNAs with unique functions. The selection yielded H. s. d. and H. s. n. pre-miRNAs with nucleotide substitutions that might contribute to significant functional differences from the H. s. s. pre-miRNAs. The functional annotation of the diverged pre-miRNA genes in archaic hominines was performed as follows. First, we selected the miRNAs expressed in the central nervous system (CNS) using the miRGator 3.0 [13] and ChIP-seq data for H3K4me3 (histone H3 trimethyl K4) modified histones marking active promoters in primate cortical neurons [14]. Second, the potential target mRNAs of these central nervous system (CNS)-active miRNAs were identified using data from miRGator 3.0 [13]. miRGator 3.0 contains experimental miRNA/target-mRNAs co-expression data obtained under various conditions. Therefore, we extracted from miRGator 3.0 genes with co-expression correlation coefficient with miRNA r≤-0.9 [13]. We used the co-expression evidence to identify putative interactions between miRNAs and their target mRNAs instead of the CLIP-seq experiments results because the latter technology requires cell lysis, which facilitates the interaction of components that are usually segregated by cellular compartments [15], and many miRNA/mRNAs interactions identified by CLIP-seq are non-canonical, which in turn does not mediate repression [16]. However, co-expression data indicate pathways (not genes) targeted by miRNA but do not allow the identification of direct miRNA/mRNA interactions. The latter has a side advantage because, in this approach, we selected against target genes with limited effect on tissue functioning [17]. Third, to identify the human brain structures (topologically different brain regions) showing increased levels of the target mRNAs (henceforth referred to as the target brain structures), we performed a randomization test using the Human Allen Brain Atlas [18] and the BrainSpan Atlas [19]. We employed the randomization test for this purpose because the standard analytic annotation enrichment techniques are inapplicable for miRNA functional enrichment analysis [20]. The characteristics of the nucleotide substitutions in each of the CNS-active pre-miRNA genes diverged in Denisovan and Neanderthal are shown in Tables 1 and 2. These miRNA genes were stratified into four groups. The first group consists of 9 miRNAs with fixed mutations unique to the H. s. d. lineage. The second group includes 5 miRNAs with fixed mutations unique to the H. s. n. lineage. Two other miRNA gene groups were compiled of 14 H. s. d. and 18 H. s. n. miRNA genes bearing variants that match to one of the polymorphic alleles in H. s. s. Most polymorphisms in these miRNAs are SNPs (single nucleotide polymorphisms). Several substitutions are observed in the seed region (based on Table 1 and 2) in six out of 14 H. s. d. and five out of 18 H. s. n. pre-miRNAs common to human polymorphisms. Only one indel in the seed region was observed in hsa-mir-3161 in both H. s. n. and H. s. d. The minority of miRNAs with polymorphisms in the seed region is not sufficient to claim that the general capacity of these miRNA pools have altered binding specificity to target mRNAs. This result allowed us to use the third and fourth pools as a control to compare the target specificities with miRNA bearing fixed variants specific for H. s. n. and H. s. d.
Table 1

Homo sapiens denisova miRNAs that differ from Homo sapiens sapiens miRNAs.

Genome position (hg 19)

#miRNA*Ancestral state (h: human, d: Denisovan)Difference (human/ Denisovan)**ChromosomePositionStrandDB SNP IDAllele frequency(1000 genomes [28])African population frequency(1000 genomes [28]or dbSNP 141 [29])
Homo sapiens denisova miRNA mutations common to previously observed human miRNA polymorphisms

1{hsa-mir-1178}d {A/G} 12120151493-rs7311975C:0.13820.4191

2{hsa-mir-1252}d{A/G}1279813049+rs115256251G:0.00320.0098

3{hsa-mir-1269a}d{G/A}467142620+rs73239138A: 0.39420.7277

4{hsa-mir-146a}d {C/G} 5159912418+rs2910164C:0.38150.475(YRI)

5{hsa-mir-2682 (3p)}d {G/A} 198510847-rs74904371T:0.02080.0053

6{hsa-mir-3161}d {-/A} 1148118347+rs11382316A:0.7599+(BUSHMAN)

7{hsa-mir-4804 (5p)}d {C/G} 572174432+rs266435G: 0.80010.7269

8{hsa-mir-608}d{C/G}10102734778+rs4919510G:0.36380.4402

hA/G10102734813+---

9hsa-mir-3124 (3p)dC/A1249120631+rs115160731A:0.01280.0469

10hsa-mir-4514hT/C1581289798-rs116034786G:0.01360.0371

11hsa-mir-1269bh C/(G|C) 1712820646-rs7210937C:0.35200.5976

12hsa-mir-6085hA/-1562635322+rs372168584-:0.00060.0023

13hsa-mir-378edT/C5169455502+rs367764573C:0.00260.0008

h T/C 5169455551+rs376752141C:0.00260.0008

14hsa-mir-662hT/A16820215+rs74656628A:0.13740.3971

h G/A 16820249+rs9745376A:0.05430.1921

15hsa-mir-4463d-/AG676138146+rs5877455AG:0.70790.9198

16{hsa-mir-532(3p, 5p)}d{A/G}X49767832+rs456615G:10.7587

17d {A/G}X49767835+rs456617G:10.7587

18{hsa-mir-943}hA/G41988176-rs368905227T: 0.00040

d{-/AG}41988188-rs3034718CT: 0.27660.4629

d{A/G}41988193-rs1077020C: 0.25040.3865

Homo sapiens denisova-specific mutations

1hsa-mir-1321h G/T X85090839+--

2hsa-mir-1909hG/C191816168---

3hsa-mir-3143hT/C627115417+--

4hsa-mir-3152 (5p)hA/-918573332+--

5hsa-mir-3185hG/A1746801808---

6hsa-mir-4478h T/A 9124882437---

7hsa-mir-4700 (3p)hC/T12121161049+--

8hsa-mir-4710hT/C14105144064---

9hsa-mir-5687h C/T 554804702-rs545080149A:0.00020.0008

10hsa-mir-609h G/A 10105978625---

11{hsa-mir-671 (3p)}h{C/T}7150935593+--

* bold - expression in the central neural system (miRGator 3.0 data [13] and data from [14]), underline - common between H. s. d. and H. s. n.

** bold - location in mature miRNA, {} common between H. s. d. and H. s. n., italic - location in canonical seed (nucleotides 2-7) region, based on [40]

Table 2

Homo sapiens neanderthalensis miRNAs that differ from Homo sapiens sapiens miRNAs.

Genome position (hg 19)

#miRNA*Ancestral state (h: human, n: Neanderthal; c: chimpanzee)Mutation (human/ Neanderthal)**ChromosomePositionStrandDB SNP IDAllele frequency(1000 genomes [28])African population frequency(1000 genomes [28]or dbSNP 141 [29])
Homo sapiens neanderthalensis miRNA mutations common to previously observed human miRNA polymorphisms

1{hsa-mir-1178}n {A/G} 12120151493-rs7311975C:0.13820.4191

2{hsa-mir-1252}n{A/G}1279813049+rs115256251G:0.00320.0098

3{hsa-mir-1269a}n{G/A}467142620+rs73239138A:0.39420.7277

4{hsa-mir-146a}n {C/G} 5159912418+rs2910164C:0.38150.475(YRI)

5{hsa-mir-2682 (3p)}n {G/A} 198510847-rs74904371T:0.02080.0053

6{hsa-mir-3161}n {-/A} 1148118347+rs11382316A:0.7599+(BUSHMAN)

7{hsa-mir-4804 (5p)}n {C/G} 572174432+rs266435G:0.80010.7269

8{hsa-mir-608}n{C/G}10102734778+rs4919510G:0.36380.4402

9hsa-mir-1343nT/C1134963416+rs2986407C:0.76680.792

10hsa-mir-3129 (5p)hG/C2189997816-rs192364638C:11

11hsa-mir-4274n-/CAC47461827+rs35245133CAC:0.858+(BUSHMAN)

12hsa-mir-4293h C/G 1014425221-rs12220909G:0.96190.9992

13hsa-mir-5189nG/C1688535400+rs80296158C:0.05070.1831

14hsa-mir-149(3p, 5p)nA/G2241395500+rs71428439G:0.14400.1112

15nT/C2241395503+rs2292832C:0.38660.2716

16hsa-mir-1908nA/G1161582708-rs174561C:0.27960.0197

hG/A1161582709----

17hsa-mir-3938nTT/-355886573-rs10575780-:0.24260.4667

18hsa-mir-4463n-/AG676138146+rs398110299AG:0.70790.9198

19hsa-mir-4719nT/C1676902847+rs7500280C:0.32930,2171

nG/A1676902850+rs7499278A:0.75280.9455

20{hsa-mir-532}{(3p, 5p)}n{A/G}X49767832+rs456615G:10.7587

21n{A/G}X49767835+rs456617G:10.7587

22{hsa-mir-943}n{-/GA}41988188-rs3034718CT:0.27660.4629

n{A/G}41988193-rs1077020C:0.25040.3865

Homo sapiens neanderthalensis-specific mutations

1hsa-mir-1208h T/C 8129162377+--

2hsa-mir-4532h C/T 2056470458+--

3hsa-mir-4718hC/T1612814185+--

4hsa-mir-615hC/T1254427811+--

5hsa-mir-639h T/C 1914640435+rs561305115C:0.00640.004

hG/C1914640399+rs372602559C:0.00620.004

6{hsa-mir-671 (3p)}h{C/T}7150935593+--

7hsa-mir-6715b (3p)hG/T10114059393-rs182337914A:0.00120

8hsa-mir-4749 (3p)h C/(C|T) 1950357891+rs372882504-1(C)

9hsa-mir-3939h(A)/c(G)A/(G|A)6167411298-rs80032204-0.915(A)

h(A)/c(G)A/(G|A)6167411300-rs77840042-0.912(A)

h(C)/c(T)C/(C|T)6167411301-rs75692943-0.912(C)

h G/(G|A) 6167411334-rs75823810-0.954(G)

h C/(C|T) 6167411337-rs73024232-0.957(C)

hC/(C|T)6167411362-rs77072520-0.972(C)

* bold - expression in the central neural system (MiRGator 3.0 data [13] and data from [14])

** bold - location in mature miRNA, {} - common between H. s. d. and for H. s. n., italic - location in canonical seed (nucleotides 2-7) region, based on [40]

Homo sapiens denisova miRNAs that differ from Homo sapiens sapiens miRNAs. * bold - expression in the central neural system (miRGator 3.0 data [13] and data from [14]), underline - common between H. s. d. and H. s. n. ** bold - location in mature miRNA, {} common between H. s. d. and H. s. n., italic - location in canonical seed (nucleotides 2-7) region, based on [40] Homo sapiens neanderthalensis miRNAs that differ from Homo sapiens sapiens miRNAs. * bold - expression in the central neural system (MiRGator 3.0 data [13] and data from [14]) ** bold - location in mature miRNA, {} - common between H. s. d. and for H. s. n., italic - location in canonical seed (nucleotides 2-7) region, based on [40] Next, we searched for the brain regions and CNS development stage showing the most abundant expression of mRNAs targeted by the miRNAs that diverged in archaic hominines (see Methods section). For this purpose, we used data in (1) the BrainSpan Exon microarray (BrainSpan) [19] and (2) the Allen Human Brain Atlas as updated March 7, 2013 (AHBA) [18]. The results of the analysis of BrainSpan data [19] demonstrated that mRNAs targeted by miRNAs bearing fixed mutations specific for H. s. d. are enriched in the postnatal stage of brain development, whereas mRNAs targeted by miRNAs bearing fixed mutations specific for H. s. n. are most abundant in the prenatal brain (Figure 1). In contrast, no development stage-dependent expression was found for mRNAs targeted by miRNAs matched to polymorphic variants in H. s. s. (Figure 2). Thus, we can anticipate the differential effect of genetic variations found in archaic hominines on brain function and development in Denisovans and Neanderthals. The effects are not likely to be due to bias in the number of mRNA targets. The number of targeted genes is similar between miRNAs bearing fixed mutations specific for H. s. n. and H. s. d. (Ensembl gene IDs): 1260 target genes for H. s. n. and 1547 for H. s. d. (excluding targets of hsa-mir-671-3p miRNA that possess specific variants common for H. s. d. and H. s. n.). In addition, the number of targeted genes is similar between H. s. n. and H. s. d. miRNAs matched to polymorphic variants in H. s. s. (Ensembl gene IDs): 4072 target genes for H. s. n. and 3653 for H. s. d. It is interesting that the highest expression of transcripts targeted by miRNAs with polymorphisms unique for H. s. d. is confined to the broad thalamus regions and the prefrontal cortex (Figure 3). Nevertheless, there are multiple forebrain regions known to be important for speech that exhibit higher expression of miRNA targets from the set with polymorphisms unique for H. s. d. (Figure 3): the inferiolateral temporal cortex, medial and ventrolateral prefrontal cortex. It is of interest that the thalamus plays a significant role in the shaping of the human language-ready brain [21].
Figure 1

Target stages of human brain development for . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, age: pcw - post conception weeks, m - postnatal months, y - postnatal years.

Figure 2

Target stages of human brain development for . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X-axis, age: pcw - post conception weeks, m - postnatal months, y - postnatal years.

Figure 3

Target structures of human brain for tested and control . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, large brain structures.

Target stages of human brain development for . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, age: pcw - post conception weeks, m - postnatal months, y - postnatal years. Target stages of human brain development for . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X-axis, age: pcw - post conception weeks, m - postnatal months, y - postnatal years. Target structures of human brain for tested and control . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, large brain structures. In contrast, when we consider miRNAs of ancient humans expressed in the central nervous system and having variants shared with known human polymorphisms, the picture changes drastically (Figure 4): all brain structures are targeted more by polymorphic H. s. n. miRNAs than by polymorphic H. s. d. miRNAs. The larger signal in H. s. n., shown in Figures 2 and 4, is likely due to a higher number of miRNAs expressed in the central nervous system and sharing variation with the human population for H. s. n. (18, see Table 2) than for H. s. d. (14, see Table 1). Considering common miRNAs sharing human polymorphisms in these organisms (10 miRNAs), the specific numbers are 4 and 8, respectively (351 H. s. d.-specific targets versus 972 H. s. n.-specific targets (Ensembl gene IDs)). Given that the largest number of gene targets was found for the polymorphic hsa-mir-149 (5p) miRNA (Table 3) in H. s. n., this difference between the two hominine subspecies can be attributed to this particular miRNA.
Figure 4

Target structures of human brain for tested and control . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, large brain structures.

Table 3

Number of target genes for each miRNA shown in Tables 1 and 2, based on MiRGator 3.0 data [13].

miRNANumber of target genes (Ensembl gene IDs)*
hsa-mir-1178701

hsa-mir-1208878

hsa-mir-12521228

hsa-mir-1269a639

hsa-mir-1269b43

hsa-mir-13211159

hsa-mir-1343128

hsa-mir-146a140

hsa-mir-149 (3p)181

hsa-mir-149 (5p)495

hsa-mir-190825

hsa-mir-190933

hsa-mir-2682 (3p)15

hsa-mir-3124 (3p)151

hsa-mir-3129 (5p)101

hsa-mir-3143121

hsa-mir-3152 (5p)33

hsa-mir-316162

hsa-mir-3185102

hsa-mir-378e49

hsa-mir-393865

hsa-mir-39391

hsa-mir-427442

hsa-mir-42930

hsa-mir-446340

hsa-mir-447876

hsa-mir-4514134

hsa-mir-453210

hsa-mir-4700 (3p)76

hsa-mir-471052

hsa-mir-471842

hsa-mir-471973

hsa-mir-4749 (3p)32

hsa-mir-4804 (5p)1

hsa-mir-51890

hsa-mir-532 (3p)598

hsa-mir-532 (5p)454

hsa-mir-56870

hsa-mir-6081039

hsa-mir-6085-**

hsa-mir-609538

hsa-mir-615200

hsa-mir-639625

hsa-mir-662256

hsa-mir-671 (3p)168

hsa-mir-6715b (3p)-**

hsa-mir-943512

* genes with Pearson correlation coefficient of co-expression ≤-0.9.

** these miRNAs were not annotated in mirGator 3.0

Target structures of human brain for tested and control . Y-axis, comparisons of Cohen's d values of the regulatory potential for selected H. s. d. (positive values) and H. s. n. (negative values) miRNA pools; X- axis, large brain structures. Number of target genes for each miRNA shown in Tables 1 and 2, based on MiRGator 3.0 data [13]. * genes with Pearson correlation coefficient of co-expression ≤-0.9. ** these miRNAs were not annotated in mirGator 3.0 The fine target structures identified using AHBA data for gene expression in the left hemisphere of the brain [18] are presented in Tables 4 and 5. For identification of the fine target brain structures, we used the difference in regulatory potential between the H. s. d. and H. s. n. samples (Table 4, 5) as a marker (see Methods section). Importantly (Table 4), the difference between the target structures of unique H. s. d. and H. s. n. miRNAs expressed in the central nervous system tissues is not significant (effect size measured as the absolute value of difference between Cohen's d statistics<0.5) for all fine brain structures except two: the fusiform gyrus for H. s. d. miRNAs and the precentral gyrus for H. s. n. miRNAs. The picture obtained based on the target mRNAs for miRNAs expressed in the central nervous system tissues with known human polymorphisms (Table 4) is the same (compare Table 4 and 5): for the vast majority of fine brain structures, the difference between targeting by H. s. d. and H. s. n. is not significant, except for two thalamus structures and the lateral orbital gyrus, targeted by H. s. d. miRNAs. It is important that the fusiform and lateral orbital gyri are responsible for face perception and socialization, respectively [22,23]. Taken together, these findings indicate an important role of unique H. s. d. miRNAs in the development of specific brain structures in Denisovans, which likely enabled them to reach an unparalleled level of craftsmanship [1].
Table 4

Results of the identification of brain structures of the left hemisphere characterized by the difference in regulatory potential for H.s. d. and H. s. n. miRNAs expressed in neural tissues and having unique substitutions.

AHBAStruct. ID*Structure description. ID **p,H. s. d. p,H. s. n.Cohen's d,H. s. d.Cohen's d,H. s. n.(Cohen's d, H. s. d.)-(Cohen's d, H. s. n.)
4158fusiform g., b. of the its0.002260.006625.948825.265650.68317

4273short insular gyri0.00410.00625.652025.402790.24923

4045inferior frontal g., orbital pt.0.006760.008845.375875.165410.21046

4023superior frontal g., medial b.0.005520.006265.466495.309470.15702

4090postcentral g., b. of the posterior central sulcus0.007640.009885.212725.074020.1387

4120precuneus, superior lateral b.0.007020.007885.301755.169090.13266

4060lateral orbital g.0.008040.009725.199635.072620.12701

4143middle temporal g., inferior b.0.008160.009365.15095.040740.11016

4088postcentral g., superior lateral aspect0.008560.009785.151445.049570.10187

4039inferior frontal g., triangular pt.0.007860.009065.206875.111070.0958

4215superior occipital g., inferior b.0.00760.007965.238155.169560.06859

4251subiculum0.006440.00665.354155.289770.06438

4074paracentral lobule, anterior pt., inferior b.0.009120.00915.153645.097670.05597

4223cingulate g., frontal pt., superior b.0.004040.00425.626055.586320.03973

4288putamen0.007280.008345.234855.197660.03719

4160fusiform g., b. of cos0.007120.006925.307525.274440.03308

4245parahippocampal g., b. of cos0.007620.006925.243755.220060.02369

4014precentral g., inferior lateral aspect0.00850.009365.112665.094520.01814

4136superior temporal g., inferior b.0.007640.007285.202655.23345-0.0308

4051medial orbital g.0.00670.005545.355835.43806-0.08223

4224cingulate g., frontal pt., inferior b.0.006640.006085.269355.35976-0.09041

4087postcentral g., b. of the central sulcus0.00960.007565.125135.23082-0.10569

4098supraparietal lobule, superior b.0.007420.005725.235455.3476-0.11215

4151inferior temporal g., b. of mts0.008880.007325.113075.22869-0.11562

4030middle frontal g., superior b.0.009840.007665.081515.20719-0.12568

4201occipito-temporal g., inferior b.0.009480.007825.057885.20134-0.14346

4031middle frontal g., inferior b.0.005480.004265.399415.54788-0.14847

4024superior frontal g., lateral b.0.003080.003025.728275.89251-0.16424

4142middle temporal g., superior b.0.009480.007365.081045.25797-0.17693

4150inferior temporal g., lateral b.0.007040.00545.300945.50188-0.20094

4149inferior temporal g., b. of the its.0.00950.007125.096725.32253-0.22581

4159fusiform g., lateral b.0.009440.007425.030245.25885-0.22861

4214superior occipital g., superior b.0.006120.00475.311985.55676-0.24478

4258dentate g.0.00730.005065.293475.56439-0.27092

4193lingual g., peristriate0.009760.006165.089975.4184-0.32843

4013precentral g., superior lateral aspect0.009680.005965.050465.38028-0.32982

4015precentral g., b. of the central sulcus0.008980.003425.108955.70299-0.59404

* - For each target brain structure, the following data are provided: (1) the AHBA ID and (2) its description; (3) and (4) p, the probability of observing by random chance using unique H. s. d. and H. s. n. miRNAs expressed in central nervous system tissues; (5) and (6) the Cohen's d statistic of difference between Land value distributions for unique H. s. d. and H. s. n. miRNAs expressed in the central nervous system tissues; (7) difference between these Cohen's d statistics.

** - g., gyrus; b., bank; pt., part.

Table 5

Results of the identification of brain structures of the left hemisphere characterized by the difference between regulatory potential for H.s. d. and H. s. n. miRNAs expressed in neural tissues and having substitutions shared with known human polymorphisms.

AHBAStruct. ID*Structure description. ID **p,H. s. d. p,H. s. n.Cohen's d,H. s. d.Cohen's d,H. s. n.(Cohen's d, H. s. d.)-(Cohen's d, H. s. n.)
4060lateral orbital g.0.001940.008165.929435.186020.74341

4288putamen0.00180.006346.039815.296930.74288

4258dentate g.0.001560.00456.180385.556270.62411

4901inferior rostral g.0.00450.00845.593375.128140.46523

4193lingual g., peristriate0.002680.006185.785555.323470.46208

4187cuneus, striate0.004740.009565.505915.043950.46196

4151inferior temporal g., b. of mts0.00420.006325.604325.298360.30596

4194lingual g., striate0.007140.008785.285975.120210.16576

4160fusiform g., b. of cos0.006220.007945.322995.179160.14383

4417lateral group of nuclei, ventral division0.007080.005145.275285.39697-0.12169

4256CA3 field0.009580.007985.066215.22156-0.15535

4169transverse gyri0.009360.006865.127855.29476-0.16691

4282body of caudate nucleus0.006720.005345.251945.42113-0.16919

4012precentral g., b. of the precentral sulcus0.00640.004465.295175.51095-0.21578

4244parahippocampal g., lateral b.0.006880.004545.312645.53652-0.22388

4186cuneus, peristriate0.003860.002745.643655.88129-0.23764

4150inferior temporal g., lateral b.0.007760.005225.215015.45297-0.23796

4106supramarginal g., superior b.0.005180.003425.464355.70277-0.23842

4088postcentral g., superior lateral aspect0.00240.00155.936686.17562-0.23894

4121precuneus, inferior lateral b.0.0040.00235.656825.89645-0.23963

4166Heschl's g.0.008580.005165.198575.44275-0.24418

4013precentral g., superior lateral aspect0.004520.002985.555365.80945-0.25409

4015precentral g., b. of the central sulcus0.008140.005645.168055.42259-0.25454

4120precuneus, superior lateral b.0.002460.001825.899556.15638-0.25683

4245parahippocampal g., b. of the cos0.006720.004265.319895.58364-0.26375

4223cingulate g., frontal pt., superior b.0.00520.003525.419845.68604-0.2662

4048g. rectus0.009980.006465.018615.28996-0.27135

4074paracentral lobule, anterior pt., inferior b.0.00420.002425.617425.89182-0.2744

4113angular g., superior b.0.009120.005225.104175.38524-0.28107

4280head of caudate nucleus0.004820.003025.497175.78461-0.28744

4114angular g., inferior b.0.006860.003945.334235.6244-0.29017

4087postcentral g., b. of the central sulcus0.004140.002225.645235.93594-0.29071

4030middle frontal g., superior b.0.00320.001665.796576.08899-0.29242

4270long insular gyri0.00970.006185.070455.36942-0.29897

4014precentral g., inferior lateral aspect0.003180.001745.802036.10501-0.30298

4200occipito-temporal g., superior b.0.005420.003145.441275.74428-0.30301

4255CA2 field0.003380.00165.781786.08596-0.30418

4273short insular gyri0.005020.00295.479245.79333-0.31409

4099supraparietal lobule, inferior b.0.004820.002345.567155.88285-0.3157

4045inferior frontal g., orbital pt.0.002720.001385.814456.1365-0.32205

4039Inferior frontal g., triangular pt.0.008080.005285.163195.49488-0.33169

4230cingulate g., parietal pt., superior b.0.009180.005465.133385.46665-0.33327

4159fusiform g., lateral b.0.003360.002225.696326.03013-0.33381

4023superior frontal g., medial b.0.001460.00076.199566.53647-0.33691

4136superior temporal g., inferior b.0.004020.00185.668856.0067-0.33785

4031middle frontal g., inferior b.0.001060.000586.299666.63933-0.33967

4208inferior occipital g., inferior b.0.00170.000846.103496.44887-0.34538

4251subiculum0.004860.002485.512245.8579-0.34566

4073paracentral lobule, anterior pt., superior b.0.005220.00315.460175.81365-0.35348

4079frontal operculum0.005020.00285.490855.84766-0.35681

4149inferior temporal g., b. of the its0.003080.001765.822216.1795-0.35729

4257CA4 field0.006120.003225.375755.73712-0.36137

4215superior occipital g., inferior b.0.008760.004385.164365.5294-0.36504

4063subcallosal g.0.007620.004245.243115.61246-0.36935

4214superior occipital g., superior b.0.00350.002165.645226.01907-0.37385

4143middle temporal g., inferior b.0.004440.002285.575225.95362-0.3784

4207inferior occipital g., superior b.0.006320.003645.269285.65709-0.38781

4051medial orbital g.0.002140.000965.99036.39314-0.40284

4898superior rostral g.0.006120.002725.391095.80014-0.40905

4098supraparietal lobule, superior b.0.004420.001985.635256.04788-0.41263

4089postcentral g., inferior lateral aspect0.007620.003985.252685.67386-0.42118

4107supramarginal g., inferior b.0.006680.003425.343535.7746-0.43107

4178planum polare0.00650.002325.339035.77072-0.43169

4224cingulate g., frontal pt., inferior b.0.005540.002545.424895.86867-0.44378

4090postcentral g., b. of the posterior central sulcus0.004860.002065.538295.99888-0.46059

4024superior frontal g., lateral b.0.000740.000226.595817.06305-0.46724

4158fusiform g., b. of the its0.001420.000386.229826.69803-0.46821

4201occipito-temporal g., inferior b.0.002840.001245.769996.2488-0.47881

4142middle temporal g., superior b.0.006980.003365.266245.76073-0.49449

* - For each target brain structure, the following data are provided: (1) the AHBA ID and (2) its description; (3) and (4) p, the probability of observing by random chance using H. s. d. and H. s. n. miRNA with known human polymorphisms expressed in central nervous system tissues; (5) and (6) the Cohen's d statistic of the difference between the Land value distributions for H. s. d. and H. s. n. miRNAs with known human polymorphisms expressed in central nervous system tissues; (7) the difference between these Cohen's d statistics.

** - g., gyrus; b., bank; pt., part.

Results of the identification of brain structures of the left hemisphere characterized by the difference in regulatory potential for H.s. d. and H. s. n. miRNAs expressed in neural tissues and having unique substitutions. * - For each target brain structure, the following data are provided: (1) the AHBA ID and (2) its description; (3) and (4) p, the probability of observing by random chance using unique H. s. d. and H. s. n. miRNAs expressed in central nervous system tissues; (5) and (6) the Cohen's d statistic of difference between Land value distributions for unique H. s. d. and H. s. n. miRNAs expressed in the central nervous system tissues; (7) difference between these Cohen's d statistics. ** - g., gyrus; b., bank; pt., part. Results of the identification of brain structures of the left hemisphere characterized by the difference between regulatory potential for H.s. d. and H. s. n. miRNAs expressed in neural tissues and having substitutions shared with known human polymorphisms. * - For each target brain structure, the following data are provided: (1) the AHBA ID and (2) its description; (3) and (4) p, the probability of observing by random chance using H. s. d. and H. s. n. miRNA with known human polymorphisms expressed in central nervous system tissues; (5) and (6) the Cohen's d statistic of the difference between the Land value distributions for H. s. d. and H. s. n. miRNAs with known human polymorphisms expressed in central nervous system tissues; (7) the difference between these Cohen's d statistics. ** - g., gyrus; b., bank; pt., part. In addition to the identification of target brain structures for H. s. d. and H. s. n. miRNA activity, we found highly enriched functional categories of target gene annotations. We used the DAVID 6.7 functional annotation system [24] and the SP_PIR_KEYWORDS data class, with genes expressed in the human brain as background (16767 genes from BrainSpan). The results of the functional enrichment test are shown in Table 6. The most important difference between the target genes for unique H. s. d. and H. s. n. miRNAs expressed in the central nervous system tissues is the transport category enriched in target genes of H. s. d. miRNAs. Similar observations can be made considering miRNAs expressed in central nervous system tissues with substitutions shared with known human polymorphisms: "functional response" categories (such as activator, transcription regulation, synapse) enriched in target genes of H. s. d. and the developmental category enriched in target genes of such H. s. n. miRNAs. It would be reasonable to speculate that that the evolutionary changes in miRNA-transregulators might contribute to alterations in functional activities in specific brain regions in Denisovans, whereas in Neanderthals, the mutations in miRNAs could promote alterations in brain development and structure.
Table 6

Annotation enrichment of genes targeted by H. s. d. and H. s. n. miRNAs expressed in neural tissues, based on DAVID 6.7 [24]; BrainSpan gene list (16767 genes expressed in human brain) used as a background.

SP_PIR_KEYWORDS category *# of genesCorrected p-value

BonferroniBenjamini
Target genes of unique H. s. d. miRNAs

transport1760.040720.04072

Target genes of unique H. s. n. miRNAs

-

Target genes of H. s. d. miRNAs having substitutions shared with known human polymorphisms

phosphoprotein10181.56E-221.56E-22

alternative splicing10134.70E-192.35E-19

transcription2983.72E-061.24E-06

transcription regulation2923.86E-069.64E-07

metal-binding4032.56E-055.13E-06

chromosomal rearrangement637.14E-051.19E-05

zinc3028.79E-051.26E-05

activator982.27E-042.84E-05

cell junction760.001791.99E-04

zinc-finger2310.004554.56E-04

synapse450.011000.00100

DNA-binding2460.022870.00193

Target genes of H. s. n. miRNAs having substitutions shared with known human polymorphisms

phosphoprotein5653.62E-043.62E-04

alternative splicing5740.002260.00113

developmental protein840.006650.00222

cytoplasm2790.007150.00179

* bold - common between target genes of H. s. n. and H. s. d. miRNAs having substitutions shared with known human polymorphisms.

Annotation enrichment of genes targeted by H. s. d. and H. s. n. miRNAs expressed in neural tissues, based on DAVID 6.7 [24]; BrainSpan gene list (16767 genes expressed in human brain) used as a background. * bold - common between target genes of H. s. n. and H. s. d. miRNAs having substitutions shared with known human polymorphisms.

Conclusions

In this work, we identified miRNA genes of archaic humans bearing sequence variations in comparison with the modern human genome sequence: 29 genes for H. s. d. and 31 genes for H. s. n. Almost one-third of those genes contain variations specific for archaic humans (11 genes for H. s. d. and 9 for H. s. n.). The analysis of human gene expression data resulted in 9 H. s. d. and 5 H. s. n. genes with specific archaic variations and the expression of their human orthologs in the central nervous system. The detailed analysis of the human brain gene expression data demonstrated that the brain regions with the most abundant expression of mRNAs targeted by the H. s. d. miRNAs with fixed mutations are confined to the thalamus and the prefrontal cortex (especially the fusiform gyrus), no large brain regions are targeted by H. s. n. miRNAs with specific mutations. The only small brain regions targeted by H. s. n. miRNAs with such mutations are the superior lateral aspect and the central sulcus of the precentral gyrus. We identified differences in gene expression pattern during human brain development for the targets of human orthologs for the selected miRNAs. Targets for miRNA genes with mutations specific for H. s. d. were expressed predominantly in the later prenatal and early postnatal development stages. Targets for miRNAs with mutations specific for H. s. n. were expressed predominantly in early prenatal brain development stages. These results may reflect a potential association between the changes in the Denisovan miRNA genes reported in this study and the brain development of H. s. d., which likely allowed them to reach their high level of craftsmanship.

Methods

Selection of miRNA genes in H. s. d. and H. s. n. with no match in H. s. s.

We used data on the chromosomal localization of 1595 experimentally confirmed H. s. s. pre-miRNAs in the miRBase database rel. 19 [11] and selected the best-sequenced H. s. d. and H. s. n. pre-miRNAs orthologous to H. s. s. pre-miRNA genes. The H. s. d. and H. s. n. genomes appear as short nucleotide sequences (reads) mapped onto the human genome [3]. To ensure that only the best-sequenced H. s. d. and H. s. n. pre-miRNAs would remain, we used filtering methods. (I) The consensus sequences of H. s. d. and H. s. n. pre-miRNAs were combined from reads with a quality of mapping onto H. s. s. pre-miRNA genes not less than 15 [3]. (II) The consensus sequences of H. s. d. and H. s. n. pre-miRNAs included only those read nucleotides (a) for which the Phred sequence quality [25] was not less than 30 [2] and (b) that were located in the middle part of the reads: the first and last three read nucleotides were discarded according to [3]. (III) Any position in the consensus with coverage of less than 5 was assumed to be undetermined [3]. (IV) The consensus sequences of H. s. d. and H. s. n. pre-miRNAs with undetermined positions were discarded. (V) The consensus sequences of H. s. d. and H. s. n. pre-miRNAs that (a) had nucleotide substitutions that H. s. s. lacked and (b) were combined from reads mapped onto the human genome with a quality of less than 30 were discarded [2,3]. (VI) If there were two or more polymorphic states at a consensus position in H. s. d. and H. s. n. pre-miRNAs, only those alternative states that (a) had comparable frequencies (the difference between the frequencies would not exceed 1.3) and (b) passed filtering stages I-V were considered. (VII) Additionally, we manually analyzed all mapped reads of selected H. s. d. and H. s. n. pre-miRNAs for the presence of PCR artifacts. We did not find any PCR artifacts in the positions of selected H. s. d. and H. s. n. pre-miRNAs. To select pre-miRNA coding genes that were present in the genome of the common ancestor of H. s. s., H. s. d. and H. s. n. we analyzed the multiple genome alignment of six primates from Ensembl rel. 69 [12]. The minimum length of any ortholog (excluding unreadable and polymorphic nucleotides) was considered to be not less than 70% of the length of the corresponding H. s. s. pre-miRNA. We mapped all the nucleotide substitutions found by comparing the pre-miRNA orthologs in the H. s. s./H. s. d. and H. s. s./H. s. n. pairs onto the secondary structures of the corresponding pre-miRNAs in H. s. s. contained in the miRBase database rel. 19 [11]. We selected H. s. d. and H. s. n. pre-miRNA genes with (i) nucleotide substitutions (deletions, insertions) in the regions corresponding to the sequences of mature miRNAs (and/or miRNAs*) or (ii) multiple substitutions within a region not larger than 1/20 of the pre-miRNA length. In the first case (i), the altered mature miRNAs are of special interest because they change the pattern of complementary interactions between miRNAs and their target mRNAs in H. s. d. or H. s. n. and make it different from H. s. s. In the second case (ii), the probability of observing multiple changes within a small region of H. s. d. or H. s. n. pre-miRNA by random chance is extremely low. For instance, the estimate of the total number of differences between H. s. d. and H. s. s. is approximately 1,650,000 [3]. Under the assumption of a uniform distribution of mutation fixation events in a 6400 bp long sequence (1/20 of the length of 1600 pre-miRNA genes, each gene being approximately 80 bp in length), the estimate of the probability of two mutations being fixed simultaneously is less than 0.002. To ensure that the mutations in our selection of miRNAs were not DNA sequencing errors, we analyzed the frequencies of all polymorphic consensuses of H. s. d. and H. s. n. DNA (see steps V and VI of the filtering protocol), considering their tetranucleotide context. Thus, we used data in the UCSC Genome Browser phyloP46way to select evolutionarily conserved regions of the human genome (phyloP conservation score > 0.1; regions were ≥7 bp in length, which eventually totaled 454775413 positions or ~15% of the human genome). The occurrence of the polymorphic variants of the consensuses in the H. s. d. and H. s. n. genomes was then assessed considering their tetranucleotide context. We selected pre-miRNAs with mutations that either do not occur in the 454,775,413-strong selection of evolutionary conservative positions analyzed (p ≤ 2·10-9) or are single occurrences (p = 2·10-9). To ensure that the mutations in the set of selected miRNAs were not a result of PCR amplification in DNA sequencing (see step VII of the filtering protocol), we performed a manual analysis of H. s. d. and H. s. n. short reads alignment on a reference H. s. s. genome (hg19). We found no traces of the PCR amplification of short reads in miRNA genome positions: any single read in these positions started and stopped at unique genome locations. Using the blastn program in BLAST 2.2.26+ [26] on the pre-miRNAs containing substitutions/polymorphisms in mature miRNAs at the last filtering stage, we discarded pre-miRNAs that occurred in more than one copy in the H. s. s. genome (E-value cut-off: 1·10-6; blastn default settings).

Identification of genes targeted by H. s. s. miRNAs orthologous to H. s. d. and H. s. n. miRNAs

For each orthologous H. s. s. miRNA, we identified the genes encoding its target mRNAs. This identification was performed using miRGator 3.0 [13] (we used only those target mRNAs whose expression correlated with microRNA expression with r<-0.9) and the ChIP-seq data for H3K4me3 histones marking active promoters in primate cortical neurons [14]. We used the ID converter from bioDBnet [27] to convert the HUGO name list of target mRNA IDs (from miRGator 3.0 prediction) into a list of Ensembl gene IDs and a list of Refseq IDs. For functional annotation of the mRNAs targeted by the selected miRNAs, we chose two independent sources of experimental data that contained the most complete quantitative information on mRNA expression in human central nervous system tissues. One of these sources was BrainSpan [19] (Exon microarray, 16767 genes), which integrates normalized quantitative data on gene expression provided by expression time series experiments in broad brain regions. The other source of data was the Allen Human Brain Atlas, as updated March 2013 [18] (20791 genes), which contains normalized quantitative data on the expression of mRNAs in tiny brain regions. The expression of these mRNAs was analyzed in six brains: H0351.1009, H0351.1012, H0351.1015, H0351.1016, H0351.2001 and H0351.2002. If the microarray contained more than one probe for measuring the expression of the same mRNA, then only the probe producing the maximum value that significantly differed from background values was taken for analysis. In addition, to improve the robustness of our results, we used only data that were consistent for at least 4 out of 6 brains.

Identification of tissues with increased levels of mRNA expression regulated by the pool of miRNAs

In our work, we are interested in the tissues and brain structures most likely affected by the pool of miRNAs with structural changes between ancient and modern humans. Unfortunately, the data on miRNA expression in the vast majority of human tissues and brain structures are currently incomplete both because the number of functional miRNAs is incomplete [30] and because of the number of functional miRNA/target-mRNA interactions under ongoing debates regarding the usage of various factors in miRNA/target-mRNA binding [31,32]. Therefore, we used an indirect approach to identify tissues subject to dynamical changes in miRNA expression. Because miRNAs are negative post-transcriptional regulators of gene expression, we suppose that changes in their expression levels will largely affect the tissues with high abundance of their mRNA targets. Based on this assumption, likely candidate tissues are the ones with high expression of the target mRNAs for the set of miRNAs under consideration. To express quantitatively the degree of post-transcriptional regulation by specific miRNAs in different tissues, we introduced the miRNAs' regulatory potential. Let W(k) denote the set of human mRNAs expressed in the k-th tissue and targeted by all known human miRNAs. It should be noted that the overwhelming majority of genes in the human genome (~20000) [33] can be targeted by various miRNAs [13,30,34]. Therefore, the set of all genes in the genome is a good approximation of the miRNA targetome. Thus, we approximated W(k) as the set of all human mRNAs expressed in the k-th tissue. W(k) denotes the subset of target mRNAs regulated in the k-th tissue by the miRNA pool, Q, and J(k) denotes the size of this subset. The regulatory potential, , exerted by the miRNA pool, Q, on mRNAs with expression in the k-th tissue was estimated using the following formula: Here, is the level of expression of the j-th target mRNA in the subset W(k) in the k-th tissue, and values were obtained using data from BrainSpan [18] or AHBA[19]. The value obtained in this manner corresponds to the number of mRNA molecules that can be regulated in the k-th tissue (or brain structure) by mature miRNAs in the pool Q and characterizes its post-transcriptional regulatory potential in that tissue. In search of the tissues targeted by the miRNAs from the pool Q, we selected the ones with the highest regulatory potential in comparison with the background level of expression of all miRNA targets expressed in the k-th tissue, W(k). To estimate the significance of the difference between the pool's Q regulatory potential and the background, we used a resampling test (see Figure 5).
Figure 5

A toy example of the resampling test to identify tissues with increased levels of mRNAs expression regulated by the pool . A). Calculation of the regulatory potential for the miRNA pool Q, . B). Calculation of the regulatory potential for the resampled miRNA pool Q in W(k) subset. C). Calculation of the regulatory potential Lin the W(k) subset. D). Distribution of (blue) and L(brown) values in 105 resampling experiments and their comparison by Cohen's d statistics.

A toy example of the resampling test to identify tissues with increased levels of mRNAs expression regulated by the pool . A). Calculation of the regulatory potential for the miRNA pool Q, . B). Calculation of the regulatory potential for the resampled miRNA pool Q in W(k) subset. C). Calculation of the regulatory potential Lin the W(k) subset. D). Distribution of (blue) and L(brown) values in 105 resampling experiments and their comparison by Cohen's d statistics. First, we performed random sampling without replacement of J(k)/2 mRNAs from the set W(k) of human mRNAs targeted by miRNAs in the pool Q (the W(k) subset). For the W(k) subset, we calculated using Formula (1). The sampling procedure was repeated 105 times. We estimated the mean and standard deviation of the values. The selection of half the number of mRNAs in the W(k) subset allowed us to account for uncertainties in miRNA annotation (for example, the possibility that the miRNA dataset under consideration is incomplete and has incomplete annotation of target mRNAs). Second, we performed sampling of mRNAs from the background distribution, using the W(k) set of all miRNA targets expressed in the k-th tissue. For this purpose, we selected J(k)/2 mRNAs from the W(k) randomly without replacement. We termed this subset W(k). We calculated as the estimate of the background miRNA regulatory potential; here, is the expression level of the j-th target mRNA in the subset W(k). This procedure was repeated 105 times and allowed us to estimate the mean and standard deviation of L. We calculated the difference between the Land value distributions using Cohen's d statistic (effect size value) [35,36]. The p-value of was calculated using Student's t-statistic. The difference between the Land value distributions was considered significant at p-value less or equal to 0.01, implying that the k-th tissue was targeted by the miRNAs from the pool Q. It is important to note that the data resampling approach described above does not rely on distributional assumptions and simultaneously allowed us to control Type I error, and corrects for any hidden data correlation [37-39]. Therefore, we did not need to perform multiple testing corrections (e.g., Bonferroni correction) of the significance level [36-39] and therefore could use the selected p-value threshold directly.

Competing interests

The authors have declared that no competing interests exist.

Authors' contributions

K.V.G., A.D.A. and E.I.R. conceived the project. K.V.G. performed all data analysis. N.A.K., E.I.R. and A.P.D. coordinated the project. All authors contributed to the final manuscript preparation, discussed the results and their implications, and have read and approved the final manuscript.
  35 in total

1.  A note on the calculation of empirical P values from Monte Carlo procedures.

Authors:  B V North; D Curtis; P C Sham
Journal:  Am J Hum Genet       Date:  2002-08       Impact factor: 11.025

2.  bioDBnet: the biological database network.

Authors:  Uma Mudunuri; Anney Che; Ming Yi; Robert M Stephens
Journal:  Bioinformatics       Date:  2009-01-07       Impact factor: 6.937

3.  Using Effect Size-or Why the P Value Is Not Enough.

Authors:  Gail M Sullivan; Richard Feinn
Journal:  J Grad Med Educ       Date:  2012-09

4.  Functional dissociation of the left and right fusiform gyrus in self-face recognition.

Authors:  Yina Ma; Shihui Han
Journal:  Hum Brain Mapp       Date:  2011-07-14       Impact factor: 5.038

5.  Denisova admixture and the first modern human dispersals into Southeast Asia and Oceania.

Authors:  David Reich; Nick Patterson; Martin Kircher; Frederick Delfin; Madhusudan R Nandineni; Irina Pugach; Albert Min-Shan Ko; Ying-Chin Ko; Timothy A Jinam; Maude E Phipps; Naruya Saitou; Andreas Wollstein; Manfred Kayser; Svante Pääbo; Mark Stoneking
Journal:  Am J Hum Genet       Date:  2011-09-22       Impact factor: 11.025

6.  In search of the functional neuroanatomy of sociality: MRI subdivisions of orbital frontal cortex and social cognition.

Authors:  Paul G Nestor; Motoaki Nakamura; Margaret Niznikiewicz; Elizabeth Thompson; James J Levitt; Victoria Choate; Martha E Shenton; Robert W McCarley
Journal:  Soc Cogn Affect Neurosci       Date:  2012-02-15       Impact factor: 3.436

7.  Mammalian microRNAs predominantly act to decrease target mRNA levels.

Authors:  Huili Guo; Nicholas T Ingolia; Jonathan S Weissman; David P Bartel
Journal:  Nature       Date:  2010-08-12       Impact factor: 49.962

8.  Predicting effective microRNA target sites in mammalian mRNAs.

Authors:  Vikram Agarwal; George W Bell; Jin-Wu Nam; David P Bartel
Journal:  Elife       Date:  2015-08-12       Impact factor: 8.140

9.  The complete genome sequence of a Neanderthal from the Altai Mountains.

Authors:  Kay Prüfer; Fernando Racimo; Nick Patterson; Flora Jay; Sriram Sankararaman; Susanna Sawyer; Anja Heinze; Gabriel Renaud; Peter H Sudmant; Cesare de Filippo; Heng Li; Swapan Mallick; Michael Dannemann; Qiaomei Fu; Martin Kircher; Martin Kuhlwilm; Michael Lachmann; Matthias Meyer; Matthias Ongyerth; Michael Siebauer; Christoph Theunert; Arti Tandon; Priya Moorjani; Joseph Pickrell; James C Mullikin; Samuel H Vohr; Richard E Green; Ines Hellmann; Philip L F Johnson; Hélène Blanche; Howard Cann; Jacob O Kitzman; Jay Shendure; Evan E Eichler; Ed S Lein; Trygve E Bakken; Liubov V Golovanova; Vladimir B Doronichev; Michael V Shunkov; Anatoli P Derevianko; Bence Viola; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Nature       Date:  2013-12-18       Impact factor: 49.962

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  2 in total

1.  Impact and Evolutionary Determinants of Neanderthal Introgression on Transcriptional and Post-Transcriptional Regulation.

Authors:  Martin Silvert; Lluis Quintana-Murci; Maxime Rotival
Journal:  Am J Hum Genet       Date:  2019-05-30       Impact factor: 11.025

2.  The papers presented at 7th Young Scientists School "Systems Biology and Bioinformatics" (SBB'15): Introductory Note. Introduction.

Authors:  Ancha V Baranova; Yuriy L Orlov
Journal:  BMC Genet       Date:  2016-01-27       Impact factor: 2.797

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.