| Literature DB >> 24667286 |
Maurizio Marrale1, Nadia Ninfa Albanese1, Francesco Calì2, Valentino Romano3.
Abstract
Autism Spectrum Disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins. Previous studies have investigated the role of de novo Copy Number Variants (CNVs) and microRNAs as important but distinct etiological factors in ASD. We developed a novel computational procedure to assess the potential pathogenic role of microRNA genes overlapping de novo CNVs in ASD patients. Here we show that for chromosomes # 1, 2 and 22 the actual number of miRNA loci affected by de novo CNVs in patients was found significantly higher than that estimated by Monte Carlo simulation of random CNV events. Out of 24 miRNA genes over-represented in CNVs from these three chromosomes only hsa-mir-4436b-1 and hsa-mir-4436b-2 have not been detected in CNVs from non-autistic subjects as reported in the Database of Genomic Variants. Altogether the results reported in this study represent a first step towards a full understanding of how a dysregulated expression of the 24 miRNAs genes affect neurodevelopment in autism. We also propose that the procedure used in this study can be effectively applied to CNVs/miRNA genes association data in other genomic disorders beyond autism.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24667286 PMCID: PMC3965395 DOI: 10.1371/journal.pone.0090947
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Overview of this study and source data.
In this study we have used previously published data from 192 autistic patients (the “APL datasets” of Table S1) bearing overall 178 de novo CNVs (118 CNV_Losses and 60 CNV_Gains) with unique start and end positions.
Fractional lengths of miRNA genes and CNVs in relation to chromosome's size.
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
| 1 | 249250621 | 127 | 4.05×10−5 | 0.0241 | 5 | 0.0050 | 5 |
| 2 | 243199373 | 98 | 3.10×10−5 | 0.0009 | 0 | 0.0314 | 17 |
| 3 | 198022430 | 76 | 3.21×10−5 | 0.0041 | 1 | 0.0923 | 12 |
| 4 | 191154276 | 56 | 2.31×10−5 |
|
| 0.0886 | 4 |
| 5 | 180915260 | 67 | 3.03×10−5 |
|
| 0.0763 | 7 |
| 6 | 171115067 | 54 | 2.56×10−5 | 0.0019 | 0 | 0.0187 | 2 |
| 7 | 159138663 | 67 | 3.41×10−5 | 0.0103 | 4 | 0.1425 | 7 |
| 8 | 146364022 | 70 | 3.54×10−5 | 0.0068 | 0 | 0.0088 | 1 |
| 9 | 141213431 | 71 | 3.95×10−5 | 0.0371 | 9 | 0.0297 | 1 |
| 10 | 135534747 | 61 | 3.66×10−5 | 0.0822 | 4 | 0.0005 | 0 |
| 11 | 135006516 | 69 | 4.03×10−5 | 0.0002 | 0 | 0.0410 | 0 |
| 12 | 133851895 | 57 | 3.48×10−5 | 0.1691 | 13 | 0.1013 | 3 |
| 13 | 115169878 | 37 | 2.62×10−5 | 0.0009 | 0 | 0.0169 | 0 |
| 14 | 107349540 | 88 | 6.58×10−5 | 0.0006 | 0 |
|
|
| 15 | 102531392 | 57 | 4.68×10−5 | 0.1254 | 39 | 0.0742 | 7 |
| 16 | 90354753 | 48 | 4.40×10−5 | 0.0320 | 2 | 0.1032 | 9 |
| 17 | 81195210 | 83 | 7.89×10−5 | 0.0317 | 8 | 0.0256 | 1 |
| 18 | 78077248 | 30 | 2.74×10−5 | 0.0006 | 0 | 0.2618 | 2 |
| 19 | 59128983 | 108 | 1.47×10−4 |
|
| 0.0145 | 2 |
| 20 | 63025520 | 40 | 5.16×10−5 |
|
| 0.0301 | 2 |
| 21 | 48129895 | 15 | 2.65×10−5 | 0.0074 | 1 | 0.1183 | 3 |
| 22 | 51304566 | 36 | 5.59×10−5 | 0.0893 | 18 | 0.1473 | 24 |
| X | 155270560 | 108 | 5.77×10−5 | 0.0304 | 8 | 0.0406 | 1 |
Ratio #1: The ratio between the sum of lengths of all miRNA genes in a chromosome and the total length of the chromosome; Ratio #2: The ratio between the sum of lengths of all de novo CNVs in a chromosome and the total length of the chromosome; # hits: For each chromosome, the total no. of identical and/or different miRNA genes included in all de novo CNVs detected in patients.
Figure 2Correlation graph between the no. of miRNA genes in de novo CNVs and the CNV/Chr.
lengths ratio (Ratio #2) For each chromosome, the number of miRNA genes associated to CNVs is plotted as a function of the fractional length of CNV over the chromosome's size for Gains (a) and Losses (b), respectively. The graphs show that whereas the majority of data points lay very close to the best-fit line, indicating that the two variables are positively correlated, few chromosomes instead behave as outliers in which certain CNVs appear to affect a no. of miRNA genes higher than expected (data used for the graphs were taken from Table 1).
Figure 3Schematic representation of the counting process of miRNA genes included in de novo CNVs of autistic patients.
The small black rectangles close to chromosome are the miRNA genes, whereas the various segments above represent the various CNVs within the chromosome. An “hit” is an overlap between a CNV and a miRNA gene. In a) four “hits” are shown. b) Four examples of random distributions of simulated CNVs within the chromosome, keeping fixed the lenght of each CNV and changing its start/end positions. Clockwise from top left, the numbers of “hits” are 2, 0, 6 and 1, respectively. For each chromosome we carried out 106 simulations. c) Finally, the histograms displaying the relative frequency of miRNA genes included in randomly located CNVs (“hits”) are obtained and the comparison between experimental data and computed Monte Carlo distribution is performed. Red lines correspond to the no. of miRNA genes detected in de novo CNVs from patients. p-values reported in Table 2 are the areas of the histogram to the right side of the red line. (See also Figure 4).
Figure 4Histograms displaying the relative frequency of miRNA genes included in randomly located CNVs.
For each chromosome, the SIMCNVMIR program computes the no. of miRNA genes affected by each randomly distributed CNV realizations and plots the frequency distribution corresponding to 106 realizations. The analyses were performed separately for CNV_Gains (a) and CNV_Losses (b).
de novo CNVs from autistic patients with an overrepresented no. of miRNA genes.
| chr | GAIN | LOSS | ||||
| Unique | Hits | FDR-adjusted p-value | Unique | Hits | FDR-adjusted p-value | |
| 1 | 5 | 5 | 0.45204 | 5 | 5 |
|
| 2 | 0 | 0 | 0.84211 | 10 | 17 |
|
| 3 | 1 | 1 | 0.45642 | 10 | 12 | 0.64896 |
| 4 | N/A | N/A | N/A | 4 | 4 | 0.98516 |
| 5 | N/A | N/A | N/A | 7 | 7 | 0.73942 |
| 6 | 0 | 0 | 0.84211 | 2 | 2 | 0.70911 |
| 7 | 2 | 4 | 0.17818 | 6 | 7 | 1 |
| 8 | 0 | 0 | 0.84211 | 1 | 1 | 0.70911 |
| 9 | 9 | 9 | 0.17818 | 1 | 1 | 0.98516 |
| 10 | 3 | 4 | 0.84211 | 0 | 0 | 1 |
| 11 | 0 | 0 | 0.84211 | 0 | 0 | 1 |
| 12 | 13 | 13 | 0.46568 | 3 | 3 | 1 |
| 13 | 0 | 0 | 0.84211 | 0 | 0 | 1 |
| 14 | 0 | 0 | 0.84211 | N/A | N/A | N/A |
| 15 | 11 | 39 | 0.17818 | 6 | 7 | 0.70911 |
| 16 | 2 | 2 | 0.84211 | 8 | 9 | 0.70911 |
| 17 | 8 | 8 | 0.13920 | 1 | 1 | 1 |
| 18 | 0 | 0 | 0.84211 | 2 | 2 | 1 |
| 19 | N/A | N/A | N/A | 2 | 2 | 0.64896 |
| 20 | N/A | N/A | N/A | 2 | 2 | 0.64896 |
| 21 | 1 | 1 | 0.17818 | 3 | 3 | 0.64896 |
| 22 | 9 | 18 |
| 9 | 24 | 0.07785 |
| X | 8 | 8 | 0.19599 | 1 | 1 | 1 |
Unique: the no. of distinct miRNA genes overlapping de novo CNVs in patients. Hits: the no. of identical or distinct miRNA genes overlapping de novo CNVs in patients. FDR-adjusted p-value: probability of obtaining a number of miRNA genes overlapping randomly-distributed CNVs larger than hits after correction for multiple testing. In bold, chromosomes displaying significant FDR-adjusted p-values (<0.05). N/A: no de novo CNVs are present in the APL dataset. For several chromosomes, the no. of unique is smaller than the no. of hits indicating that the same microRNA genes are affected by different de novo CNVs.
List of 24 microRNA genes overrepresented in de novo CNVs.
| miRNA gene name | Chr | Cytoband | Coordinates (GRCh37) | strand +/− | Size (bp) | Clustered miRNA genes | DGV | no. of Patients | Gain/Loss in Patients | Gain/Loss in positive chr. | Intra/Intergenic miRNA loci | |
| start | end | |||||||||||
|
| 1 | 1p21.3 | 98510798 | 98510907 | − | 110 | hsa-mir-137; hsa-mir-2682. | + | 1 | Loss | L | none (miRNA gene) none |
|
| 1 | 1p21.3 | 98511626 | 98511727 | − | 102 | hsa-mir-137; hsa-mir-2682. | + | 1 | Loss | L | (miRNA gene) |
|
| 1 | 1p36.33 | 1102484 | 1102578 | + | 95 | hsa-mir-200b; hsa-mir-200a; hsa-mir-429. | + | 1 | Loss | L | Intergenic |
|
| 1 | 1p36.33 | 1103243 | 1103332 | + | 90 | hsa-mir-200b; hsa-mir-200a; hsa-mir-429. | + | 1 | Loss | L | Intergenic |
|
| 1 | 1p36.33 | 1104385 | 1104467 | + | 83 | hsa-mir-200b; hsa-mir-200a; hsa-mir-429. | + | 1 | Loss | L | Intergenic |
|
| 2 | 2q37.3 | 240273419 | 240273499 | − | 81 | - | + | 2 | Loss | L | HDAC4 intron 2 |
|
| 2 | 2q37.3 | 242417320 | 242417397 | + | 78 | - | + | 2 | Loss | L | FARP2 intron 2 - 19 |
|
| 2 | 2q13 | 111042430 | 111042520 | + | 91 | - | − | 1 | Loss | L | Intergenic |
|
| 2 | 2q37.3 | 239990513 | 239990610 | − | 98 | - | + | 2 | Loss | L | HDAC4 intron 1 - 16 - 22 |
|
| 2 | 2q37.3 | 240007523 | 240007622 | − | 100 | - | + | 2 | Loss | L | HDAC4 intron 5 - 13 - 19 |
|
| 2 | 2q37.3 | 240882432 | 240882511 | − | 80 | - | + | 2 | Loss | L | NDUFA10 intron 4 |
|
| 2 | 2q13 | 110827538 | 110827619 | − | 82 | - | + | 1 | Loss | L | Intergenic |
|
| 2 | 2q13 | 110844010 | 110844100 | − | 91 | - | − | 1 | Loss | L | MALL intron 1 - 3–4 (antisense) |
|
| 2 | 2q37.3 | 240227157 | 240227240 | + | 84 | - | + | 2 | Loss | L | HDAC4 intron 1 - 2 |
|
| 2 | 2q37.3 | 241395418 | 241395506 | + | 89 | - | + | 2 | Loss | L | GPC1 intron 1 |
|
| 22 | 22q11.21 | 20073269 | 20073356 | + | 88 | hsa-mir-3618; hsa-mir-1306. | + | 2/3 | Gain/Loss | L, G | DGCR8 3′UTR exon 1 - 2 |
|
| 22 | 22q11.21 | 20073581 | 20073665 | + | 85 | hsa-mir-3618; hsa-mir-1306. | + | 2/3 | Gain/Loss | L, G | DGCR8 3′UTR exon 1 - 2 |
|
| 22 | 22q11.21 | 22007270 | 22007347 | + | 78 | hsa-mir-301b; hsa-mir-130b | + | 2 | Gain | L, G | PPIL2 intron 1 |
|
| 22 | 22q11.21 | 22007593 | 22007674 | + | 82 | hsa-mir-301b; hsa-mir-130b | + | 2 | Gain | L, G | PPIL2 exon 2 |
|
| 22 | 22q11.21 | 19951276 | 19951357 | + | 82 | - | + | 2/3 | Gain/Loss | L, G | COMT 3′UTR exon 1 - 2 - 4 - 5 |
|
| 22 | 22q11.21 | 20020662 | 20020743 | + | 82 | - | + | 2/3 | Gain/Loss | L, G | C22orf25 intron 1 - 2 |
|
| 22 | 22q11.21 | 20236657 | 20236734 | − | 78 | - | + | 2/3 | Gain/Loss | L, G | RTN4R intron 1 - 2 |
|
| 22 | 22q11.21 | 21388465 | 21388561 | − | 97 | - | + | 2/3 | Gain/Loss | L, G | Intergenic |
|
| 22 | 22q11.22 | 23165270 | 23165365 | + | 96 | - | + | 2 | Gain | L, G | IGLV2-8 3′UTR exon 1 |
Information on miRNA gene coordinates, size, gene clustering, intergenic/intragenic loci are as reported in miRBase; no. of patients: number of patients bearing a CNV (including a given miRNA gene), CNVs among patients may have identical or different start/end; Gain/Loss in patients: CNV type overlapping miRNA genes detected in patients (from Table S1); Gain/Loss in positive chr.: CNV type overlapping miRNA genes detected in simulation.
KEGG pathways enriched for targets of miRNAs hsa-mir-4436b-3p and -5p identified by mirPath1.
| KEGG PATHWAY | p-value | # genes | Target Genes | |
|
|
| 3.30×10−8 | 5 | NSD1, KMT2D, PIPOX, KMT2A, KMT2E. |
|
| 6.73×10−6 | 1 | CYP3A43. | |
| Other glycan degradation | 0.000197 | 1 | NEU1. | |
| Glycerophospholipid metabolism | 0.00029 | 4 | PNPLA7, PGS1, LPCAT3, MBOAT2. | |
|
| 0.000942 | 4 | CTBP1, APH1A, NOTCH2, MAML1. | |
|
| 0.006618 | 5 | PGF, EGLN3, PLCG1, ENO3, SLC2A1. | |
|
| 0.00847 | 1 | AQP2. | |
| Glycine, serine and threonine metabolism | 0.01361 | 3 | SHMT2, | |
| Renal cell carcinoma | 0.016804 | 3 | PGF, EGLN3, SLC2A1. | |
| One carbon pool by folate | 0.01935 | 2 | SHMT2, | |
| Pathogenic Escherichia coli infection | 0.04271 | 3 | TUBA1A, TUBA8, TUBA1C. | |
| Graft-versus-host disease | 0.04271 | 3 | KIR3DL2, KIR2DL3, KIR3DL1. | |
| Antigen processing and presentation | 0.044381 | 5 | NFYA, NFYB, KIR3DL2, KIR2DL3, KIR3DL1. | |
|
| 0.048633 | 5 | KIR3DL2, KIR2DL3, TNFRSF10B, PLCG1, KIR3DL1. | |
|
| Prion diseases | 2.16×10−73 | 1 | PRNP. |
| Sulfur relay system | 2.63×10−9 | 1 | NFS1. | |
| Thiamine metabolism | 0.002145 | 1 | NFS1. | |
|
| 0.008932 | 1 | WHSC1. | |
| Transcriptional misregulation in cancer | 0.023567 | 2 | NCOR1, WHSC1. |
In bold pathways or genes previouly implicated in autism, see main text.