| Literature DB >> 35893067 |
Anthony Wong1, Anbo Zhou1, Xiaolong Cao1, Vaidhyanathan Mahaganapathy1, Marco Azaro1, Christine Gwin1, Sherri Wilson1, Steven Buyske2, Christopher W Bartlett3,4, Judy F Flax1, Linda M Brzustowicz1,5, Jinchuan Xing1,5.
Abstract
Autism spectrum disorder (ASD) is a childhood neurodevelopmental disorder with a complex and heterogeneous genetic etiology. MicroRNA (miRNA), a class of small non-coding RNAs, could regulate ASD risk genes post-transcriptionally and affect broad molecular pathways related to ASD and associated disorders. Using whole-genome sequencing, we analyzed 272 samples in 73 families in the New Jersey Language and Autism Genetics Study (NJLAGS) cohort. Families with at least one ASD patient were recruited and were further assessed for language impairment, reading impairment, and other associated phenotypes. A total of 5104 miRNA variants and 1,181,148 3' untranslated region (3' UTR) variants were identified in the dataset. After applying several filtering criteria, including population allele frequency, brain expression, miRNA functional regions, and inheritance patterns, we identified high-confidence variants in five brain-expressed miRNAs (targeting 326 genes) and 3' UTR miRNA target regions of 152 genes. Some genes, such as SCP2 and UCGC, were identified in multiple families. Using Gene Ontology overrepresentation analysis and protein-protein interaction network analysis, we identified clusters of genes and pathways that are important for neurodevelopment. The miRNAs and miRNA target genes identified in this study are potentially involved in neurodevelopmental disorders and should be considered for further functional studies.Entities:
Keywords: 3′ UTR; autism spectrum disorder; family cohort; miRNA; neurodevelopmental disorder; whole-genome sequencing
Mesh:
Substances:
Year: 2022 PMID: 35893067 PMCID: PMC9329941 DOI: 10.3390/genes13081329
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Summary of Samples and Families.
| Patients | Male | Female | Families | Dominant | Recessive/ | |
|---|---|---|---|---|---|---|
|
| 83 | 65 | 18 | 73 | 0 | 67 |
|
| 117 | 86 | 31 | 73 | 9 | 58 |
|
| 134 | 96 | 38 | 73 | 9 | 42 |
|
| 83 | 60 | 23 | 59 | 25 | 43 |
|
| 63 | 43 | 20 | 47 | 10 | 24 |
|
| 272 | 166 | 106 | 73 | 35 | 67 |
The first three columns are the total number of affected individuals, male, and female affected individuals, respectively. Families indicate the total number of families that contain at least one affected individual with the respective phenotype. Dominant and Recessive/de novo are the number of families that meet the criteria for the specific mode of inheritance.
Figure 1miRNA variant filtering steps. After selecting all variants within miRNAs, we filtered variants based on their inheritance pattern, miRNA region annotation, brain expression pattern, and population allele frequency to select final candidate variants.
miRNA candidate variants.
| miRNA | Region | Target Genes | chr | pos | rsID | Ref | Alt | AF_2_exome | AF_2_genome | AF_3 | Family | Inheritance | Phenotype |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| hsa-miR-6780a-3p | seed | 179 | chr17 | 40860121 | rs200279579 | A | G | 1.23 × 10−4 | 9.42 × 10−5 | 1.26 × 10−4 | FAM5 | dominant | RI |
| hsa-miR-1225-5p | mature | 43 | chr16 | 2140269 | rs371749301 | C | A | 1.48 × 10−5 | 4.72 × 10−5 | 2.00 × 10−4 | FAM58 | dominant | RI |
| hsa-miR-2277-3p | mature | 37 | chr5 | 92956416 | rs550720421 | G | T | 4.62 × 10−4 | 4.24 × 10−4 | 8.83 × 10−4 | FAM58 | dominant | RI |
| hsa-miR-548j-5p | mature | 42 | chr22 | 26951249 | rs565141718 | C | T | 3.95 × 10−4 | 4.70 × 10−5 | 1.41 × 10−4 | FAM13 | dominant | LI |
| hsa-miR-100-5p | mature | 26 | chr11 | 122022992 | rs761222509 | G | A | 1.93 × 10−5 | N/A | 7.42 × 10−6 | FAM66 | dominant | RI |
miRNA: HGNC Symbol. Region: miRNA region. Target Genes: the number of miRNA target genes predicted by TargetScanHuman. chr: Chromosome. pos: Variant position. rsID: DBSNP150 reference SNV number. Ref: Reference allele. Alt: Alternate Allele. AF_2_exome: Population AF from gnomAD v2 non-neuro exome database. AF_2_genome: Population AF from gnomAD v2 non-neuro genome database. AF_3: Population AF from gnomAD v3_non_neuro database. Family: Affected Family ID. Inheritance: Inheritance mode for the affected family. Phenotype: Phenotype of the affected individual.
Figure 23′ UTR variant filtering steps. After selecting all variants within 3′ UTR regions, we filtered variants based on their inheritance pattern, miRNA target region annotation, brain expression pattern, and population allele frequency to select final candidate variants.
Summary of 3′UTR Candidate Genes. Total number of genes found in each phenotype and inheritance pattern.
| ASD | ADHD | LI | RI | SRS | Total Unique | |
|---|---|---|---|---|---|---|
| Dominant (AF < 1%) | 0 | 26 | 30 | 79 | 33 | 138 |
| Recessive (AF < 1%) | 3 | 2 | 3 | 3 | 2 | 5 |
| 9 | 4 | 6 | 5 | 4 | 10 | |
| Total Unique | 12 | 32 | 39 | 87 | 39 | 152 |
Figure 3Number of families with 3′ UTR variants. Venn Diagrams of families with 3′ UTR variants for each inheritance pattern: (A) Dominant; (B) Recessive; (C) de novo. The value indicates the number of families with candidate variants for the phenotype(s).
Top Candidate Genes.
| Gene | miRNA Variant | 3′ UTR Variant | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Max Brain Expression | Other NDD | pLI | Previous Studies | miRNA | Inheritance | Family | Phenotype | chr | pos | rsID | AF_gnomAD | Inheritance | Family | Phenotype |
|
| 66 | NA | 0.91 | [ | hsa-miR-6780a-3p | dominant | FAM5 | RI | chr18 | 55155836 | rs147208471 | 7.27 × 10−4 | dominant | FAM56 | SRS |
|
| 61.03 | NA | 1.00 | [ | hsa-miR-6780a-3p | dominant | FAM5 | RI | chr11 | 59343007 | rs149325846 | 3.61 × 10−3 | dominant | FAM14; FAM66 | RI |
|
| 65.43 | ASD_Low | 0.00 | [ | hsa-miR-6780a-3p | dominant | FAM5 | RI | chr1 | 53516762 | rs182947399 | 1.33 × 10−3 | dominant | FAM2 | ADHD |
|
| 187 | NA | 0.00 | [ | hsa-miR-6780a-3p | dominant | FAM5 | RI | chr11 | 8123523 | rs1379616749 | 3.12 × 10−4 | dominant | FAM70 | RI |
|
| 49.40 | NA | 0.09 | [ | chr6 | 17292448 | rs914886490 | 2.60 × 10−4 |
| FAM5; FAM59 | ADHD,LI | ||||
|
| 86 | NA | 0.94 | [ | chr9 | 114695431 | rs201977317 | 4.54 × 10−3 | dominant | FAM5; FAM36; FAM37 | RI,SRS |
Other NDD: genes implicated in NDDs from previous studies (Table S2). pLI: the probability the gene is loss-of-function intolerant. AF_gnomAD: maximum variant AF in the three gnomAD databases. Other headers are the same as in Table 2.
Figure 4Candidate gene PPI network. The network includes both miRNA target genes and 3′ UTR genes (Table S9). PPI edges are included if at least two databases show evidence of interaction. The highlighted region included genes enriched in regulation of neuron death (GO:1901214).
Figure 5Candidate gene interaction network with known NDD genes. Known genes implicated in ASD, ADHD, and other NDDs (Table S2) were included in the PPI network with candidate genes in this study. (A) ASD candidates PPI. (B) ADHD candidates PPI. (C) LI candidates PPI. (D) RI candidates PPI. Nodes with degree greater than 13 were hidden from the network for clarity. (E) SRS candidates PPI. Isolated nodes were removed from the networks. Connections between non-candidate genes were not shown. Edges with only one protein–protein database evidence were not shown in (C,E). Highlighted regions included genes in GO terms.