| Literature DB >> 31801603 |
Sjors Middelkamp1, Judith M Vlaar1, Jacques Giltay2, Jerome Korzelius1,3, Nicolle Besselink1, Sander Boymans1, Roel Janssen1, Lisanne de la Fonteijne1, Ellen van Binsbergen2, Markus J van Roosmalen1, Ron Hochstenbach2, Daniela Giachino4, Michael E Talkowski5,6,7, Wigard P Kloosterman2, Edwin Cuppen8.
Abstract
BACKGROUND: Genomic structural variants (SVs) can affect many genes and regulatory elements. Therefore, the molecular mechanisms driving the phenotypes of patients carrying de novo SVs are frequently unknown.Entities:
Keywords: Copy number variants; Driver genes; Intellectual disability; Multiple congenital anomalies; Neurodevelopmental disorders; Position effects; Structural variation; Topologically associating domains; Transcriptome sequencing; Whole-genome sequencing
Mesh:
Year: 2019 PMID: 31801603 PMCID: PMC6894143 DOI: 10.1186/s13073-019-0692-0
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Cutoffs used to classify affected genes as T1, T2, or T3 candidate driver genes
| Disease association score (0–5) | pLI > 0.9 RVIS < 10 HI < 10 DDG2P OMIM | > 0 | > 0 | > 2 | |
| Total phenomatch score | > 0 | > 4 | > 10 | ||
| Phenomatches (% of HPO terms with phenomatch score > 5) | > 0 | > 10% | > 25% | ||
| Mode of inheritance | AD/XD/XR+XY | AD/XD/XR+XY | |||
| Gene location | Adjacent | Dup | Adjacent | DEL/TRUNC | |
| Support score (0–6) | TAD disrupted V4C disrupted PCHiC disrupted DHS disrupted RNA expression | > 1 | NA | > 3 | NA |
| Classification | |||||
| Phenotype association + effect of SV on gene | Weak + weak | Strong + weak | Medium + strong | Strong + strong | |
pLI probability of being loss-of-function intolerant, RVIS Residual Variation Intolerance Score, HI haploinsufficiency, DDG2P Developmental Disorders Genotype-Phenotype Database, OMIM Online Mendelian Inheritance in Man, AD autosomal dominant, XD X-linked dominant, XR X-linked recessive, XY male, TAD topologically associating domain, V4C virtual 4C, PCHiC promoter capture Hi-C, DHS DNase hypersensitivity site
Fig. 2Prediction of candidate driver genes directly and indirectly affected by the SVs. a Schematic overview of the computational workflow developed to detect candidate driver genes. Classification of genes at (direct) or surrounding (indirect) the de novo SVs is based on the association of the gene with the phenotype and the predicted direct or indirect effect on the gene (Table 1). b Total number of identified tier 1, 2, and 3 candidate driver genes predicted to be directly or indirectly affected by an SV. c Genome browser overview showing the predicted disruption of regulatory landscape of the HOXD locus in individual P22. A 107-kb fragment (red shading) upstream of the HOXD locus (green shading) is translocated to a different chromosome, and a 106-kb fragment (yellow shading) is inverted. The SVs affect the TAD centromeric of the HOXD locus which is involved in the regulation of gene expression in developing digits. The translocated and inverted fragments contain multiple mouse [43] and human (day E41) [44] embryonic limb enhancers, including the global control region (GCR). Disruptions of these developmental enhancers likely contributed to the limb phenotype of the patient. The virtual V4C track shows the Hi-C interactions per 10 kb bin in germinal zone (GZ) cells using the HOXD13 gene as viewpoint [35]. The bottom track displays the PCHiC interactions of the HOXD13 gene in neuroectodermal cells [40]. UCSC Liftover was used to convert mm10 coordinates to hg19. d RNA expression levels of genes at or adjacent to de novo SVs. Log2 fold RNA expression changes compared to controls (see the “Methods” section) determined by RNA sequencing for expressed genes (RPKM > 0.5) that are located within 2 Mb of SV breakpoint junctions (FLANK) or that are inverted (INV), duplicated (DUP), deleted (DEL), or truncated (TRUNC). Differentially expressed genes (p < 0.05, calculated by DESeq2) are displayed in red
Fig. 1Characterization of de novo SVs in a cohort of individuals with neurodevelopmental disorders. a Frequencies of clinical phenotypic categories described for the 39 included individuals based on the categories defined by HPO. Nervous system abnormalities are divided into 4 subcategories. b Number of de novo breakpoint junctions per SV type identified by WGS of 39 included patients. Most detected de novo SVs are part of complex genomic rearrangements, which are defined by the involvement of more than 3 breakpoint junctions (SVs with 1 or 2 breakpoint junctions are considered simple rearrangements). c Number of cases in which WGS analysis identified new, additional, or similar SVs compared to microarray-based copy number profiling. d Schematic representation of additional genomic rearrangements that were observed by WGS in 5 individuals. For each patient, the top panel shows the de novo SVs identified by arrays or karyotyping and bottom panel shows the structures of the SVs detected by WGS. The WGS data of individual P8 revealed complex chromoanasynthesis rearrangements involving multiple duplications and an insertion of a fragment from chr14 into chr3. Individual P11 has an insertion of a fragment of chr9 into chrX that was detected as a copy number gain by array-based analysis (Additional file 2: Figure S2). The detected copy number gains in individuals P12 and P21 show an interspersed orientation instead of a tandem orientation. The translocation in patient P20 appeared to be more complex than previously anticipated based on karyotyping results, showing 11 breakpoint junctions on 3 chromosomes
Fig. 3SVs can affect multiple candidate drivers which jointly contribute to a phenotype. a Number of patients whose phenotype can be partially or largely explained by the predicted T1/T2 candidate drivers (based on the percentage of the patient’s HPO terms that have a phenomatch score > 4). These molecular diagnoses are based on the fraction of HPO terms assigned to the patients that have a phenomatch score of more than 5 with at least one T1/T2 driver gene. b Scatterplot showing the number of predicted T1/T2 candidate drivers compared to the total number of genes at or adjacent (< 2 Mb) to the de novo SVs per patient. c Heatmap showing the association of the four predicted T1/T2 candidate drivers with the phenotypic features (described by HPO terms) of individual P25. The numbers correspond to the score determined by phenomatch. The four genes are associated with different parts of the complex phenotype of the patient. d Ideogram of the derivative (der) chromosomes 6, 12, and 20 in individual P25 reconstructed from the WGS data. WGS detected complex rearrangements with six breakpoint junctions and two deletions on chr6 and chr20 respectively of ~ 10 Mb and ~ 0.6 Mb. e Circos plot showing the genomic regions and candidate drivers affected by the complex rearrangements in individual P25. Gene symbols of T1/T2 and T3 candidate drivers are shown respectively in red and black. The breakpoint junctions are visualized by the lines in the inner region of the plot (red lines and highlights indicate the deletions). The middle ring shows the log2 fold change RNA expression changes in lymphoblastoid cells derived from the patient compared to controls measured by RNA sequencing. Genes differentially expressed (p < 0.05) are indicated by red (log2 fold change < − 0.5) and blue (log2 fold change > 0.5) bars. The inner ring shows the organization of the TADs and their boundaries (indicated by vertical black lines) in germinal zone (GZ) brain cells [35]. TADs overlapping with the de novo SVs are highlighted in red. f Genomic distance (in base pairs) between the indirectly affected candidate driver genes and the closest breakpoint junction. Most candidate drivers are located within 1 Mb of a breakpoint junction (median distance of 185 kb)
Fig. 4In silico prediction of candidate drivers in larger cohorts of patients with de novo SVs. a Comparison between previous SV classifications with the strongest candidate driver (located at or adjacent (< 1 Mb) to these SVs) predicted by our approach. Two different patient cohorts, one containing mostly balanced SVs [21] and one containing copy number variants, were screened for candidate drivers. Our method identified T1/T2 candidate drivers for most SVs previously classified as pathogenic or likely pathogenic. Additionally, the method detected T1/T2 candidate drivers for some SVs previously classified as VUS, which may lead to a new molecular diagnosis. b Quantification of the predicted effects of the SVs on proposed T1/T2 candidate driver genes per cohort. Individuals with multiple directly and indirectly affected candidate drivers are grouped in the category described as “Both.” Indirect position effects of SVs on genes contributing to phenotypes appear to be more common in patients with balanced SVs compared to patients with copy number variants