Literature DB >> 29077808

TITINdb-a computational tool to assess titin's role as a disease gene.

Anna Laddach1, Mathias Gautel1, Franca Fraternali1.   

Abstract

SUMMARY: Large numbers of rare and unique titin missense variants have been discovered in both healthy and disease cohorts, thus the correct classification of variants as pathogenic or non-pathogenic has become imperative. Due to titin's large size (363 coding exons), current web applications are unable to map titin variants to domain structures. Here, we present a web application, TITINdb, which integrates titin structure, variant, sequence and isoform information, along with pre-computed predictions of the impact of non-synonymous single nucleotide variants, to facilitate the correct classification of titin variants.
AVAILABILITY AND IMPLEMENTATION: TITINdb can be freely accessed at http://fraternalilab.kcl.ac.uk/TITINdb. CONTACT: franca.fraternali@kcl.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2017. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29077808      PMCID: PMC5860166          DOI: 10.1093/bioinformatics/btx424

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

The giant protein titin, encoded by the gene TTN, is 35 991 amino acids in length [inferred complete (IC) isoform], weighs over 4000 kDa and spans half a sarcomere. Since the advent of next generation sequencing (NGS) technology, a number of titin missense variants, both recessive and dominant, have been associated with disease (both skeletal and cardiac forms of myopathy) (Chauveau ; Hastings ; Helle ; Savarese ), including those which can lead to sudden cardiac death [e.g. hypertrophic cardiomyopathy (HCM)]. Unfortunately, due to titin’s large size, even the majority of healthy individuals possess one or more rare titin missense variants (Lopes ). This results in the paradox that rare titin variants are commonly found; therefore, pathogenicity cannot be inferred from frequency alone. To complicate matters further, it has recently been shown that certain titin variants can be pathogenic in particular constellations or act as phenotype modifiers (Evilä ). One such scenario is the inheritance of a truncating variant along with a rare or unique missense variant in compound heterozygosity [as has been observed in childhood core myopathy with heart disease, with rare recessive mutations also found in the general population (Chauveau )]. In light of this information, we believe the assessment of the impact of non-synonymous single nucleotide variants (nsSNVs) at the molecular level to be essential, and propose that in silico analyses can be used to prioritise variants for further experimental investigation. We have created TITINdb to facilitate such prioritization.

2 Implementation and features

TITINdb includes disease-associated nsSNVs reported in the literature as well as population nsSNVs from the gnomAD database (Lek ) and 1000 genomes project (Auton ). Additionally, in silico saturation mutagenesis has been performed to allow users to access predictions for the impact of any possible single amino acid variants (SAVs). As experimental structures were only available for 23 of titin’s 302 globular domains (132 Fn3, 169 Ig, 1 Kinase), an automated pipeline based on the Modeller software (Webb and Sali, 2016) was set up to model all 279 domains without structure (see Supplementary Figure S3 for more details). As major bioinformatics resources did not agree on titin domain numbers and boundaries, we found it necessary to define these prior to modelling. As illustrated in Supplementary Figure S6, the TITINdb pipeline has greatly increased the structural coverage of titin domains and the quality of the coverage. Sequence-based prediction of the impact of all nsSNVs was performed using the Condel software (González-Pérez and López-Bigas, 2011). The in silico assessment of the impact of nsSNVs using structural information was performed using the DUET software (Pires ) for all known nsSNVs which map to domain structures, additionally the mCSM software (Pires ) was used to predict the impact of all possible SAVs. Where experimental structures of titin domains in complex with binary interaction partners exist, mCSM was also used to predict the impact of SAVs on protein–protein binding affinity. Other structural analysis provided by the application includes computation of the quotient solvent accessible surface area [Q(SASA)] of all residues which map to structure [calculated using POPS (Cavallo )] and predictions of which residues are involved in protein–protein interactions [calculated using SPIDDER (Porollo )]. Of note, however, is the absence of experimental, molecularly resolved protein–protein interaction data for most of titin’s domains, precluding detailed impact analysis on protein–protein interactions. Additionally, nsSNVs are annotated with functional site information from UniProt (The UniProt Consortium, 2017), including residue modifications. Representative structures for each domain were used in the computation of all structural analyses, apart from the calculation of Q(SASA). This was calculated separately for each structure. A list of structure representatives can be found in the Supplementary Tables S1 and S2. The application enables users to perform a number of visualizations, which include viewing population nsSNVs as distributions on structures, colour-coded by minor allele frequencies. Additionally, users are able to confidentially upload their own structure for nsSNV visualization (this may be useful if a group has an unpublished crystal structure or believe their own model to be of better quality). All structures and in silico analyses can be freely accessed and downloaded. Additionally, we provide quality assessment of the models (in the form of zDOPE scores and per-residue DOPE plots) (Shen and Sali, 2006) along with the alignments used for homology modelling. Video tutorials showing the use of TITINdb can be found at http://fraternalilab.kcl.ac.uk/TITINdb/tutorials/

3 Applications

3.1 Investigating disease associated nsSNVs

A potential application of TITINdb that involves investigating SNVs associated with specific diseases is shown in Figure 1 and further explored in section S2.1 of the Supplementary Materials. The facility to search by disease enables the detection of patterns or hotspots characteristic of variants associated with particular diseases. Two known nsSNV hotspots exist: one in domain Fn3-119 associated with hereditary myopathy with early respiratory failure (HMERF) (Pfeffer ) and one in Ig-169 associated with tibial muscular dystrophy, limb-girdle muscular dystrophy 2J (TMD/LGMD2J) (Chauveau ; Hackman ; Savarese ).
Fig. 1

TITINdb user interface overview. The HMERF and TMD associated nsSNV hotspots are shown. Users can pin disease-associated nsSNVs from the SNV table onto domain structure and visualize these against the distribution of population nsSNVs (Gnomad or 1000 genomes). Pre-computed in silico analyses are shown in the SNV table (more information can be accessed by scrolling horizontally and vertically)

TITINdb user interface overview. The HMERF and TMD associated nsSNV hotspots are shown. Users can pin disease-associated nsSNVs from the SNV table onto domain structure and visualize these against the distribution of population nsSNVs (Gnomad or 1000 genomes). Pre-computed in silico analyses are shown in the SNV table (more information can be accessed by scrolling horizontally and vertically) TITINdb facilitates the visualization of nsSNVs associated with these diseases on structure (see Fig. 1); for both conditions nsSNVs can be observed to cluster in 3D space. In each case it can also be clearly seen that the distribution of disease associated nsSNVs on 3D structure is distinct from the distribution of population nsSNVs from the gnomAD database. Furthermore it becomes clear that all disease-associated nsSNVs discussed here are fairly buried (as indicated by a burgundy colour); therefore, it appears likely that they may disrupt protein stability. From the pre-calculated in silico analysis, it can be seen that all these disease associated nsSNVs are predicted to be destabilizing by DUET (Pires ). TMD associated nsSNVs are also predicted, by mCSM (Pires ), to disrupt the interaction between titin and obscurin, albeit by varying magnitudes; this has been validated experimentally (Fukuzawa ; Rudloff ). Interestingly the I35947N variant is predicted to have the least impact on the titin-obscurin interaction affinity (mCSM score -0.17 kcal/mol) out of all the TMD associated variants; this correlates with in vitro experimental observations where negligible differences have been found between this variant and wild-type titin (Fukuzawa ; Rudloff ). Additionally, the majority of HMERF associated nsSNVs are predicted to be deleterious by Condel (González-Pérez and López-Bigas, 2011), whereas only half the TMD associated nsSNVs are predicted to be deleterious. This highlights the need to take into consideration multiple sources of information, as provided by TITINdb, when predicting the potential impact of nsSNVs, and does not exclude experimental validation on a case-by-case basis. Despite being a hotspot for HMERF associated nsSNVs, no experimental PDB structures or models are currently publicly available for the domain Fn3-119. Therefore, TITINdb has made possible the visualization of HMERF nsSNVs on structure and the in silico prediction of their impact at the molecular level. Multiple PDB structures exist for the domain Ig-169 (commonly referred to as M-10), to which TMD associated nsSNVs localize. Here, users can select which structure they wish to use to perform nsSNV visualization.

3.2 Investigating NGS nsSNV data

An application of TITINdb we believe to be particularly useful is the analysis of variants from NGS data. Specifically, the tool can be used in the prioritization of rare variants observed in disease cohorts for further experimental investigation. An example of such a variant is the P13979S titin N2B (isoform) nsSNV, which is published in the Supplementary Information associated with the article from Lopes , and further described in Section S2.2 of the Supplementary Materials. The nsSNV is found in 3/143 patients with HCM, leading to a cohort minor allele frequency (MAF) of between 0.01 and 0.02 (details on zygosity are not available). A notable feature of TITINdb is the ability to search by different isoform positions. Tools such as ANNOVAR (Wang ) enable researchers to map variants from genomic to protein coordinates, however, depending on the protocol followed, variants may be mapped to different isoforms. For our nsSNV of interest, the N2B isoform coordinate is reported, thus the ‘search by position’ facility allows it to be mapped to both other major isoforms and the position within the affected domain. Additionally, it can be seen that the nsSNV localizes to residue position 5 of domain Fn3-55 and is present in all isoforms apart from the novex-3 isoform; therefore it is expressed in both cardiac and skeletal muscle. TITINdb allows easy access to information concerning the nsSNV’s potential impact. Structurally, it can be seen that the affected residue has a Q(SASA) of 0.1 (this information is provided in the table on the nsSNV page), indicating that it is buried and that the nsSNV could potentially cause disease through destabilization of the underlying domain. It can also be seen, from predictions by the software SPPIDER (Porollo ), that the affected residue is not predicted to be involved in protein-protein interactions and thus is unlikely to cause disease through the disruption of these. On comparison to known nsSNVs it can be seen that the nsSNV is present in both the 1000 genomes data and the gnomAD database with MAFs of 3.7036E-03 and 9.98403E-04. This indicates the variant is rare but present in a small proportion of nominally healthy individuals. From the MAF values it can be deduced that the variant is enriched in the HCM cohort (which we know has a MAF between 0.01 and 0.02). This suggests that the variant is either neutral, disease-causing with incomplete penetrance, recessive, or that a small number of nominally healthy individuals have undiagnosed HCM. Structure (DUET)- and sequence (Condel)-based predictions of the impact of the nsSNV can be observed. The DUET score of −2.703 kcal/mol suggests the variant is highly destabilizing and supports the hypothesis derived earlier from the Q(SASA) that the variant could potentially lead to disease by disrupting the domain structure. Furthermore, it can be seen that the variant is also predicted to be deleterious by Condel. As no experimental structures exist for the domain Fn3-55, 3D visualization and access to pre-computed structural analyses are made possible by the homology model provided as part of TITINdb. One salient feature is that, if nsSNVs are pinned on structure from the sequence, any related/identical nsSNVs rise to the top of the nsSNV table and become highlighted in either yellow (surface) or red (buried) according to their Q(SASA) (see Supplementary Figure S2); the pinned nsSNVs also follow this colour scheme. The results indicate that, although the analysed nsSNV is highly likely to affect the domain structure, it is unclear whether this will contribute to the disease phenotype (primarily as the mutant titin may not be expressed in vivo in heterozygous cases). Further information concerning applications of TITINdb can be found in the Supplementary Materials. In particular, it is hoped that the tool will enable clinicians to perform the information-based assessment of variants from patient data, and assist biologists in the prioritization of domain structures for biophysical characterization.

Funding

This work has been supported by the British Heart Foundation [RE/13/2/30182, RG/15/8/31480 and CH/08/001] and the Biotechnology and Biological Sciences Research Council [BB/H018409/1 to FF]. Conflict of Interest: none declared. Click here for additional data file.
  21 in total

1.  POPS: A fast algorithm for solvent accessible surface areas at atomic and residue level.

Authors:  Luigi Cavallo; Jens Kleinjung; Franca Fraternali
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Atypical phenotypes in titinopathies explained by second titin mutations.

Authors:  Anni Evilä; Anna Vihola; Jaakko Sarparanta; Olayinka Raheem; Johanna Palmio; Satu Sandell; Bruno Eymard; Isabel Illa; Ricard Rojas-Garcia; Karolina Hankiewicz; Luis Negrão; Tuija Löppönen; Pekka Nokelainen; Mikko Kärppä; Sini Penttilä; Mark Screen; Tiina Suominen; Isabelle Richard; Peter Hackman; Bjarne Udd
Journal:  Ann Neurol       Date:  2014-02-24       Impact factor: 10.422

3.  Biophysical characterization of naturally occurring titin M10 mutations.

Authors:  Michael W Rudloff; Alec N Woosley; Nathan T Wright
Journal:  Protein Sci       Date:  2015-04-02       Impact factor: 6.725

4.  Tibial muscular dystrophy is a titinopathy caused by mutations in TTN, the gene encoding the giant skeletal-muscle protein titin.

Authors:  Peter Hackman; Anna Vihola; Henna Haravuori; Sylvie Marchand; Jaakko Sarparanta; Jerome De Seze; Siegfried Labeit; Christian Witt; Leena Peltonen; Isabelle Richard; Bjarne Udd
Journal:  Am J Hum Genet       Date:  2002-07-26       Impact factor: 11.025

5.  Interactions with titin and myomesin target obscurin and obscurin-like 1 to the M-band: implications for hereditary myopathies.

Authors:  Atsushi Fukuzawa; Stephan Lange; Mark Holt; Anna Vihola; Virginie Carmignac; Ana Ferreiro; Bjarne Udd; Mathias Gautel
Journal:  J Cell Sci       Date:  2008-05-13       Impact factor: 5.285

Review 6.  Diagnosis of muscle diseases presenting with early respiratory failure.

Authors:  Gerald Pfeffer; Marcus Povitz; G John Gibson; Patrick F Chinnery
Journal:  J Neurol       Date:  2014-11-07       Impact factor: 4.849

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

9.  mCSM: predicting the effects of mutations in proteins using graph-based signatures.

Authors:  Douglas E V Pires; David B Ascher; Tom L Blundell
Journal:  Bioinformatics       Date:  2013-11-26       Impact factor: 6.937

10.  Combination of Whole Genome Sequencing, Linkage, and Functional Studies Implicates a Missense Mutation in Titin as a Cause of Autosomal Dominant Cardiomyopathy With Features of Left Ventricular Noncompaction.

Authors:  Robert Hastings; Carin P de Villiers; Charlotte Hooper; Liz Ormondroyd; Alistair Pagnamenta; Stefano Lise; Silvia Salatino; Samantha J L Knight; Jenny C Taylor; Kate L Thomson; Linda Arnold; Spyros D Chatziefthimiou; Petr V Konarev; Matthias Wilmanns; Elisabeth Ehler; Andrea Ghisleni; Mathias Gautel; Edward Blair; Hugh Watkins; Katja Gehmlich
Journal:  Circ Cardiovasc Genet       Date:  2016-09-13
View more
  15 in total

1.  Fine mapping titin's C-zone: Matching cardiac myosin-binding protein C stripes with titin's super-repeats.

Authors:  Paola Tonino; Balazs Kiss; Jochen Gohlke; John E Smith; Henk Granzier
Journal:  J Mol Cell Cardiol       Date:  2019-05-31       Impact factor: 5.000

2.  A missense variant in the titin gene in Doberman pinscher dogs with familial dilated cardiomyopathy and sudden cardiac death.

Authors:  Kathryn M Meurs; Steven G Friedenberg; Justin Kolb; Chandra Saripalli; Paola Tonino; Kathleen Woodruff; Natasha J Olby; Bruce W Keene; Darcy B Adin; Oriana L Yost; Teresa C DeFrancesco; Sunshine Lahmers; Sandra Tou; G Diane Shelton; Henk Granzier
Journal:  Hum Genet       Date:  2019-02-04       Impact factor: 4.132

Review 3.  Titin mutations and muscle disease.

Authors:  Dalma Kellermayer; John E Smith; Henk Granzier
Journal:  Pflugers Arch       Date:  2019-03-27       Impact factor: 3.657

Review 4.  How Functional Genomics Can Keep Pace With VUS Identification.

Authors:  Corey L Anderson; Saba Munawar; Louise Reilly; Timothy J Kamp; Craig T January; Brian P Delisle; Lee L Eckhardt
Journal:  Front Cardiovasc Med       Date:  2022-07-04

5.  Solution NMR Structure of Titin N2A Region Ig Domain I83 and Its Interaction with Metal Ions.

Authors:  Colleen Kelly; Nicola Pace; Matthew Gage; Mark Pfuhl
Journal:  J Mol Biol       Date:  2021-03-31       Impact factor: 6.151

6.  A long-read RNA-seq approach to identify novel transcripts of very large genes.

Authors:  Prech Uapinyoying; Jeremy Goecks; Susan M Knoblach; Karuna Panchapakesan; Carsten G Bonnemann; Terence A Partridge; Jyoti K Jaiswal; Eric P Hoffman
Journal:  Genome Res       Date:  2020-07-06       Impact factor: 9.043

7.  A new congenital multicore titinopathy associated with fast myosin heavy chain deficiency.

Authors:  Aurélien Perrin; Corinne Metay; Marcello Villanova; Robert-Yves Carlier; Elena Pegoraro; Raul Juntas Morales; Tanya Stojkovic; Isabelle Richard; Pascale Richard; Norma B Romero; Henk Granzier; Michel Koenig; Edoardo Malfatti; Mireille Cossée
Journal:  Ann Clin Transl Neurol       Date:  2020-04-19       Impact factor: 4.511

8.  Novel TTN mutations and muscle imaging characteristics in congenital titinopathy.

Authors:  Meng Yu; Ying Zhu; Zhiying Xie; Yiming Zheng; Jiangxi Xiao; Wei Zhang; Ichizo Nishino; Yun Yuan; Zhaoxia Wang
Journal:  Ann Clin Transl Neurol       Date:  2019-07-01       Impact factor: 4.511

Review 9.  When signalling goes wrong: pathogenic variants in structural and signalling proteins causing cardiomyopathies.

Authors:  Mehroz Ehsan; He Jiang; Kate L Thomson; Katja Gehmlich
Journal:  J Muscle Res Cell Motil       Date:  2017-11-08       Impact factor: 2.698

Review 10.  Is Gene-Size an Issue for the Diagnosis of Skeletal Muscle Disorders?

Authors:  Marco Savarese; Salla Välipakka; Mridul Johari; Peter Hackman; Bjarne Udd
Journal:  J Neuromuscul Dis       Date:  2020
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.