| Literature DB >> 33543012 |
Martin N Davis1, Sally Horne-Badovinac2, Alexandra Naba1.
Abstract
The extracellular matrix (ECM) is an assembly of hundreds of proteins that structurally supports the cells it surrounds and biochemically regulates their functions. Drosophila melanogaster has emerged as a powerful model organism to study fundamental mechanisms underlying ECM protein secretion, ECM assembly, and ECM roles in pathophysiological processes. However, as of today, we do not possess a well-defined list of the components forming the ECM of this organism. We previously reported the development of computational pipelines to define the matrisome - the ensemble of genes encoding ECM and ECM-associated proteins - of humans, mice, zebrafish and C. elegans. Using a similar approach, we report here that our pipeline has identified 641 genes constituting the Drosophila matrisome. We further classify these genes into different structural and functional categories, including an expanded way to classify genes encoding proteins forming apical ECMs. We illustrate how having a comprehensive list of Drosophila matrisome proteins can be used to annotate large proteomic datasets and identify unsuspected roles for the ECM in pathophysiological processes. Last, to aid the dissemination and usage of the proposed definition and categorization of the Drosophila matrisome by the scientific community, our list has been made available through three public portals: The Matrisome Project (http://matrisome.org), The FlyBase (https://flybase.org/), and GLAD (https://www.flyrnai.org/tools/glad/web/).Entities:
Keywords: Apical ECM; Basement membrane; Cuticle; Eggshell; Extracellular matrix; Glue; Protein domains
Year: 2019 PMID: 33543012 PMCID: PMC7852309 DOI: 10.1016/j.mbplus.2019.100015
Source DB: PubMed Journal: Matrix Biol Plus ISSN: 2590-0285
Fig. 1Bioinformatic workflow to define the in-silico matrisome of Drosophila melanogaster.
The databases FlyBase, DIOPT, and Ensembl were interrogated with the full list of human and mouse matrisome and matrisome-associated gene symbols. Selected InterPro domains, including domains characteristic of collagens, proteoglycans, ECM-affiliated proteins, and cuticle-binding proteins (see Supplementary Table 4) were used to identify ECM-domain-containing proteins in the reference proteome. The Gene Ontology annotations related to the ECM were then used to identify previously-annotated ECM components. Finally, selected published literature using proteomic and/or bioinformatic methods, as well as reviews on the subject, were searched to identify ECM proteins not identified by the orthology-based or protein-sequencing methods. These data were combined and manually curated to generate the first complete Drosophila matrisome. Hs, Homo sapiens; Mm, Mus musculus; Dm Drosophila melanogaster.
The databases FlyBase, DIOPT, and Ensembl were interrogated with the full list of human and mouse matrisome and matrisome-associated gene symbols. Selected InterPro domains, including domains characteristic of collagens, proteoglycans, ECM-affiliated proteins, and cuticle-binding proteins (see Supplementary Table 4) were used to identify ECM-domain-containing proteins in the reference proteome. The Gene Ontology annotations related to the ECM were then used to identify previously-annotated ECM components. Finally, selected published literature using proteomic and/or bioinformatic methods, as well as reviews on the subject, were searched to identify ECM proteins not identified by the orthology-based or protein-sequencing methods. These data were combined and manually curated to generate the first complete Drosophila matrisome. Hs, Homo sapiens; Mm, Mus musculus; Dm Drosophila melanogaster.
Fig. 2The Drosophila matrisome.
(A) The Drosophila matrisome comprises 641 genes. These genes are then divided into either categories which we have previously defined, or the newly proposed apical matrix category.
(B) The genes that encode proteins that make up the apical matrix of Drosophila were further divided into classes and sub-classes.