| Literature DB >> 35267146 |
Thomas H Gillespie1, Shreejoy J Tripathy2,3,4, Mohameth François Sy5, Maryann E Martone1, Sean L Hill6,7,8,9.
Abstract
The challenge of defining and cataloging the building blocks of the brain requires a standardized approach to naming neurons and organizing knowledge about their properties. The US Brain Initiative Cell Census Network, Human Cell Atlas, Blue Brain Project, and others are generating vast amounts of data and characterizing large numbers of neurons throughout the nervous system. The neuroscientific literature contains many neuron names (e.g. parvalbumin-positive interneuron or layer 5 pyramidal cell) that are commonly used and generally accepted. However, it is often unclear how such common usage types relate to many evidence-based types that are proposed based on the results of new techniques. Further, comparing different types across labs remains a significant challenge. Here, we propose an interoperable knowledge representation, the Neuron Phenotype Ontology (NPO), that provides a standardized and automatable approach for naming cell types and normalizing their constituent phenotypes using identifiers from community ontologies as a common language. The NPO provides a framework for systematically organizing knowledge about cellular properties and enables interoperability with existing neuron naming schemes. We evaluate the NPO by populating a knowledge base with three independent cortical neuron classifications derived from published data sets that describe neurons according to molecular, morphological, electrophysiological, and synaptic properties. Competency queries to this knowledge base demonstrate that the NPO knowledge model enables interoperability between the three test cases and neuron names commonly used in the literature.Entities:
Keywords: Cell types; FAIR principles; Interoperability; Knowledge base; Knowledge integration; Neurons; Ontology
Mesh:
Substances:
Year: 2022 PMID: 35267146 PMCID: PMC9547803 DOI: 10.1007/s12021-022-09566-7
Source DB: PubMed Journal: Neuroinformatics ISSN: 1539-2791
The current Phenotypic Dimensions of the NPO and the associated ontologies/vocabularies used to populate the data model. When NIFSTD appears in this table the terms were nearly always added to support the NPO. Examples are drawn from Fig. 2
Taxonomic Example: Species | The species or taxon rank in which the phenotype inheres | NCBI taxonomy1 |
Anatomical Example: Brain Region | The regions of the nervous system containing parts of the neuron. Primary location is indicated by the location of the cell soma, but anatomical location may be assigned to any cell part through a series of predicates | UBERON; various brain atlases via NIFSTD parcellation2 |
| Morphological | Distinguishing morphological characteristics | NIFSTD3 |
Molecular Example: Expression | Distinguishing molecular constituents | NCBI Gene4, CHEBI5, Protein Ontology6 |
| Physiological | Expresses a relationship between a neuron type and an electrophysiological phenotype concept. This should be used when a neuron type is described using a high level electrophysiological concept class, e.g., bursting | NIFSTD Petilla Conventions (Petilla Interneuron Nomenclature Group, |
| Connection | Indicates a synaptic relationship between cell types. Further elaborated into connectivity determined by different techniques, e.g., physiology, electron microscopy | Gene Ontology7 |
Circuit role Example: Projection | Indicates whether the neuron is an Intrinsic neuron (local circuit neuron), projection neuron, or sensory neuron | NIFSTD (Bug et al., |
Projection targets Example: Projection | Expresses a relationship between a neuron type and a brain region to which it sends axons. Synaptic relationships are represented through the connection relationship | UBERON (Mungall et al., |
1https://www.ncbi.nlm.nih.gov/taxonomy
2https://github.com/SciCrunch/NIF-Ontology/blob/master/docs/brain-regions.org
3https://github.com/SciCrunch/NIF-Ontology
4https://www.ncbi.nlm.nih.gov/gene
5https://www.ebi.ac.uk/chebi/
6https://proconsortium.org/
7http://geneontology.org/
Fig. 2High level data model for neuron phenotypes. The Neuron Phenotype Ontology characterizes neuron types as bundles of normalized phenotypic properties. Dimensions that have not been used in the current version of the NPO or are planned for the future are grayed out
Fig. 1Evolution of neuron knowledge A Common usage types (CUTs) emerge in the literature as evidence accumulated for generally accepted neuron types with implicitly known properties. Data-driven studies generate evidence-based types (EBTs) based on explicitly measured standardized properties B The Neuron Phenotype Ontology (NPO) provides interoperability between the CUTs from the literature, the EBTs from data-driven studies, and new experimental observations from individual laboratories
Fig. 3The set of predicates employed to define molecular phenotypes. Relationships that have not been used in the current version of the NPO or are planned for the future are grayed out
Fig. 4Process used to translate local terminology into ontology-based representations and machine-generated names. Using neurondm, phenotypes are first mapped by a user into ontology identifiers (top panel). Neuron types are constructed and neurondm automatically translates these mappings into OWL equivalence statements (middle panel). From the same internal representation of these restrictions neurondm generates a set of human readable labels (bottom panel)
Examples of EBT and CUT neurons returned from Competency query CQ1: Find all examples of parvalbumin containing neurons. The form of the parvalbumin indicator is highlighted in red. Only one example is provided from the Allen EBT (total 59). Full results are available in Gillespie et al. (2020). The compact identifier for each class is prefixed (in bold) to the localLabel for ease of reference. The local label preserves the form in which the molecule was measured. The Common/original name represents the common name from the superclass for all of the physiological subtypes for the Markram cells. However, for the local label we provide a subtype as the superclass does not include the full molecular profile in the name
| 6 | |||
| 16 | |||
| 2 | |||
| 59 | none |
Results for CQ2: Find all cortical neurons containing somatostatin. Full results are available in Gillespie et al. (2020). The compact identifier for each class is prefixed (in bold) to the local label for ease of reference. The local label preserves the form in which the molecule was measured. The Common/original name represents the common name from the superclass for all of the physiological subtypes for the Markram cells. However, for the local label we provide a subtype as the superclass does not include the full molecular profile in the name. Similar entities across cell types are color coded. Brain region = blue; somatostatin indicator = red
| CUT | 1 | ||
| EBT Markram | 31 | ||
| EBT Huang | 4 | ||
| EBT Allen | 64 | none |
Neurons that have a basket phenotype. Similar entities across the cell are color coded to aid in comparison. The full results list is available in Gillespie et al (2020). Similar entities are color coded across cell types: blue = brain region; green = morphology; purple = neurotransmitter; dark red = parvalbumin indicator; red = somatostatin indicator
| PVBC Neuron (Huang2017) | npokb:43 | Mus musculus |
| CCKC Neuron (Huang2017) | npokb:40 | Mus musculus |
| Large basket cell (Markram2015): subtype | npokb:59 | 'Rattus norvegicus |
| Nest basket cell (Markram2015): subtype | npokb:65 | ‘Rattus norvegicus |
| Small basket cell (Markram2015): subtype | npokb:73 | ‘Rattus norvegicus |
Fig. 5Inferred hierarchy after reasoning over the ontology for the Martinotti cell. Panel A shows the hierarchy generated under the NeuronCUT class. The position of the Marinotti CUT is indicated by the lower red arrow. An enlargement of the Martinotti classification is shown in panel B. Panel C shows the OWL representation of the Martinotti CUT
This rubric (Hodson et al., 2018) organizes the 15 FAIR principles (Applicable principles) into a hierarchical table according to how easy they are to achieve, starting from a basic core (Summary) and rates data according to level of compliance, from 1 to 4 * (Rating). We provide an evaluation of the NPO/NPOKB against these principles in column 4
| * | The basic core: metadata, PID & access | F2. data are described with rich metadata F1. (meta)data are assigned a globally unique and persistent identifier A1. (meta)data are retrievable by their identifier using a standardized communications protocol | • F2: Full descriptive metadata for the ontology are included in the.ttl file. Metadata for the datasets and code are included in Pypi from setup.py, Zenodo, MIRO; The NPOKB includes complete authoring metadata • F1: All datasets referenced in this paper have been assigned DOIs • F1. The NPOKB is assigned a unique identifier (RRID) RRID:SCR_017403 • A1. RRIDs are resolvable through identifiers.org: |
| ** | Enhanced access: catalogues for discovery, standard (controlled) access & licences | F4:. (meta)data are registered or indexed in a searchable resource A1.1. the protocol is free, open and universally implementable A1.2. the protocol allows for an authentication and authorization procedure, where necessary R1.1. (meta)data are released with a clear and accessible data usage license | • F4: All python code is available via pypi. ebuilds for Gentoo are available from tgbugs-overlay • F4. The NPO is registered in BioPortal and in the SciCrunch Registry (RRID:SCR_017403) • A1.2 API access is provided via Bioportal and also via SciGraph maintained by the Neuroscience Information Framework and dkNET • R1.1 The NPO is covered under a CC-BY 4.0 license |
| *** | Use of standards: for metadata and data | I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation R1.3. (meta)data meet domain relevant community standards F3: metadata clearly and explicitly include the identifier of the data it describes | • I1: The ontology is built in OWL2, a recognized standard for ontologies • R1.3: The phenotype bags are built out of terms from community standard ontologies • F3: All terms are defined by a URI as well as a compact identifier |
| **** | Rich, FAIR metadata | R1. (meta)data are richly described with a plurality of accurate and relevant attributes I2. (meta)data uses vocabularies that follow FAIR principles | • R1: The ontology has complete metadata associated with it • I2: The NPO has been designed in accordance with the FAIR principles. Documentation • I2: The NPO/NPOKB imports relevant community vocabularies (see Table |
| ***** | Provenance and additional context | R1.2 (meta)data are associated with data provenance I3. (meta)data include qualified references to other (meta)data A2. metadata are accessible, even when the data are no longer available | • R1.2: References that support assertions are included in the annotations although unfortunately OWL does not provide an easy way to annotate specific triples •!3: I2: The NPO/NPO-KB imports relevant community vocabularies (see Table • A2: The NPO and associated tools have been registered with the SciCrunch Registry, which maintains metadata pages for similar resources. They ensure that their metadata is accessible even if the resource is no longer available |