| Literature DB >> 35309148 |
Kan Yan Chloe Li1,2, Andrew C Cook2, Ruth C Lovering1.
Abstract
The cardiac conduction system (CCS) comprises critical components responsible for the initiation, propagation, and coordination of the action potential. Aberrant CCS development can cause conduction abnormalities, including sick sinus syndrome, accessory pathways, and atrioventricular and bundle branch blocks. Gene Ontology (GO; http://geneontology.org/) is an invaluable global bioinformatics resource which provides structured, computable knowledge describing the functions of gene products. Many gene products are known to be involved in CCS development; however, this information is not comprehensively captured by GO. To address the needs of the heart development research community, this study aimed to describe the specific roles of proteins reported in the literature to be involved with CCS development and/or function. 14 proteins were prioritized for GO annotation which led to the curation of 15 peer-reviewed primary experimental articles using carefully selected GO terms. 152 descriptive GO annotations, including those describing sinoatrial node and atrioventricular node development were created and submitted to the GO Consortium database. A functional enrichment analysis of 35 key CCS development proteins confirmed that this work has improved the in-silico interpretation of this CCS dataset. This work may improve future investigations of the CCS with application of high-throughput methods such as genome-wide association studies analysis, proteomics, and transcriptomics.Entities:
Keywords: annotation; biocuration; cardiac conduction; gene ontology (GO); heart development
Year: 2022 PMID: 35309148 PMCID: PMC8924464 DOI: 10.3389/fgene.2022.802393
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Ontology relevant to cardiac conduction system development. QuickGO graph of the part of the heart development ontology (www.ebi.ac.uk/QuickGO). Six GO terms used as GO slims are highlighted in yellow: GO:0003161 cardiac conduction system development, GO:0003162 atrioventricular node development, GO:0003163 sinoatrial node development, GO:0003164 His-Purkinje system development, GO:0036302 atrioventricular canal development and GO:0007507 heart development. The is_a relations between the GO term are indicated by black arrows, the part_of relations as blue arrows ((Ashburner et al., 2000), (The Gene Ontology Consortium et al., 2021)).
Impact of cardiac conduction-focused annotation project.
| GO term identifier | GO term | Number of annotations March 2021 | Number of annotations October 2020 | ||
|---|---|---|---|---|---|
| All | Priority | All | Priority | ||
| GO:0003161 | Cardiac conduction system development | 17 ( | 17 ( | 3 ( | 3 ( |
| GO:0003162 | Atrioventricular node development | 9 ( | 7 ( | 2 ( | 0 |
| GO:0003163 | Sinoatrial node development | 13 ( | 12 ( | 1 ( | 0 |
| GO:0003164 | His-Purkinje system development | 10 ( | 9 ( | 1 ( | 0 |
| Total number of CCS development annotations | 49 ( | 45 ( | 7 ( | 3 ( | |
| GO:0007507 | Heart development | 1067 (356) | 200 ( | 1023 (343) | 177 ( |
| GO:0036302 | Atrioventricular canal development | 15 ( | 10 ( | 8 ( | 3 (3) |
| Total number of heart development GO annotations | 1131 (373) | 255 ( | 1038 (345) | 183 ( | |
Comparison of the number of CCS and heart development manual GO annotations associated with all human gene products (All) and 35 key CCS development proteins in March 2021 (end of project) versus October 2020 (start of project). The six listed GO terms were used as a GO slim to filter the annotations downloaded from QuickGO. Full details of the annotations can be found in Supplementary Table S2 and https://tinyurl.com/3259w2vc and https://tinyurl.com/3525rt4u. The number of manual annotations associated with the human gene products are listed, with the number of gene products associated with these terms in brackets. The IEA (Inferred from Electronic Annotation) annotations have not been included as this data is updated with each release.
A selection of GO annotations describing the role of murine Shox2 and Nkx2-5 in heart development.
| Protein | Qualifier | GO term | GO term identifier | Evidence code | Annotation extension | |
|---|---|---|---|---|---|---|
| Relation | Identifier | |||||
| Nkx2-5 | Involved in | Cardiac muscle cell development | GO:0055013 | IMP | — | — |
| NKx2-5 | Involved in | Cardiac muscle tissue morphogenesis | GO:0055008 | IMP | — | — |
| Shox2 | Acts upstream of | Regulation of heart rate | GO:0002027 | IMP | — | — |
| Shox2 | Involved in | Sinoatrial node development | GO:0003163 | IMP | — | — |
| Shox2 | Involved in | Sinoatrial node cell development | GO:0060931 | IMP | — | — |
| Shox2 | Involved in | Cardiac right atrium morphogenesis | GO:0003213 | IMP | — | — |
| Shox2 | Involved in | Cardiac pacemaker cell differentiation | GO:0060920 | IMP | Occurs_in | UBERON:0002351 |
| Part_of | GO:0003163 | |||||
| Shox2 | Involved in | Negative regulation of transcription by RNA polymerase II | GO:0000122 | IMP | Has_input | UniProtKB:P42582 |
| Occurs_in | UBERON:0002351 | |||||
A selection of the GO terms and annotation extension statements used to capture the role of Shox2 and Nkx2-5, as described by Espinoza-Lewis et al. (2009 and 2011) (Espinoza-Lewis et al., 2009; Espinoza-Lewis et al., 2011). These GO terms had not been previously associated with these proteins. Terms associated with the listed annotation extension identifiers: Uberon:0002351, sinoatrial node; GO:0003163, sinoatrial node development; UniProtKB:P42582, Nkx2-5.
A selection of GO annotations describing the role of human TBX2 and TBX3 proteins in heart development.
| Protein | Qualifier | GO term | GO term Identifier | Evidence code | With | Annotation extension | |
|---|---|---|---|---|---|---|---|
| Relation | Identifier | ||||||
| Biological process | |||||||
| TBX2 | Involved in | Endocardial cushion formation | GO:0003272 | IMP | — | — | — |
| TBX2 | Involved in | Atrioventricular canal development | GO:0036302 | ISS | Q60707 murine Tbx2 | — | — |
| TBX2 | Involved in | Atrioventricular canal morphogenesis | GO:1905222 | ISS | Q60707 murine Tbx2 | — | — |
| TBX2 | Involved in | Cardiac jelly development | GO:1905072 | IMP | — | — | — |
| TBX3 | Involved in | Sinoatrial node cell development | GO:0060931 | IDA | — | — | — |
| TBX3 | Involved in | Endocardial cushion formation | GO:0003272 | IMP | — | — | — |
| TBX3 | Involved in | Atrioventricular canal development | GO:0036302 | ISS | P70324 murine Tbx3 | — | — |
| TBX3 | Involved in | Atrioventricular canal morphogenesis | GO:1905222 | ISS | P70324 murine Tbx3 | — | — |
| TBX3 | Involved in | Cardiac jelly development | GO:1905072 | IMP | — | — | — |
| TBX3 | Involved in | Cardiac epithelial to mesenchymal transition | GO:0060317 | IMP | — | — | — |
| TBX3 | Involved in | Negative regulation of cell proliferation involved in heart morphogenesis | GO:2000137 | IMP | — | — | — |
| TBX3 | Involved in | negative regulation of transcription by RNA polymerase II | GO:0000122 | IMP | — | Occurs in | UBERON:0000948 |
| Part of | GO:0060317 | ||||||
| TBX3 | Involved in | positive regulation of transcription by RNA polymerase II | GO:0000315 | IMP | — | Occurs in | UBERON:0000948 |
| Part of | GO:0060317 | ||||||
| Molecular function | |||||||
| TBX3 | Enables | RNA polymerase II | GO:0000978 | IDA | — | Occurs in | UBERON:0000948 |
| TBX3 | Enables | DNA-binding transcription activator activity, RNA polymerase II-specific | GO:0001228 | IDA | — | Occurs in | UBERON:0000948 |
| Part of | GO:0060317 | ||||||
| TBX3 | Enables | DNA-binding transcription repressor activity, RNA polymerase II-specific | GO:0001227 | IDA | --- | Occurs in | UBERON:0000948 |
| Part of | GO:0060317 | ||||||
Twenty GO annotations were created following the review of Singh et al. (2012) (Singh et al., 2012). Sixteen of these annotations describe the role of human TBX2 and TBX3 in heart development. Terms associated with the listed annotation extension identifiers: Uberon: 0000948, heart; GO:0060317, cardiac epithelial to mesenchymal transition.
Impact of this focused annotation project on functional enrichment analysis.
| GO term identifier | GO term name | P | k | K | K/n (%) | K/K (%) | Proteins associated with enriched term (HGNC symbols) | |
|---|---|---|---|---|---|---|---|---|
| October 2020 ( | GO:0003161 | cardiac conduction system development | 6.75E-14 | 6 | 13 | 17.14 | 46.15 | BMPR1A, GJA5, ID2, NKX2-5, TBX3, TBX5 |
| GO:0003164 | His-Purkinje system development | 6.58E-11 | 4 | 5 | 11.43 | 80.00 | ID2, NKX2-5, TBX3, TBX5 | |
| GO:0003162 | atrioventricular node development | 5.96E-03 | 1 | 3 | 2.86 | 33.33 | NKX2-5 | |
| GO:0003163 | sinoatrial node development | 5.96E-03 | 1 | 3 | 2.86 | 33.33 | TBX3 | |
| GO:0036302 | atrioventricular canal development | 6.03E-07 | 3 | 9 | 8.57 | 33.33 | BMP2, SMAD4, TBX2 | |
| GO:0007507 | heart development | 3.55E-31 | 25 | 514 | 71.43 | 4.86 | BMP2, BMP4, BMPR1A, GATA4, GATA6, GJA1, GJA5, HEY1, HEY2, ID2, ISL1, MSX1, MSX2, NKX2-5, NOTCH2, NPPA, PITX2, SCN5A, SHOX2, SMAD1, SMAD4, TBX2, TBX3, TBX5, TBX20 | |
| March 2021 ( | GO:0003161 | cardiac conduction system development | 1.69E-82 | 30 | 38 | 85.71 | 78.95 | BMP4, BMPR1A, CACNA1G, GATA4, GATA6, GJA1, GJA5, GJB6, HCN4, HEY1, HEY2, HOPX, ID2, IRX3, ISL1, MSC, MSX1, MSX2, NKX2-5, NOTCH2, NPPA, NPPB, SCN5A, SHOX2, SMAD1, SMAD4, SMAD5, TBX3, TBX5, TBX18 |
| GO:0003164 | His-Purkinje system development | 2.27E-14 | 5 | 5 | 14.29 | 100.00 | HOPX, ID2, IRX3, NKX2-5, TBX3, TBX5 | |
| GO:0003162 | atrioventricular node development | 4.76E-13 | 5 | 7 | 14.29 | 71.43 | BMPR1A, GATA4, GATA6, NKX2-5 NOTCH2, TBX5 | |
| GO:0003163 | sinoatrial node development | 4.52E-21 | 8 | 10 | 22.86 | 80.00 | BMP4, CACNA1G, GJB6, HCN4, ISL1, SHOX2, TBX3 TBX18 | |
| GO:0036302 | atrioventricular canal development | 1.08E-16 | 7 | 13 | 20.00 | 53.85 | BMP2, GATA4, GATA6, SMAD4, TBX2, TBX3, TBX20 | |
| GO:0007507 | heart development | 7.96E-52 | 34 | 516 | 97.14 | 6.59 | BMP2, BMP4, BMPR1A, CACNA1G, GATA4, GATA6, GJA1, GJA5, GJB6, HCN4, HEY1, HEY2, HOPX, ID2, IRX3, ISL1, MSC, MSX1, MSX2, NKX2-5, NOTCH2, NPPA, NPPB, PITX2, SCN5A, SHOX2, SMAD1, SMAD4, SMAD5, TBX18, TBX2, TBX3, TBX5, TBX20 |
VisuaL Annotation Display (VLAD) statistical analysis for the two datasets (October 2020 and March 2021) based on the six GO slim terms used in the graphical output shown in Figure 1. The total number of annotated human proteins (N) is 17572 and 17655 for October 2020 and March 2021, respectively, with 35 key CCS development proteins in the Query Set (n). K is the number of human proteins associated with the listed GO term or its descendants and k is the number of proteins in the Query Set associated with the GO term or its descendants.