| Literature DB >> 35815201 |
Tadaaki Katsuda1, Noriko Sato1, Kaoru Mogushi2,3, Takeshi Hase2,3,4,5, Masaaki Muramatsu1.
Abstract
Clonal mosaicism (a detectable post-zygotic mutational event in cellular subpopulations) is common in cancer patients. Detected segments of clonal mosaicism are usually bundled into large-locus regions for statistical analysis. However, low-frequency genes are overlooked and are not sufficient to elucidate qualitative differences between cancer patients and non-patients. Therefore, it is of interest to develop and describe a tool named Sub-GOFA for Sub-Gene Ontology function analysis in clonal mosaicism using semantic similarity. Sub-GOFA measures the semantic (logical) similarity among patients using the sub-GO network structures of various sizes segmented from the gene ontology (GO) for clustering analysis. The sub-GO's root-terms with significant differences are extracted as disease-associated genetic functions. Sub-GOFA selected a high ratio of cancer-associated genes under validation with acceptable threshold.Entities:
Keywords: Sub-GOFA; clonal mosaicism; function; logical; semantic; similarity; sub-gene ontology; tool
Year: 2022 PMID: 35815201 PMCID: PMC9200605 DOI: 10.6026/97320630018053
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Sub-GOFA’s genetic functional analysis overview and performance for clonal mosaicism. A: The analysis flow of Sub-GOFA. Step 1 - Generating Sub-GO networks (43,364) by segmenting the huge GO network. Step 2 - Generating analysis dataset annotating multivariate GO terms in abnormal regions for case (cancer-patients) and ctrl (non-patients). Step 3 - Executing genetic functional analysis consists of semantic similarity analysis with Sub-GO (43,364), clustering analysis from the similarity results and statistical evaluation with adjusted p-values. B: Comparison of lung cancer associated gene contents ratio of Sub-GOFA and Fisher's exact test for gene symbols and for GO terms with varying FDR values. C: Plot of the frequency and density of GO terms calculated by Sub-GOFA and Fisher's exact test for GO in lung cancer analysis dataset at FDR value of 0.3.
Figure 2Comparison of case (cancer-patients) and ctrl (non-patients) for the frequency of clonal mosaicism at 1000 Mb in each chromosome.
Figure 3Comparison of detection ratio of cancer-associated genes among Sub-GOFA, Fisher's exact test for GO, and Fisher's exact test for gene symbols.