| Literature DB >> 33138786 |
Alexander J Carrillo1, Ilva E Cabrera1, Marko J Spasojevic2, Patrick Schacht1, Jason E Stajich1, Katherine A Borkovich3.
Abstract
BACKGROUND: With 9730 protein-coding genes and a nearly complete gene knockout strain collection, Neurospora crassa is a major model organism for filamentous fungi. Despite this abundance of information, the phenotypes of these gene knockout mutants have not been categorized to determine whether there are broad correlations between phenotype and any genetic features.Entities:
Keywords: Functional genomics; Fungal genetics; Hierarchical clustering; Neurospora; Phenotypic analysis
Mesh:
Substances:
Year: 2020 PMID: 33138786 PMCID: PMC7607824 DOI: 10.1186/s12864-020-07131-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Summary of results from phenotypic analysis of 1286 N. crassa gene replacement mutants. a. Flow chart for phenotypic analysis. The number and features of mutants analyzed at each step of the process are noted. The percentage of “No phenotype” and “With phenotype(s)” mutants refers to strains with complete data. b. Major phenotypic classes of mutants. The distribution of mutants with phenotypes in at least one of the three major categories (growth rate, asexual development and sexual development) is shown in the lobes of the Venn diagram. The intersection of lobes indicates mutants with defects in more than one major category
Fig. 2Distribution of genes by linkage group and functional catalogue assignments. a. Genes on different linkage groups. The distribution of all N. crassa genes on different linkage groups and the genes in each linkage group deleted in knockout mutants with complete phenotypic data are shown. Raw data are available in Additional file 1. b. Genes by functional catalogue (FunCat) assignment. The distribution of FunCat assignments for all genes in the N. crassa genome and the genes deleted in knockout mutants with complete phenotypic data is shown. Functional category data are available at https://elbe.hki-jena.de/fungifun/fungifun.php
Comparison of results using different iterations of Partitioning Around Medoids (PAM)
| Weight | Number of Clusters | Average % consensus | Average relative standard deviation (RST) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Conidia number | Conidia morphology | Protoperithecia number | Protoperithecia morphology | Perithecia number | Perithecia morphology | Ascospore number | Ascospore morphology | Average % consensus overall | Basal hyphae growth rate | Aerial hyphae height | Average RST overall | ||
| 0 | 18 | 91.72 | 94.20 | 93.74 | 94.23 | 95.27 | 93.34 | 96.35 | 98.32 | 95.06 | 26.76 | 34.12 | 13.20 |
| 0 | 19 | 92.16 | 94.50 | 94.05 | 94.51 | 95.45 | 93.66 | 96.48 | 98.39 | 95.29 | 25.33 | 32.54 | 12.76 |
| 0 | 20 | 92.50 | 94.78 | 94.35 | 94.79 | 95.68 | 93.92 | 96.66 | 98.47 | 95.52 | 24.62 | 30.43 | 12.36 |
| 0 | 21 | 93.02 | 95.19 | 94.61 | 95.04 | 95.88 | 94.21 | 96.81 | 97.03 | 95.54 | 28.30 | 31.30 | 13.09 |
| 1 | 31 | 95.92 | 96.57 | 93.46 | 94.99 | 95.70 | 95.39 | 97.32 | 98.76 | 96.03 | 19.80 | 24.13 | 11.02 |
| 1 | 32 | 95.22 | 96.66 | 94.40 | 95.98 | 95.84 | 95.53 | 97.40 | 98.80 | 96.37 | 20.18 | 24.55 | 10.44 |
| 1 | 33 | 95.67 | 97.05 | 94.57 | 96.11 | 95.97 | 95.67 | 97.47 | 97.95 | 96.40 | 22.57 | 25.24 | 10.92 |
| 2 | 32 | 96.23 | 94.89 | 94.02 | 95.04 | 94.22 | 95.51 | 98.76 | 98.82 | 95.90 | 17.52 | 26.50 | 10.88 |
| 2 | 33 | 96.35 | 95.05 | 94.20 | 95.19 | 94.31 | 95.64 | 98.79 | 98.86 | 96.01 | 16.91 | 25.83 | 10.67 |
| 2 | 34 | 95.45 | 95.24 | 94.99 | 96.82 | 94.47 | 95.77 | 98.82 | 98.89 | 96.43 | 17.55 | 26.30 | 10.22 |
| 2 | 35 | 95.15 | 94.62 | 95.00 | 96.91 | 93.95 | 94.98 | 98.03 | 98.09 | 95.94 | 18.98 | 25.30 | 11.19 |
| 3 | 29 | 96.20 | 96.09 | 93.25 | 94.20 | 92.63 | 95.26 | 95.53 | 97.62 | 94.94 | 13.77 | 27.67 | 10.82 |
| 3 | 30 | 96.37 | 96.22 | 93.47 | 94.39 | 92.88 | 95.34 | 95.67 | 97.70 | 95.10 | 13.43 | 26.61 | 10.60 |
| 3 | 31 | 97.45 | 97.64 | 91.71 | 94.52 | 94.39 | 94.05 | 95.14 | 96.63 | 94.87 | 15.60 | 27.08 | 10.98 |
| 3 | 32 | 97.45 | 97.29 | 92.26 | 94.54 | 93.62 | 93.38 | 95.46 | 96.91 | 94.78 | 15.46 | 27.69 | 11.41 |
| 3 | 33 | 96.92 | 96.53 | 92.41 | 94.65 | 93.28 | 92.81 | 95.29 | 96.69 | 94.52 | 17.58 | 26.11 | 12.14 |
| 4 | 32 | 90.89 | 94.99 | 93.99 | 95.11 | 94.39 | 94.23 | 94.57 | 97.20 | 94.93 | 14.23 | 18.81 | 11.42 |
| 4 | 33 | 90.50 | 94.98 | 93.99 | 94.80 | 94.13 | 93.80 | 94.40 | 97.37 | 94.78 | 14.21 | 19.08 | 11.96 |
| 4 | 34 | 90.78 | 94.78 | 94.10 | 94.83 | 94.76 | 94.13 | 94.97 | 97.46 | 95.01 | 14.10 | 20.18 | 11.75 |
| 4 | 35 | 90.82 | 94.93 | 94.27 | 94.98 | 94.91 | 94.23 | 95.11 | 97.53 | 95.14 | 13.78 | 19.68 | 11.57 |
| 5 | 36 | 85.84 | 95.66 | 94.14 | 95.40 | 95.38 | 93.87 | 94.33 | 97.42 | 95.17 | 14.09 | 13.44 | 11.26 |
| 5 | 37 | 85.28 | 95.45 | 94.18 | 95.41 | 95.51 | 94.04 | 94.48 | 97.49 | 95.22 | 13.99 | 12.45 | 11.22 |
| 5 | 38 | 85.18 | 95.55 | 94.88 | 95.63 | 94.53 | 94.15 | 94.03 | 97.55 | 95.19 | 13.90 | 12.05 | 11.20 |
| 5 | 39 | 84.56 | 95.62 | 95.24 | 96.70 | 94.67 | 94.30 | 94.18 | 97.61 | 95.48 | 13.93 | 12.46 | 11.00 |
| 5 | 40 | 84.36 | 95.37 | 96.40 | 96.26 | 94.37 | 94.01 | 94.33 | 97.67 | 95.49 | 13.80 | 12.71 | 10.96 |
| 6 | 39 | 89.07 | 95.73 | 94.33 | 94.58 | 93.57 | 93.70 | 93.89 | 97.39 | 94.03 | 14.11 | 11.64 | 11.03 |
| 6 | 40 | 89.64 | 95.83 | 94.22 | 94.71 | 93.73 | 93.86 | 94.04 | 97.45 | 94.19 | 13.94 | 11.86 | 10.98 |
aAverage percent consensus - For each cluster the category that was most prevalent was determined and represented as a percent. Then each cluster’s most prevalent category was used to create a trait average
bAverage relative standard deviation - For continuous variables, the standard deviation for each cluster was calculated and then divided by the cluster mean to determine the relative standard deviation
cWeight - Each phenotype was assigned a specific weight relative to the other traits when creating the Gower’s distance matrix. The phenotypes are in the order as shown in Table 1
No weight = (1,1,1,1,1,1,1,1,1,1); Weight 1 = (1,1,1,1,1,1,1,1,2,2); Weight 2 = (1,0.5,1,0.5,1,0.5,1,0.5,2,2); Weight 3 = (2,2,1,1,1,1,1,1,6,2); Weight 4 = (0.5,0.5,1,1,1,1,1,1,6,5); Weight 5 = (1,1,1,1,1,1,1,1,6,4); Weight 6 = (1,1,1,1,1,1,1,1,6,6)
Phenotype summary for 40 PAM clusters
| Cluster # | # Genes in cluster | % Mutants with same phenotype | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Basal hyphae growth rate | Aerial hyphae height | Conidial # | Conidial morph-ology | Protoperi-thecial # | Protoperi-thecial morph-ology | Peri-thecial # | Peri-thecial morph-ology | Asco-spore | Asco-spore morph-ology | ||
| 28 | 75% Normal Average | 57% Normal High | 93% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 10 | 90% Reduced | 80% Reduced | 100% Reduced | 100% Normal | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 171 | 94% Normal Low | 95% Normal Average | 99% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 99% Normal | 100% Normal | 100% Normal | |
| 82 | 88% Normal Average | 85% Normal Average | 98% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 73 | 100% Slightly Reduced | 93% Reduced | 100% Normal | 100% Normal | 99% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 23 | 100% Reduced | 96% Reduced | 100% Normal | 100% Normal | 96% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 130 | 100% Slightly Reduced | 96% Normal Average | 99% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 99% Normal | 99% Normal | 100% Normal | |
| 38 | 100% Reduced | 100% Reduced | 97% Normal | 97% Normal | 97% Normal | 100% Normal | 97% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 82 | 80% Normal Low | 95% Reduced | 100% Normal | 100% Normal | 100% Normal | 99% Normal | 99% Normal | 100% Normal | 95% Normal | 100% Normal | |
| 8 | 62% Severely Reduced | 75% Normal Average | 75% Normal | 100% Normal | 88% Normal | 100% Normal | 62% Normal | 88% Normal | 75% Normal | 100% Normal | |
| 8 | 50% Slightly Reduced | 88% Reduced | 100% Normal | 100% Normal | 100% Normal | 75% Normal | 88% Not Formed | 88% Not Formed | 100% Not Formed | 100% Not Formed | |
| 8 | 88% Reduced | 88% Reduced | 100% Reduced | 88% Normal | 62% Normal | 50% Normal | 88% Not Formed | 88% Not Formed | 100% Not Formed | 100% Not Formed | |
| 12 | 50% Reduced | 58% Reduced | 75% Reduced | 92% Normal | 92% Reduced | 100% Normal | 92% Reduced | 92% Normal | 83% Reduced | 83% Normal | |
| 9 | Varied | 89% Normal Average | 89% Normal | 100% Normal | 89% Normal | 100% Normal | 67% Normal | 78% Normal | 100% Not Formed | 100% Not Formed | |
| 77 | 100% Slightly Reduced | 96% Normal Average | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 97% Normal | 100% Normal | 99% Normal | 100% Normal | |
| 21 | Varied | 67% Normal Average | 95% Normal | 95% Normal | 86% Normal | 90% Normal | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 12 | 100% Severely Reduced | 92% Reduced | 92% Normal | 83% Normal | 92% Normal | 100% Normal | 100% Normal | 92% Normal | 92% Normal | 92% Normal | |
| 8 | 100% Severely Reduced | 88% Reduced | 75% Normal | 88% Normal | 88% Not Formed | 88% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 25 | 52% Reduced | 100% Increased | 96% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 96% Normal | 100% Normal | |
| 19 | 74% Severely Reduced | 100% Reduced | 84% Normal | 95% Normal | 95% Normal | 95% Normal | 100% Normal | 100% Normal | 89% Normal | 100% Normal | |
| 42 | 100% Slightly Reduced | 100% Reduced | 100% Normal | 100% Normal | 100% Normal | 98% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 32 | 100% Reduced | 84% Normal Average | 97% Normal | 100% Normal | 97% Normal | 100% Normal | 97% Normal | 100% Normal | 97% Normal | 97% Normal | |
| 11 | 82% Reduced | 91% Reduced | 91% Normal | 100% Normal | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 13 | 85% Reduced | 85% Reduced | 92% Normal | 92% Normal | 92% Normal | 62% Normal | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 11 | 91% Reduced | 100% Severely Reduced | 55% Reduced | 100% Normal | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 15 | 67% Slightly Reduced | 100% Severely Reduced | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 97% Normal | 100% Normal | |
| 15 | Varied | 60% Normal Average | 100% Normal | 100% Normal | 80% Normal | 100% Normal | 87% Normal | 87% Normal | 93% Reduced | 100% Normal | |
| 15 | 67% Reduced | 100% Increased | 80% Normal | 100% Normal | 93% Normal | 93% Normal | 100% Normal | 93% Normal | 87% Normal | 100% Normal | |
| 9 | 67% Slightly Reduced | 78% Reduced | 89% Normal | 100% Normal | 89% Normal | 89% Normal | 78% Reduced | 89% Normal | 100% Not Formed | 100% Not Formed | |
| 7 | 100% Severely Reduced | 100% Severely Reduced | Varied | 71% Normal | 100% Normal | 100% Normal | 57% Normal | 86% Normal | Varied | Varied | |
| 36 | 100% Reduced | 92% Normal Average | 97% Normal | 100% Normal | 100% Normal | 97% Normal | 94% Normal | 100% Normal | 94% Normal | 100% Normal | |
| 9 | 56% Slightly Reduced | 89% Reduced | 100% Reduced | 100% Normal | 89% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | |
| 6 | 50% Reduced, 50% Severely Reduced | 67% Normal Average | 67% Normal | 67% Normal | 83% Reduced | 100% Normal | 83% Reduced | 67% Abnorm-al Beaks | 83% Not Formed | 83% Not Formed | |
| 12 | 67% Reduced | 83% Normal Average | 83% Normal | 75% Normal | 92% Not Formed | 92% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
| 10 | 60% Slightly Reduced | 50% Normal Average | 100% Normal | 100% Normal | 100% Reduced | 90% Normal | 90% Reduced | 100% Normal | 60% Normal | 100% Normal | |
| 33 | Varied | 100% Increased | 88% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | 94% Normal | 100% Normal | 100% Normal | |
| 13 | 69% Reduced | 85% Reduced | 92% Normal | 100% Normal | 92% Normal | 92% Normal | 85% Normal | 77% Normal | 100% Reduced | 100% Normal | |
| 9 | 67% Reduced | 100% Severely Reduced | 89% Normal | 100% Normal | 89% Normal | 89% Normal | 89% Normal | 100% Normal | 89% Normal | 100% Normal | |
| 5 | 80% Reduced | 60% Reduced | 80% Normal | 100% Normal | 100% Normal | 100% Normal | 100% Normal | Varied | 100% Not Formed | 100% Not Formed | |
| 21 | 100% Severely Reduced | 100% Severely Reduced | 76% Reduced | 90% Normal | 90% Not Formed | 90% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | 100% Not Formed | |
aThe value in the cell is the percentage of mutants with the indicated majority phenotype. Varied indicates that no one phenotype was present in more than 50% of the mutants. Normal range for growth rate is 75–85 mm/day. 75–77.5 = Normal Low; 77.6–82.5 = Normal Average; 82.6–85 = Normal High; 65–74.9 = Slightly Reduced; 40–64.9 = Reduced; < 40 = Severely Reduced; > 85 = Increased. Normal range for aerial hyphae height is 30-45 mm. 35.1–39.9 = Normal Average; 30–35 = Normal Low; 40–45 = Normal High; 25–29 = Slightly Reduced; 15–24.9 = Reduced; < 15 = Severely Reduced; > 45 = Increased
Fig. 3Grouping of mutants in clusters. a. Number of genes per cluster. The number of mutants in each cluster is shown. Data are taken from Additional file 1. The clustering algorithm was a weighted PAM. See Methods for details. The numbers above the bars indicate the number of genes per cluster. b. Number of clusters in different size ranges. Clusters were sorted into bins according to the number of genes in the cluster. The number of clusters corresponding to each size range (bin) is shown. The numbers above the bars indicates the number of clusters in each size range. Raw data are available in Additional file 1
Fig. 4Placement of genes with yeast orthologs in clusters. The genes deleted in the mutants were analyzed for S. cerevisiae orthologs using the “Transform by Orthology” tool at the FungiDB database. The percentage of genes with orthologs in each cluster (blue bars), and in the entire dataset (orange bar), are shown. The numbers above the bars indicate the percent of genes in a cluster with a yeast ortholog. Raw data are available in Additional file 6
Fig. 5Genes encoding proteins with predicted transmembrane domain(s) and/or G protein coupled receptors (GPCRs). a. Transmembrane domains and GPCRs. The percentage of genes in each cluster encoding a protein with at least one predicted transmembrane domain is shown. The orange portion of each bar denotes the percent of genes in each cluster encoding transmembrane proteins that are also predicted G protein coupled receptors (GPCRs). Genes were retrieved using the “Protein targeting and localization - #TM Domains” tool at FungiDB. The numbers on the orange portion of the bars indicate the percent of proteins in a cluster that are GPCRs. The numbers on the blue portion of the bars indicates the percent of proteins in a cluster that have a predicted transmembrane domain. Raw data are available in Additional file 1. b. Functional categories of proteins with transmembrane domains in Cluster 19. Each “slice” of the pie represents the percentage of genes encoding proteins with transmembrane domains in the indicated functional category. Functional category data are available at https://elbe.hki-jena.de/fungifun/fungifun.php
Fig. 6Protein phosphorylation-related genes. a. Assortment of phosphorylated proteins into clusters. Proteins that are phosphorylated during growth in submerged cultures with glucose as the carbon source were obtained from Ref. [37]. The number of phosphorylated proteins in each cluster (blue bars; actual number on top of bars), along with the % representation in each cluster (orange bars; % noted on top of bars) is shown. Data are taken from Additional file 1. The number above the blue bars represents the number of phosphorylated proteins in each cluster. The number above the orange bars indicates the percent of phosphorylated protein that make up each cluster. b. Distribution of serine-threonine protein kinases and serine-threonine or tyrosine protein phosphatases. The percentage of genes in each cluster encoding a serine-threonine protein kinase (STKinase) or serine-threonine or tyrosine protein kinase (Protein Pase) is shown. Data are taken from Additional file 1. The numbers above the blue bars indicates the percent of genes in each cluster that are serine/threonine protein kinases. The numbers above the orange bars represents the percent of genes in each cluster that are protein phosphatases
Fig. 7Cluster genes dependent on the three MAPKs: MAK-1, MAK-2 and OS-2. Data for identification of genes with down-regulated expression in a N. crassa Δmak-1 mutant were obtained from Ref. [44]. For mining cluster genes encoding transcriptional or phosphorylation targets of MAK-2 in N. crassa, targets were compiled from four datasets, including microarray analysis of genes down-regulated in a Δmak-2 mutant [41] or down-regulated in a strain expressing the mak-2Q100G strain in the presence of inhibitor [45], and MAK-2 phosphorylation targets identified during phosphoproteomics studies with the mak-2Q100G strain with inhibitor [46, 47]. Cluster genes encoding N. crassa orthologs of transcriptional targets of the p38 MAPK (os-2) in Cryptococcus neoformans and Aspergillus fumigatus were obtained using data from Ref. [48, 49]. N. crassa orthologs in the combined dataset were retrieved using the “Transform by Orthology” tool at FungiDB. The numbers above the blue, orange and green bars indicate the percent of genes in each cluster that are either regulated by MAK-1, MAK-2 and the p38 MAPK (OS-2), respectively. Raw data are available in Additional file 7
Fig. 8K-means clustering of transcriptomic data from wild type for genes in the phenotype dataset. The expression values for each gene in the phenotype dataset were standardized to have a mean of 0 and a standard deviation of 2. Genes were partitioned into the indicated number of expression profiles using K-means clustering. a. RNA-seq data from a sexual cycle time course. RNA-seq data for eight time points were obtained from Ref. [52]. Genes were partitioned into seven expression profiles. b. RNA-seq data from a time course of conidial germination. RNA-seq data for four time points were obtained from Ref. [53]. Genes were partitioned into eight expression clusters. c. Microarray data from a time course of conidiation. Microarray data for 10 time points were obtained from Ref. [54]. Genes were partitioned into seven expression clusters. d. Microarray data during a colony development time course. Microarray data for six time points were obtained from Ref. [55]. Genes were partitioned into six expression clusters
Fig. 9Model showing predicted pathway(s) for genes that co-cluster with the ERK MAPK genes in Cluster 40. The MAK-1 and MAK-2 ERK MAPK cascade genes are represented by green and red shading, respectively. S. cerevisiae interaction data for genes in Cluster 40 were obtained from www.yeastgenome.org. Physical interactions, genetic interactions, or physical and genetic interactions between the indicated genes/gene products in yeast are shown by yellow, blue, or green lines, respectively. Genes with known interactions in N. crassa are connected by black lines. Genes that interact with the MAK-1 cascade kinases are shaded in yellow, while those that interact with components of the MAK-2 pathway are indicated with blue shading. Genes with no ortholog in yeast and no known interaction in N. crassa are represented by the white rectangles
Fig. 10Model for predicted pathway(s) including genes that co-cluster with the p38 MAPK genes in Cluster 34. The OS-2 MAPK cascade genes are represented by red rectangles. S. cerevisiae interaction data for genes in Cluster 34 were obtained from www.yeastgenome.org. Physical interactions, genetic interactions, or physical and genetic interactions between the indicated genes/gene products in yeast are indicated by yellow, blue, or green lines, respectively. Genes with known interactions in N. crassa are connected by black lines. Genes that interact with the OS-2 cascade kinases are represented by blue rectangles. Genes with no known interaction in N. crassa or S. cerevisiae are represented by the white rectangles