| Literature DB >> 26933475 |
Janneke Aylward1, Emma T Steenkamp2, Léanne L Dreyer1, Francois Roets3, Brenda D Wingfield4, Michael J Wingfield2.
Abstract
Two closely related ophiostomatoid fungi, Knoxdaviesia capensis and K. proteae, inhabit the fruiting structures of certain Protea species indigenous to southern Africa. Although K. capensis occurs in several Protea hosts, K. proteae is confined to P. repens. In this study, the genomes of K. capensis CBS139037 and K. proteae CBS140089 are determined. The genome of K. capensis consists of 35,537,816 bp assembled into 29 scaffolds and 7940 predicted protein-coding genes of which 6192 (77.98 %) could be functionally classified. K. proteae has a similar genome size of 35,489,142 bp that is comprised of 133 scaffolds. A total of 8173 protein-coding genes were predicted for K. proteae and 6093 (74.55 %) of these have functional annotations. The GC-content of both genomes is 52.8 %.Entities:
Keywords: Gondwanamycetaceae; Knoxdaviesia; Microascales; Ophiostomatoid fungi; Protea
Year: 2016 PMID: 26933475 PMCID: PMC4772463 DOI: 10.1186/s40793-016-0139-9
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Maximum Likelihood tree illustrating the phylogenetic position of K. capensis and K. proteae in the Gondwanamycetaceae (grey block). The Protea-associated species are shaded red and the two isolates for which genome sequences were determined are indicated with a box. The sequences of the Internal Transcribed Spacer (ITS) region (available from GenBank®, accession numbers in brackets following isolate numbers) were aligned in MAFFT 7 [55]. The phylogeny was calculated in MEGA6 [56] using the Tamura-Nei substitution model [57], 1000 bootstrap replicates and Ceratocystis fimbriata (Ceratocystidaceae) as an outgroup
Fig. 2Sexual sporing structures of the two Knoxdaviesia species sequenced in this study. K. capensis (a) and K. proteae (b) were sampled from Protea longifolia and P. repens flowers, respectively. Scale bars = 1 mm
Classification and general features of K. capensis and K. proteae [29]
| MIGS ID | Property |
|
| Evidence codea |
|---|---|---|---|---|
| Classification | Domain Fungi | Domain Fungi | TAS [ | |
| Phylum Ascomycota | Phylum Ascomycota | TAS [ | ||
| Class Sordariomycetes | Class Sordariomycetes | TAS [ | ||
| Order Microascales | Order Microascales | TAS [ | ||
| Family | Family | TAS [ | ||
| Genus | Genus | TAS [ | ||
| Species | Species | TAS [ | ||
| Strain: CMW40890 = CBS139037 | Strain: CMW40880 = CBS140089 | |||
| Cell shape | septate, smooth-walled hyphae | septate, smooth-walled hyphae | TAS [ | |
| Motility | Non-motile | Non-motile | NAS | |
| Sporulation | Unsheathed allantoid ascospores | Falcate ascospores | TAS [ | |
| Temperature range | 15–30 °C | 15–30 °C | TAS [ | |
| Optimum temperature | 25 °C | 25 °C | TAS [ | |
| pH range; Optimum | Unknown | Unknown | ||
| Carbon source | Unknown | Unknown | ||
| MIGS-6 | Habitat | Seed cones (infructescences) of | Seed cones (infructescences) of | TAS [ |
| MIGS-6.3 | Salinity | Unknown | Unknown | |
| MIGS-22 | Oxygen requirement | Aerobic; requirement/tolerance unknown | Aerobic; requirement/tolerance unknown | |
| MIGS-15 | Biotic relationship | Plant-associated | Plant-associated | TAS [ |
| MIGS-14 | Pathogenicity | None known | None known | |
| MIGS-4 | Geographic location | Hermanus, South Africa | Stellenbosch, South Africa | |
| MIGS-5 | Sample collection | February 2014 | January 2014 | |
| MIGS-4.1 | Latitude | -34.4093 | -33.9430 | |
| MIGS-4.2 | Longitude | 19.2150 | 18.8802 | |
| MIGS-4.4 | Altitude | 20 m | 140 m |
aEvidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from http://www.geneontology.org/GO.evidence.shtml of the Gene Ontology project [58]
Project information
| MIGS ID | Property |
|
|
|---|---|---|---|
| MIGS 31 | Finishing quality | High quality draft | High quality draft |
| MIGS-28 | Libraries used | 2x paired-end (PE) (350 and 550 bp) and 1x mate-pair (MP) (3 kbp) | 2x paired-end (PE) (350 and 550 bp) and 1x mate-pair (MP) (3 kbp) |
| MIGS 29 | Sequencing platforms | Illumina Hiseq 2500 | Illumina Hiseq 2500 |
| MIGS 31.2 | Fold coverage | PE library 1: 91.6 x | PE library 1: 142 x |
| PE library 2: 80 x | PE library 2: 79.3 x | ||
| MP library: 17 x | MP library: 50.2 x | ||
| MIGS 30 | Assemblers | ABySS 1.5.2; SSPACE 3.0 | ABySS 1.5.2; SSPACE 3.0 |
| MIGS 32 | Gene calling method | MAKER 2.31.8 | MAKER 2.31.8 |
| Genbank ID | LNGK00000000 | LNGL00000000 | |
| GenBank Date of Release | 11th January 2016 | 11th January 2016 | |
| GOLD ID | Gp0093999 | Gp0110284 | |
| BIOPROJECT | PRJNA246171 | PRJNA275563 | |
| MIGS 13 | Source Material Identifier | CMW40890/CBS139037 | CMW40880/CBS 140089 |
| Project relevance | Biodiversity, evolution | Biodiversity, evolution |
Genome statistics
| Species |
|
| ||
|---|---|---|---|---|
| Attribute | Value | % of Totala | Value | % of Totala |
| Genome size (bp) | 35,537,816 | 100.00 | 35,489,142 | 100.00 |
| DNA coding (bp) | 12,640,368 | 35.57 | 12,542,580 | 35.34 |
| DNA G + C (bp) | 18,774,628 | 52.83 | 18,745,365 | 52.82 |
| DNA scaffolds | 29 | 133 | ||
| Total genes | 8107 | 100.00 | 8316 | 100.00 |
| Protein coding genes | 7940 | 97.94 | 8173 | 98.28 |
| RNA genesb | 167 | 2.06 | 143 | 1.72 |
| Pseudo genes | unknown | unknown | ||
| Genes in internal clusters | unknown | unknown | ||
| Genes with function prediction | 6192 | 77.98 | 6093 | 74.55 |
| Genes assigned to KOGs | 6059 | 76.31 | 6015 | 73.60 |
| Genes with Pfam domains | 5455 | 68.70 | 5335 | 65.28 |
| Genes with signal peptides | 354 | 4.46 | 335 | 4.10 |
| Genes with transmembrane helices | 1510 | 19.02 | 1527 | 18.68 |
| CRISPR repeats | N/A | N/A | ||
aThe total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome
bBased on tRNA and rRNA genes only
Number of genes associated with the 25 general KOG functional categories
| Species |
|
| |||
|---|---|---|---|---|---|
| Code | Value | % of totala | Value | % of totala | Description |
| J | 359 | 4.52 | 371 | 4.54 | Translation, ribosomal structure and biogenesis |
| A | 280 | 3.53 | 273 | 3.34 | RNA processing and modification |
| K | 475 | 5.98 | 484 | 5.92 | Transcription |
| L | 196 | 2.47 | 198 | 2.42 | Replication, recombination and repair |
| B | 109 | 1.37 | 99 | 1.21 | Chromatin structure and dynamics |
| D | 209 | 2.63 | 227 | 2.78 | Cell cycle control, cell division, chromosome partitioning |
| Y | 34 | 0.43 | 32 | 0.39 | Nuclear structure |
| V | 32 | 0.40 | 32 | 0.39 | Defence mechanisms |
| T | 505 | 6.36 | 586 | 5.95 | Signal transduction mechanisms |
| M | 69 | 0.87 | 76 | 0.93 | Cell wall/membrane/envelope biogenesis |
| N | 6 | 0.08 | 6 | 0.07 | Cell motility |
| Z | 279 | 3.51 | 289 | 3.54 | Cytoskeleton |
| W | 10 | 0.13 | 12 | 0.15 | Extracellular structures |
| U | 539 | 6.79 | 543 | 6.64 | Intracellular trafficking, secretion, and vesicular transport |
| O | 502 | 6.32 | 495 | 6.06 | Post-translational modification, protein turnover, chaperones |
| C | 265 | 3.34 | 256 | 3.13 | Energy production and conversion |
| G | 202 | 2.54 | 202 | 2.47 | Carbohydrate transport and metabolism |
| E | 227 | 2.86 | 228 | 2.79 | Amino acid transport and metabolism |
| F | 76 | 0.96 | 74 | 0.91 | Nucleotide transport and metabolism |
| H | 87 | 1.10 | 85 | 1.04 | Coenzyme transport and metabolism |
| I | 234 | 2.95 | 234 | 2.86 | Lipid transport and metabolism |
| P | 144 | 1.81 | 151 | 1.85 | Inorganic ion transport and metabolism |
| Q | 139 | 1.75 | 137 | 1.68 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 735 | 9.26 | 694 | 8.49 | General function prediction only |
| S | 344 | 4.33 | 330 | 4.04 | Function unknown |
| X | 2 | 0.03 | 1 | 0.01 | Multiple functions |
| - | 1881 | 23.69 | 2159 | 26.41 | Not in KOGs |
aThe total is based on the total number of protein coding genes in the genome