| Literature DB >> 35137016 |
Marcus de Melo Teixeira1,2, Jason E Stajich3,4, Jason W Sahl2, George R Thompson5, Rachel B Brem6, Claire A Dubin6, Austin V Blackmon2, Heather L Mead2, Paul Keim2, Bridget M Barker2.
Abstract
Coccidioidomycosis is a common fungal disease that is endemic to arid and semi-arid regions of both American continents. Coccidioides immitis and Coccidioides posadasii are the etiological agents of the disease, also known as Valley Fever. For several decades, the C. posadasii strain Silveira has been used widely in vaccine studies, is the source strain for production of diagnostic antigens, and is a widely used experimental strain for functional studies. In 2009, the genome was sequenced using Sanger sequencing technology, and a draft assembly and annotation were made available. In this study, the genome of the Silveira strain was sequenced using single molecule real-time sequencing PacBio technology, assembled into chromosomal-level contigs, genotyped, and the genome was reannotated using sophisticated and curated in silico tools. This high-quality genome sequencing effort has improved our understanding of chromosomal structure, gene set annotation, and lays the groundwork for identification of structural variants (e.g. transversions, translocations, and copy number variants), assessment of gene gain and loss, and comparison of transposable elements in future phylogenetic and population genomics studies.Entities:
Keywords: coccidioidomycosis; funannotate; fungal genomes; human fungal pathogen; long-read sequencing; reference genome; valley fever
Mesh:
Year: 2022 PMID: 35137016 PMCID: PMC8982387 DOI: 10.1093/g3journal/jkac031
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Chromosome sizes.
| Scaffold | COntig | Size (bp) |
|---|---|---|
| 1 | 1 | 8,340,845 |
| 2 | 2 | 8,079,863 |
| 3 | 3 | 6,494,557 |
| 4 | 4 | 3,437,093 |
| 5 | 5 | 1,489,275 |
| 6 | Mitochondrial genome | 74,407 |
| 7 | 6 | 119,732 |
| 8 | 7 | 80,756 |
| 9 | 8 | 76,740 |
Comparison of Silveira genomes.
| Silveira Sanger 2007 | Silveira CANU 2020 | |
|---|---|---|
| Contigs | 57 | 9 |
| GC% | 46.34 | 46.44 |
| Predicted protein-coding genes | 10,228 | 8,491 |
| BUSCO completeness | 95.3% | 97.1% |
| Median gene length (bp) | 2,109 | 1,810.5 |
| Total number of introns | 20,607 | 17,004 |
| Total number of exons | 32,511 | 26,348 |
| Total number of CDS | 30,840 | 26,216 |
| tRNAs | 119 | 111 |
Fig. 1.The gene density and repeats for Canu assemblies. Blue histograms on X/Y axes represent gene densities. Forward and reverse similarity blocks are indicated by red and blue dots, respectively. Central highly repeated sequence regions are the putative centromeres. Canonical telomeric repeats are indicated at both 5′ and 3′ ends. a) Dot Plot of Chromosome I. b) Dot Plot of Chromosome II. c) Dot Plot of Chromosome III. d) Dot Plot of Chromosome IV. e) Dot Plot of Chromosome V. f) Classes of repeats.
Fig. 2.Silveira (C. posadasii) Chromosome III Dot Plot Similarity to the RS (C. immitis) Genome. The homologous regions of the Silveira chromosome III (6.5 mbp) are distributed across 2 super contigs of the RS genome assembly and not completely collinear. a) RS super contig 3.1 contains about 30% the 5′ Silveira chromosome III homologous sequences, with the remainder not homologous to chromosome III. Within the homologous region there a large gap of nonhomology. b) RS super contig 3.2 contains about 70% of Silveira chromosome III homologous sequences including a highly repetitive region at the Silveira 3′ end. The arrangement of the cross species homologous regions is not completely collinear with an apparent inversion of about half.
Fig. 3.Final circular mitochondrial genome assembly of the C. posadasii Silveira strain. The circular plots show the core genes as part of the ubiquinone oxidoreductase, cytochrome b, cytochrome oxidase and ATP synthase protein complexes. Small and large ribosomal RNAs (rns and rnl), the RNAseP subunit (rnpB), as well as tRNAs are also displayed. Additional ORFs and introns are showed along the circular map.