| Literature DB >> 35663011 |
Xuan Liu1, Xin Huang1, Chen Chu1, Hui Xu1, Long Wang1, Yarong Xue1, Zain Ul Arifeen Muhammad1, Fumio Inagaki2,3, Changhong Liu1.
Abstract
To understand the genomic evolution and adaptation strategies of fungi to subseafloor sedimentary environments, we de novo assembled the genome of Schizophyllum commune strain 20R-7-F01 isolated from ∼2.0 km-deep, ∼20-millionyearsago (Mya) coal-bearing sediments. Phylogenomics study revealed a differentiation time of 28-73 Mya between this strain and the terrestrial type-strain H4-8, in line with sediment age records. Comparative genome analyses showed that FunK1 protein kinase, NmrA family, and transposons in this strain are significantly expanded, possibly linking to the environmental adaptation and persistence in sediment for over millions of years. Re-sequencing study of 14 S. commune strains sampled from different habitats revealed that subseafloor strains have much lower nucleotide diversity, substitution rate, and homologous recombination rate than other strains, reflecting that the growth and/or reproduction of subseafloor strains are extremely slow. Our data provide new insights into the adaptation and long-term survival of the fungi in the subseafloor sedimentary biosphere.Entities:
Keywords: Geology; Mycology
Year: 2022 PMID: 35663011 PMCID: PMC9156946 DOI: 10.1016/j.isci.2022.104417
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Genome assembly and annotation summary of S. commune strains
| Assembly feature | ||||
|---|---|---|---|---|
| Genome size (Mbp) | 40.79 | 35.88 | 36.46 | 38.48 |
| Coverage (X) | 113.1X | 112.3X | 109.4X | 8.29X |
| Number of Scaffold | 162 | 1,774 | 1,707 | 36 |
| N50 (bp) of contigs | 1,826,793 | 54,148 | 54,683 | 2,548,518 |
| GC content (%) | 57.32 | 57.50 | 57.45 | 56.67 |
| Gene number | 10,765 | 13,827 | 15,199 | 13,210 |
| Average gene length (bp) | 1,725 | 1,708 | 1,692 | 1,795 |
| Average exon length (bp) | 213 | 247 | 264 | 249 |
| Average intron length (bp) | 91 | 76 | 72 | 79 |
| Average number of exons per genes | 5.9 | 5.55 | 5.27 | 5.7 |
Figure 1The circos diagram of S. commune 20R-7-F01 genome
The outermost layer is the chromosome and its size. The second and third layers are CDS on the positive and negative chains, and the different colors indicate the functional classification of different COGs of the CDS. The fourth and fifth layers are gene density on the positive and negative chains. The sixth and seventh layers are GC content, the green part indicates that the GC content in this area is higher than the whole genome average GC content, and the blue part indicates that the GC content in this area is lower than the whole genome average GC content. Links between all genes represents inparalogs and the bold line indicates the top five genes with the most copies.
Figure 2Phylogenetic tree and divergence time of S. commune 20R-7-F01
The branch lengths of the phylogenetic tree are scaled to estimated divergence time. The blue bars on the nodes indicate the 95% credibility intervals of the estimated posterior distributions of the divergence times. The overall timeline is shown below the phylogenetic tree.
Figure 3Phylogenetic, population stratification and principal component analyses of S. commune
(A) Maximum likelihood phylogenetic tree of subseafloor, marine, and terrestrial S. commune population.
(B) PCA analysis for 30 S. commune samples.
The contribution of recombination and mutation to nucleotide diversity of subseafloor and terrestrial S. commune populations
| Group | R/θ | δ | v | r/m |
|---|---|---|---|---|
| Subseafloor | 0.00238761 | 2084.436 | 0.0105791 | 0.0527 |
| Terrestrial | 0.122791 | 11831.35 | 0.183557 | 266.67 |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Total DNA and RNA of | This study | NA |
| original code | This study | |
| Resequencing data | This study | GeneBank: PRJNA738972 |
| RNA sequence data | This study | GeneBank: PRJNA543698 |
| Genome assembly data | This study | GeneBank: PRJNA544166 |
| Other genome data | GeneBank: PRJNA236351 | |
| Other resequencing data | GeneBank: PRJNA234274 | |
| BUSCO | ||
| HGAP3 | ||
| Quast | ||
| TopHat | ||
| Cufflinks | ||
| RNAmmer | ||
| tRNAscan-SE | ||
| Fgenesh | ||
| EVidenceModeler | ||
| OrthoFinder | ||
| Orthomcl | ||
| MAFFT | ||
| Gblocks | ||
| RAxML | ||
| PAML | ||
| CAFE | ||
| MUMmer | ||
| BWA-MEM | ||
| GATK | ||
| Annovar | ||
| PLINK | ||
| SNPhylo | ||
| GCTA | ||
| VCFtools | ||
| ClonalFrameML | ||
| MCScanX | ||
| bcftools | ||
| Poppr | ||