| Literature DB >> 24415955 |
Kui Lin1, Erik Limpens2, Zhonghua Zhang3, Sergey Ivanov2, Diane G O Saunders4, Desheng Mu5, Erli Pang1, Huifen Cao1, Hwangho Cha1, Tao Lin3, Qian Zhou3, Yi Shang3, Ying Li3, Trupti Sharma2, Robin van Velzen2, Norbert de Ruijter6, Duur K Aanen7, Joe Win4, Sophien Kamoun4, Ton Bisseling8, René Geurts2, Sanwen Huang9.
Abstract
Nuclei of arbuscular endomycorrhizal fungi have been described as highly diverse due to their asexual nature and absence of a single cell stage with only one nucleus. This has raised fundamental questions concerning speciation, selection and transmission of the genetic make-up to next generations. Although this concept has become textbook knowledge, it is only based on studying a few loci, including 45S rDNA. To provide a more comprehensive insight into the genetic makeup of arbuscular endomycorrhizal fungi, we applied de novo genome sequencing of individual nuclei of Rhizophagus irregularis. This revealed a surprisingly low level of polymorphism between nuclei. In contrast, within a nucleus, the 45S rDNA repeat unit turned out to be highly diverged. This finding demystifies a long-lasting hypothesis on the complex genetic makeup of arbuscular endomycorrhizal fungi. Subsequent genome assembly resulted in the first draft reference genome sequence of an arbuscular endomycorrhizal fungus. Its length is 141 Mbps, representing over 27,000 protein-coding gene models. We used the genomic sequence to reinvestigate the phylogenetic relationships of Rhizophagus irregularis with other fungal phyla. This unambiguously demonstrated that Glomeromycota are more closely related to Mucoromycotina than to its postulated sister Dikarya.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24415955 PMCID: PMC3886924 DOI: 10.1371/journal.pgen.1004078
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Genome sequence of single R. irregularis DAOM197198w nuclei.
(A) Sytox Green stained spore containing numerous nuclei. (B) Single Sytox-stained nucleus trapped with a micropipette. (C) Level of homology between four individual nuclei (N6, N31, N33 and N36) and 2 mycelium DNA samples (DNA1 and DNA2). Presented are the 10 largest contigs of the reference genome (representing ∼1,278 kb). The occurrence of SNPs (marked in blue) and INDELs (marked in red), and gene distributions, in the different assemblies are indicated.
Characteristics of the seven genome assemblies from R. irregularis DAOM197198w.
| Reference | DNA1 | DNA2 | N6 | N31 | N33 | N36 | ||
| Contigs | N20 (bp) | 35,093 | 41,795 | 43,394 | 46,162 | 23,481 | 18,221 | 25,221 |
| N50 (bp) | 16,014 | 18,598 | 19,912 | 19,648 | 10,530 | 8,434 | 11,121 | |
| N80 (bp) | 4,077 | 6,103 | 6,420 | 6,341 | 3,678 | 3,065 | 3,795 | |
| Av. length (bp) | 2,366 | 1,684 | 1,620 | 1,544 | 2,309 | 2,330 | 2,269 | |
| Max. length (bp) | 189,408 | 208,614 | 214,169 | 171,410 | 79,274 | 61,908 | 107,227 | |
| Total number | 31,773 | 14,093 | 13,797 | 13,787 | 16,493 | 15,073 | 16,893 | |
| Total (Mb) | 140.5 | 115.8 | 117.1 | 115 | 90.4 | 71.6 | 95.5 | |
| Scaffolds | N20 (bp) | 35,747 | 43,804 | 45,734 | 48,447 | 24,016 | 18,834 | 26,344 |
| N50 (bp) | 16,360 | 19,381 | 20,888 | 20,759 | 10,982 | 8,624 | 11,688 | |
| N80 (bp) | 4,293 | 6,331 | 6,752 | 6,713 | 3,813 | 3,144 | 3,935 | |
| Av. length (bp) | 2,322 | 1,599 | 1,532 | 1,470 | 2,223 | 2,265 | 2,175 | |
| Max. length (bp) | 198,933 | 208,614 | 214,169 | 171,626 | 79,274 | 69,834 | 107,227 | |
| Total number | 30,638 | 13,333 | 12,871 | 12,603 | 15,672 | 14,550 | 15,949 | |
| Total (Mb) | 140.9 | 115.9 | 117.1 | 115 | 90.4 | 71.6 | 95.5 |
N50: the length for which the contigs (scaffolds) of that length or longer contains at least half of the total lengths of the contigs (scaffolds).
Figure 2Overview of polymorphisms in the R. irregularis 45S rDNA repeat unit in four individual nuclei.
The top part indicates the various regions within the R. irregularis DAOM197198 45S rDNA reference sequence (Genbank JF439109). Position means the position of each polymorphic site on the reference. G1: genotype identical to reference; G2: polymorphic nucleotide. The six histograms show the numbers of sequenced reads supporting the two genotypes for N6, N31, N33, N36 and mycelium DNA samples DNA1 and DNA2. The dashed lines indicate the average sequencing depth for each sample.
Figure 3ML tree derived from the concatenation of 35 widespread, single-copy genes.
The amino acid alignment was trimmed as explained in the Materials and methods section to remove non-informative positions, resulting in 26,604 positions. The tree was estimated using the rtREV evolutionary model implemented in RAxML. Bootstrap analysis was performed based on 100 replicates, and the three nodes with support below 100 are indicated. Scale bar indicates average number of amino acid substitutions per site.
Figure 4Comparison of secretomes of R. irregularis and other 43 fungi.
Percentage of predicted proteome representing putative effectors, using stringent (lacking transmembrane domains; yellow bars) or relaxed criteria (including proteins with predicted single transmembrane domain that overlapped with the signal peptide; blue bars).
Figure 5Top 100 ranked protein tribes containing putative effector candidates.
Clusters were determined using hierarchical clustering of the top 100 ranked tribes containing putative effector candidates. A. Rank associated with each tribe based on their content of effector features. B. Score for number of members containing a nuclear localization signals (NLS). C. Score for number of members classified as repeat containing (RCPs). D. Score reflecting number of members classified as small and cysteine rich (SCRs). E. Score for number of members not annotated by searches against swissprot. F. Average protein sequence length for tribe members (ranging from 55 to 856 amino acids). Stars indicate tribes that contain members with similarity to the characterised effector SP7.