| Literature DB >> 26945779 |
Angela H Williams1,2, Mamta Sharma3, Louise F Thatcher4, Sarwar Azam5, James K Hane6,7, Jana Sperschneider8, Brendan N Kidd9, Jonathan P Anderson10,11, Raju Ghosh12, Gagan Garg13, Judith Lichtenzveig14, H Corby Kistler15, Terrance Shea16, Sarah Young17, Sally-Anne G Buck18, Lars G Kamphuis19,20, Rachit Saxena21, Suresh Pande22, Li-Jun Ma23, Rajeev K Varshney24,25, Karam B Singh26,27.
Abstract
BACKGROUND: Soil-borne fungi of the Fusarium oxysporum species complex cause devastating wilt disease on many crops including legumes that supply human dietary protein needs across many parts of the globe. We present and compare draft genome assemblies for three legume-infecting formae speciales (ff. spp.): F. oxysporum f. sp. ciceris (Foc-38-1) and f. sp. pisi (Fop-37622), significant pathogens of chickpea and pea respectively, the world's second and third most important grain legumes, and lastly f. sp. medicaginis (Fom-5190a) for which we developed a model legume pathosystem utilising Medicago truncatula.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26945779 PMCID: PMC4779268 DOI: 10.1186/s12864-016-2486-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genome assembly characteristics and comparisons across Fusarium species
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| Genome Abbreviation |
|
|
|
|
|
|
|
|
| Isolate | 5190a | 38-1 | NRRL37622 | 4287 | 5176 | MPVI 77-13-4 | 7600 | PH-1 |
| Chromosome number | Unknown | Unknown | 16 | 15 | Unknown | 17 | 11 | 4 |
| Host |
| Chickpea |
| Tomato |
| Pea | Maize and cereals | Wheat and barley |
| Total assembly length (bp) | 51,139,932 | 54,813,009 | 55,188,000 | 57,720,560 | 54,767,602 | 47,328,059 | 41,104,290 | 36,346,967 |
| Average length (bp) | 12,674 | 36,961 | 116,923 | 3,607,535 | 6969 | 2,629,337 | 3,425,358 | 7,269,393 |
| Maximum length (bp) | 4,225,797 | 2,918,844 | 5,895,957 | 6,854,980 | 415,898 | 6,369,736 | 6,219,215 | 11,694,295 |
| Minimum length (bp) | 200 | 1000 | 2000 | 1,646,460 | 200 | 215,166 | 2,040,847 | 7,711,129 |
| L50 (bp) | 1,632,076 | 182,371 | 2,840,000 | 4,589,937 | 60,340 | 3,621,839 | 4,236,349 | 8,911,601 |
| N50 | 10 | 58 | 54 | 6 | 222 | 6 | 5 | 2 |
| Total sequences | 4034 | 1482 | 472 | 15 | 7858 | 17 | 11 | 4 |
| Total unknown bases (N) | 1,929,171 | 2,262,803 | 12,221 | 1,238,430 | 33 | 1915 | 87,573 | 212,843 |
| GC%c | 48.2 | 48.1 | 47.6 | 48.5 | 47.8 | 51.5 | 48.7 | 48.4 |
| Percentage of | 1.5 | 5.4 | 8.2 | 21.9 | 9.4 | 3.1 | 1.3 | 1.5 |
| Percentage of predicted repetitive sequences based on comparison to known fungal repeats in Repbase | 1.0 | 2.4 | 2.8 | 4.4 | 2.4 | 1.4 | 0.8 | 0.9 |
| CEGMA Analysisa | ||||||||
| CEGMA partial % | 98 | 98 | 98 | 97 | 98 | 98 | 96 | 97 |
| CEGMA partial | 243 | 244 | 242 | 241 | 243 | 243 | 239 | 241 |
| CEGMA complete | 239 | 240 | 237 | 237 | 238 | 238 | 235 | 234 |
aNumber of complete Conserved Eukaryotic Genes found in the assembly of the core set of 248 defined by Parra et al. [46]
bReconstructed chromosome mapped regions only
cGC content and de novo repeat percentage based on RepeatMasker analysis of scaffolds versus de novo repeats
Protein set comparisons across Fusarium species
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| Total number of proteins | 16,124 | 16,858 | 19,623 | 17,817 | 17,701 | 15,707 | 13,321 | 20,553 |
| SSPsa | 537 | 580 | 620 | 588 | 597 | 426 | 483 | 621 |
| Number of species specific proteins (including paralogs) | 1090 | 863 | 832 | 1606 | 1825 | 2083 | 1632 | 1852 |
| Number of unique species specific proteinsb | 1058 | 808 | 785 | 1540 | 1645 | 1919 | 1622 | 1687 |
aSmall secreted proteins
bNo orthologs or paralogs in any of the other 43 spp. tested including all publicly available Fusarium spp.
Isolate details are as detailed in Additional file 5
Foc F. oxysporum f. sp. ciceris, Fom F. oxysporum f. sp. medicaginis, Fop F. oxysporum f. sp. pisi, Fob F. oxysporum 5176, Fol F. oxysporum f. sp. lycopersici, Fsol F. solani, Fg F. graminearum, Fv F. verticilliodes
Fig. 1F. oxysporum f. sp. lycopersici chromosomes highlighting sequence conservation and other features of CDCs in comparison to core chromosomes. The circos plot illustrates the gene-sparse, repeat-rich nature of Fol CDCs and their lower sequence conservation across related species in comparison to core chromosomes. Outer ring-Fol chromosomes highlighting CDCs (red) and chromosomes that are partially dispensable (yellow). Inner rings: (a) gene density in 100 kb windows, (b) repeat density in 100 kb windows, (c) GC content in 50 k bp windows, range 45–55 %, (d) Region of Fol chromosomes overlapped by Fom-5190a sequences, (e) Foc-38-1, (f) Fop-37622, (g) F. oxysporum f. sp. brassica Fo5176, (h) F. oxysporum f. sp. melonis, (i) F. solani, (j) F. fujikuori, (k) F. verticilliodes, (l) F. virguliforme, and (m) F. graminearum
Properties of scaffolds predicted to form part of either core or dispensable chromosomes in legume-infecting formae speciales
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Number of scaffold sequences | 446 | 3529 | 413 | 1069 | 116 | 356 |
| Total length (bp) | 42,167,838 | 8,766,042 | 41,518,444 | 13,294,565 | 43,587,736 | 11,600,480 |
| Average length (bp) | 94,547 | 2484 | 100,529 | 12,436 | 375,756 | 64,988 |
| Length gene coding (bp) | 23,149,588 | 2,802,952 | 21,935,481 | 4,117,674 | 23,633,062 | 4,042,338 |
| Length repetitive (bp) | 277,400 | 234,489 | 707,976 | 580,264 | 1,354,046 | 3,172,535 |
| % Gene coding | 54.90 % | 31.98 % | 52.83 % | 30.97 % | 54.2 % | 34.8 % |
| % Repetitive | 0.66 % | 2.67 % | 1.71 % | 4.36 % | 3.1 % | 27.3 % |
| Number of predicted genes | 14,427 | 2424 | 12,985 | 3139 | 16,405 | 3218 |
| Percentage of genes with Pfam annotaion | 71 % | 48 % | 72 % | 47 % | 66 % | 51 % |
| Gene density (per 10 Kb) | 3.4 | 2.8 | 3.1 | 2.4 | 3.7 | 2.7 |
| Average protein length (aa) | 479 | 330 | 486 | 361 | 443 | 374 |
Summary of sequence conservation between Fom-5190a and Foc-38-1 predicted dispensable scaffolds and other Fusarium species
|
| (Masked)a |
| (Masked)a |
| (Masked)a | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total length of predicted dispensable scaffolds (bp) | 13,294,565 | (12,714,301) | 8,766,042 | (8,531,553) | 11,600,480 | (8,427,945) | ||||||
| Foc length conserved | Foc sequence conserved | Foc length conserved maskeda | Foc sequence conserved masked |
|
|
|
|
|
|
|
| |
| (bp) | (%) | (bp) | (%) | (bp) | (%) | (bp) | (%) | (bp) | (%) | (bp) | (%) | |
|
| - | - | - | - | 3,504,836 | 40 | 3,051,034 | 34.8 | 5,820,869 | 50.2 | 3,747,704 | 32.3 |
|
| 3,618,892 | 27.2 | 3,134,089 | 23.6 | - | - | - | - | 5,200,782 | 44.8 | 3,852,673 | 33.2 |
|
| 5,645,201 | 42.5 | 4,105,088 | 30.9 | 4,413,435 | 50.3 | 3,484,062 | 39.7 | - | - | - | - |
|
| 1,417,711 | 10.7 | 1,099,653 | 8.3 | 1,215,395 | 13.9 | 1,037,655 | 11.8 | 1,458,181 | 12.6 | 954,748 | 8.2 |
|
| 4,580,988 | 34.5 | 2,041,608 | 15.4 | 3,174,649 | 36.2 | 1,543,921 | 17.6 | 5,107,489 | 44.0 | 1,636,670 | 14.1 |
|
| 5,635,252 | 42.4 | 4,242,674 | 31.9 | 4,684,280 | 53.4 | 3,907,121 | 44.6 | 6,686,394 | 57.6 | 3,999,585 | 34.5 |
|
| 6,429,341 | 48.4 | 4,618,139 | 34.7 | 5,013,128 | 57.2 | 3,930,676 | 44.8 | 7,604,002 | 65.5 | 4,446,346 | 38.3 |
aMasked sequences exclude repetitive regions
Properties of Fom-5190a effector candidates
| Gene |
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| Protein length (aa) | 199 | 264 | 91 | 144 | 131 | 111 | 279 | 141 | 122 | 292 |
| Mature protein length (aa) | 186 | 247 | 72 | 128 | 112 | 95 | 263 | 126 | 105 | 276 |
| Cysteine count in mature protein | 7 | 8 | 8 | 6 | 10 | 9 | 8 | 2 | 6 | 6 |
| % Cysteine (of aa length) | 3.5 | 3 | 8.8 | 4.2 | 8.9 | 8.1 | 2.8 | 1.4 | 5.7 | 2.1 |
| Predicted to be secreted by SignalP | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y |
| WoLFPsort predicted location | extr | extr | extr | extr | extr | extr | extr | extr | extr | extr, mito |
| Phobius predicted signal peptide | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y |
| TMHMM predicted transmembrane count | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Pfam domains | - | - | - | LysM | - | - | - | - | - | - |
| Number of aligned RNAseq reads overlapping gene model from combined 2d | 24 | 1039 | 89 | 378 | 10 | 369 | 2546 | 828 | 246 | 440 |
| Average depth of RNAseq coverage per base of gene | 4 | 123 | 27 | 88 | 3 | 76 | 354 | 141 | 33 | 61 |
| Gene length (bp) | 655 | 844 | 324 | 432 | 458 | 488 | 720 | 587 | 728 | 720 |
| Proportion of gene model supported by RNAseq | 0.96 | 1 | 1 | 1 | 0.69 | 1 | 0.99 | 1 | 1 | 1 |
| Scaffold | 122 | 322 | 364 | 461 | 485 | 531 | 3157, 696, 1167 | 420 | 270 | 306, 3812 |
| Scaffold length (bp) | 28,293 | 2356 | 1489 | 971 | 914 | 815 | 219, 647, 416 | 1129 | 4106 | 3,471, 204 |
| Predicted scaffold type | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable | Dispensable |
| Orthology | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Single ortholog unique to | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. | Has orthologs in other ff.spp. |
SIX gene presence on chromosomes/scaffolds in published Fusarium speciesa
|
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| six1 | 14 (2nd half dup. on chr 14) | split over 696 and 1167 (partial dup. 241) | - | 1.281, 1.315 | contig00620 | 1.92, 1.101 | 1.500 | - | - | - | - | - |
| six2 | 14 | - | - | - | - | 1.95 | - | - | - | - | 5 | 06 |
| six3 | 14 | - | - | - | - | - | - | - | - | - | - | - |
| six4 | - | - | - | - | contig01435 | 1.102 | - | - | - | - | - | - |
| six4b | _ | - | - | - | contig04359 | _ | - | - | - | - | - | - |
| six5 | 14 (partial match Chr 5) | - | 1215 | partial match 1.4 | - | - | 1.4 | - | - | - | - | - |
| six6 | 14 | - | - | - | - | 1.102 | 1.226 | - | - | - | - | - |
| six7 | 14 | - | - | - | - | - | - | - | - | - | - | - |
| six8 | 2, 3 (3 copies), 6 (2 copies), 7, 14 (2 copies) | 420 | 0138 | - | contig03501 | 1.127 | - | - | - | - | - | - |
| six9 | 14 | 270 | - | 1.43 | contig02779 | 1.86, 1.70 | - | - | - | - | - | - |
| six10 | 14 | - | - | - | - | - | - | - | - | - | - | - |
| six11 | 14 | - | 1126 | - | - | - | 1.532 | - | - | - | - | - |
| six12 | 14 | - | - | - | - | - | - | - | - | - | - | - |
| six13 | 6 (2 copies) | 306 (split), 3812, 306 | 1143 | 1.90 | - | 1.216 | 1.222 | - | - | - | - | - |
| six14 | 14 (1st half dup.) | - | 0905 | 1.191 | - | - | - | - | - | - | 2 | - |
aID of scaffold or contig showing TBLASTN matches with an Expect (E) value score below 10−5, dup duplicate. b Fop-37622 contained annotated homologs of SIX1 (FOVG_19815, FOVG_19730 and SIX9 (FOVG_17008) and matches by TBLASTN to unannotated versions of SIX13 and SIX14 which were manually annotated for this study (Additional file 23) and SIX5 where the coding sequence was interrupted by stop codons
Fig. 2Expression of candidate pathogenicity genes in vitro versus in planta. a Expression of SIX genes and b candidate pathogenicity genes as determined by qRT-PCR in in vitro samples and M. truncatula DZA315 root samples harvested at 1, 2, 4 and 7 days post inoculation (dpi) with Fom-5190a. In vitro samples are averages ± SE of 3 biological replicates. In planta samples are averages ± SE of 4 biological replicates each consisting of pools of 10 seedlings. Gene expression levels are relative to the fungal actin gene (FOXM-5190a_13365). Note: ^ No detectable expression in 1 or 2 out of the 3 in vitro replicates