| Literature DB >> 28870170 |
Shuxian Li1, Omar Darwish2, Nadim W Alkharouf2, Bryan Musungu3,4, Benjamin F Matthews5.
Abstract
BACKGROUND: Phomopsis longicolla T. W. Hobbs (syn. Diaporthe longicolla) is a seed-borne fungus causing Phomopsis seed decay in soybean. This disease is one of the most devastating diseases reducing soybean seed quality worldwide. To facilitate investigation of the genomic basis of pathogenicity and to understand the mechanism of the disease development, the genome of an isolate, MSPL10-6, from Mississippi, USA was sequenced, de novo assembled, and analyzed.Entities:
Keywords: Genome; Phomopsis longicolla; Phomopsis seed decay; Soybean
Mesh:
Substances:
Year: 2017 PMID: 28870170 PMCID: PMC5584002 DOI: 10.1186/s12864-017-4075-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of genome sequencing and assembly of Phomopsis longicolla isolate MSPL 10–6
| Sequencing Statistics Library | Paired End (0.5 Kb inserts) | Mate Pair (3.9 Kb inserts) | Total |
|---|---|---|---|
| Raw data | |||
| Size | 6.9 Gb | 16.2 Gb | 23.1 Gb |
| Coverage | 108 X | 253 X | 361 X |
| Processed data | |||
| Size | 6.2 Gb | 8.2 Gb | 14.4 Gb |
| Coverage | 97 X | 128 X | 225 X |
| Assembly Statistics | Contigs | Scaffolds | |
| Total assembly size | 62 Mb | 66.7 Mb | |
| Total assembled sequences | 12,329 | 108 | |
| Longest sequence length | 215 Kb | 6.2 Mb | |
| Average sequence length | 5054 bp | 618 Kb | |
| N90 index | 2900 | 62 | |
| N90 length | 3.21 Kb | 299 Kb | |
| N50 index | 662 | 17 | |
| N50 length | 26.3 Kb | 1.04 Mb | |
Statistics of genome annotation of Phomopsis longicolla isolate MSPL 10–6
| Genes | |||
| Single Exon Genes | All Genes | ||
| Count | 4334 | Count | 16,597 |
| Average Length | 1219.1 | Average Length | 1704.37 |
| Median Length | 1026 | Median Length | 1411 |
| Total Length | 5,283,582 | Total Length | 28,287,360 |
| Average Coding Length | 1216.07 | Average Coding Length | 1496.72 |
| Median Coding Length | 1023 | Median Coding Length | 1236 |
| Total Coding Length | 5,270,457 | Total Coding Length | 24,840,981 |
| Average Score | 0.82 | Average Score | 2.25 |
| Total Score | 3560.51 | Total Score | 37,365.46 |
| Ave Exons Per | 1 | Ave Exons Per | 2.84 |
| Med Exons Per | 1 | Med Exons Per | 2 |
| Total Exons | 4334 | Total Exons | 47,213 |
| Exons | |||
| Initial | Terminal | ||
| Count | 12,262 | Count | 12,262 |
| Average Length | 361.76 | Average Length | 636.18 |
| Median Length | 214 | Median Length | 434 |
| Total Length | 4,435,952 | Total Length | 7,800,818 |
| Average Score | 0.8 | Average Score | 0.79 |
| Total Score | 9765.44 | Total Score | 9696.51 |
| Internal | Single | ||
| Count | 18,355 | Count | 4334 |
| Average Length | 399.55 | Average Length | 1216.07 |
| Median Length | 233 | Median Length | 1023 |
| Total Length | 7,333,754 | Total Length | 5,270,457 |
| Average Score | 0.78 | Average Score | 0.82 |
| Total Score | 14,343 | Total Score | 3560.51 |
Fig. 1Functional categorization and distribution of potential genes in the genome of Phomopsis longicolla isolate MSPL 10–6
Fig. 2Enzyme code (EC) class distributions in the genome of Phomopsis longicolla isolate MSPL 10–6 based on the gene ontology classification
A list of the carbohydrate-activated enzymes (CAZy) identified in the genome of Phomopsis longicolla isolate MSPL 10–6
| CAZy familya | Substrate | Description | EC | Copy number |
|---|---|---|---|---|
| CE1 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 33 |
| Feruloyl esterase | 3.1.1.73 | |||
| CE2 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 2 |
| CE3 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 10 |
| CE4 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 8 |
| CE5 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 18 |
| Cutin | Cutinase | 3.1.1.74 | ||
| CE7 | Hemicellulose (xylan) | Acetyl xylan esterase | 3.1.1.72 | 1 |
| CE8 | Pectin (homogalacturonan) | Pectin methylesterase | 3.1.1.11 | 6 |
| CE9 | Polysaccharides | N-acetylglucosamine 6-phosphate | 3.5.1.25 | 1 |
| Deacetylase | 3.5.1.80 | |||
| CE10 | Polysaccharides | Arylesterase | 3.1.1.- | 105 |
| CE12 | Pectin(homogalacturonan, rhamnogalacturonan I) | Pectin acetylesterase | 3.1.1.- | 5 |
| Hemicellulose | Acetyl pectin esterase | 3.1.1.72 | ||
| CE14 | Polysaccharides | N-acetylglucosaminylphosphatidy-linositol deacetylase | 3.5.1.89 | 1 |
| CE15 | Polysaccharides | 4-O-methyl-glucuronoyl methylesterase | 3.1.1.- | 2 |
| CE16 | Polysaccharides | Acetylesterase | 3.1.1.6 | 7 |
| GH1 | Cellulose | β-glucosidase | 3.2.1.21 | 6 |
| Hemicellulose (xylan, xyloglucan) | β-xylosidase | 3.2.1.37 | ||
| Pectin (rhamnogalacturonan I) | β-galactosidase | 3.2.1.23 | ||
| GH2 | Hemicellulose (xylan, xyloglucan, galactomannan) | β-mannosidase | 3.2.1.25 | 10 |
| Pectin (rhamnogalacturonan I) | β-glucuronidase | 3.2.1.31 | ||
| GH3 | Cellulose | β-glucosidase | 3.2.1.21 | 23 |
| Hemicellulose | β-xylosidase | 3.2.1.37 | ||
| (xylan, xyloglucan) | 3.2.1.74 | |||
| Pectin | exo-β-1,4-glucanase | |||
| GH5 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 27 |
| Hemicellulose (galactomannan) | endo-β-1,4-xylanase | 3.2.1.8 | ||
| Pectin (rhamnogalacturonan I) | exo-β-1,4-glucanase | 3.2.1.74 | ||
| GH6 | Cellulose | Cellobiohydrolase | 3.2.1.91 | 4 |
| endo-β-1,4-glucanase | 3.2.1.4 | |||
| GH7 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 9 |
| Cellobiohydrolase | 3.2.1.176 | |||
| GH9 | Cellulose | Cellusae | - | 1 |
| GH10 | Hemicellulose (xylan) | endo-β-1,4-xylanase | 3.2.1.8 | 7 |
| GH11 | Hemicellulose (xylan) | endo-β-1,4-xylanase | 3.2.1.8 | 4 |
| GH12 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 6 |
| Hemicellulose (xyloglucan) | Xyloglucanase | 3.2.1.151 | ||
| GH13 | Polysaccharides | α-amylase | 3.2.1.1 | 17 |
| GH15 | Polysaccharides | Glucoamylase | 3.2.1.3 | 1 |
| GH16 | Hemicellulose | Xyloglucanase | 3.2.1.151 | 21 |
| GH17 | Polysaccharides | endo-1,3-β-glucosidase | 3.2.1.39 | 7 |
| GH18 | Polysaccharides | Chitinase | 3.2.1.14 | 27 |
| endo-β-N-acetylglucosaminidase | 3.2.1.96 | |||
| GH20 | Polysaccharides | β-hexosaminidase | 3.2.1.52 | 3 |
| GH26 | Polysaccharides | beta-mannanase | 3.2.1.78 | 1 |
| GH27 | Hemicellulose (xylan, xyloglucan, galactomannan) | α-galactosidase | 3.2.1.22 | 2 |
| α-N-acetylgalactosaminidase | 3.2.1.49 | |||
| GH28 | Pectin (homogalacturonan, rhamnogalacturonan I) | Polygalacturonase | 3.2.1.15 | 21 |
| GH29 | Oligosaccharides | alpha-L-fucosidase | 3.2.1.51 | 4 |
| GH30 | Polysaccharides | Glucosylceramidase | 3.2.1.45 | 4 |
| GH31 | Hemicellulose (xyloglucan) | α-xylosidase | 3.2.1.177 | 7 |
| GH32 | Sucrose | Invertase | 3.2.1.26 | 6 |
| GH33 | Oligosaccharides | exo-α-sialidase | 3.2.1.18 | 1 |
| GH35 | Hemicellulose (xylan, xyloglucan, galactomannan) | β-galactosidase | 3.2.1.23 | 7 |
| Pectin (rhamnogalacturonan I) | exo-β-1,4-galactanase | 3.2.1.- | ||
| GH36 | Hemicellulose (xylan, xyloglucan, galactomannan) | α-galactosidase | 3.2.1.22 | 1 |
| α-N-acetylgalactosaminidase | 3.2.1.49 | |||
| GH37 | Trehalose | α,α-trehalase | 3.2.1.28 | 2 |
| GH38 | Oligosaccharides | α-mannosidase | 3.2.1.24 | 1 |
| GH39 | Oligosaccharides | alpha-L-iduronidase | 3.2.1.76 | 2 |
| GH42 | Oligosaccharides | beta-galactosidase | 3.2.1.23 | 1 |
| GH43 | Hemicellulose (xylan) | β-xylosidase | 3.2.1.37 | 40 |
| Pectin (rhamnogalacturonan I) | α-L-arabinofuranosidase | 3.2.1.55 | ||
| GH45 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 2 |
| GH47 | Oligosaccharides | α-mannosidase | 3.2.1.113 | 12 |
| GH51 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 4 |
| Hemicellulose (xylan,xyloglucan) | β-xylosidase | 3.2.1.37 | ||
| GH53 | Pectin (rhamnogalacturonan I) | endo-β-1,4-galactanase | 3.2.1.89 | 4 |
| GH54 | Pectin | alpha-L-arabinofuranosidase | 3.2.1.55 | 1 |
| GH55 | Polysaccharides | endo-1,3-β-glucosidase | 3.2.1.39 | 6 |
| GH62 | Polysaccharides | alpha-L-arabinofuranosidase | 3.2.1.55 | 1 |
| GH63 | Oligosaccharides | α-glucosidase | 3.2.1.106 | 4 |
| GH64 | Polysaccharides | endo-1,3-β-glucosidase | 3.2.1.39 | 4 |
| GH65 | Polysaccharides | alpha,alpha-trehalase | 3.2.1.28 | 2 |
| GH67 | Polysaccharides | alpha-glucuronidase | 3.2.1.139 | 1 |
| GH71 | Polysaccharides | α-1,3-glucanase | 3.2.1.59 | 9 |
| GH72 | Polysaccharides | β-1,3-glucanosyltransglycosylase | 2.4.1.- | 10 |
| GH74 | Cellulose | endo-β-1,4-glucanase | 3.2.1.4 | 11 |
| Hemicellulose (xyloglucan) | Xyloglucanase | 3.2.1.151 | ||
| GH76 | Oligosaccharides | α-1,6-mannanase | 3.2.1.101 | 14 |
| GH78 | Pectin | α-L-rhamnosidase | 3.2.1.40 | 14 |
| GH79 | Pectin (rhamnogalacturonan I) | β-glucuronidase | 3.2.1.31 | 9 |
| GH81 | Polysaccharides | endo-1,3-β-glucosidase | 3.2.1.39 | 2 |
| GH88 | Polysaccharides | β-glucuronyl hydrolase | 3.2.1.- | 1 |
| GH92 | Oligosaccharides | Mannosyl-oligosaccharide alpha-1,2-mannosidase | 3.2.1.113 | 8 |
| GH93 | Pectin (rhamnogalacturonan I) | exo-α-L-1,5-arabinanase | 3.2.1.- | 7 |
| GH94 | Cellulose | cellobiose phosphorylase | 2.4.1.20 | 1 |
| GH95 | Hemicellulose (xyloglucan) | α-1,2-L-fucosidase | 3.2.1.63 | 3 |
| GH105 | Pectin | Rhamnogalacturonyl hydrolase | 3.2.1.172 | 8 |
| GH106 | Polysaccharides | alpha-L-rhamnosidase | 3.2.1.40 | 4 |
| GH109 | Polysaccharides | α-N-acetylgalactosaminidase | 3.2.1.49 | 20 |
| GH114 | Polysaccharides | endo-α-1,4-polygalactosaminidase | 3.2.1.109 | 3 |
| GH115 | Hemicellulose (xylan) | Xylan α-1,2-glucuronidase | 3.2.1.131 | 4 |
| GH125 | Oligosaccharides | exo-α-1,6-mannosidase | 3.2.1.- | 4 |
| GH127 | Oligosaccharides | β-L-arabinofuranosidase | 3.2.1.185 | 3 |
| GH128 | Polysaccharides | endo-1,3-β-glucosidase | 3.2.1.39 | 6 |
| GH131 | Cellulose | exo-β-1,3/1,4/1,6-glucanase | 3.2.1.- | 8 |
| Hemicellulose | ||||
| GH132 | Polysaccharides | Activity on β-1,3glucan | – | 2 |
| GH133 | Polysaccharides | amylo-α-1,6-glucosidase | 3.2.1.33 | 1 |
| GH134 | Polysaccharides | endo-β-1,4-mannanase | 3.2.1.78 | 1 |
| GH135 | Polysaccharides | α-1,4-galactosaminogalactan hydrolase | 3.2.1.- | 4 |
| PL1 | Pectin (homogalacturonan) | Pectate lyase | 4.2.2.2 | 21 |
| PL3 | Pectin | Pectate lyase | 4.2.2.2 | 10 |
| PL4 | Pectin (rhamnogalacturonan I) | Rhamnogalacturonan lyase | 4.2.2.- | 8 |
| PL9 | Pectin | Pectate lyase | 4.2.2.2 | 2 |
| Exopolygalacturonate lyase | 4.2.2.9 | |||
| PL11 | Pectin | Rhamnogalacturonan endolyase | 4.2.2.23 | 2 |
| PL22 | Pectin | Oligogalacturonate lyase | 4.2.2.6 | 1 |
| PL24 | Pectin | Ulvan lyase | 4.2.2.- | 1 |
a CE Carbohydrate esterases, GH Glycoside hydrolases, PL polysaccharide lyases
Classes of auxiliary activity (AA), carbohydrate-binding module (CBM), and glycosyl-transferase (GT) enzymes in the genome of Phomopsis longicolla isolate MSPL 10–6
| CAZya family | Description | Copy Number |
|---|---|---|
| AA1 | Multicopper oxidases | 6 |
| AA2 | Lignin peroxidase | 16 |
| AA3 | Glucose-methanol-choline (GMC) oxidoreductases | 75 |
| AA4 | vanillyl-alcohol oxidase | 7 |
| AA5 | radical-copper oxidases | 3 |
| AA6 | 1,4-benzoquinone reductases | 1 |
| AA7 | Glucooligosaccharide oxidase | 96 |
| AA8 | Iron reductase | 7 |
| AA9 | Copper-dependent lytic polysaccharide monooxygenases | 35 |
| AA11 | monooxygenase | 10 |
| AA12 | The pyrroloquinoline quinone-dependent oxidoreductase activity was demonstrated for the CC1G_09525 protein of | 2 |
| AA13 | Monooxygenase | 1 |
| CBM1 | Cellulose-binding | 21 |
| CBM6 | Amylase | 1 |
| CBM13 | Cellulose-binding | 2 |
| CBM18 | Chitin-binding | 12 |
| CBM20 | Starch-binding | 6 |
| CBM21 | Starch-binding | 1 |
| CBM23 | Mannan-binding | 1 |
| CBM24 | Alpha-1,3-glucan (mutan)-binding | 12 |
| CBM32 | Binding to LacNAc (beta-D-galactosyl-1,4-beta-D-N-acetylglucosamine) | 2 |
| CBM35 | Xylan-binding | 4 |
| CBM37 | Xylanase | 1 |
| CBM42 | Binding to arabinofuranose | 1 |
| CBM43 | Beta-1,3-glucan binding | 2 |
| CBM48 | Amylase | 1 |
| CBM50 | Peptidoglycan-binding (LysM domain) | 37 |
| CBM63 | Cellulose-binding | 1 |
| CBM66 | β-fructosidase | 2 |
| CBM67 | L-rhamnose-binding | 6 |
| GT1 | UDP-glucuronosyl-transferase | 12 |
| GT2 | Cellulose/chitin synthase | 16 |
| GT3 | Glycogen synthase | 1 |
| GT4 | Sucrose synthase | 7 |
| GT5 | Glycogen glucosyltransferase | 4 |
| GT8 | Lipopolysaccharide glucosyl-transferase | 6 |
| GT15 | α-1,2-mannosyl-transferase | 5 |
| GT17 | β-1,4-N-acetyl-glucosaminyl-transferase | 1 |
| GT20 | α,α-trehalose-phosphate synthase | 3 |
| GT21 | Ceramide β-glucosyl-transferase | 3 |
| GT22 | Man6GlcNAc2-PP-Dol α-1,2-mannosyl-transferase | 4 |
| GT24 | Glycoprotein α-glucosyl-transferase | 1 |
| GT25 | Lipopolysaccharide beta-1,4-galactosyltransferase | 6 |
| GT28 | Digalactosyl-diacyl-glycerol- synthase | 1 |
| GT31 | Fucose-specific β-1,3-N-acetylglucosaminyl-transferase | 4 |
| GT32 | α-1,6-mannosyl-transferase | 13 |
| GT33 | Chitobiosyl-diphosphodolichol β-mannosyl-transferase | 1 |
| GT34 | α-1,2-galactosyl-transferase | 2 |
| GT35 | Starch phosphorylase | 1 |
| GT39 | Protein α-mannosylt-ransferase | 3 |
| GT41 | Beta-N-acetylglucosaminyltransferase | 1 |
| GT48 | 1,3-β-glucan synthase | 1 |
| GT50 | α-1,4-mannosyl-transferase | 1 |
| GT55 | GDP-Man: mannosyl-3-phosphoglycerate synthase | 2 |
| GT57 | α-1,3-glucosyl-transferase | 3 |
| GT58 | Man5GlcNAc2-PP-Dol α-1,3-mannosyl-transferase | 1 |
| GT59 | Glc2Man9GlcNAc2-PP-Dol α-1,2-glucosyl-transferase | 1 |
| GT61 | Xylanase | 1 |
| GT62 | α-1,2-mannosyl-transferase | 3 |
| GT66 | dolichyl-diphospho-oligosaccharide-protein Glycotransferase | 1 |
| GT68 | O-alpha-fucosyltransferase | 1 |
| GT69 | α-1,3-mannosyl-transferase | 4 |
| GT71 | α-mannosyl-transferase | 8 |
| GT76 | α-1,6-mannosyl-transferase | 1 |
| GT77 | Xylanase | 1 |
| GT90 | Xylanase | 9 |
| GT92 | Glycanase | 1 |
a Carbohydrate-activated enzymes
Fig. 3The number of different classes of the carbohydrate-activated enzymes (CAZy) in Phomopsis longicolla and other ascomycete fungi (Aspergillus niger, Hypocrea jecorina, Fusarium oxysporum, F. virguliforme, F. verticilloides, Magnaporthe oryzae, Neurospora crassa, and Podospora anserine)
Classification of repetitive elements identified in the genome of Phomopsis longicolla isolate MS 10–6
| Total length (bp) | Repetitive content (%) | Genome content (%) | |
|---|---|---|---|
| Class I (Retrotransposon) | |||
| LTR retrotransposona | |||
| LTR/Copia | 2,408,959 | 30.71 | 3.89 |
| LTR/Gypsy | 1,914,239 | 24.40 | 3.09 |
| LTR/Other | 80,954 | 1.03 | 0.13 |
| Subtotal | 4,404,152 | 56.14 | 7.11 |
| Non-LTR retrotransposon | |||
| LINEb | 74,280 | 0.95 | 0.12 |
| Total Class I | 4,478,432 | 57.09 | 7.23 |
| Class II (DNA transoposon) | |||
| DNA/TcMar | 3,277,211 | 41.77 | 5.29 |
| DNA/Other | 65,709 | 0.84 | 0.11 |
| Total Class II | 3,342,920 | 42.61 | 5.4 |
| Other | |||
| Satellites, rRNA, Unknown repeats | 23,808 | 0.30 | 0.04 |
| Total | 7,845,160 | 12.67 | |
along terminal repeat retrotransposon
blong interspersed nuclear elements
The donor, taxonomy, and number of proteins encoded by the horizontal transferred genes in in the genome of Phomopsis longicolla isolate MSPL 10–6
| Donor | Taxonomy | Number of Proteins |
|---|---|---|
|
| Bacteria | 1 |
|
| Fungus | 1 |
|
| Fungus | 4 |
|
| Bacteria | 5 |
|
| Fungus | 2 |
|
| Fungus | 24 |
|
| Bacteria | 8 |
|
| Fungus | 28 |
|
| Fungus | 89 |
|
| Bacteria | 5 |
|
| Bacteria | 2 |
|
| Bacteria | 1 |
|
| Fungus | 1 |
|
| Fungus | 12 |
|
| Bacteria | 2 |
|
| Fungus | 73 |
|
| Parasites | 1 |
|
| Fungus | 66 |
|
| Fungus | 1 |
|
| Bacteria | 1 |
|
| Bacteria | 4 |
|
| Bacteria | 1 |
|
| Bacteria | 3 |
|
| Bacteria | 1 |
|
| Bacteria | 1 |
|
| Fungus | 8 |
|
| Fungus | 5 |
|
| Bacteria | 1 |
|
| Oligohymenophorea | 1 |
|
| Bacteria | 1 |
|
| Fungus | 78 |
|
| Fungus | 3 |
|
| Bacteria | 1 |
|
| Bacteria | 4 |
|
| Bacteria | 6 |
|
| Bacteria | 1 |
|
| Bacteria | 2 |
|
| Fungus | 2 |
|
| Fungus | 1 |
|
| Bacteria | 2 |
|
| Bacteria | 5 |
|
| Fungus | 1 |
|
| Bacteria | 7 |
|
| Bacteria | 1 |
|
| Crenarchaeota | 1 |
|
| Crenarchaeota | 1 |
|
| Fungus | 4 |
|
| Parabasalia | 3 |
|
| Fungus | 9 |
|
| Fungus | 8 |
|
| Bacteria | 1 |
|
| Bacteria | 1 |
|
| Fungus | 15 |
| Total | 510 |
Fig. 4Distribution of annotated horizontal transferred genes in the genome of Phomopsis longicolla isolate MSPL 10–6 based on gene ontology analysis (a) cellular component, (b) biological process, and (c) molecular function