Literature DB >> 27222801

Genome-wide functional annotation of Phomopsis longicolla isolate MSPL 10-6.

Omar Darwish1, Shuxian Li2, Benjamin Matthews3, Nadim Alkharouf1.   

Abstract

Phomopsis seed decay of soybean is caused primarily by the seed-borne fungal pathogen Phomopsis longicolla (syn. Diaporthe longicolla). This disease severely decreases soybean seed quality, reduces seedling vigor and stand establishment, and suppresses yield. It is one of the most economically important soybean diseases. In this study we annotated the entire genome of P. longicolla isolate MSPL 10-6, which was isolated from field-grown soybean seed in Mississippi, USA. This study represents the first reported genome-wide functional annotation of a seed borne fungal pathogen in the Diaporthe-Phomopsis complex. The P. longicolla genome annotation will enable research into the genetic basis of fungal infection of soybean seed and provide information for the study of soybean-fungal interactions. The genome annotation will also be a valuable resource for the research and agricultural communities. It will aid in the development of new control strategies for this pathogen. The annotations can be found from: http://bioinformatics.towson.edu/phomopsis_longicolla/download.html. NCBI accession number is: AYRD00000000.

Entities:  

Keywords:  Functional annotation; Phomopsis longicolla; Phomopsis seed decay; Soybean pathogen

Year:  2016        PMID: 27222801      PMCID: PMC4856852          DOI: 10.1016/j.gdata.2016.03.006

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

Annotations can be found here: http://bioinformatics.towson.edu/phomopsis_longicolla/download.html. NCBI deposited data can be found here: http://www.ncbi.nlm.nih.gov/nuccore/AYRD00000000.

Experimental design, materials and methods

DNA extraction, library construction, and sequencing

P. longicolla is the primary cause of Phomopsis seed decay in soybean [1], [2]. An isolate of P. longicolla, MSPL10-6, was isolated from field-grown soybean seed in Stoneville, Mississippi, USA in 2010 using the standard seed plating procedure [3]. DNA extraction, library construction, and sequencing of P. longicolla MSPL10-6 was previously described in [4].

Gene prediction

The genome sequence of P. longicolla MSPL10-6, was previously de-novo assembled into 108 scaffolds of 500 bases or larger [4]. Gene prediction analysis was done on these scaffolds, using a combination of homology searching and de novo prediction using Augustus web server [5] with complete gene option enabled and default for the rest of the parameters. Fusarium graminearum was selected as the reference species due to the relatively close phylogenetic relationship to P. longicolla MSPL10-6. The analysis yielded a total of 16,597 genes (average length = 1704.37 bp, total length = 28,287,360 bp, total coding length = 24,840,981 bp). Out of which 4334 genes were found to consist of a single exon (average length = 1219.1 bp). The total number of exons in all predicted genes was found to be 47,213 exons (average length = 361.76 bp, total length = 4,435,952 bp). The gene prediction statistics are summarized in Supplemental Table 1.

Gene functional annotation

Predicted genes were functionally annotated using Blast2GO [6]. The gene models were blasted (blastx) [7] against the NCBI non-redundant protein database. Then domain finding searches were done using InterProScan [8]. Enzyme codes and GO ontologies were then assigned to the gene models [9]. From a total of 16,596 genes, 9.64% failed to obtain significant hits with Blast. 18.01% of them returned significant sequence alignments, but cannot be linked to any Gene Ontology entries. Overall functional labels were assigned to 59.45% of the predicted genes. Enzyme codes were assigned to 15.45% of the genes. The enzyme code (EC) distributions (level 3) are summarized in Fig. 1. The GO distributions (level 3) are summarized in Fig. 2 (biological process) and Fig. 3 (molecular function).
Fig. 1

Enzyme code level 3 distribution.

Fig. 2

GO distribution in level 3 (top 50) for biological process (BP).

Fig. 3

GO distribution in level 3 (top 50) for molecular function (MF).

The following are the supplementary data related to this article.

Supplementary Table 1

Gene model numbers and exon types identified from the genome annotation of Phomopsis longicolla isolate MSPL 10-6.
Specifications
Organism/cell line/tissuePhomopsis longicolla
StrainMSPL 10-6
SequencerIllumina HiSeq 2500 sequencer
Data formatAnalyzed, gff file
Experimental factorsn/a
Experimental featuresGenome annotation
Consentn/a
Sample source locationn/a
  6 in total

1.  InterProScan--an integration platform for the signature-recognition methods in InterPro.

Authors:  E M Zdobnov; R Apweiler
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Draft genome sequence of Phomopsis longicolla isolate MSPL 10-6.

Authors:  Shuxian Li; Omar Darwish; Nadim Alkharouf; Benjamin Matthews; Pingsheng Ji; Leslie L Domier; Ning Zhang; Burton H Bluhm
Journal:  Genom Data       Date:  2014-11-20

4.  WebAUGUSTUS--a web service for training AUGUSTUS and predicting genes in eukaryotes.

Authors:  Katharina J Hoff; Mario Stanke
Journal:  Nucleic Acids Res       Date:  2013-05-21       Impact factor: 16.971

5.  High-throughput functional annotation and data mining with the Blast2GO suite.

Authors:  Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Tim D Williams; Shivashankar H Nagaraj; María José Nueda; Montserrat Robles; Manuel Talón; Joaquín Dopazo; Ana Conesa
Journal:  Nucleic Acids Res       Date:  2008-04-29       Impact factor: 16.971

6.  Gene Ontology Consortium: going forward.

Authors: 
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

  6 in total
  1 in total

1.  Identification of fungi associated with soybeans and effective seed disinfection treatments.

Authors:  Diana Escamilla; Maria Luciana Rosso; Bo Zhang
Journal:  Food Sci Nutr       Date:  2019-08-22       Impact factor: 2.863

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.