Literature DB >> 19055845

SNUGB: a versatile genome browser supporting comparative and functional fungal genomics.

Kyongyong Jung1, Jongsun Park, Jaeyoung Choi, Bongsoo Park, Seungill Kim, Kyohun Ahn, Jaehyuk Choi, Doil Choi, Seogchan Kang, Yong-Hwan Lee.   

Abstract

BACKGROUND: Since the full genome sequences of Saccharomyces cerevisiae were released in 1996, genome sequences of over 90 fungal species have become publicly available. The heterogeneous formats of genome sequences archived in different sequencing centers hampered the integration of the data for efficient and comprehensive comparative analyses. The Comparative Fungal Genomics Platform (CFGP) was developed to archive these data via a single standardized format that can support multifaceted and integrated analyses of the data. To facilitate efficient data visualization and utilization within and across species based on the architecture of CFGP and associated databases, a new genome browser was needed.
RESULTS: The Seoul National University Genome Browser (SNUGB) integrates various types of genomic information derived from 98 fungal/oomycete (137 datasets) and 34 plant and animal (38 datasets) species, graphically presents germane features and properties of each genome, and supports comparison between genomes. The SNUGB provides three different forms of the data presentation interface, including diagram, table, and text, and six different display options to support visualization and utilization of the stored information. Information for individual species can be quickly accessed via a new tool named the taxonomy browser. In addition, SNUGB offers four useful data annotation/analysis functions, including 'BLAST annotation.' The modular design of SNUGB makes its adoption to support other comparative genomic platforms easy and facilitates continuous expansion.
CONCLUSION: The SNUGB serves as a powerful platform supporting comparative and functional genomics within the fungal kingdom and also across other kingdoms. All data and functions are available at the web site http://genomebrowser.snu.ac.kr/.

Entities:  

Mesh:

Year:  2008        PMID: 19055845      PMCID: PMC2649115          DOI: 10.1186/1471-2164-9-586

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

As the number of sequenced genomes rapidly increases, search and comparison of sequence features within and between species has become an integral part of most biological inquires. To facilitate uses of the sequenced genomes, numerous bioinformatics tools have been developed; among these, genome browser plays an essential role by providing various means for viewing genome sequences and annotated features (e.g., chromosomal position and context of individual genes, protein/nucleotide sequences, structures of exon/intron, and promoters) via graphical and text interfaces. Widely utilized genome browsers include: (i) Ensembl , which is specialized for mammalian genomics and comparative genomics [1], (ii) UCSC Genome Browser , which archives genome sequences of 30 vertebrate and 24 non-vertebrate species [2], (iii) GBrowse , a widely-used component-based genome browser [3], and (iv) Map Viewer at the National Center for Biotechnology Information (NCBI), which covers a large number of organisms [4]. A new genome browser based on the Google map engine, called the X::Map Genome Browser [5], contains genomes of three mammalian species and is specialized for supporting microarray analyses based on the Affymetrix platform [6]. Since complete S. cerevisiae genome sequences were released in 1996, more than 90 fungal/oomycete species have been sequenced with many additional species being currently sequenced [7]. A few sequencing centers, such as the Broad Institute and the JGI , have sequenced most of the fungal genomes and provide their own genome browsers to support data visualization and utilization. Although they use standardized formats, such as fasta and gff3, for data presentation and distribution, each center uses its own data formats for sequences, annotation data, and other chromosomal information. In addition, some of the sequenced fungal genomes lack certain data, such as exon positions. These problems have hampered the integration and visualization of all available genome sequences via a single genome browser. As a solution for this problem, a group at Duke University installed an open-source browser called the GBrowse [3] after reannotating genome sequences of 42 fungal species from multiple sequencing centers through the use of their own annotation pipeline consisting of several gene prediction programs; large scale evolutionary analyses were conducted based on the archived genomes, demonstrating the usefulness of unified and standardized data formats [8]. A large number of sequenced fungal genomes have provided opportunities to compare genome sequences and features at multiple taxon levels, revealing potential mechanisms underpinning fungal evolution and biology [8-18]; however, due to the complexity and vast scale of the resulting data, presentation of these data in an easily accessible format is challenging. To overcome this limitation, both the database construction and the pipeline/tools for comparative analyses should be carefully designed. One good example is the e-Fungi project [19], which archives genome sequences of 34 fungal and 2 oomycete species and supports various queries via the web interface. Comparative fungal genomics studies have been conducted using e-Fungi [9,11]. Yeast Gene Order Browser (YGOB; ) [20] archives genome sequences of the species belonging to the subphylum Saccharomycotina and provides a graphical gene order browser, which helps the dissection of evolutionary history of genome changes during yeast speciation [21]. Although these platforms provide useful tools and data, only certain fungal genomes are covered, and the function of user-friendly access to sequence information and graphical presentation of data are limited. The Comparative Fungal Genomics Platform (CFGP; ) was established to archive all publicly available fungal and oomycete genome sequences using a unified data format and to support multifaceted analyses of the stored data via a newly developed user interface named as Data-driven User Interface [7]. Currently, CFGP archives genome sequences of 92 fungal and 6 oomycete species (137 different datasets) and also carries genome sequences of 55 plant, animal and bacterial species (56 datasets). Taking advantage of the data warehouse and functionalities in CFGP, several databases specialized for certain gene families or functional groups have been constructed, one of which is the Fungal Transcription Factor Database (FTFD; ) [22]. This database identified and classified all fungal transcription factors and provides a phylogenomic platform supporting analyses of individual transcription factor families [23]. In addition, Fungal Cytochrome P450 Database (FCPD; ) [24], Fungal Secretome Database (FSD; ; Choi et al., unpublished), Fungal Expression Database (FED; ; Park et al., unpublished) have been constructed or are currently being constructed. The CFGP was also used to manage high-throughput experimental data and link them to corresponding genes [25,26] and to maintain the Phytophthora database [27]. To support comparative genomics analyses using CFGP and offer tools for versatile data visualization, we newly developed a genome browser named as the Seoul National University Genome Browser (SNUGB; ). We chose to develop a new genome browser instead of adopting one of the existing browsers in part because the adoption required conversion of the data archived in CFGP into new formats, and the existing browsers do not support the integration of additional databases, such as the InterPro and customized homologous gene databases available through SNUGB. We also wanted to have a browser based on the architecture of CFGP and associated databases so that we would be able to quickly present updated contents in these resources and seamlessly integrate new tools for data processing, visualization, and/or utilization. The SNUGB currently covers genome sequences and associated information for 92 fungal and 6 oomycete species (137 datasets), which is the largest among the available fungal genome browser services on the web. These 92 fungal species cover four phyla and one subphylum based on a recently revised fungal taxonomy framework [28] (Table 1, 2, and 3). It also houses genome sequences of 12 plant, 18 insect, and 3 nematode species and human genome sequences (38 datasets), to support comparison of fungal genomes with those in other kingdoms (Table 4). The taxonomy browser implemented in the SNUGB provides an easy means to access genome sequences of specific species via two ways. The SNUGB provides lists of putative orthologous genes of all fungal ORFs and a tool for comparison of genomic contexts of any orthologous genes among chosen species. In addition, SNUGB displays the InterPro terms assigned to each ORF as well as the genomic regions where expressed sequence tags (ESTs) are matched. With these functionalities, SNUGB will serve as a powerful platform supporting comprehensive fungal comparative genomics.
Table 1

List and characteristics of the fungal genomes belonging to the subphylum Pezizomycota archived in SNUGB.

SpeciesaSize (Mb)# of ORFs# of ExonsCbIbEbSourceRefs
Fungi (Kingdom)e
Ascomycota (Phylum)
  Pezizomycotina (Subphylum)
  A: Botrytis cinerea T: Botryotinia fuckeliana42.716,44843,358NYNBIN
  Sclerotinia sclerotiorum38.314,52240,623NYNBIN
  Aspergillus clavatus27.99,12127,959NYNBI[17,44]
  Aspergillus flavus36.812,60440,971NYNBI[16]
  Aspergillus fumigatus AF29329.49,88728,1648YNTIGR[45]
  Aspergillus fumigatus A116329.29,92929,094NYNTIGR[44]
  A:Aspergillus nidulans T:Emericella nidulans30.110,70135,5258YNBI[14]
  Aspergillus niger ATCC101537.211,20034,971NYNJGIN
  Aspergillus niger CBS513.8834.014,08650,3718YNNCBI[38]
  A: Aspergillus oryzae T: Eurotium oryzae37.112,06335,319NYNDOGAN[46]
  Aspergillus terreus29.310,40633,116NYNBI[17]
  A:Aspergillus fischerianus T: Neosartorya fischerid32.610,403NNNNBI[44]
  Penicillium chrysogenum32.212,79140,441NNNNCBI[47]
  Penicillium marneffei28.510,63834,306NNNTIGRN
  Coccidioides immitis RS28.910,45736,137NYNBIN
  Coccidioides immitis H538.427.710,66334,503NYNBIN
  Coccidioides immitis RMSCC 239428.810,40834,807NYNBIN
  Coccidioides immitis RMSCC 370327.610,46533,931NYNBIN
  Coccidioides posadasii Silveria27.510,12533,520NYNBIN
  Coccidioides posadasii C73526.7NNNNNBIN
  Coccidioides posadasii CPA000128.7NNNNNBIN
  Coccidioides posadasii CPA002027.3NNNNNBIN
  Coccidioides posadasii CPA006627.7NNNNNBIN
  Coccidioides posadasii RMSCC 103726.7NNNNNBIN
  Coccidioides posadasii RMSCC 103826.2NNNNNBIN
  Coccidioides posadasii RMSCC 104026.5NNNNNBIN
  Coccidioides posadasii RMSCC 213327.9NNNNNBIN
  Coccidioides posadasii RMSCC 348828.19,96433,484NYNBIN
  Coccidioides posadasii RMSCC 370025.5NNNNNBIN
  Paracoccidioides brasiliensis Pb0133.09,13637,310NYNBIN
  Paracoccidioides brasiliensis Pb0329.19,26431,468NYNBIN
  Paracoccidioides brasiliensis Pb1830.08,74133,239NYNBIN
  Blastomyces dermatitidis61.8NNNNNWGSCN
  A: Histoplasma capsulatum G217B T: Ajellomyces capsulatus G217B41.38,03826,711fNYNWGSCN
  A: Histoplasma capsulatum G186AR T: Ajellomyces capsulatus G186AR29.97,45424,562fNYNWGSCN
  A: Histoplasma capsulatum NAm1 T: Ajellomyces capsulatus NAm133.09,34932,844NYNBIN
  A: Histoplasma capsulatum H143 T: Ajellomyces capsulatus H14339.07,36525,164fNYNBIN
  A: Histoplasma capsulatum H88 T: Ajellomyces capsulatus H8837.97,42825,356fNYNBIN
  A: Arthroderma gypseum T: Microsporum gypseum23.38,87628,624NYNBIN
  Microsporum canis23.3NNNNNBIN
  Trichophyton equinum24.2NNNNNBIN
  Ascosphaera apis21.6NNNNNBGM[48]
  Uncinocarpus reesii22.37,79824,094NYNBIN
  Chaetomium globosumd34.911,124NNNNBIN
  Epichloe festucae27.0NNNNNOUN
  A: Fusarium graminearum PH-1 T: Gibberella zeae PH-136.613,32137,549NYNBI[37]
  A: Fusarium graminearum GZ3639 T:Gibberella zeae GZ3639c15.16,69411,692fNYNBI[37]
  Fusarium oxysporum f. sp. lycopersici 428661.417,60847,05115YNBIN
  A: Fusarium verticillioides 7600 T:Gibberella moniliformis 760041.914,19939,058NYNBIN
  A: Fusarium solani MPVI T:Nectria haematococca MPVI51.315,70748,387NYNJGIN
  A: Pyricularia oryzae 70–15 T:Magnaporthe oryzae 70–1541.612,84134,1897YYBI[49]
  A: Pyricularia oryzae 70–15 chromosome 7 T:Magnaporthe oryzae 70–15 chromosome 74.01,1223,2891YN[50]
  Cryphonectria parasitica43.911,18433,090NNNJGIN
  Neurospora crassa OR74A39.29,84227,1888YNBI[51]
  Podospora anserina DSM98035.710,59624,4379YNIGM[52]
  Trichoderma atroviride IMI20604036.111,10032,563NYNJGIN
  A:Trichoderma reesei QM6a T: Hypocrea jecorina QM6a33.59,12927,891NYNJGI[53]
  A:Trichoderma virens Gv29-8 T:Hypocrea virens Gv29-838.811,64334,673NYNJGIN
  Talaromyces stipitatus ATCC 1050035.6NNNNNTIGRN
  Verticillium dahliae VaLs. 1733.910,57529,736NNNBIN
  Verticillium albo-atrum VaMs. 10232.910,23928,842NNNBIN
  Alternaria brassicicola32.0NNNNNWGSCN
  A:Bipolaris maydis T:Cochliobolus heterostrophus C534.99,63328,007NNNJGIN
  Pyrenophora tritici-repentis38.012,16932,717NYNBIN
  A: Septoria tritici T: Mycosphaerella graminicola41.911,39530,629NYNJGIN
  A:Paracercospora fijiensis T:Mycosphaerella fijiensis73.410,32725,289NYNJGIN
  A: Stagonospora nodorum T: Phaeosphaeria nodorum37.216,59744,017NYNBI[54]

  Total2,844.0637,0061,755,6558431

aA indicates anamorph name and T presents teleomorph name of fungi.

bC means chromosomes, I indicates InterPro, and E presents EST.

cIncomplete coverage of genome information

dInsufficient exon/intron information

eTaxonomy based on [28]

fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55].

'Y' indicates the existence of information in each field, and 'N' indicates the lack of information.

Table 2

List and characteristics of the fungal genomes belonging to the subphyla Saccharomycotina and Taphrinomycotina archived in SNUGB.

SpeciesaSize (Mb)# of ORFs# of ExonsCbIbEbSourceRefs
Fungi (Kingdom)e
Ascomycota (Phylum)
  Saccharomycotina (Subphylum)
  Candida albicans SC531414.36,0906,624NYNSGTC[56,57]
  Candida albicans WO-114.46,1606,395NYNBIN
  Candida dubliniensisd14.56,027NNNNSIN
  Candida glabrata CBS13812.35,1655,249NYNCBS[58]
  A: Candida guilliermondii T: Pichia guilliermondii10.65,9205,935NYNBIN
  Candida lusitaniae12.15,9415,956NYNBIN
  Candida parapsilosis13.15,7335,733NYNBIN
  Candida tropicalis14.76,2586,292NYNBIN
  Candida tropicalisf2.1NNNNNGS[59]
  Ashbya gossypii8.84,7174,9437YNNCBI[60]
  Debaryomyces hansenii12.26,3546,7107YNCBS[58]
  Debaryomyces hanseniif2.3NNNNNGS[61]
  A: Candida sphaerica T: Kluyveromyces lactis10.75,3275,457NYNGS[58]
  A: Candida sphaerica T: Kluyveromyces lactisf5.1NNNNNGS[62]
  A: Candida kefyr T:Kluyveromyces marxianusf2.0NNNNNGS[63]
  Kluyveromyces polysporus DSM7029414.75,3675,524NYNSIG[64]
  Kluyveromyces thermotoleransf2.2NNNNNGS[65]
  Kluyveromyces waltii10.94,9355,395NYNBI[66]
  Lodderomyces elongisporus15.55,8025,856NYNBIN
  Saccharomyces bayanus MCYC 62311.59,3859,385NYNBI[13]
  Saccharomyces bayanus 623-6C YM491111.94,9664,966NYNWGSC[12]
  Saccharomyces bayanus var. uvarumf4.5NNNNNGS[67]
  Saccharomyces castellii11.44,6774,677NYNWGSC[12]
  A: Candida robusta S288C T: Saccharomyces cerevisiae S288C12.26,6927,04216YNSGD[68]
  A: Candida robusta RM11-1a T: Saccharomyces cerevisiae RM11-1a11.75,6965,988NYNBIN
  A: Candida robusta YJM789 T: Saccharomyces cerevisiae YJM78912.05,9036,153NYNSI[69]
  Saccharomyces exiguusf2.0NNNNNGS[70]
  Saccharomyces kluyveri11.02,9682,968NYNWGSC[12]
  Saccharomyces kluyverif2.2NNNNNGS[71]
  Saccharomyces kudriavzevii11.23,7683,768NYNWGSC[12]
  Saccharomyces mikatae11.59,0169,016NYNBI[13]
  Saccharomyces mikatae10.83,1003,100NYNWGSC[12]
  Saccharomyces paradoxus11.98,9398,939NYNBI[13]
  Saccharomyces servazziif2.0NNNNNGS[72]
  Pichia angustaf4.5NNNNNGS[73]
  Pichia stipitis15.45,8398,428NYNJGI[74]
  Pichia sorbitophilaf3.8NNNNNGS[75]
  A: Candida lipolytica T: Yarrowia lipolytica20.56,5247,2646YNCBS[58]
  A: Candida lipolytica T: Yarrowia lipolyticaf4.6NNNNNGS[76]
  Zygosaccharomyces rouxiif4.1NNNNNGS[77]
  Taphrinomycotina (Subphylum)
  Pneumocystis cariniic, d6.34,020NNNNSIN
  Schizosaccharomyces japonicus11.35,17210,321NYNBIN
  Schizosaccharomyces pombe12.65,0589,8693YNGDB[78]
  Schizosaccharomyces octosporus11.24,92510,168NNNBIN

Total424.6176,444188,1215280

aA indicates anamorph name and T presents teleomorph name of fungi.

bC means chromosomes, I indicates InterPro, and E presents EST.

cIncomplete coverage of genome information

dInsufficient exon/intron information

eTaxonomy based on [28]

fSequences from Random Sequence Tag (RST)

'Y' indicates the existence of information in each field, and 'N' indicates the lack of information.

Table 3

List and characteristics of the genomes belonging to the phyla Basidiomycota, Chytridiomycota, and Microsporidia, the subphylum Mucoromycotina, and the phylum Peronosporomycota (oomycetes) archived in SNUGB.

SpeciesaSize (Mb)# of ORFs# of ExonsCbIbEbSourceRefs
Fungi (Kingdom)e
Basidiomycota (Phylum)
  Agricomycotina (Subphylum)
  Postia placenta90.917,173116,596NYNJGIN
  T: Phanerochaete chrysosporium A: Sporotrichum pruinosum35.110,04858,746NYNJGI[79]
  Coprinus cinereus36.313,54472,887NYNBIN
  Laccaria bicolor64.920,614111,290NYNJGI[80]
  A: Cryptococcus neoformans Serotype A T: Filobasidiella neoformans Serotype A19.57,30243,32520YNBIN
  A: Cryptococcus neoformans Serotype B T: Filobasidiella neoformans Serotype B19.06,87040,589NYNNCBIN
  A: Cryptococcus neoformans Serotype D B-3501A T: Filobasidiella neoformans Serotype D B-3501A18.56,43140,942NYNSGTC[41]
  A: Cryptococcus neoformans Serotype D JEC21 T: Filobasidiella neoformans Serotype D JEC2119.16,47540,811NYNSGTC[41]
  Pucciniomycotina (Subphylum)
  Sporobolomyces roseus21.25,53639,911NYNJGIN
  Puccinia graminis88.720,56795,838NYNBIN
  Ustilaginomycotina (Subphylum)
  Malassezia globosa CBS79669.04,2864,286NNNPGC[15]
  Malassezia restricta CBS7877c4.6NNNNNPGC[15]
  Ustilago maydis 52119.76,68911,589NYNBI[81]
  Ustilago maydis FB119.36,95010,310fNYNBI[81]
Chytridiomycota (Phylum)
  Batrachochytrium dendrobatidis JEL42323.98,81838,551NYNBIN
  Batrachochytrium dendrobatidis JAM8124.38,73237,423NYNJGIN
Mucoromycotina (Subphylum incertae sedis)
  Rhizopus oryzae46.117,46757,981NYNBIN
  Phycomyces blakesleeanus55.914,79271,502NYNJGIN
Microsporidia (Phylum)
  Encephalitozoon cuniculi2.51,9962,002NYNGS[82]
  Antonospora locustaed6.12,606NNNNJBPCN
Stramenopila (Kingdom)e
Peronosporomycota (Phylum)
  Phytophthora capsici107.817,41445,661NNNJGIN
  Phytophthora infestansd228.522,658NNNNBIN
  Phytophthora ramorum66.715,74340,639NYNJGI[83]
  Phytophthora sojae86.019,02753,552NYNJGI[83]
  Hyaloperonospora parasitica83.614,78924,907NYNVBIN
  Pythium ultimum44.3NNNNNN

Total1,241.5276,5271,058,8781200

aA indicates anamorph name and T presents teleomorph name of fungi

bC means chromosomes, I indicates InterPro, and E presents EST.

cIncomplete coverage of genome information

dInsufficient exon/intron information

eTaxonomy based on [28]

fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55].

'Y' indicates the existence of information in each field, and 'N' indicates the lack of information.

Table 4

List and characteristics of the non-fungal genomes archived in SNUGB.

SpeciesaSize (Mb)# of ORFs# of ExonsCbIbEbSourceRefs
Chloroplastida (Kingdom)e
Streptophyta (Phylum)
  Arabidopsis thaliana119.228,581150,3695YNTAIR[33]
  Carica papaya271.7NNNNNPGSC[84]
  Glycine max996.962,199281,102NNNJGIN
  Lycopersicon esculentumc39.98,72529,707NYNSOLN
  Medicago truncatula278.738,334122,8898YNMTGSP[85-87]
  Oryza sativa var. Indicad426.349,710NNNNBGI[88,89]
  Oryza sativa var. Japonica372.166,710319,14012YNIRGSP[89,90]
  Populus trichocarpa485.545,555193,687NYNJGI[91]
  Ricinus communisd362.538,613NNNNTIGRN
  Selaginella moellendorffii212.822,285124,645NYNJGIN
  Sorghum bicolor738.536,338165,14911YNJGIN
  Vitis vinifera497.530,434149,35119YNGS[92]
  Zea maysd2,314.7420,732NNNNMGSPN
Metazoa (Kingdom)
Arthropoda (Phylum)
  Apis mellifera235.211,06271,496NNNHBGP[93]
  Acyrthosiphon pisum446.6NNNNNBCMN
  Bombyx mori397.721,30282,381NNNBGI[94]
  Drosophila ananassae231.015,27656,595NNNFB[95]
  Drosophila erecta152.715,32456,924NNNFB[95]
  Drosophila grimshawi200.515,27056,647NNNFB[95]
  Drosophila melanogaster168.720,92396,745NNNFB[96]
  Drosophila mojavensis193.814,84955,013NNNFB[95]
  Drosophila persimilis188.417,23559,116NNNFB[95]
  Drosophila pseudoobscura152.716,36357,864NNNFB[97]
  Drosophila sechellia166.616,88458,584NNNFB[95]
  Drosophila simulans137.815,98354,294NNNFB[95]
  Drosophila virilise206.014,68055,005NNNFB[95]
  Drosophila willistoni235.515,81656,641NNNFB[95]
  Drosophila yakuba165.715,42359,098NNNFB[95]
  Glossina morsitans205.7NNNNNTIGRN
  Nasonia vitripennis239.627,95798,570fNNNBCMN
  Tribolium castaneum152.114,27458,381fNNNBCMN
Nematoda (Phylum)
  Caenorhabditis elegans100.326,902175,2327NNWB[34]
  Caenorhabditis briggsaed108.520,669NNNNWB[98]
  Caenorhabditis remanei145.4NNNNNWBN
Vertebrata (Phylum)
  Homo sapiens Celera assembly2,828.428,057273,999NNNNCBI[99]
  Homo sapiens HuRef assembly2,843.927,937273,135NNNNCBI[100]
  Homo sapiens NCBI Reference2,870.829,319284,553NNNNCBI[100]
  Homo sapiens3,665.543,570452,09929NNEM[100]

Total21,241.01,294,2814,142,169780

aA indicates anamorph name and T presents teleomorph name of fungi

bC means chromosomes, I indicates InterPro, and E presents EST.

cIncomplete coverage of genome information

dInsufficient exon/intron information

eTaxonomy based on [101]

fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55].

'Y' indicates the existence of information in each field, and 'N' indicates the lack of information.

List and characteristics of the fungal genomes belonging to the subphylum Pezizomycota archived in SNUGB. aA indicates anamorph name and T presents teleomorph name of fungi. bC means chromosomes, I indicates InterPro, and E presents EST. cIncomplete coverage of genome information dInsufficient exon/intron information eTaxonomy based on [28] fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55]. 'Y' indicates the existence of information in each field, and 'N' indicates the lack of information. List and characteristics of the fungal genomes belonging to the subphyla Saccharomycotina and Taphrinomycotina archived in SNUGB. aA indicates anamorph name and T presents teleomorph name of fungi. bC means chromosomes, I indicates InterPro, and E presents EST. cIncomplete coverage of genome information dInsufficient exon/intron information eTaxonomy based on [28] fSequences from Random Sequence Tag (RST) 'Y' indicates the existence of information in each field, and 'N' indicates the lack of information. List and characteristics of the genomes belonging to the phyla Basidiomycota, Chytridiomycota, and Microsporidia, the subphylum Mucoromycotina, and the phylum Peronosporomycota (oomycetes) archived in SNUGB. aA indicates anamorph name and T presents teleomorph name of fungi bC means chromosomes, I indicates InterPro, and E presents EST. cIncomplete coverage of genome information dInsufficient exon/intron information eTaxonomy based on [28] fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55]. 'Y' indicates the existence of information in each field, and 'N' indicates the lack of information. List and characteristics of the non-fungal genomes archived in SNUGB. aA indicates anamorph name and T presents teleomorph name of fungi bC means chromosomes, I indicates InterPro, and E presents EST. cIncomplete coverage of genome information dInsufficient exon/intron information eTaxonomy based on [101] fORFs and exons were predicted by AUGUSTUS 2.0.3 with species-specific training datasets [55]. 'Y' indicates the existence of information in each field, and 'N' indicates the lack of information.

Construction, content, and applications

Data processing via an automated pipeline and the function of Positional Database

Positional information of functional/structural units that are present on individual contigs/chromosomes, such as the start and stop sites of ORFs and exons/introns, was collected from the data warehouse of CFGP and stored in the Position Database of SNUGB. New types of data, such as Simple Sequence Repeats (SSRs) on the genome, can be easily added to the Positional Database for visualization via SNUGB. Along with the positional information, for each data, data type (e.g., ORFs), primary key, and any additional information were saved into the partitioned tables, which were designed for enhancing the speed of data retrieval. Through the primary key, SNUGB can display detailed information of each datum (e.g., sequences) stored at external sources. Considering the large number of available fungal genome sequences and those that are currently being sequenced, in addition to this data standardization scheme, a standardized pipeline for data extraction and management is needed to organize the data and to ensure orderly expansion of SNUGB. The pipeline developed for SNUGB processes each genome data set via the following steps. Firstly, once whole genome sequences are deposited in the data warehouse of CFGP, the integrity of genome information, such as the position information of functional/structural units, is inspected. Several properties of the whole genome, such as the length and the GC content, are calculated. Secondly, the GC content, AT-skew, and CG-skew are calculated via 50-bp sliding windows with 20 bp steps. Thirdly, for each gene, three types of sequence information, including coding sequences (sequences from the start to stop codon without introns), gene sequences (sequences from the start to stop codon with introns), and transcript sequences (sequences from the transcription start site to end site without intron sequence), if transcript information is available, are generated based on the genome annotation information. Fourthly, all data generated in the previous steps are transferred into the Position Database to support graphical representation of these features. Fifthly, if the genome has chromosomal map information, including genetic map and optical map, this information is converted into a standardized format and stored in SNUGB for graphical representation via Chromosome Viewer. Lastly, after subjecting all ORFs in the genome through the InterPro Scan [29], the genomic position of each domain predicted by the InterPro Scan is calculated and stored into the Position Database.

Modular design of SNUGB facilitates its application

To facilitate the efficient implementation of SNUGB in diverse genomics platforms, a modular design was used for its application programming interface (API). Through API, a diagram showing genome features in a selected region can be created using only their chromosomal positions and display options. Four recent publications illustrate the utility of this design: T-DNA Analysis Platform (TAP; ) provides the GC content and AT skew around T-DNA insertion sites on the chromosomes of Magnaporthe oryzae via a mini genome browser supported by SNUGB [25]. The chromosomal distribution pattern of T-DNA insertion sites in M. oryzae was also displayed using SNUGB [26]. Fungal Cytochrome P450 Database (FCPD; ) [24] employs SNUGB to present the chromosomal distribution pattern and contexts of cytochrome P450 genes in fungal genomes. Two databases, FED and FSD , utilize SNUGB for presenting the genomic context of the region matched to EST and secreted proteins, respectively. Moreover, Systematical Platform for Identifying Mutated Proteins (SysPIMP; ) [30] and Insect Mitochondrial Genome Database (IMGD; ; Lee et al., under revision) also adopted SNUGB for data presentation. These examples illustrate the utility of SNUGB.

Properties of the fungal/oomycete genomes archived in SNUGB

Among the 98 fungal/oomyvete species (137 genome datasets) covered by SNUGB, 77 species (111 genome datasets; 81%) belong to the phylum Ascomycota (Table 1 and 2), and 10 species (14 genome datasets; 10%) belong to the phylum Basidiomycota (Table 3). In contrast, the phyla Chytridiomycota and Micosporidia are represented only by one (2 datasets) and two species (both belong to the subphylum Mucoromycotina), respectively (Table 3). Six oomycete genomes, derived from Phytophthora, Hyaloperonospora, and Pythium species, are available for comparison with fungal genomes (Table 3). Although oomycetes belong to the kingdom Stramenophla and show closer phylogenetic relationships to algae and diatoms than fungi [31], due to their morphological similarities to fungi, they have been traditionally grouped with fungi. The datasets that cover the whole genome (121 out of the 137 datasets) were analyzed to investigate genome properties. The average size of the genomes, measured by adding lengths of all scaffolds together, is 31.42 Mb which is one-seventeenth of plant genomes (547.41 Mb in the phylum Streptophyta) and one-seventh of insect genomes (215.36 Mb in the phylum Arthropoda) (Figure 1A). The fungal/oomycete genome sizes ranged from 2.5 Mb (Encephalitozoon cuniculi) to 228.5 Mb (Phytophthora infestans); the genome of E. cuniculi is shorter than that of Escherichia coli (4.6 Mb) [32], while the genome of P. infestans is much larger than the genomes of Arabidopsis thaliana (119.2 Mb) [33] and Caenorhabditis elegans (100.5 Mb) [34], indicating no clear relationship between the genome size and the organismal complexity [35]. With regard to the average genome sizes in different taxon groups, the phylum Microsporidia, known as ancestral fungi, shows the smallest average size (4.28 Mb), while oomycetes show the largest at 102.83 Mb (Figure 1A). In the phylum Basidiomycota, which is large and very diverse, the degree of difference in average genome sizes within each of the represented subphyla is highest in the fungal kingdom: the ratios of standard deviation to the average length in three subphyla Agricomycotina, Pucciniomycotina, and Ustilaginomycotina are 71.95%, 86.93%, and 57.46%, respectively (Figure 1B). The subphylum Pucciniomycotina displays the largest size with large variation (Figure 1A and 1B), while two subphyla Saccharomycotina and Taphrinomycotina belonging to the phylum Ascomycota exhibit the relatively low degree of variations (Figure 1B), probably because only closely related species have been sequenced. Although the average genome sizes varied from group to group, ANOVA and TukeyHSD tests (P < 0.05) showed only the difference between fungi and oomycetes was significant (Figure 1A). The GC content of fungal genomes ranges from 32.523% (Pneumocystis carinii in subphylum Taphrinomycotina) to 56.968% (Phanerochaete chrysosporium in the subphylum Agricomycotina), while the GC content of plant and insect genomes ranges from 29.638% to 46.850% (Figure 1C). Although the coding regions exhibit higher GC contents than the rest of the genome, there is no relationship between the proportion of ORFs on the genome and the GC content of the whole genomes (linear regression; R2 = 0.04; Figure 1C and 1D).
Figure 1

Characteristics of the 137 fungal and oomycetes genomes archived in SNUGB. In all graphs, the first six groups correspond to subphyla and the rests indicate phyla. Error bars indicate variation of data within each taxonomic group. The last two phyla were used as outgroup. In graphs A, E, F, and G, each color of bar indicates distinct group supported by Turkey HSD test. (A) Average genome size. (B) the ratio of variation of genome size to the average genome size. (C) Average GC ratio of each subphylum/phylum. (D) The percentage of coding regions to the genome length. (E) Average number of total ORFs. (F) The total number of ORFs per Mb (= ORF density). (G) The average exon number of each ORFs.

Characteristics of the 137 fungal and oomycetes genomes archived in SNUGB. In all graphs, the first six groups correspond to subphyla and the rests indicate phyla. Error bars indicate variation of data within each taxonomic group. The last two phyla were used as outgroup. In graphs A, E, F, and G, each color of bar indicates distinct group supported by Turkey HSD test. (A) Average genome size. (B) the ratio of variation of genome size to the average genome size. (C) Average GC ratio of each subphylum/phylum. (D) The percentage of coding regions to the genome length. (E) Average number of total ORFs. (F) The total number of ORFs per Mb (= ORF density). (G) The average exon number of each ORFs. The number of total proteins encoded by each organism was once considered to reflect organism's characteristics [36]. Based on the size of total proteomes, all sequenced fungal and oomycete species were divided into three groups: The medium group contains the subphylum Pezizomycotina in Ascomycota and the subphyla Agricomycota and Puccinomycotina in Basidiomycota, the small group includes three subphyla Saccharomycotina, Taphrinomycotina, and Ustilagomycotina and the phylum Microsporidia, and the large group has the subphylum Mucoromycotina and the phylum Oomycota (ANOVA and TukeyHSD; P < 0.05; Figure 1E). This grouping shows that the number of total ORFs does not correlate with taxonomic positions at the phylum level, however, at the subphylum level, the correlation was high. For example, subphyla Saccharomycotina and Taphrinomycotina can be distinguishable from Pezizomycotina based on this character. The ORF density classified the sequenced species into three distinct groups, Oomycetes, Microsporidia and the rest, through ANOVA and TukeyHSD test (P < 0.05; Figure 1F). Taken together, these three indicators can be used to divide fungal subphyla/phyla. For example, the subphylum Pezizomycotina shows the medium-level of ORF number and ORF density, while the subphylum Saccharomycotina displays the low-level of ORF number but its ORF density is comparable to that of the subphylum Pezizomycotia. Both the number of ORFs and the ORF density are high for oomycetes, exhibiting a pattern different from fungi. The number of exons per ORF was investigated, resulting in four groups (ANOVA and TukeyHSD test; P < 0.05; Figure 1G). With the exception of the subphylum Ustilagomycotina, the phylum Basidiomycota exhibits the highest number (~6). The subphyla Saccharomycotina and Mycoromycotina show the lowest value (nearly 1), indicating that almost all their genes do not have introns.

Comparison of genome sequences of multiple isolates within species

For 14 fungal species, two or more strains have been sequenced (Table 5). For some species, such as Fusarium graminearum, additional isolate(s) were sequenced only at a low coverage (e.g., 0.4× coverage for the second strain of F. graminearum); however, even such low-coverage provided some insights into the evolution of pathogenicity in this important cereal pathogen [37]. Except Aspergillus niger, Histoplasma capsulatum, and Paracoccidioides brasiliensis, all strains within same species showed less than 1 Mb variation in genome sizes (Table 5). It is possible that the 3.2 Mb difference between two A. niger strains is in part due to different sequencing coverage: the coverage of ATCC1015 was 8.9× while CBS513.88 was 7× [38]. The differences among three P. brasiliensis genomes, ranging from 29.1 Mb to 33.0 Mb, may reflect their distinct phylogenetic positions [39]. The differences among five H. capsulatum genomes may be due to a combination of different levels of sequencing coverage and different geological origins [40]. Three isolates of H. capsulatum and P. brasiliensis showed approximately 1% difference in the GC content, whereas the degree of GC content variation among 11 strains of Coccidioides posadasii was only 0.5%. Four Cryptococcus neoformans strains, representing three different serotypes (A, B and D), showed around 0.3% variation in the GC content, and within a serotype (two serotype D strains) the difference was only 0.043% [41]. Isolates of Candida albicans, Saccharomyces bayanus, and Batrachochytrium dendrobatidis showed only 0.01% variation in the GC content. These intraspecific variations of genome properties can be compared in detail via SNUGB.
Table 5

Basic properties of different strains of fungal genomes deposited in SNUGB.

Species# of StrainsGenome size (Mb)GC content (%)
Fungi (Kingdom)
Ascomycota (Phylum)

  Pezizomycotina (Subphylum)
  Aspergillus fumigatus229.3 ± 0.149.672 ± 0.178
  Aspergillus niger235.6 ± 2.350.365 ± 0.012
  Coccidioides immitis428.3 ± 0.746.529 ± 0.514
  Coccidioides posadasii1127.2 ± 0.946.839 ± 0.537
  Histoplasma capsulatum536.2 ± 4.743.400 ± 1.859
  Paracoccidioides brasiliensis330.7 ± 2.043.868 ± 0.930
  Fusarium graminearuma236.648.283
  Saccharomycotina (Subphylum)
  Candida albicans214.4 ± 0.133.462 ± 0.010
  Saccharomyces cerevisiae311.9 ± 0.338.252 ± 0.090
  Saccharomyces bayanus211.7 ± 0.340.196 ± 0.011
  Saccharomyces mikataeb211.1 ± 0.537.920 ± 0.315
  Basidiomycota (Phylum)
  Agricomycotina (Subphylum)
  Cryptococcus neoformans419.2 ± 0.248.251 ± 0.316
  Ustilaginomycotina (Subphylum)
  Ustilago maydis219.7 ± 0.053.995 ± 0.045
  Chytridiomycota (Phylum)
  Batrachochytrium dendrobatidis224.1 ± 0.339.261 ± 0.011
Chloroplastida (Kingdom)
Charophyta (Phylum)
  Oryza sativa2399.2 ± 38.443.530 ± 0.046
Vertebrata (Phylum)
Vertebrata (Phylum)
  Homo sapiens43,052.2 ± 409.340.878 ± 0.042

aOne of strains are incomplete whole genome sequences, so that standard deviation of genome length and GC content are not calculated.

bSame strain but different version of assembly

Basic properties of different strains of fungal genomes deposited in SNUGB. aOne of strains are incomplete whole genome sequences, so that standard deviation of genome length and GC content are not calculated. bSame strain but different version of assembly

Update of SNUGB

The number of on-going fungal genome sequencing projects is approximately 40 . 37 strains of S. cerevisiae and 25 strains of S. paradoxus were already sequenced and released by the Sanger institute , indicating that more than 100 additional fungal genomes will be available soon. Next generation high throughput sequencing technologies, such as GS Flx, Solexa, and SOLiD [42,43], will further accelerate the rate of fungal genome sequencing, emphasizing the importance of frequently updating SNUGB. With the aid of the developed pipeline, SNUGB will be updated whenever new fungal genome sequences have been publicly released with annotation information. A notice for updated genomes will be posted on the SNUGB web site.

Functions and tools

Taxonomy browser

To support selection of species of interests based on their taxonomic positions, a web-based tool, named as the taxonomy browser, was developed. Considering an anticipated increase in comparing genome sequences and features across multiple species to investigate evolutionary questions at the genome scale, such a tool is necessary to provide an overview of the taxonomic positions of the sequenced species and their evolutionary relationships with other fungi to users of SNUGB and to assist them in selecting appropriate species for comparative analyses. The taxonomy browser provides two methods for accessing the data archived in SNUGB, one of which is text-search using species name (Figure 2A). When a user begins typing a species name in the text box, the full name will be completed automatically to assist a quick search of species. The other method is using the taxonomical hierarchy (i.e., tree of life). When a user clicks a specific taxon (e.g., phylum), taxonomy browser will present all subgroups within the chosen taxon for further selection (Figure 2B).
Figure 2

Taxonomy browser. A screenshot of data generated using Taxonomy browser is shown. (A) Search interface by species name shows a list of species along with inserted string. (B) Taxonomical tree shows a lineage of the chosen species and its genome datasets deposited in SNUGB.

Taxonomy browser. A screenshot of data generated using Taxonomy browser is shown. (A) Search interface by species name shows a list of species along with inserted string. (B) Taxonomical tree shows a lineage of the chosen species and its genome datasets deposited in SNUGB.

Chromosome viewer and Contig/ORF browser

Three different methods can be used to access genomic information. For those with chromosomal map data (21 species), their chromosomal maps can be displayed via Chromosome viewer (Figure 3A). The following color scheme was used to denote the level of completeness: i) chromosome constructed using genetic or optical map data (with gaps) as blue (Chromosomes 1 to 7 of M. oryzae; Figure 3A), ii) chromosome map based on a combination of sequences and genetic/optical map data as pink (e.g., chromosomes of A. niger), and iii) unassigned contigs (labeled as Chromosome Ex of M. oryzae; Figure 3A) as light blue. For the species without chromosomal map information, SNUGB provides the contig and ORF browsers, which display the name of contig and ORFs, respectively, and allow users to search them using their names (Figures 3B and 3C).
Figure 3

Chromosome viewer, Contig Viewer, and ORF Viewer. (A) The chromosome viewer displays seven chromosomes of M. oryzae with a size indicator at the right side. At the bottom, the interface allows for jumping directly to a specific region by selecting chromosome/contigs and its position. (B) The contig viewer provides a list of contigs with its length. Through this interface, contigs can be searched by name. (C) The ORF viewer presents the names and lengths of ORFs with search function.

Chromosome viewer, Contig Viewer, and ORF Viewer. (A) The chromosome viewer displays seven chromosomes of M. oryzae with a size indicator at the right side. At the bottom, the interface allows for jumping directly to a specific region by selecting chromosome/contigs and its position. (B) The contig viewer provides a list of contigs with its length. Through this interface, contigs can be searched by name. (C) The ORF viewer presents the names and lengths of ORFs with search function.

Graphical Browser with six different display formats

Gene annotation information in a selected area of chromosome or contig, such as transcripts, ORFs, and exon/intron structure, and InterPro domains [29], can be displayed through three formats: i) the 'single' format shows these features as bars; ii) the 'squish' format displays them via color-coded diagrams without description; and iii) the 'pack' format presents them as small color-coded icons with description (Figure 4A). These graphical formats were also used by UCSC Genome Browser [2]. In addition, the GC content and AT/CG skew information for individual chromosomes can be displayed via three formats: i) color-coded bar graph, ii) line, and iii) dotted lines along with a description of data (Figure 4B). For species with EST data (Table 1), the genomic region corresponding to each EST sequence can be displayed along with ORF and InterPro domains to help users identify predicted gene structure and expressed regions (see Figure 4A). Presentation of these data is supported by Fungal Expression Database .
Figure 4

Six different display methods of the genome content and properties via Graphical browser. (A) The graphical browser in SNUGB shows the genome context via three different formats: bar, squash, and pack. At the bottom, ORFs, ESTs, and InterPro domains on chromosome 1 of M. oryzae are displayed. (B) Three graphic representations, including graph, line, and Single line (S. line), of the AT-skew, GC-skew, and GC content are shown.

Six different display methods of the genome content and properties via Graphical browser. (A) The graphical browser in SNUGB shows the genome context via three different formats: bar, squash, and pack. At the bottom, ORFs, ESTs, and InterPro domains on chromosome 1 of M. oryzae are displayed. (B) Three graphic representations, including graph, line, and Single line (S. line), of the AT-skew, GC-skew, and GC content are shown.

Table browser and Text browser

Although graphical presentation of genomic features helps users view global patterns, the graphical browser does not provide sequences or a list of elements present in a chosen area. To provide such information, we developed two additional tools named as the table browser and the text browser. The table browser provides a list of the names and chromosomal/contig positions of all elements present in a selected region in the csv format, which can be opened using the Excel program (Figure 5A). The text browser provides sequences in a selected region. If ORFs exist in the region, exons and introns are presented using different colors and cases; this function is useful for designing primers and transferring selected sequences to a different data analysis environment (Figure 5B). Additionally, all InterPro domains present on each ORF are displayed as special characters under corresponding sequences so that putative functional domains can be easily recognized at the sequence level. The table and text browser can display sequences up to 50 kb.
Figure 5

Table and Text browsers. (A) The table browser shows all ORFs, ESTs, and InterPro domains in a selected region as a list. (B) The text browser displays sequences showing exon/intron region as different colors and EST and InterPro domains.

Table and Text browsers. (A) The table browser shows all ORFs, ESTs, and InterPro domains in a selected region as a list. (B) The text browser displays sequences showing exon/intron region as different colors and EST and InterPro domains.

Kingdom-wide identification of the putative orthologues of individual fungal proteins via BLAST and comparison of the genomic contexts and properties of homologous proteins among species via the Session History function

To identify putative orthologues of individual fungal proteins, BLAST searches with each of the 924,343 fungal proteins against all proteins were performed using the e-value of 1e-5 as the cut-off line. The 'BLAST annotation' tab shows a list of putative orthologues of a chosen gene product in other species with their BLAST e-values (see Figure 6A). To compare the genomic contexts around the orthologous genes between species or among multiple species, users can store the genomic contexts of the genes using the Session History function, in which the stored genomic contexts can be displayed in one screen (Figure 6B). In each session, other information, such as the GC content and InterPro terms, can also be presented to further support the comparison.
Figure 6

BLAST annotation to catalog homologous proteins. (A) A result of 'BLAST annotation' is shown with the corresponding gene names, species names, and e-values of putative homologs. 'Genome Browser' button after gene name can display the genome context of the selected gene, and 'Mini GB' button will show genome contexts of the selected gene as a smaller size to provide a quick overview, supported by MiniGB. The session can be stored by clicking the save link inside the small SNUGB image. (B) Two independent sessions showing homologs of two genes, MGG_01378.5 and FGSG_01632.3, are shown. Clicking the red button X at the bottom will hide the session.

BLAST annotation to catalog homologous proteins. (A) A result of 'BLAST annotation' is shown with the corresponding gene names, species names, and e-values of putative homologs. 'Genome Browser' button after gene name can display the genome context of the selected gene, and 'Mini GB' button will show genome contexts of the selected gene as a smaller size to provide a quick overview, supported by MiniGB. The session can be stored by clicking the save link inside the small SNUGB image. (B) Two independent sessions showing homologs of two genes, MGG_01378.5 and FGSG_01632.3, are shown. Clicking the red button X at the bottom will hide the session.

Additional functionalities of SNUGB

The 'flexible-range-select' function allows users to select a chromosomal segment by clicking a mouse at the start site and moving it over the desired segment; the selected area will be displayed as shaded box, and the subsequent click displays an enlarged view of the selected segment (Figure 3A). Through the 'high-resolution-diagram' function, users can obtain a high-resolution image (more than 3,000 pixels in width) showing various features on a whole chromosome, such as ORFs, InterPro terms, and GC content. This image can be downloaded as image file via both the graphical genome browser and the session-storage function.

Conclusion

The SNUGB supports efficient and versatile visualization and utilization of rapidly increasing fungal genome sequence data, as well as those from selected organisms in other kingdoms, to address various types of questions at the genome scale. Properties and features of the archived fungal genomes are available for viewing and comparison in SNUGB. The taxonomy browser helps users easily access the genomes of individual species and provides taxonomic positions of chosen species, and the chromosome map function shows the whole genome of selected species. The graphical browser, table browser, and text browser present a global view of genomic contexts in a selected chromosomal region and support analyses of sequences in the region. The 'BLAST annotation' provides lists of putatively orthologous proteins in the fungal kingdom and facilitates comparison of the genomic contexts of their genes across multiple species. The SNUGB also allows users to manage their own work histories via the SNUGB web site.

Availability and requirements

All data and functionalities in this paper can be freely accessed through the SNUGB web site at . The source code, a set of programs, and database structure of SNUGB will be publicly released in the future after finalizing packaging of SNUGB to be opened.

Abbreviations

PZ: the subphylum Pezizomycotina; SC: the subphylum Saccharomycotina; TP: the subphylum Taphrinophycotina; AG: the subphylum Agricomycotina; PC: the subphylum Pucciniomycotina; US: the subphylum Ustilagomycotina; CH: the phylum Chytridiomycota; MU: the subphylum Mucoromycotina; MS: the phylum Microsporidia; OO: oomycete (the phylum Peronosporomycota); AT: the phylum Arthropoda; ST: the phylum Streptophyta; BCM: Baylor College of Medicine; BGI: Beijing Genome Institute; BGM: Baylor College of Medicine; BI: Broad Institute; CBS: Center For Biological Sequences; DOGAN: Database Of the Genomes Analyzed at Nite; EM: Ensembl; FB: Flybase; GDB: GeneDB; GS: Genoscope; HBGP: Honey Bee Genome Project; IGM: Instituté de Génétique et Microbiologie; IRGSP: International Rice Genome Sequencing Project; JBPC: Josephine Bay Paul Center for Comparative Molecular Biology and Evolution; JGI: DOE Joint Genomic Institute; MGSP: Maize Genome Sequencing Project; MTGSP: Medicago Truncatula Genome Sequencing Project; OU: Oklahoma University; PGC: Procter & Gamble Co; PGSC: Papaya Genome Sequencing Consortium; SGTC: Stanford Genome Technology Center; SI: Sanger Institute; SIG: Trinity College Dublin: Smurfit Institute of Genetics; TAIR: The Arabidopsis Information Resource; VGI: Virginia Bioinformatics Institute; WB: Wormbase; WGSC: Washington University Genome Sequencing Center.

Authors' contributions

JP and YHL planed and managed this project, KJ designed the web site, KJ, JP, BP, KA, JYC, and JHC implemented various functions to SNUGB, JP, JYC, SIK, and DC processed genome sequences, and JP, SK and YHL wrote the manuscript.
  96 in total

1.  Genomic exploration of the hemiascomycetous yeasts: 10. Kluyveromyces thermotolerans.

Authors:  A Malpertuy; B Llorente; G Blandin; F Artiguenave; P Wincker; B Dujon
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

2.  Genomic exploration of the hemiascomycetous yeasts: 5. Saccharomyces bayanus var. uvarum.

Authors:  E Bon; C Neuvéglise; S Casaregola; F Artiguenave; P Wincker; M Aigle; P Durrens
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

3.  Genomic exploration of the hemiascomycetous yeasts: 9. Saccharomyces kluyveri.

Authors:  C Neuvéglise; E Bon; A Lépingle; P Wincker; F Artiguenave; C Gaillardin; S Casarégola
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

4.  Genomic exploration of the hemiascomycetous yeasts: 6. Saccharomyces exiguus.

Authors:  E Bon; C Neuvéglise; A Lépingle; P Wincker; F Artiguenave; C Gaillardin; S Casaregola
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

5.  Genomic exploration of the hemiascomycetous yeasts: 8. Zygosaccharomyces rouxii.

Authors:  J de Montigny; M Straub; S Potier; F Tekaia; B Dujon; P Wincker; F Artiguenave; J Souciet
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

6.  Genomic exploration of the hemiascomycetous yeasts: 7. Saccharomyces servazzii.

Authors:  S Casaregola; A Lépingle; E Bon; C Neuvéglise; H Nguyen; F Artiguenave; P Wincker; C Gaillardin
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

7.  Genomic exploration of the hemiascomycetous yeasts: 12. Kluyveromyces marxianus var. marxianus.

Authors:  B Llorente; A Malpertuy; G Blandin; F Artiguenave; P Wincker; B Dujon
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

8.  Genomic exploration of the hemiascomycetous yeasts: 11. Kluyveromyces lactis.

Authors:  M Bolotin-Fukuhara; C Toffano-Nioche; F Artiguenave; G Duchateau-Nguyen; M Lemaire; R Marmeisse; R Montrocher; C Robert; M Termier; P Wincker; M Wésolowski-Louvel
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

9.  The Drosophila genome sequence: implications for biology and medicine.

Authors:  T B Kornberg; M A Krasnow
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

10.  Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78.

Authors:  Diego Martinez; Luis F Larrondo; Nik Putnam; Maarten D Sollewijn Gelpke; Katherine Huang; Jarrod Chapman; Kevin G Helfenbein; Preethi Ramaiya; J Chris Detter; Frank Larimer; Pedro M Coutinho; Bernard Henrissat; Randy Berka; Dan Cullen; Daniel Rokhsar
Journal:  Nat Biotechnol       Date:  2004-05-02       Impact factor: 54.908

View more
  9 in total

1.  Fungal secretome database: integrated platform for annotation of fungal secretomes.

Authors:  Jaeyoung Choi; Jongsun Park; Donghan Kim; Kyongyong Jung; Seogchan Kang; Yong-Hwan Lee
Journal:  BMC Genomics       Date:  2010-02-11       Impact factor: 3.969

2.  Comparative genomics allowed the identification of drug targets against human fungal pathogens.

Authors:  Ana Karina R Abadio; Erika S Kioshima; Marcus M Teixeira; Natalia F Martins; Bernard Maigret; Maria Sueli S Felipe
Journal:  BMC Genomics       Date:  2011-01-27       Impact factor: 3.969

3.  Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

Authors:  Bongsoo Park; Jongsun Park; Kyeong-Chae Cheong; Jaeyoung Choi; Kyongyong Jung; Donghan Kim; Yong-Hwan Lee; Todd J Ward; Kerry O'Donnell; David M Geiser; Seogchan Kang
Journal:  Nucleic Acids Res       Date:  2010-11-17       Impact factor: 16.971

4.  Systematic functional profiling of transcription factor networks in Cryptococcus neoformans.

Authors:  Kwang-Woo Jung; Dong-Hoon Yang; Shinae Maeng; Kyung-Tae Lee; Yee-Seul So; Joohyeon Hong; Jaeyoung Choi; Hyo-Jeong Byun; Hyelim Kim; Soohyun Bang; Min-Hee Song; Jang-Won Lee; Min Su Kim; Seo-Young Kim; Je-Hyun Ji; Goun Park; Hyojeong Kwon; Suyeon Cha; Gena Lee Meyers; Li Li Wang; Jooyoung Jang; Guilhem Janbon; Gloria Adedoyin; Taeyup Kim; Anna K Averette; Joseph Heitman; Eunji Cheong; Yong-Hwan Lee; Yin-Won Lee; Yong-Sun Bahn
Journal:  Nat Commun       Date:  2015-04-07       Impact factor: 14.919

5.  Systematic and searchable classification of cytochrome P450 proteins encoded by fungal and oomycete genomes.

Authors:  Venkatesh Moktali; Jongsun Park; Natalie D Fedorova-Abrams; Bongsoo Park; Jaeyoung Choi; Yong-Hwan Lee; Seogchan Kang
Journal:  BMC Genomics       Date:  2012-10-04       Impact factor: 3.969

6.  CFGP 2.0: a versatile web-based platform for supporting comparative and evolutionary genomics of fungi and Oomycetes.

Authors:  Jaeyoung Choi; Kyeongchae Cheong; Kyongyong Jung; Jongbum Jeon; Gir-Won Lee; Seogchan Kang; Sangsoo Kim; Yin-Won Lee; Yong-Hwan Lee
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

7.  IMGD: an integrated platform supporting comparative genomics and phylogenetics of insect mitochondrial genomes.

Authors:  Wonhoon Lee; Jongsun Park; Jaeyoung Choi; Kyongyong Jung; Bongsoo Park; Donghan Kim; Jaeyoung Lee; Kyohun Ahn; Wonho Song; Seogchan Kang; Yong-Hwan Lee; Seunghwan Lee
Journal:  BMC Genomics       Date:  2009-04-07       Impact factor: 3.969

8.  Genome Sequences of Three Phytopathogenic Species of the Magnaporthaceae Family of Fungi.

Authors:  Laura H Okagaki; Cristiano C Nunes; Joshua Sailsbery; Brent Clay; Doug Brown; Titus John; Yeonyee Oh; Nelson Young; Michael Fitzgerald; Brian J Haas; Qiandong Zeng; Sarah Young; Xian Adiconis; Lin Fan; Joshua Z Levin; Thomas K Mitchell; Patricia A Okubara; Mark L Farman; Linda M Kohn; Bruce Birren; Li-Jun Ma; Ralph A Dean
Journal:  G3 (Bethesda)       Date:  2015-09-28       Impact factor: 3.154

9.  Systematic functional analysis of kinases in the fungal pathogen Cryptococcus neoformans.

Authors:  Kyung-Tae Lee; Yee-Seul So; Dong-Hoon Yang; Kwang-Woo Jung; Jaeyoung Choi; Dong-Gi Lee; Hyojeong Kwon; Juyeong Jang; Li Li Wang; Soohyun Cha; Gena Lee Meyers; Eunji Jeong; Jae-Hyung Jin; Yeonseon Lee; Joohyeon Hong; Soohyun Bang; Je-Hyun Ji; Goun Park; Hyo-Jeong Byun; Sung Woo Park; Young-Min Park; Gloria Adedoyin; Taeyup Kim; Anna F Averette; Jong-Soon Choi; Joseph Heitman; Eunji Cheong; Yong-Hwan Lee; Yong-Sun Bahn
Journal:  Nat Commun       Date:  2016-09-28       Impact factor: 14.919

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.