Literature DB >> 32292921

Nucleotide and protein researches on anaerobic fungi during four decades.

Abstract

Anaerobic fungi habitat in the gastrointestinal tract of foregut fermenters or hindgut fermenters and degrade fibrous plant biomass through the hydrolysis reactions with a wide variety of cellulolytic enzymes and physical penetration through fiber matrix with their rhizoids. To date, seventeen genera have been described in family Neocallimasticaceae, class Neocallimastigomycetes, phylum Neocallimastigomycota and one genus has been described in phylum Neocallimastigomycota. In National Center for Biotechnology Information (NCBI) database (DB), 23,830 nucleotide sequences and 59,512 protein sequences have been deposited and most of them were originated from Piromyces, Neocallimastix and Anaeromyces. Most of protein sequences (44,025) were acquired with PacBio next generation sequencing system. The whole genome sequences of Anaeromyces robustus, Neocallimastix californiae, Pecoramyces ruminantium, Piromyces finnis and Piromyces sp. E2 are available in Joint Genome Institute (JGI) database. According to the results of protein prediction, average Isoelectric points (pIs) were ranged from 5.88 (Anaeromyces) to 6.57 (Piromyces) and average molecular weights were ranged from 38.7 kDa (Orpinomyces) to 56.6 kDa (Piromyces). In Carbohydrate-Active enZYmes (CAZY) database, glycoside hydrolases (36), carbohydrate binding module (11), carbohydrate esterases (8), glycosyltransferase (5) and polysaccharide lyases (3) from anaerobic fungi were registered. During four decades, 1,031 research articles about anaerobic fungi were published and 444 and 719 articles were available in PubMed (PM) and PubMed Central (PMC) DB. © Copyright 2020 Korean Society of Animal Science and Technology.

Entities: CellLine Chemical Disease Gene Species

Keywords: Anaerobic fungi; Neocallimasticaceae; Neocallimastigomycota; Nucleotide; Protein

Year: 2020 PMID： 32292921 PMCID： PMC7142291 DOI： 10.5187/jast.2020.62.2.121

Source DB: PubMed Journal: J Anim Sci Technol ISSN： 2055-0391

INTRODUCTION

Herbivorous animals can utilize fibrous carbohydrates as feed resources through the actions of various carbohydrate hydrolyzing enzymes secreted by microorganisms in their gastrointestinal tracts (GIT) including rumen for foregut fermenters and cecum for hindgut fermenters. Produced volatile fatty acids and microbial cells are absorbed and used as energy and nitrogen sources by host animals. The performance and production of herbivorous animals are closely related to the microbial ecosystem and individual microorganisms in GIT and this has made scientists isolate, identify and characterize microorganisms from GIT. In the GIT of herbivores, bacteria, protozoa and fungi are the major microorganisms which degrade fibrous feed and bacteria and protozoa degrade fiber only with the reaction of fiber degrading enzymes [1,2]. However, the anaerobic fungi (AF) degrade fiber through their physical and biochemical reactions. During fiber degradation, the anaerobic fungi penetrate their rhizoid filaments into the fiber matrix [3] and secrete a wide variety of both carbohydrate hydrolyzing enzymes and protein hydrolyzing enzymes [4]. The carbohydrate hydrolyzing enzymes secreted from the anaerobic fungi during fiber degradation are both glucanases [5] and xylanases [6] and the former hydrolyzes cellulose or starch to hexose glucose and the later hydrolyzes hemicellulose to pentose xylose. More than four decades, the carbohydrate degrading enzymes from the anaerobic fungi were considered as promising resources both for animal production and other industries including food, paper, fabric and bioenergy. For this reason, many research teams have tried to isolate and identify the anaerobic fungi either from the foregut or hindgut of herbivores. However, only limited numbers of anaerobic fungi have been isolated and identified from the gastrointestinal tract of herbivores due to the difficulties in culture conditions. Besides, the sequence analysis of anaerobic fungi had been hindered with their high adenine (A) and thymine (T) contents and high A–T rich repeats in nucleotide sequences. Recently, the development of high throughput sequencing using the next generation sequencer (NGS) made it possible to analyze the fungal genome and transcriptome with or without culturing technique. In this paper, the researches on the anaerobic fungi during the last four decades were reviewed based on publicly available DB.

ABOUT ANAEROBIC FUNGI

Anaerobic fungi were classified as flagellated protozoa due to their flagellated zoospore [7,8] before Orpin’s report (1975) and they were reclassified as family Neocallimastigaceae [9], order Neocallimastigales [10], and phylum Neocallimastigomycota [11]. Finally, the anaerobic fungi are classified in the phylum Neocallimastigomycota containing one class; Neocallimastigomycetes, one order; Neocallimastigales, one family; Neocallimasticaceae and 18 genera (Fig. 1). The genera in the family Neocallimasticaceae have been classified based on the morphological characteristics including zoospore flagellation (uniflagellate vs. polyflagellate), the sporangia development (monocentric vs. polycentric) and the thallus morphology (filamentous vs. bulbous) [12]. The number of flagella on zoospores is less than 4 in uniflagellate fungi and uniflagellate fungi are Agriosomyces, Akiloshbomyces, Anaeromyces, Buwchfawromyces, Caecomyces, Capellomyces, Joblinomyces, Khoyollomyces, Liebetanzomyces, Oontomyces, Pecoramyces and Piromyces. The polyflagellate fungi including Cyllamyces, Feramyces, Ghzallomyces, Neocallimastix, and Orpinomyces have more than 4 flagella on their zoospores. In monocentric fungi including Agriosomyces, Akiloshbomyces, Buwchfawromyces, Caecomyces, Capellomyces, Cyllamyces, Feramyces, Ghzallomyces, Joblinomyces, Khoyollomyces, Liebetanzomyces, Neocallimastix, Oontomyces, Pecoramyces and Piromyces, single sporangium containing a nucleus develops on its thallus and their rhizoidal system are anucleate. On the other hand, several sporangia develop on their thallus and the nucleus migrate through rhizoid with repeated division in polycentric fungi including Anaeromyces and Orpinomyces. The genera Caecomyces and Cyllamyces are bulbous type thallus with holdfast sporangia [13,14]. However, there have been many difficulties in morphology based identification due to variations in the number of flagellate and shape and size of sporangia under different nutritional environments [13]. The development of molecular technique led to the application of gene sequence for fungal identification and several different barcode markers were suggested. Dore and Stahl [15] compared the small subunit (SSU) rRNA genes of different anaerobic fungi for the first time, however, inter-relationship between genera was not clear due to highly conserved sequences in SSU. The internal transcribed spacer 1 region (ITS1) of rRNA was suggested by Li and Heath [16] and established [17]. Later several regions within the ITS1 were suggested for identification and quantification of anaerobic fungi [18,19]. The use of large subunit (LSU) rRNA gene was reported by Hausner and colleagues [20] for the first time and practical application methods using D1/D2 region within LSU for identification were reported [21,22]. Currently, both ITS1 and LSU sequences are analyzed for the identification of anaerobic fungi to compensate for the limitations of each marker [23].

Fig. 1.

Taxonomy of anaerobic fungi.

The identified 35 species of anaerobic fungi were isolated from either ruminant (cattle, buffalo, sheep, wild goat, and camel) or non-ruminant herbivores (horse, donkey, and elephants) (Table 1). Among 35 identified species, 19 species were isolated from rumen contents and 16 species were isolated from fecal samples. During the 1980s, three filamentous type fungi including Neocallimastix, Orpinomyces and Piromyces and one bulbous type fungus, Caecomyces, with a total of 5 species were isolated and described from New Zealand (1), Canada (2) and United Kingdom (3). The new genus Anaeromyces and 3 genera with 11 species were isolated and described from Australia (1), Canada (1), United Kingdom (1), France (2), New Zealand (2) and Malaysia (4) during the 1990s and Ho and her colleagues [42] isolated 4 genera with 5 species during this period. The bulbous type fungi Caecomyces and Cyllamyces and filamentous type fungus Piromyces with a total of 3 species were isolated and described from United Kingdom (1) and Taiwan (2) during the 2000s. After the report on the use of barcode markers for identification of anaerobic fungi [15-17], Caecomyces sympodialis was the first described anaerobic fungi based on both barcode marker and morphological characteristics [49]. During the last decade, 12 new genera with a total 23 species were identified due to advanced barcode marker techniques and increased needs for new enzyme resources required for bioenergy production.

Table 1.

The isolation of anaerobic fungi from different host animals

Genus	Species	Host animal	Sample	Year	Ref.
Agriosomyces	longus	Ovis orientalis	Feces	2020	[24]
Aklioshbomyces	papillarum	Odocoileus virginianus	Feces	2020	[24]
Anaeromyces	contortus	Bos taurus	Feces	2018	[25]
	elegans	Bos taurus	Rumen	1993	[13]
	mucronatus	Bos taurus	Rumen	1990	[26]
	polycephalus	Babalus befullus	Rumen	2012	[27]
	robustus	Ovis aries	Feces	2016	[28]
Buwchfawromyces	eastonii	Bulbalus bubalis	Feces	2015	[29]
Caecomyces	churrovis	Ovis aries	Feces	2017	[30]
	communis	Ovis aries	Feces	1988	[31]
	equi	Equus ferus	Feces	1988	[31]
	hurleyensis	Ovis aries	Rumen	2012	[27]
	sympodialis	Bos indicus	Rumen	2007	[32]
Capellomyces	elongates	Capra aegagrus	Feces	2020	[24]
	foraminis	Capra aegagrus	Feces	2020	[24]
Cyllamyces	aberensis	Bos taurus	Feces	2001	[33]
Feramyces	austinii	Ammotragus lervia	Rumen	2018	[34]
Ghazallomyces	constrictus	Axis axis	Feces	2020	[24]
Joblinomyces	apicalis	Capra aegagrus	Feces	2020	[24]
Khoyollomyes	ramosus	Eauus grevyi	Feces	2020	[24]
Liebetanzomyces	polymorphus	Capra aegarrus	Rumen	2018	[35]
Neocallimastix	californiae	Capra aegarrus	Feces	2016	[28]
	cameroonii	Ovis aries	Feces	2015	[36]
	frontalis	Ovis aries	Rumen	1983	[9]
	hurleyensis	Ovis aries	Rumen	1991	[37]
	patriciarum	Ovis aries	Rumen	1986	[38]
	variabilis	Bos indicus		1993	[39]
Oontomyces	anksri	Damelus dromedarius	Rumen	2015	[40]
Orpinomyces	bovis	Bos taurus	Rumen	1989	[41]
	intercalaris	Bos indicus	Rumen	1994	[42]
	joyonii	Bos taurus	Rumen	1991	[43]
Pecoramyces	ruminantium	Bos taurus	Feces	2017	[44]
Piromyces	communis	Ovis aries	Feces	1988	[31]
	cryptodigmaticus	Bos taurus	Feces	2012	[27]
	dumbonicus	Elephas maximus	Feces	1990	[45]
	finnis	Equus ferus	Feces	2016	[28]
	irregularis	Bos taurus	Rumen	2015	[36]
	mae	Equus ferus	Feces	1990	[45]
	minutus	Cervus nippon	Rumen	1993	[46]
	polycephalus	Babalus befullus	Rumen	2002	[47]
	rhizinflatus	Equus africanus	Feces	1991	[48]
	spiralis	Capra aegarrus	Rumen	1993	[49]
Tahromyces	munnarensis	Nilgiritragus hylocrius	Feces	2020	[24]

Ref, reference.

NUCLEOTIDES OF ANAEROBIC FUNGI

After the first deposition of genomic DNA sequence from Neocallimastix ssp., by Brownlee [50], 23,830 nucleotide sequences were available in NCBI database (Table 1) and the numbers were in the order of Piromyces (82.63%), Neocallimastix (9.25%), Anaeromyces (4.87%), Orpinomyces (1.65%), Cyllamyces (0.54%), Caecomyces (0.38%), Pecoramyces (0.29%), and Feramyces (0.29%). In the early 90s, the applications of cDNA library construction method using λ-ZAP II vector system were reported [51,52] and the needs of potent carbohydrate degrading enzyme for bioenergy production triggered researches on the functional genes originated from the anaerobic fungi during. Also, genome projects of JGI supported by the U.S. Department of Energy (DOE) accelerated the researches on the anaerobic fungi and resulted in large amounts of nucleotide sequences information about the anaerobic fungi during the third decade. During this period, about 17,100 out of 18,771 nucleotide sequences were acquired with transcriptome analysis method using constructed cDNA library in efforts to get the sequence information about carbohydrate degrading enzymes. Currently, whole genome sequences of Anaeromyces robustus, Neocallimastix californiae, Pecoramyces ruminantium, Piromyces finnis and Piromyces sp. E2 are available in JGI (https://mycocosm.jgi.doe.gov/index.html). The development of NGS technologies also affected the nucleotide researches on AF during the fourth decade and total 3,865 nucleotides sequenced with NGS including Illumina system (2016), PacBio system (1833) and Roch 454 GS-FLX system (16) were deposited in NCBI DB. The number of deposited nucleotide sequences was smaller than expected because the developments of NGS technologies and bioinformatics tools made it possible to translate acquired nucleotide sequences to amino acid sequences for protein prediction. After the first report of ITS1 and 18S rRNA sequences (AF170188.10 [17], 1,160 nucleotide sequences including 210 partial sequences from Anaeromyces were deposited in NCBI DB and most of them (925) were acquired with whole genome sequence analyses. The sequencing methods were PacBio sequencing system (925), Sanger method (115) and Illumina sequencing system (19) and. The number of reported ribosomal RNA sequences from Anaeromyces was 194 including ITS1 (186), 5.8S rRNA (149) and 28S rRNA (68) sequences and those were used for identification of anaerobic fungi. Only 16 mRNA sequences including carbohydrate hydrolyzing enzymes, polysaccharide lyase (PL), esterases, phosphatase, and acyltrasferase were reported. The carbohydrate hydrolyzing enzyme sequences including cellulose (cel 1, GI 33327793), lichenase (licB, GI 33327791), glycosyl hydrolase family (GH) 1 (MH043785.1), GH10 (MH043796.1), GH32 (MH043815.1), and GH 67 (MH043829.1) were reported. In addition, three carboxyl esterases (CE) (CE7; MH043852.1, CE 12; MH043857.1, MH043856.1) and two polysaccharide lyases (PL) (PL9; MH043867.1, PL11; MH043868.1) sequences were also reported. After the first report about 18S ribosomal RNA (M62707.1) originated from Caecomyces communis in 1993, 91 nucleotide sequences including 77 partial sequences were available in NCBI DB and the sequences were acquired with Sanger method (45) and Illumina system (23). Most of the nucleotides (68) from Caecomyces were related to fungal identification including ITS1 (61), 5.8S rRNA (61), 18S rRNA (22) and 28S rRNA (9) and only 20 nucleotide sequences were related to the information about the fungal protein. The sequences related to glycosyl hydrolase families (GH3; MH043789.1, GH8; MH043793.1, GH11; MH043803.1, GH18; MH043803.1, GH43; MH043822.1, and GH115; MH043838.1), carboxyl esterases (CE2; MH043845.1, CE3; MH043846.1, and CE15; MH043860.1) and polysaccharide lyase (PL9; MH043865.1) from Caecomyces were reported. One interesting sequence was the immunity protein 51 which was not reported from other anaerobic fungi but bacteria as a hypothetical protein. The number of partial nucleotide sequences originated from genus Cyllamyces in NCBI DB was 122 out of total 130 nucleotide sequences and 113 nucleotide sequences were from uncultured Cyllamyces. The sequences related to ribosomal RNA were 122 including ITS1 (64), 5.8S rRNA (64), 18S rRNA (118), and 28S rRNA (4) and 8 sequences were related to the patent of xylose isomerase production. However, no mRNA sequences related to protein were reported yet. The number of partial nucleotide sequences originated from genus Feramyces in NCBI DB was 65 out of total 69 nucleotide sequences and the sequences were acquired with Sanger method (46) and Illumina system (23). The number of ribosomal RNA sequences were 44 including ITS1 (28), 5.8S rRNA (25) and 28S rRNA (16) sequences. The nucleotide sequences related to carbohydrate degrading proteins including glycosyl hydrolase (GH2; MH043788.1, GH5; MH043790.1, GH10; MH0437951, GH30; MH043812.1, GH43; MH043823.1, GH47; MH043824.1, GH48; MH043825.1, GH76; MH043830.1, MH043831.1 GH97; MH043835.1, and GH130; MH043840.1), carboxyl esterase (CE1; MH043842.1, CE2; MH043843.1, CE4; MH043847.1, and CE12; MH043854.1), α-amylase (MH044722.1) and galactoside-O-acetyltransferase (MH043885.1) were deposited by Murphy and coworkers (http://ncbi.nlm.nih.gov/nuccore). After the first report on AT-rich region of Neocallimastix LM-2 DNA (X14665.1) [50], 2,111 nucleotide sequences including 1,152 mRNAs and 163 rRNAs including ITS1 (140), 5.8S (110), 18S (98) and 28S (35) have been deposited in NCBI DB. In addition, 45 nucleotides were patent sequences related with AXEs, xylanases or enzyme production. The genes related with 11 glycosyl hydrolases (GH1, GH16, GH17, GH19, GH28, GH32, GH35, GH36, GH43, Gh64, and GH108), 4 carboxyl esterases (CE2, CE4, CE13, and CE16), glucanases (CelA, CelB, CelD, and Cel48) and cellobiohydrolases (CBH6, CBH20). Most nucleotide sequences were acquired with either ZAP II cDNA libraries (997) during transcriptome analyses or shotgun assembly (713) during whole genome analyses and used sequencing technologies were PacBio sequencing system (705), Sanger dideoxy sequencing method (78) or Illumina system (26). The evolutionary closeness of cyclophilin originated from Orpinomyces sp. PC-2 and human were the first report of the nucleotide information originated from the genus Orpinomyces [53] and total 394 nucleotide sequences including 195 rRNAs and 55 mRNAs in NCBI DB. In addition, 12 nucleotide sequences were related to information about patent and 47 ITS1, 115 5.8S, 141 18S, and 77 28S rRNA sequences were related to fungal classification. The nucleotide sequences of 4 cellulases, 9 cellobiohydrolases, 8 glycosyl hydrolases, 2 glucanohydrolases, beta-glucosidase and endo-glucanase were deposited. Besides, the nucleotide sequences of other 3 hexose-hydrolases including α-amylase, lichenase, mannase and 3 pentose-hydrolases including xylanase and xylose isomerase were deposited in NCBI DB. The whole genome sequences of Orpinomyces sp. strain C1A was reported by Yousseff and her colleagues in 2013 [54] and Orpinomyces sp. strain C1A was reclassified as Pecoramyces ruminatium strain C1A in 2017 [44]. Among available 70 nucleotide sequences, 32 mRNAs sequences and 21 rRNA sequences including ITS1 (15), 5.8S (7), 18S (3), and 28S (6) rRNA sequences originated from the genus Pecoramyces were deposited in NCBI DB. In addition, 16 nucleotide sequences related with carbohydrate degradation including α-amylase, β-glucosidase (BGL1 and BGL3), cellulase (Cel6A, Cel6B, and Cel48), glycosyl hydrolase (GH18, GH31, GH39, GH53, GH78, and GH88), polysaccharide lyase (PL1), and xylanase (XYL11 and xylan 1,4-β-xylosidase) originated from Orpinomyces were available. Most of Piromyces nucleotide sequences were acquired from expressed sequence tag (EST) sequences and 9 nucleotide sequences were from genome sequences including shotgun sequences, and chromosome sequences. Interestingly, 258 nucleotide sequences were from patents and 94 nucleotide sequences were ribosomal RNA sequences. In Piromyces, 10 glycosyl hydrolases (GH1, GH5, GH6, GH8, GH25, GH26, GH31, GH43, GH57, and GH 127), 3 carboxyl esterases (CE4, CE12 and CE15), 12 glucanases (Cel1B, Cel1C, Cel1D, Cel3A, Cel5, Cel6A, Cel6B, Cel6C, Cel6E, Cel6G, Cel9, and Cel48), 7 cellobiohydrolases (CBHB, CBH6, CBH29, CBH120, CBHYW23-1, CBHYW23-2, and CBHYW23-4), 2 polysaccharide lyase (PL4 and PL9), xylanases and xylose isomerases were reported. Only a limited number (12) of nucleotide sequences from genus Buwchfawromyces were available on NCBI DB and no information about functional genes from Buwchfawromyces has been reported yet. In addition, the numbers of nucleotide sequence from genus Liebetanzomyces and genus Oontomyces were 4 and 3, respectively, and all of them were ribosomal RNAs acquired with Sanger dideoxy sequencing method.

PROTEINS OF ANAEROBIC FUNGI

Until the end of the third decade, the total number of deposited protein amino acid sequences in NCBI DB was 189 even with the application of EST methods (Table 2). The length of sequence acquired from EST was ranged approximately from 500 to 800 nucleotides and this was not long enough to predict protein under the limited number of protein information. In addition, other working horse steps such as rapid amplification of cDNA ends (RACE) PCR method should be performed to get a complete amino acid sequence of the target protein [55]. It could be easy to get complete sequence information of expressed proteins with λ-ZAP II vector system, however either enough protein database or proper substrates for enzyme reactions to get the functional information about expressed proteins. The development of NGS technologies and their application to cDNA library of AF resulted in remarkable increase in sequence information of fungal proteome during the last two decades (Table 2). The number of sequence information acquired with PacBio system was 44,026 protein sequences from thee genera including Neocallimastix (20,214 proteins), Anaeromyces (12,833 proteins) and Piromyces (10,979 proteins) and that with Illumina system was 14,618 proteins from three genera including Piromyces (14,613 proteins), Neocallimastix (4 proteins) and Orpinomyces (1 proteins). On the other hand, the classical Sanger dideoxy sequencing method was also used for sequence analyses of 14,618 proteins.

Table 2.

The nuclei and proteins deposited in NCBI database during 4 decades

Fungi	DB	Year
Fungi	DB	1980s	1990s	2000s	2010s	Total
Anaeromyces	Nucleus	0	2	43	1,115	1,160
Anaeromyces	Protein	0	0	2	12,851	12,853
Buwchfawromyces	Nucleus	0	0	5	7	12
Buwchfawromyces	Protein	0	0	0	0	0
Caecomyces	Nucleus	0	1	13	77	91
Caecomyces	Protein	0	0	0	20	20
Cyllamyces	Nucleus	0	0	68	62	130
Cyllamyces	Protein	0	0	0	0	0
Feramyces	Nucleus	0	0	0	69	69
Feramyces	Protein	0	0	0	21	21
Liebetanzomyces	Nucleus	0	0	0	4	4
Liebetanzomyces	Protein	0	0	0	0	0
Neocallimastix	Nucleus	1	39	1,220	946	2,206
Neocallimastix	Protein	0	2	59	20,404	20,465
Oontomyces	Nucleus	0	0	0	3	3
Oontomyces	Protein	0	0	0	0	0
Orpinomyces	Nucleus	0	21	164	209	394
Orpinomyces	Protein	0	1	72	69	142
Pecoramyces	Nucleus	0	0	0	70	70
Pecoramyces	Protein	0	0	0	41	41
Piromyces	Nucleus	0	52	17,258	2,381	19,691
Piromyces	Protein	0	7	46	25,917	25,970
Total	Nucleus	1	115	18,771	4,943	23,830
Total	Protein	0	10	179	59,323	59,512

NCBI, National Center for Biotechnology Information; DB, database.

NCBI, National Center for Biotechnology Information; DB, database. Protein separation using two dimensional gel electrophoresis (2D-GE), sequence analysis using mass spectrometry (MS) and protein identification are three key steps for classic proteome analysis technique and this could be a quite powerful technique to collect and compare each protein information (Fig. 2) [56]. However, it requires long training time to get high quality results of 2D-GE due to lots of tricky steps and the repeatability of 2D-GE is comparatively low. Technical advances in MS, protein separation and sequence analyses could be performed with a single run of LC-MS and this method was regarded as more effective one than 2D-GE.

Fig. 2.

Comparison of fungal mat proteome (A) and culture supernatant proteome (B) of anaerobic fungus Neocallimastix frontalis PMA02 on 2D-GE.

In this paper, the protein virtual gels of individual fungus based on publically available amino acid sequences were constructed using JVirGel version 2.0 software (jvirgel.de). Among 12,853 protein sequences from Anaeromyces, 1,383 and 2,874 proteins were predicted as secretory proteins and membrane proteins and 5,452 proteins were predicted as remaining proteins that were not predicted either of them (Fig. 3, Supplementary Table S1). For the total proteome of Anaeromyces, the pI values were ranged from 3.00 (hypothetical protein, ORX83422.1) to 11.10 (hypothetical protein, ORX75296.1) and molecular weights were ranged from 200 kDa (Ketoacyl-synt-domain-containing protein, ORX78811.1) to 12.7 kDa (hypothetical protein, ORX75296.1). Among known functional proteomes, the domain of unknown function (DUF) 6 containing protein (ORX84229.1) with pI 9.82 had the highest pI and cellulose-domain containing protein (ORX78609.1) with pI 3.80 had the lowest pI among functionally known secretomes. The abi-domain containing protein (ORX55221.1) with 15 kDa was the smallest and P-loop containing nucleoside triphosphate hydrolase protein (ORX86651.1) with 182.4 kDa was the largest among functionally known secretome. The average pI and molecular weight of Anaeromyces secretome were 5.88 and 55.4 kDa, respectively. The MFS (major facilitator superfamily) transporter (ORX87586.1) with pI 9.99 had the highest pI and WD40 (β– transducin) repeat-like protein (ORX61822.1) with pI 3.88 had the lowest pI among functionally known membrane proteome of Anaeromyces. The keotacyl-synthetase domain containing protein (ORX78811.1) with 199.8 kDa was the largest and RER1 protein (ORX55221.1) which involved in the retrieval of endoplasmic reticulum membrane protein with 16 kDa was the smallest among functionally known predicted membrane proteome. The average pI and molecular weight of membrane proteome were 6.88 and 69.4 kDa, respectively.

Fig. 3.

Virtual 2D gel of Anaeromyces proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins.

Virtual 2D gel of Anaeromyces proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins. Among 15,745 protein sequences from Neocallimastix, 2,095 and 4,335 protein sequences were predicted as secretome and membrane proteome, respectively, and 9,315 protein sequences were predicted as remaining proteome (Fig. 4, Supplementary Table S2). The molecular weight of Neocallimastix proteome was a range from 200 kDa (hypothetical protein LY90DRAFT_666922, ORY72815.1) to 12.1 kDa (partial sequence of hypothetical protein LY90DRAFT_664663 from Neocallimastix californiae ORY79598.1) with an average of 58.8 kDa and hypothetical protein LY90DRAFT_499033 (ORY85919.1) with 13.8 kDa was the smallest among complete proteomes. One of hypothetical protein (ORY54617.1) with pI 10.0 had the highest pI, and another hypothetical protein (ORY24039.1) with pI 2.97 had the lowest pI among predicted secretomes of Neocallimastix. However, RNI-like protein (ORY54484.1) which involve in protein binding with pI 9.95 had the highest pI and invertase (ORY74006.1) with pI 3.84 had the lowest pI among functionally known secretomes of Neocallimastix. Scaffolding (ORY55229.1) which involved signaling pathway with 196 kDa was the largest and MFS general substrate transporter (ORY80567.1) with 14.3 kDa the smallest functionally known predicted secretome of Neocallimastix. The average pI and molecular weight of Neocallimastix secretome were 6.06 and 53.6 kDa, respectively. The pI 9.99 of essential protein for acyl-CoA-dependent ceramide synthesis, LAG1-domain containing protein (ORY33646.1) was the highest and the pI 3.88 of glycoside hydrolase/deacetylase (ORY79242.1) was the lowest among functionally known predicted membrane proteome of Neocallimastix. The protein related to the retrieval of early ER protein, Rer1 (ORT22199.1) with 15.7 kDa was the smallest and P-loop containing nucleoside triphosphate hydrolase protein (ORY43884.1) was the largest among functionally known predicted membrane proteomes of Neocallimastix. The average pI and molecular weight of Neocallimastix membrane proteome were 6.96 and 66.3 kDa, respectively.

Fig. 4.

Virtual 2D gel of Neocallimastix proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins.

Virtual 2D gel of Neocallimastix proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins. Among 126 Orpinomyces proteomes, 29 protein sequences were predicted as membrane proteome and 97 protein sequences were predicted as remaining proteins (Fig. 5, Supplementary Table S3). The molecular weight of reported Orpinomyces proteome was ranged from 133.1 kDa (partial sequence of hypothetical protein from Orpinomyces sp. OUS1, CAI29548.1) to 10.1 kDa (partial sequence of the hypothetical protein from Orpinomyces sp. OUS1, CAI29549.1) with average 38.7 kDa. In addition, the pI of reported Orpinomyces proteome was ranged from 12.35 (partial sequence of the hypothetical protein from Orpinomyces sp OUS1, CAI29565.1) to 3.23 (partial sequence of the hypothetical protein from Orpinomyces sp OUS1, CAI29558.1) with an average of 6.47. The GH 95 from Orpinomyces joyonii (AWI66988.1) with 117.2 kDa was the largest and Cyclophilin B precursor from Orpinomyces joyonii (ABC47329.1) with 22.0 kDa was the smallest Orpinomyces membrane protein. In addition, pI value of reported Orpinomyces membrane proteins was ranged from 9.29 (GH114 from Orpinomyces joyonii, AWI66991.1) to 4.08 (partial sequence of putative 5-azacytidine resistance protein from Orpinomyces sp. OUS1, CAI11365.1).

Fig. 5.

Virtual 2D gel of Orpinomyces proteome.

Blue, predicted secretome; black, remaining proteins.

Virtual 2D gel of Orpinomyces proteome.

Blue, predicted secretome; black, remaining proteins. Among 18,822 Piromyces proteomes, 2,270 and 4,863 protein sequences were predicted as secretome and membrane proteome, respectively, and 11,689 protein sequences were predicted as remaining proteins (Fig. 6, Supplementary Table S4). The molecular weight of Piromyces proteome was ranged from 200.0 kDa (Hypothetical protein from Piromyces sp. E2, OUM64425.1) to 12.9 kDa (hypothetical protein from Piromyces sp. E2, OUM60062.1) with an average of 56.9 kDa and the pIs were ranged from 11.00 (hypothetical protein from Piromyces finnis, ORX60159.1) to 2.92 (hypothetical protein from Piromyces sp. E2, OUM67877.1) with average of 6.57. The pI 10.06 of DUF6-domain-containing protein (ORX43281.1) was the highest and the pI 2.95 of non-catalytic module family DOC2 (OUM64499.1) was the lowest among functionally known predicted secretomes of Piromyces. The enzyme endoglucanase 5A with 192.9 kDa was the largest and putative terbinafine resistance locus protein (ORX49356.1) with 13.1 kDa was the smallest among functionally known predicted secretomes of Piromyces. The average pI and molecular weight of Piromyces secretome were 6.03 and 49.9 kDa, respectively. The glycosyltranferase family 2 protein (OUM61397.1) with 193.3 kDa was the largest and putative terbinafine resistance locus protein (ORX49356.1) with 13.1 kDa was the smallest among functionally known predicted membrane proteomes of Piromyces.

Fig. 6.

Virtual 2D gel of Piromyces proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins.

Virtual 2D gel of Piromyces proteome.

Blue, predicted secretome; red, predicted membrane proteome; black, remaining proteins.

CARBOHYDRATE DEGRADING PROTEINS

The information about the proteins related to carbohydrate degradation including glycoside hydrolase (GH), glycosyl transferase (GT), polysaccharide lyase (PL), carbohydrate esterase (CE) and carbohydrate binding module (CBM) as an associated module is publically available on the CAZY DB (www.cazy.org). The GH family proteins hydrolyze the glycosidic bonds between sugar or bond between sugar and non-sugar moiety and 165 GH families have been registered to CAZY database. From anaerobic fungi, 36 GH family were registered and 5 different GH clans including (α/α)6, (β/α)8, 5-fold β-propeller and β-jelly roll were reported. The hexose hydrolyzing enzymes including α-glucosidase (EC 3.2.1.20; GH31), β-glucosidase (EC 3.2.1.21, GH1 and GH3) which hydrolyze non-reducing glucosyl residue terminal were detected from Caecomyces, Neocallimastix, Orpinomyces, Pecoramyces and Piromyces. The enzyme endoglucanases (EC 3.2.1.4; GH5, GH6, GH9, GH45, and GH 48) which hydrolyze glucosidic linkage of cellulose, lichenin and glucans were detected from Anaeromyces, Neocallimastix, Orpinomyces and Piromyces. The enzyme α-amylases (EC 3.2.1.1; GH13 and GH57) which hydrolyze α-D-glycosidic linkages of polysaccharide were detected from Orpinomyces, Pecoramyces and Piromyces. The enzyme β-mannase (EC 3.2.1.78; GH26) is a representative enzyme of GH 26 family proteins and the glucanase (EC 3.2.1.39) is the representative enzyme of GH 17 and GH 64 family proteins. The enzyme α-galactosidase (EC 3.2.1.22) and β-galactosidase (EC 3.2.1.23) were the representative enzymes of GH 36 and GH 35 family proteins, respectively. The enzymes involved in the hydrolyses of pentose sugar including endo-1,4-β-xylanase (EC 3.2.1.8, GH10 and GH11), β-xylosidase (EC 3.2.1.37, GH43 and GH 120), α-L-fucosidase (EC 3.2.1.51, GH 95 and GH 141), α-L-rhamnosidase (EC 3.2.1.40, GH 28 and GH 78), β-L-arabinofuranosidase (EC 3.2.1.185, GH 127) were detected from anaerobic fungi. In addition, chitinase (EC 3.2.1.14; GH18 and GH19) and chitosanse (EC 3.2.1.132; GH8) were also detected from anaerobic fungi. The glycosyl transferases transfer sugar during glycosidic bond synthesis and 5 glycosyl transferase were registered and 3 polysaccharide lyase (PL), pectate lyase (EC 4.2.2.2, PL1), rhamnogalacturona endolyase (EC4.2.2.23, PL4) and rhamnogalacturona exolyase (EC 4.2.2.24, PL11) were detected from anaerobic fungi. The carbohydrate esterases (CE) hydrolyze esters into an acid and an alcohol and 8 CE families were detected from anaerobic fungi. The enzyme acetyl xylan esterase (EC 3.1.1.72), feruloyl esterase (EC3.1.1.73), cinnamoyl esterase (EC 3.1.1-) and carboxylesterase belong to CE 1 family, the existence of only acetyl xylan esterase and feruloyl esterase were reported. The acetyl xylan esterase from Neocallimastix frontalis PMA02 cleaves ester bond between acetyl side group and xylan or xylo-oligosaccharides [57] and feruloyl esterase from Anaeromyces mucronatus cleaves ester bond between ferulate and polysaccharide [58]. The representative enzyme of CE 8 is pectin methylesterase (EC 3.1.1.11) and the partial sequence of CE 8 from Orpinomyces joyonii D3B (AWI67007.1) could be methylesterase. Likewise, partial sequence of CE 13 from Neocallimastix cameroonii G3 (AWI67012.1) and CE 15 from Piromyces sp could be pectin aceylesterase (EC 3.1.1.-) and 4-o-methyl-glucuronoyl methylesterase (EC3.1.1.-), respectively. The protein 3D structure of CE 1, CE3, CE6, and CE13 family were (α/β/α)-sandwich type and those of CE 4, CE 8 family were (β/α)7-barrel and (β)-helix type, respectively. The carbohydrate esterases do not directly hydrolyze glucosidic linkage of polysaccharides, however, glucose hydrolysis is promoted with the removal of non-sugar side groups by CE [57].

RESEARCHES ON ANAEROBIC FUNGI

The research articles about anaerobic fungi were searched on PubMed (PM) and PubMed Central (PMC) DB using individual genus name as keywords and the numbers of hits were 1,138 in PM and 1,344 in PMC during the period from 1980 to 2019 (Table 3). Before 1980, 5 journal articles were detected in PM and one was detected in PMC. After filtration by deleting duplicated one, the numbers were reduced to 444 in PM and 719 in PMC and the articles overlapped both PM and PMC were 132 (Supplementary Table S5). Most of the results acquired from PM were directly related to anaerobic fungi, however, some results from PMC were not directly related to keyword or unrelevant to it. According to PM results, Neocallimastix had been the most popular topic in the anaerobic fungal research until 1990 because it was the earliest genus of anaerobic fungi. In addition, the numbers of research articles about Neocallimastix were the highest both in PM and PMC database and those about Orpinomyces and Piromyces were the next. The researches on Piromyces were continuously reported last four decades. Even with their early discovery, researches on Anaeromyces, and Caecomyces were actively reported during the 2000s. The research on newly discovered anaerobic fungi including Buwchfawromyces, Feramyces, Liebetanzomyces, Oontomyces, and Pecoramyces were reported during 2010s.

Table 3.

The published research papers about anaerobic fungi during 4 decades in PubMed and PMC DB

Fungi	DB	Year
Fungi	DB	1980s	1990s	2000s	2010s	Total
Anaeromyces	PubMed	1	18	95	63	177
Anaeromyces	PMC	0	2	27	69	98
Buwchfawromyces	PubMed	0	0	0	2	2
Buwchfawromyces	PMC	0	0	0	11	11
Caecomyces	PubMed	1	25	96	63	185
Caecomyces	PMC	5	6	26	65	102
Cyllamyces	PubMed		16	96	58	170
Cyllamyces	PMC	0	2	25	54	81
Feramyces	PubMed				1	1
Feramyces	PMC	0	0	0	4	4
Liebetanzomyces	PubMed				1	1
Liebetanzomyces	PMC	0	0	0	2	2
Neocallimastix	PubMed	40	90	52	50	232
Neocallimastix	PMC	36	68	84	263	451
Oontomyces	PubMed				3	3
Oontomyces	PMC	0	0	0	9	9
Orpinomyces	PubMed		35	102	81	218
Orpinomyces	PMC	0	14	48	182	244
Pecoramyces	PubMed				4	4
Pecoramyces	PMC	0	0	0	12	12
Piromyces	PubMed	3	39	51	52	145
Piromyces	PMC	5	22	80	223	330
Fungi	PubMed	3		1		4
Fungi	PMC	10	1	2	5	18
Order	PubMed		65	79	186	330
Order	PMC	0	2	24	37	63
Phylum	PubMed		16	92	94	202
Phylum	PMC	0	2	21	180	203
Total	PubMed	48	304	664	658	1,674
	PMC	56	119	337	1,116	1,628
	Total	104	423	1,001	1,774	3,302

PMC, PubMed Central DB; DB, database.

PMC, PubMed Central DB; DB, database. The AF does not possess mitochondria but hydrogenosome for ATP production [59] and the hydrogenosome is surrounded by the double membrane [60]. The hydrogen can be produced through enzyme cascade reactions in hydrogenosome [61] and the biochemical characteristics of hydrogenosomal enzymes including pyruvate format-lyase [62], hydrogenase [63] and malic enzyme [64] were reported. The first report on the physicochemical mechanism of AF in fiber degradation [3] suggested the importance of AF during fiber digestion in the rumen and it triggered the researches on the possible use of AF themselves or their enzymes as feed additives to increase the fiber digestibility in the rumen. The glycosidase activities in the culture supernatant of Neocallimastix frontalis (N. frontalis) using filter paper or avicel as substrate were compared [65] and the substrate conditions were expanded to natural fiber including wheat straw [6], Italian ryegrass [66] and maize stem [67]. The β-glucosidase [68] and xylanases [69] from N. frontalis were purified and characterized through the biochemical procedure using chromatograms and cellulases (celA, celB, and celC) from Neocallimastix patriciarum (N. patriciarum) were produced through molecular procedure using cDNA cloning and consecutive heterologous expression in Escherichia coli [70]. Later, xylanase and mannanase from Piromyces [51], cellulase and xylanases from Orpinomyces [71], xylanases from Anaeromyces [72] and cellulase from Pecoramyces [73] were reported. Unlike other eukaryotic genes, no introns were detected from most of endoglucanases [74,75] originated from AF and the horizontal transfer of bacterial genes to AF could be the reason for the intronless endoglucanases in AF [75,76]. The horizontal gene transfer in AF is crucial for the survival of AF in the rumen [77] and it could be beneficial to researchers for the production of AF protein. The use of cDNA library constructed with vector system was considered a promising tool for the production of enzymes from AF and has been used for three decades. The development NGS system with bioinformatics enormously increased the amount of protein sequence information with a single run and assembled 27,560 transcripts were acquired with 70.2 Gb reading with Illumina HiSeq 2,000 platform [78]. The Illumina systems cover their short read length (up to 300 bp) with a huge amount of read (up to 6,000 Gb for NovaSeq), however, there might be some possibilities for misleading due to the characteristics of anaerobic fungal proteins. Recently, Pacific Biosciences developed PacBio system which armed with long read length (15–20 kb), however solid reports have not been published yet. Natural celluloses were classified as cellulose I, II, III, and IV based upon its crystalline allomorphs [79] and the possible effects of cellulose crystallinity index on cellulose degradability was proposed [80]. The adsorption and activity of cellulolytic enzymes from AF were affected by the cellulose microcrystallinity [81] and the protein, cellulose binding module (CBM) in cellulases could be responsible for ligand binding action [82]. After molecular and biochemical characterization of CBM29 from Piromyces equi [83], 11 CBM including CBM1, 6, 10, 13, 18, 22, 26, 29, 35, 52, and 66 from Anaeromyces, Caecomyces, Neocallimastix, Orpinomyces, Pecoramyces and Piromyces are available in CAZY database (www.cazy.org). Anaerobic fungi improved their survivability under various substrate conditions with a wide variety of carbohydrate degrading enzymes which consisted of combinations of GH and CBM domains and this could be the reason why AFs maintain a wide range of enzyme systems for fiber digestion. The use of enzyme cocktail has been used in the glycosylation process of fibrous biomass for biofuel production and the similar concept was proposed using enzyme cocktail form AF [71]. The research on designing chimeric enzymes using AF genes in efforts to make multipurpose with thermostability was reported [84].

SUMMARY

Anaerobic fungi produce a wide variety of powerful carbohydrate hydrolyzing enzymes that can be used for animal production, biofuel production, food production and other purposes. The difficulties in the cultivation of AF under strictly anaerobic conditions caused hesitation in AF researches, however, the social requirement of substitutional energy source to petroleum and the development of molecular techniques including NGS stimulated researches on AF. During the last decades, development in culture medium led the discovery of new 12 genera of AF, however, there might be possibilities for more hidden AF in untouched area. In addition, large amounts of protein sequences have been produced during the last two decades, however, the information was skewed to three genera including Piromyces, Neocallimastix and Anaeromyces. The researches on AF should be expanded not only to newly described genera but the target itself. With a massive amount of sequence data, it might be difficult to scrutinize each protein, and researchers normally screen target sequences mechanically. However, sometimes AFs bear unexpected gift which researchers can easily miss. Supplementary Tables

55 in total

Review 1. A higher-level phylogenetic classification of the Fungi.

Authors: David S Hibbett; Manfred Binder; Joseph F Bischoff; Meredith Blackwell; Paul F Cannon; Ove E Eriksson; Sabine Huhndorf; Timothy James; Paul M Kirk; Robert Lücking; H Thorsten Lumbsch; François Lutzoni; P Brandon Matheny; David J McLaughlin; Martha J Powell; Scott Redhead; Conrad L Schoch; Joseph W Spatafora; Joost A Stalpers; Rytas Vilgalys; M Catherine Aime; André Aptroot; Robert Bauer; Dominik Begerow; Gerald L Benny; Lisa A Castlebury; Pedro W Crous; Yu-Cheng Dai; Walter Gams; David M Geiser; Gareth W Griffith; Cécile Gueidan; David L Hawksworth; Geir Hestmark; Kentaro Hosaka; Richard A Humber; Kevin D Hyde; Joseph E Ironside; Urmas Kõljalg; Cletus P Kurtzman; Karl-Henrik Larsson; Robert Lichtwardt; Joyce Longcore; Jolanta Miadlikowska; Andrew Miller; Jean-Marc Moncalvo; Sharon Mozley-Standridge; Franz Oberwinkler; Erast Parmasto; Valérie Reeb; Jack D Rogers; Claude Roux; Leif Ryvarden; José Paulo Sampaio; Arthur Schüssler; Junta Sugiyama; R Greg Thorn; Leif Tibell; Wendy A Untereiner; Christopher Walker; Zheng Wang; Alex Weir; Michael Weiss; Merlin M White; Katarina Winka; Yi-Jian Yao; Ning Zhang
Journal: Mycol Res Date: 2007-03-13

2. Anaeromyces contortus, sp. nov., a new anaerobic gut fungal species (Neocallimastigomycota) isolated from the feces of cow and goat.

Authors: Radwa A Hanafy; Britny Johnson; Mostafa S Elshahed; Noha H Youssef
Journal: Mycologia Date: 2018-06-26 Impact factor: 2.696

3. Studies on the rumen flagellate Neocallimastix frontalis.

Authors: C G Orpin
Journal: J Gen Microbiol Date: 1975-12

4. Anaeromyces mucronatus nov. gen., nov. sp. A new strictly anaerobic rumen fungus with polycentric thallus.

Authors: A Breton; A Bernalier; M Dusser; G Fonty; B Gaillard-Martinie; J Guillot
Journal: FEMS Microbiol Lett Date: 1990-07 Impact factor: 2.742

5. The conserved noncatalytic 40-residue sequence in cellulases and hemicellulases from anaerobic fungi functions as a protein docking domain.

Authors: C Fanutti; T Ponyi; G W Black; G P Hazlewood; H J Gilbert
Journal: J Biol Chem Date: 1995-12-08 Impact factor: 5.157

6. Isolation and characterization of a ferulic acid esterase (Fae1A) from the rumen fungus Anaeromyces mucronatus.

Authors: M Qi; P Wang; L B Selinger; L J Yanke; R J Forster; T A McAllister
Journal: J Appl Microbiol Date: 2011-03-18 Impact factor: 3.772

7. Noncatalytic docking domains of cellulosomes of anaerobic fungi.

Authors: P J Steenbakkers; X L Li; E A Ximenes; J G Arts; H Chen; L G Ljungdahl; H J Op Den Camp
Journal: J Bacteriol Date: 2001-09 Impact factor: 3.490

8. D1/D2 domain of large-subunit ribosomal DNA for differentiation of Orpinomyces spp.

Authors: Sumit S Dagar; Sanjay Kumar; Priti Mudgil; Rameshwar Singh; Anil K Puniya
Journal: Appl Environ Microbiol Date: 2011-07-22 Impact factor: 4.792

9. Degradation of maize stem by two rumen fungal species, Piromyces communis and Caecomyces communis, in pure cultures or in association with cellulolytic bacteria.

Authors: V Roger; E Grenet; J Jamot; A Bernalier; G Fonty; P Gouet
Journal: Reprod Nutr Dev Date: 1992

10. Internal transcribed spacer 1 secondary structure analysis reveals a common core throughout the anaerobic fungi (Neocallimastigomycota).

Authors: Christian Koetschan; Sandra Kittelmann; Jingli Lu; Djamila Al-Halbouni; Graeme N Jarvis; Tobias Müller; Matthias Wolf; Peter H Janssen
Journal: PLoS One Date: 2014-03-24 Impact factor: 3.240

2 in total

1. Identification of the Potential Role of the Rumen Microbiome in Milk Protein and Fat Synthesis in Dairy Cows Using Metagenomic Sequencing.

Authors: Xin Wu; Shuai Huang; Jinfeng Huang; Peng Peng; Yanan Liu; Bo Han; Dongxiao Sun
Journal: Animals (Basel) Date: 2021-04-26 Impact factor: 2.752

2. Roughage biodegradation by natural co-cultures of rumen fungi and methanogens from Qinghai yaks.

Authors: Yaqin Wei; Hui Yang; Zhiye Wang; Jiang Zhao; Hongshan Qi; Chuan Wang; Jingrong Zhang; Tao Yang
Journal: AMB Express Date: 2022-09-19 Impact factor: 4.126

2 in total