Literature DB >> 15626330

Azolla--a model organism for plant genomic studies.

Yin-Long Qiu1, Jun Yu.   

Abstract

The aquatic ferns of the genus Azolla are nitrogen-fixing plants that have great potentials in agricultural production and environmental conservation. Azolla in many aspects is qualified to serve as a model organism for genomic studies because of its importance in agriculture, its unique position in plant evolution, its symbiotic relationship with the N2-fixing cyanobacterium, Anabaena azollae, and its moderate-sized genome. The goals of this genome project are not only to understand the biology of the Azolla genome to promote its applications in biological research and agriculture practice but also to gain critical insights about evolution of plant genomes. Together with the strategic and technical improvement as well as cost reduction of DNA sequencing, the deciphering of their genetic code is imminent.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 15626330      PMCID: PMC5172247          DOI: 10.1016/s1672-0229(03)01004-0

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

Azolla, a genus of aquatic ferns (Fig. 1), consists of seven species that all form symbiotic relationships with N2-fixing cyanobacterium Anabaena azollae (Fig. 1, 1.), and is used as a biofertilizer in agriculture worldwide (2., 3.). Its most common utilization is the co-cultivation with rice, as water-filled rice paddies provide a perfect habitat for the water fern to propagate. The Azolla-Anabaena symbiosis is the only other major natural N2-fixing process besides the legumes-rhizobia symbiosis that is being utilized in large-scale for agricultural purposes. By a rough estimate, it supplies 40-60 kg nitrogen per hectare to the crop field (). Given the acreage of rice cultivated worldwide, the potential to exploit this low cost, self-renewable, and environmentally sustainable N2-fixing symbiosis remains enormous.
Fig. 1

The growing Azolla plant, an aquatic water fern (left, scale bar = 1 cm), and Anabaena azollae, the filamentous cyanobacteria from cavities within the leaves of Azolla (right, magnified filaments, scale bar = 10 μm).

Plant genomics has been advancing at a breathtaking pace over the past few years. With the sequencing of Arabidopsis () and rice genomes (5., 6., 7., 8.) completed and Lotus japonicus (), Medicago truncatula (http://medicago.toulouse.inra.fr/), and poplar (http://bahama.jgi-psf.org/prod/bin/populus/home.populus.cgi) genomes now underway, we will soon be able to learn more about the organization, diversity, and evolution of the nuclear genome of flowering plants. The primary goal of sequencing these genomes is to identify genes that encode economically important traits and cellular processes in crops and forest trees so that we can further improve them to meet the challenge of providing sufficient and high quality food, fiber, medicine, and shelter to the world’s growing population. A comparative approach has proven effective and efficient in genome sequence annotation and gene identification or prediction aside from libraries of experimental evidence, which are known to be both laborious and time-consuming. Moreover, to gain a full understanding of these genes and their functions as well as the genomic environment within which these genes operate, evolutionary analyses complement and enhance the power of experimental assays. From this perspective, it will be of tremendous value to have information from non-flowering plant genomes to realize the full potential of the sequenced plant genomes. The green alga Chlamydomonas reinhardtii genome has just been sequenced (http://www.biology.duke.edu/chlamy_genome/) can to some extent fill this information gap, but its distant relationship to flowering plants naturally undermines its utility. Hence, Azolla, being a fern and lying at the mid-point between algae and flowering plants in the phylogeny, will be an excellent choice for a future genome sequencing project to gather information for understanding the working and evolution of the nuclear genome in green plants.

Biology and Utilization of Azolla

Azolla is a water fern that floats on the surface of fresh water ponds, rivers, and flooded fields in the tropics and subtropics of both the New and Old Worlds. Its size is typically 1-2.5 cm in diameter, but some species can achieve the size of 15 cm or more (1., 2., 10.). It belongs to a derived group of ferns Azollaceae and is closely related to several other aquatic taxa such as Salvinia and Marsilea (). Its fossil history extends back to at least the Upper Cretaceous (1., 12.). It is one of the few pteridophyte lineages that have evolved heterospory (with male and female spores in different sizes), a condition that precedes evolution of the seed. Traditionally, seven species are recognized and they are divided into three series, with A. filiculoides, A. rubra, A. caroliniana, A. microphylla, and A. mexicana in section Azolla (found in tropical areas of the Americas, New Zealand, and Australia), A. pinnata in section Rhizosperma (occurring in Africa, Australia, and Asia), and A. nilotica in section Tetrasporocarpia (distributed only in Africa) (). A molecular phylogenetic study has now more or less confirmed this taxonomic scheme (), with the only change that A. nilotica and A. pinnata form a monophyletic instead of a paraphyletic group as in the morphological cladistic analysis (). The cyanobacteria are housed in a cavity of dorsal lobe of the leaf. All of the strains isolated from these seven Azolla species belong to Anabaena azollae. According to two molecular phylogenetic studies, the bacterial phylogeny is in parallel to that of their host plants (14., 15.). Hence, a symbiont-host coevolution has obviously occurred since establishment of the symbiotic relationship in the common ancestor of Azolla. (Salvinia, the sister group of Azolla, does not harbor any of the cyanobacterial symbiont.) Azolla has both sexual and asexual reproduction, but the latter is more common. During sexual reproduction, each (female) megaspore contains a small colony of Anabaena filaments bearing akinetes (spores of the cyanobacterium), and so the bacterium is passed down to the next generation of the water fern. Its asexual reproduction is achieved simply by multiplication of fragmented fronds (fern leaves). Under ideal growth conditions, the water fern grows and reproduces (asexually) extremely fast, often able to double its weight in 2-5 d (), making it a popular green manure species in tropical and subtropical agriculture. The Azolla-Anabaena symbiosis has been exploited by man for agricultural production for many centuries. Its earliest written record appeared in the ancient Chinese book Er-Ya, which was published about 2000 years ago. Another Chinese book, Qi-Min-Yao-Shu, on agricultural techniques published by Sixue Jia in 540 A.D., mentioned Azolla with respect to applied plant cultivation (). Today, Azolla is used extensively in China, India, Vietnam, Thailand, the Philippines, Indonesia, Italy, Senegal, West Africa, Brazil, and other tropical countries (2., 3.). Its primary use is cocultivation with rice to provide natural manure. The most common practice is that before or right after rice planting, Azolla, grown from a nursery or a pond, is dispersed into the field. In the first few weeks, the thin canopy of rice plants provides a partial shade and plenty of sunlight required for the water fern to grow. Application of some phosphorus and a small amount of nitrogen fertilizers during this period stimulates growth of both Azolla and rice. Under such condition, Azolla is able to double its mass between 2-5 d. Later in the rice-growing season, over-shading by the rice canopy and draining of water from the field cause Azolla to die. Subsequently, the nitrogen fixed by Anabaena in Azolla as well as the phosphorous, potassium, and other nutrients assimilated by the water fern are released into the environment and used by rice plants for grain formation as well as by the crop planted in the next season. Besides nitrogen, Azolla also fixes a substantial amount of carbon through photosynthesis so that organic matters in a steady amount in each rice-growing season are incorporated into the soil to constantly improve the soil composition. Therefore, the use of Azolla has a long-term positive effect on agricultural production. This long-term benefit of preserving and maintaining the soil fertility and sustainability of the agricultural ecosystem indeed becomes the single most important advantage that Azolla has over chemical fertilizers. Other than cocultivation with rice, Azolla has been used as green manure also for other crops, including wheat, taro, banana, and vegetables. For any crop that requires a waterlogged growth condition, Azolla can simply be used through intercropping. In most other cases, Azolla grown in fields as a monocrop is to be harvested and ploughed into the soil like any fertilizer before crop planting. These applications have been used in China, India, Southeast Asia, and tropical Africa (). Azolla can also be used as a food supplement for domesticated animals such as pigs, cows, ducks, chickens, and fish (). In fact, the rice-Azolla-fish and rice-Azolla-duck agricultural ecosystems have been practiced in certain parts of China for a long time (). Finally, Azolla has been used increasingly for bioremediation. The water fern is found to be able to absorb and break down antibiotics and pesticides used for insect and pathogen control in agricultural fields (18., 19.). It is also used to clean up heavy metal pollution in water (20., 21.). Thus, there are great potentials for the Azolla-Anabaena symbiosis to be exploited both as a biofertilizer and a bioremediation agent.

Why Azolla as a Model Organism?

The extensive exploitation of Azolla as a biofertilizer for rice cultivation as well as for other purposes such as animal feed and bioremediation provides a strong incentive to study many aspects of its biology. Several reasons justify selection of Azolla as a model organism for functional, comparative, and evolutionary genomic studies of plants. First, it is an economically and environmentally important plant. Model organisms are usually chosen based on either species with convenient biological traits for laboratory studies, such as short life cycle, small dimension, and feasibility for genetic manipulation, or species with economic significance, such as major species of domesticated animals and plants. For genomic studies, because it is relatively expensive to sequence the entire genome of an organism, economically important species are usually given a priority for consideration, e.g., rice, poplar, and Medicago truncatula. Azolla as a major biofertilizer that can provide an environmentally sustainable and long term self-renewable nitrogen source to agricultural production, despite being only moderately well-studied thus far (2., 3.), deserves the level of research investment that has been accorded to other economically important plants like cereals and legumes because of the ultimate enormous payback. An estimate based on the numbers from the year 2000 shows that 88 million metric tons (Tg) of nitrogen fertilizers are needed for agricultural production in the world every year, and 45-50 Tg actually comes from symbiotic N2 fixation. By 2030-2040, 120 Tg nitrogen fertilizers will then be needed in agriculture to produce food to feed the world’s population of 8-10 billions (22., 23.). If the contribution of symbiotic N2 fixation does not increase proportionally, addition of another 32 Tg chemical nitrogen fertilizers each year to the arable soil alone would have an unthinkable and probably uncontrollable impact on the earth’s aquatic, terrestrial, and atmospheric ecosystems. Thus, it is imperative that we employ all means to increase the use of symbiotic N2 fixation systems in order to reduce the use of inorganic nitrogen fertilizers. Among all natural N2 fixation symbioses, Azolla-Anabaena system is the only major one that is applicable to aquatic ecosystems (most legumes-rhizobia systems are terrestrial), especially in rice cultivation in developing countries where much of the future population growth is believed to take place. Hence, its improvement through intensive studies, ranging from acquisition of basic genomic information (such as the genome sequences and gene maps of the relevant organisms) to field applications of our understanding on its basic biological processes, will contribute enormously to our well-being in the future in terms of both food production and environmental protection. Second, its status as a fern provides another good reason for Azolla to be chosen as a model organism for plant genomic studies. Phylogenetic position is being increasingly used to evaluate whether a species makes a good model because the knowledge gained from model organisms will eventually be applied to other species to gain a full understanding of the life on the earth, and a limited number of species that represent the entire spectrum of life’s diversity are needed to achieve this goal. This is especially true in the era of genomics as demonstrated by the recent genomic studies from their comparative analyses (5., 24., 25.). At present, most plants that have been or are being sequenced are flowering plants, with the sole exception of Chlamydomonas reinhardtii (Table 1), but the latter is a unicellular green alga. Hence, a multicellular non-flowering vascular plant like Azolla has extremely good properties to fill the vast gap between green algae and angiosperms to understand evolution of the nuclear genome in plants (). The fact that Azolla is the only economically important non-seed plant provides a doubly strong argument for it to be the next target for genome sequencing.
Table 1

The Genome Size of Selected Green Plants*

PlantGenome Size in MbGenome Size in pg (1C)Reference
green algae
Chlamydomonas reinhardtii100 Mbsee text
Caulerpa mexicana100 MbMandoli
Mesostigma viride100 MbMandoli
Coleochaete orbicularis94 MbMandoli
Chara aspera7200 MbMandoli
bryophytes
Marchantia polymorpha300 MbMandoli
Andreaea rupestris203 Mb0.21Kew
Physcomitrella patens475 Mb(53)
Tortula ruralis377 Mb0.39Kew
Anthoceros sp.382 MbMandoli
pteridophytes
Lycopodium clavatum931 MbMandoli
Selaginella kraussiana58 Mb0.06Kew
Equisetum hyemale11368 Mb11.75Kew
Angiopteris evecta388 MbMandoli
Azolla sp.716 Mb0.74Arumuganathan/Qiu/Mandoli unpubl.
Marsilea quadrifolia426 Mb0.44Kew
Ceratopteris richardii4000 MbMandoli
gymnosperms
Cycas revoluta12336 Mb12.75Kew
Ginkgo biloba9627 Mb9.95Kew
Gnetum ula2177 Mb2.25Kew
Welwitschia mirabilis6966 Mb7.20Kew
Ephedra tweediana8611 Mb8.90Kew
Pinus caribaea5564 Mb5.75Kew
Abies balsamea12722 Mb13.15Kew
Podocarpus acutifolius7934 Mb8.20Kew
Taxus baccata10691 Mb11.05Kew
basal angiosperms
Amborella trichopoda900 MbMandoli
Nuphar advento3234 MbMandoli
Nymphaea caerula532 Mb0.55Kew
Illicium anisatum3241 Mb3.35Kew
Liriodendron tulipifera790 MbMandoli
Cinnamomum camphora580 Mb0.60Kew
Aristolochia fimbriata435 Mb0.45Kew
monocots
Acorus gramineus400 MbMandoli
Pistia stratiotes319 Mb0.33Kew
Dioscorea togoensis474 Mb0.48Kew
Asparagus officinmalis1306 Mb1.35Kew
Ananas bracteatus435 Mb0.45Kew
Oryza sativa400 Mb(5)
Zea mays2641 Mb2.73Kew
Triticum aestivum16767 Mb17.33Kew
basal eudicots
Papaver nudicaule1693 Mb1.75Kew
Amaranthus hypochondriacus464 Mb0.48Kew
Spinacia oleracea997 Mb1.03Kew
Beta vulgaris1209 Mb1.25Kew
rosids
Arabidopsis thaliana125 Mb(4)
Citrus aurantium368 Mb0.38Kew
Gossypium klotzschianum1161 Mb1.20Kew
Carya illinoensis803 Mb0.83Kew
Morus alba822 Mb0.85Kew
Cucurbita moschata416 Mb0.43Kew
Rosa wichuraiana126 Mb0.13Kew
Pyrus communis532 Mb0.55Kew
Prunus persica271 Mb0.28Kew
Malus communis2177 Mb2.25Kew
Manihot esculenta803 Mb0.83Kew
Populus balsamifera550 Mbsee text
Lotus japonicus472 Mb(54)
Medicago truncatula464 Mb0.48Kew
Glycine max1093 Mb1.13Kew
asterids
Helianthus annuus2351 Mb2.43Kew
Daucus carota967 Mb1.00Kew
Ipomoea triloba726 Mb0.75Kew
Solanum tuberosum851 Mb0.88Kew

The species are selected based on their economic or biological importance, arranged according to recent phylogenetic studies (55., 56., 57., 58., 59.). Most of the genome size information was obtained from the websites at Royal Botanic Gardens, Kew (http://www.rbgkew.org.uk/cval/homepage.html) and Dina Mandoli’s lab at University of Washington, Seattle (http://faculty.washington.edu/mandoli/). For the taxa listed at the Kew site, we converted the genome size measurement in picograms (pg) of DNA to million base pairs (Mb). The species in bold (or their congeners) have been or are being sequenced, or are being characterized for some aspects of their genomes. See the corresponding websites below besides those listed in the text:

Moss genomics project - http://www.plant-biotech.net/pb/pb.html

Cycas genomics project - http://www.nybg.org/pr/PPCycads3.htm

Pine genomics projects - http://pinetree.ccgb.umn.edu/, http://cc.oulu.fi/~genetwww/plants/pinegen.htm

Potato functional genomics project - http://www.tigr.org/tdb/potato/

Third, Azolla has a moderate-sized genome of 720 Mb. It is more practical to sequence its genome than the 4,000-Mb-sized one of Ceratopteris richardi (Table 1), the only other moderately well studied fern. Given the current sequencing technology, in which a large proportion of the sequencing cost is attributed to the sequencing reagents, the genome size is a critical factor in determining whether a species is a good choice. Another strong argument in favor of sequencing a smaller genome is that the larger the genome is, the higher fraction it contains of repetitive sequences that evoke many technical challenges for completing the project. Fourth, the symbiotic relationship of Azolla with N2-fixing Anabaena is another aspect of the water fern that is likely to generate a wealth of information from genomic studies that may enhance our understanding of the well-studied legumes-rhizobia symbiosis through comparative analyses. N2-fixing cyanobacteria-plant symbiosis evolved only a few times in the 500 million years of land plant evolution: in the liverworts Blasia and Cavivularia, the moss Pleurozium schreberi, the hornworts Anthoceros, Dendroceros, Notothylas, and Phaeoceros, the fern Azolla, all cycad genera (gymnosperms), and the angiosperm Gunnera (27., 28., 29.). Among these, only the Azolla-Anabaena symbiosis is likely to be intensively studied because of its wide agricultural utilization. The fact that this symbiotic relationship involves a non-flowering plant and a cyanobacterium is going to provide a new angle for understanding many aspects of the rhizobia-legumes symbiosis as well as plant-microbe interaction in general. The symbiont-host recognition, suppression of plant defense reactions, nutrient transport between the two partners, and genetic adaptation and coevolution of both the bacterium and the plant are all important topics to be addressed scientifically. The information gathered on the bacterium-plant symbiosis will also help us to understand another agriculturally and environmentally important plant-fungus symbiotic system— mycorrhizae, as the two systems seem to have involved the same set of plant genes (). Fifth, Azolla is one of the few heterosporous pteridophyte lineages, the others being Isoetes-Selaginella, Marsileaceae, and Salviniaceae (). Heterospory is an intermediate condition between homospory and seed-pollen in the evolution of reproductive dispersal units in land plants. The seed is an extremely important plant structure that provides nutrition for human and animals. As of yet, there have been only a few studies initiated to investigate evolution of the seed from a genomic perspective (32., 33.). Because none of the heterosporous pteridophyte taxa except Azolla has any economic application, they are unlikely to be subjected to intensive studies like most model organisms. Azolla would be the only plant, if its genome is sequenced, that can provide a comparative perspective for understanding evolution of heterospory and seed. Finally, because Azolla has been used in agricultural production and environmental pollution control for quite some time, a relatively large body of literature is already in existence, particularly on its physiology and ecology (2., 3., 16.). Its small size, fast growth, and easy culturing should make it fairly easy to grow in laboratories. The International Rice Research Institute in the Philippines (http://www.irri.org/hotl.htm) also has an extensive germplasm collection of all seven species. All these factors should make it easy to popularize Azolla as a model organism.

The Goals

What should be the goals for extensive investigations of Azolla, especially the large-scale genomic studies? First, a number of constraints that currently limit the use of Azolla as a biofertilizer can be relaxed or removed, so that it can be used on a larger scale in a wide variety of environmental conditions. The wild species of Azolla grow optimally at the temperature of 20-30 °C. In the summer of tropics the water fern cannot propagate fast enough to keep in pace with the growth of rice plants. Likewise, in temperate regions application of Azolla is hampered due to the low temperature. Another problem often encountered in Azolla application is its phosphor (P) requirement, as the water fern can derive nitrogen from the cyanobacterial symbiont. A fairly minimal work on mutagenesis, selection, and breeding of Azolla and their symbionts has resulted in significant improvement of both these traits. Mutant strains of Azolla are able to grow well up to a temperature of 40 °C and requiring as low as 50% P (). The third area where improvement can be made to increase the use of Azolla in rice cultivation is its propagation. The current practice is that Azolla is raised in a nursery field or a pond, where the water fern mostly reproduces by asexual reproduction, and is then transported to rice fields (). There is a fair amount of labor involved in harvesting, transporting, and dispersing the water fern in this practice. If sexual reproduction can be used to propagate Azolla, only a few handfuls of spores need to be thrown into the field and the resulting labor cost can be greatly reduced (). Yet, this method is still not available for large-scale agricultural production. For these three major bottleneck factors, a completely sequenced Azolla genome and a large number of ensuing studies can surely improve our understanding of physiology (particularly temperature sensitivity and phosphor requirement) and reproductive biology of the water fern, and thus increase our ability to genetically engineer strains that can grow in different environmental conditions and be propagated by spores. Hence, it can be predicted that Azolla will be used more commonly in agricultural production and bioremediation. The kind of extensive and intensive studies as usually applied to model organisms on Azolla is also likely to greatly enhance our understanding on the symbiotic relationship of the water fern with N2-fixing cyanobacteria. Given the paucity of N2-fixing plant-bacterium symbioses that have ever evolved in land plants (see above for cyanobacteria; ref. for rhizobia and Frankia), it is safe to say that it will still be some time before any artificially engineered plant-bacterium N2-fixing symbiotic system comes into existence. Therefore, it will be more efficient to study, improve, and make use of the naturally occurring systems like those of Azolla-Anabaena and legumes-rhizobia in order to solve the nitrogen supply problem for agricultural production. The fact that no monocot has any symbiotic association with N2-fixing bacteria further highlights the potential difficulty to genetically engineer a cereal-bacterium N2-fixing system. Thus, Azolla-Anabaena symbiosis will likely remain as the only choice for supplying the fields with nitrogen in a self-renewable and environmentally sustainable way for rice and rice-wheat growing countries. It is of a pragmatic necessity, not just an intellectual curiosity, to understand and improve this system. Initiating an Azolla genome-sequencing project will itself provide an essential amount of basic genomic information about this organism, opening a way for its future research. More importantly, the project will make the Azolla-Anabaena system more attractive to many experimental biologists for intensive studies of both the plant and bacterium, which have not been as well studied as the legumes and rhizobia. Historically, designation of an organism as a model naturally leads to intensive characterization of every aspect of the organism. The recent examples are Lotus japonicus () and Medicago truncatula (http://medicago.toulouse.inra.fr/). An increased understanding of symbiotic relationships between plants and bacteria or fungi, as can be obtained from studies of Azolla-Anabaena, legumes-rhizobia, and plant-mycorrhizae, will provide the much-needed knowledge to help formulate environmentally sustainable policies and practices in agricultural production. The last major goal of making Azolla a model organism for genomic studies is to gather information from the species that represent a major and intermediate step in plant evolution so that comparative analyses can be conducted to shed light on many aspects of the nuclear genome evolution in green plants. One such aspect concerns genome size. The plant nuclear genome seems to have increased steadily in size at every major step of phyletic evolution, e.g., from green algae to land plants, from bryophytes to vascular plants, and from pteridophytes to seed plants (Table 1). This increase may have been achieved through polyploidization (), as a gradually lengthened sporophyte (diploid) generation in the life cycle provides an opportunity for manifesting the evolutionary advantage of multiple copies of the same or similar genes. Along this major trend, occasional dramatic genome size expansion (e.g., Chara, Table 1) or reduction (e.g., Selaginella, Table 1) is seen in some lineages, possibly due to further polyploidization and/or multiplication or elimination of particular classes of transposons and other repetitive sequence elements. Such phenomena have been seen in the sequenced genomes of human () and fission yeast (). The 720-Mb Azolla genome is smaller than most pteridophyte genomes (), but significantly larger than those of Arabidopsis () and rice (5., 6.). Comparisons between the two angiosperm genomes have yielded several interesting insights into possible mechanisms of the genome size variation in plants. First, plant genomes are organized in such a way that the transposon-derived repetitive sequences are scattered between gene-islands where the plant genes are clustered; thus, the increase of genome size due to transposon expansion is largely intergenic (5., 39., 40.). Second, the rice genome has twice as many genes as predicted for the Arabidopsis genome, indicating that an ancient genome duplication event had happened after the split of monocot and eudicot plants. Surprisingly, the extra set of genes in the rice genome lacks homologs in any other known genomes sequenced so far but is definitely transcribed; some genes are expressed at high levels in rice tissues (5., 41.). Third, the average size of the rice genes is larger than that of the Arabidopsis, attributable mainly to the gradual intron size increase (not due to transposon insertions) over the evolutionary time scale. Our preliminary analysis has shown that the difference is extendable between the genes of monocot and eudicot plants in general, regardless of the actual genome sizes. Finally, a GC-content gradient increase starting from the 5’ end of gene transcripts is found in most of the rice or monocot genes (5., 41.). Such a maneuver in DNA composition suggests that evolutionary forces, such as mutation and selection at the DNA sequence level, are working on the genes constantly. Taken together, Azolla should have fewer genes than the two sequenced angiosperms, as it has not evolved many features of the latter two, e.g., seed, pollen, flower, and many secondary compounds. It may have lineage-specific transposons that are propagating or deteriorating to affect the genome size. However, we will not know the answer with certainty until the Azolla genome is fully sequenced. Another aspect regarding genome evolution involves gene content. A few studies that have examined evolution of multigene families in land plants show that the copy number increases steadily from charophytes to angiosperms, e.g., actin genes (), MADS box genes (), and phytochrome genes (). It is not known whether the gene copy number increases via individual gene duplication, chromosome segmental duplication, or polyploidization. If the last two mechanisms are responsible for evolution of the multigene families, the next question is to what extent the synteny has been maintained. Comparisons among three flowering plants, Arabidopsis, tomato, and soybean show that there are major syntenic blocks conserved on the chromosomes after duplication (44., 45.). It is tempting to ask how far back during land plant evolution this kind of large-scale syntenic relationships was maintained, and what kind of evolutionary forces were behind the maintenance. The final aspect concerning genome evolution is the role of “junk” DNA, i.e., transposons and introns. It has been known for a long time that most eukaryotic genomes, especially those of high plants and animals, are packed with highly and moderately repetitive DNAs from DNA denaturation and renaturation (Cot curve plotting) studies. Now whole genome sequence analyses have confirmed this aspect of genome organization. At least one quarter of the rice genome is of recognizable transposons in origin (), whereas for a large genome like that of human, transposons account for about half of the genome (). What are the evolutionary and functional roles of this massive amount of transposon sequences in the genome? It has been suggested that they are involved in origins and functions of centromeres (46., 47., 48.), and telomeres (49., 50.). Further, they are responsible for chromosome rearrangements (). Hence, these seemingly useless DNAs, as judged by geneticists in the traditional sense of coding capacity for phenotype, are actually fundamental forces in maintaining faithful inheritance of the information in the genome from generation to generation, and at the same time breaking up syntenic relationships among loci after replicated chromosomes are partitioned into daughter species and generating genetic diversity via recombination and independent assortment. In other words, transposons are a “twin engine” that powers the evolution of life ()! Similarly, introns have been dubbed molecular parasites, but now we know that they play an extremely important role in eukaryotes to generate protein diversity by making alternative splicing possible in animal genomes, such as the human genome (). It has been suggested that plants may not use much of the cellular alternative splicing machinery to the same extent as animals (), but this conclusion is drawn from comparisons of two small, perhaps atypical plant genomes (Arabidopsis and rice, see Table1) with the large human genome. These are just a few clues seen from a very small sample of diversity of life that has been subjected to whole genome sequencing and analysis. It would be ideal to see if the validity and generality of the conclusions drawn from these studies can be extended to all life when a larger diversity of representative organisms is investigated. Thus, a plant like Azolla makes an ideal candidate for future genome sequencing projects to help us understand the evolution of the nuclear genome in plants. In a broader perspective, as we turn over more rocks (sequenced genomes), we will certainly see more surprises (new insights into genome biology and evolution)!

Technical Planning and Cost Estimate

The Azolla genome most likely will be sequenced through a whole genome shotgun approach due to the genome size and the readiness of sequence-assembly software tools. It can be carried out in two phases, a genome survey sequencing phase and a gene-map construction phase. In the first phase of the project, sequencing reads (or sequencing traces, usually about 500 bp in average length) of a low coverage (0.1 to 2X coverage of the genome, depending on its repetitive sequence content) should be acquired and analyzed to provide independent evaluation of the genome constituents, such as the type of repeats and the fraction of gene-coding sequences. An adequate amount of expressed sequence tags (ESTs) or cDNAs (say, representing over 10,000 unique genes) from different tissues or developmental stages (to provide enough sample diversity) of the plant should also be sequenced. In order to localize genes on chromosomes, some large-insert clone libraries, such as those of bacterial artificial chromosomes (BACs) and cosmids, should be constructed concurrently for subsequent physical mapping. In the second phase of the project, significant sequence coverage should be achieved, usually in a range of 4-7X of the genome equivalents. For the Azolla genome, IX coverage is equal to 1.4 million sequencing reads. A draft sequence assembly should be obtained at this point together with some clone-end sequences from large-insert clone libraries to provide a framework in the sequence-assembling process. Genes now can be identified based on gene-prediction software and verified by the acquired EST sequences. Nearly 95% of the genes in the genome can be identified from a working draft sequence. A vigorous completion strategy has to be applied for the final 5% genes, adding significant cost to the grand total. It is still debatable to what extent a genome project is defined as “finished” or “complete” since the plant genomes desirable for sequencing are mostly too large to be even considered as tangible targets. Based on current academic institutional costs (including both direct and indirect costs) and the cost reduction trend over the past 5 years in large-scale sequencing efforts, the total cost of the project should be around 6 to 10 million US dollars for a start-date in the current fiscal year. The phase one goals of the project may only cost about 2 million US dollars to fulfill. Therefore, the genome project of this magnitude in scientific and agriculture significance is very much a policy decision for the funding agencies rather than a scientific one. Scientists who are eager to use the basic genomic information yield from the Azolla Genome Project should not shy about expressing their thoughts about the prospective scientific and practical yield of this endeavor.
  37 in total

1.  Horsetails and ferns are a monophyletic group and the closest living relatives to seed plants.

Authors:  K M Pryer; H Schneider; A R Smith; R Cranfill; P G Wolf; J S Hunt; S D Sipes
Journal:  Nature       Date:  2001-02-01       Impact factor: 49.962

Review 2.  Missing links: the genetic architecture of flowers [correction of flower] and floral diversification.

Authors:  Douglas E Soltis; Pamela S Soltis; Victor A Albert; David G Oppenheimer; Claude W dePamphilis; Hong Ma; Michael W Frohlich; Günter Theissen
Journal:  Trends Plant Sci       Date:  2002-01       Impact factor: 18.313

3.  Three retrotransposon families in the genome of Giardia lamblia: two telomeric, one dead.

Authors:  I R Arkhipova; H G Morrison
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-04       Impact factor: 11.205

Review 4.  Symbiotic nitrogen fixation and phosphorus acquisition. Plant nutrition in a world of declining renewable resources.

Authors:  C P Vance
Journal:  Plant Physiol       Date:  2001-10       Impact factor: 8.340

Review 5.  Chromosome rearrangements and transposable elements.

Authors:  Wolf-Ekkehard Lonnig; Heinz Saedler
Journal:  Annu Rev Genet       Date:  2002-06-11       Impact factor: 16.830

Review 6.  The origin and evolution of model organisms.

Authors:  S Blair Hedges
Journal:  Nat Rev Genet       Date:  2002-11       Impact factor: 53.242

7.  The significance of responses of the genome to challenge.

Authors:  B McClintock
Journal:  Science       Date:  1984-11-16       Impact factor: 47.728

8.  Genetic definition and sequence analysis of Arabidopsis centromeres.

Authors:  G P Copenhaver; K Nickel; T Kuromori; M I Benito; S Kaul; X Lin; M Bevan; G Murphy; B Harris; L D Parnell; W R McCombie; R A Martienssen; M Marra; D Preuss
Journal:  Science       Date:  1999-12-24       Impact factor: 47.728

9.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica).

Authors:  Stephen A Goff; Darrell Ricke; Tien-Hung Lan; Gernot Presting; Ronglin Wang; Molly Dunn; Jane Glazebrook; Allen Sessions; Paul Oeller; Hemant Varma; David Hadley; Don Hutchison; Chris Martin; Fumiaki Katagiri; B Markus Lange; Todd Moughamer; Yu Xia; Paul Budworth; Jingping Zhong; Trini Miguel; Uta Paszkowski; Shiping Zhang; Michelle Colbert; Wei-lin Sun; Lili Chen; Bret Cooper; Sylvia Park; Todd Charles Wood; Long Mao; Peter Quail; Rod Wing; Ralph Dean; Yeisoo Yu; Andrey Zharkikh; Richard Shen; Sudhir Sahasrabudhe; Alun Thomas; Rob Cannings; Alexander Gutin; Dmitry Pruss; Julia Reid; Sean Tavtigian; Jeff Mitchell; Glenn Eldredge; Terri Scholl; Rose Mary Miller; Satish Bhatnagar; Nils Adey; Todd Rubano; Nadeem Tusneem; Rosann Robinson; Jane Feldhaus; Teresita Macalma; Arnold Oliphant; Steven Briggs
Journal:  Science       Date:  2002-04-05       Impact factor: 47.728

10.  The gain of three mitochondrial introns identifies liverworts as the earliest land plants.

Authors:  Y L Qiu; Y Cho; J C Cox; J D Palmer
Journal:  Nature       Date:  1998-08-13       Impact factor: 49.962

View more
  3 in total

Review 1.  Field Guide to Plant Model Systems.

Authors:  Caren Chang; John L Bowman; Elliot M Meyerowitz
Journal:  Cell       Date:  2016-10-06       Impact factor: 41.582

Review 2.  Between two fern genomes.

Authors:  Emily B Sessa; Jo Ann Banks; Michael S Barker; Joshua P Der; Aaron M Duffy; Sean W Graham; Mitsuyasu Hasebe; Jane Langdale; Fay-Wei Li; D Blaine Marchant; Kathleen M Pryer; Carl J Rothfels; Stanley J Roux; Mari L Salmi; Erin M Sigel; Douglas E Soltis; Pamela S Soltis; Dennis W Stevenson; Paul G Wolf
Journal:  Gigascience       Date:  2014-09-25       Impact factor: 6.524

Review 3.  Malaria vaccines: high-throughput tools for antigens discovery with potential for their development.

Authors:  Nora Céspedes; Andrés Vallejo; Myriam Arévalo-Herrera; Sócrates Herrera
Journal:  Colomb Med (Cali)       Date:  2013-06-30
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.