Literature DB >> 29618049

10KP: A phylodiverse genome sequencing plan.

Shifeng Cheng^1,2, Michael Melkonian³, Stephen A Smith⁴, Samuel Brockington⁵, John M Archibald⁶, Pierre-Marc Delaux⁷, Fay-Wei Li⁸, Barbara Melkonian³, Evgeny V Mavrodiev⁹, Wenjing Sun^1,2, Yuan Fu^1,2, Huanming Yang^1,10, Douglas E Soltis^9,11, Sean W Graham¹², Pamela S Soltis^9,11, Xin Liu^1,2, Xun Xu^1,2, Gane Ka-Shu Wong^1,13,14.

Abstract

Understanding plant evolution and diversity in a phylogenomic context is an enormous challenge due, in part, to limited availability of genome-scale data across phylodiverse species. The 10KP (10,000 Plants) Genome Sequencing Project will sequence and characterize representative genomes from every major clade of embryophytes, green algae, and protists (excluding fungi) within the next 5 years. By implementing and continuously improving leading-edge sequencing technologies and bioinformatics tools, 10KP will catalogue the genome content of plant and protist diversity and make these data freely available as an enduring foundation for future scientific discoveries and applications. 10KP is structured as an international consortium, open to the global community, including botanical gardens, plant research institutes, universities, and private industry. Our immediate goal is to establish a policy framework for this endeavor, the principles of which are outlined here.

Entities: Chemical Species

Mesh：

Year: 2018 PMID： 29618049 PMCID： PMC5869286 DOI： 10.1093/gigascience/giy013

Source DB: PubMed Journal: Gigascience ISSN： 2047-217X Impact factor: 6.524

Introduction

Based on the success of the 1KP (1000 Plants) Initiative [1,2], an international multidisciplinary consortium that sequenced and analyzed transcriptomes from more than 1000 species of green plants representing most of the known diversity within Viridiplantae, we are now aiming to sequence complete genomes from more than 10,000 plants and protists. 10KP will address fundamental questions in plant evolution and diversity, providing data on more than 10,000 species representing every major clade of embryophytes (land plants), green algae (chlorophytes and streptophytes), and protists (photosynthetic and heterotrophic). For embryophytes, we will sequence nonflowering plants (bryophytes, lycophytes, ferns, and gymnosperms) and flowering plants (angiosperms). In addition to green algae, we will also sequence diverse clades of photosynthetic and heterotrophic protists, representing some of the most enigmatic and unexplored eukaryotic microbes. This project was launched at the XIXth International Botanical Congress (2017, Shenzhen, China) and was covered by a Science News release in July 2017 [3]. Following the Bermuda Principles and the Fort Lauderdale Agreement, this project will make the resulting genomics data freely available. In the spirit of the Toronto Data Release Workshop and Statement [4] recommendation, which encourages large-scale sequencing projects to produce a citable statement for their data and intentions for downstream analyses and publications, we present this marker paper to outline our overall plans and explain how interested parties can get involved. The basic goal of 10KP is to build an annotated reference genome for a member of every genus of the Viridiplantae (land plants and green algae), as well as a phylodiverse set of species representing both photosynthetic and heterotrophic protists. These data will provide a wealth of information to address fundamental questions across the plant/eukaryotic tree of life, e.g., enabling studies of phylogeny, origin/acquisition and diversification of specific traits, gene and genome duplication, correlation between genomic and morphological changes, and convergent evolution of important genetic networks. The scope and quantity of data produced by 10KP will allow researchers to develop new techniques that address fundamental questions in evolution and comparative genomics. We will complete this project over the next 5 years (2018–2023), including sample acquisition, sequencing, genome assembly, analyses, and (initial) publications. Major supporters include BGI-Research, the nonprofit division of BGI-Shenzhen, and China National GeneBank (CNGB), an open nonprofit scientific platform that is managed by BGI-Shenzhen. Through 10KP, we hope to foster imaginative and high-quality research that addresses major questions in plant and protist biology while also, indirectly, demonstrating the value of both preserving and investigating biodiversity. Our effort is meant to complement, not replace, research programs supported through other funding agencies worldwide. We are open to collaborations with all interested research groups.

Species Lists and Phylogenetic Diversity

The number of extant species of Viridiplantae and protists is unknown. An estimate for embryophytes, gleaned from various databases (Table 1), is that there are at least 380,000 known species representing approximately 23,562 genera in 667 families; however, Govaerts [5] estimated a higher number. For green algae and photosynthetic protists, approximately 40,000 species have been described, and predictions for species yet to be described range from 25,000 to 100,000 [6]. In the case of heterotrophic eukaryotic microbes, estimates are much less clear; approximately 440,000 species have been described, most of which (approximately 400,000) are fungi [7]. The reality is that much of eukaryotic microbial diversity remains unexplored and the number of genera and species is essentially unknown.

Table 1:

Statistics of the described species of embryophytes (land plants) distributed among major clades

Clade	Family	Genus	Species	Three largest families
Hornworts	5	11	130	Anthocerotaceae (161)	Dendrocerotaceae (25)	Notothyladaceae (16)
Liverworts	87	387	7356	Lejeuneaceae (2270)	Jungermanniaceae(725)	Lepidoziaceae (580)
Mosses	111	874	13,000	Pottiaceae (3223)	Hypnaceae (2520)	Bryaceae (2108)
Lycophytes	3	18	1338	Lycopodiaceae (475)	Selaginellaceae (404)	Isoetaceae (51)
Ferns	48	319	10,578	Dryopteridaceae (1871)	Polypodiaceae (1601)	Pteridaceae (1226)
Gymnosperms	12	88	1104	Pinaceae (255)	Zamiaceae (216)	Cupressaceae (166)
ANA grade	6	20	193	Nymphaeaceae (88)	Schisandraceae (84)	Hydatellaceae (12)
Monocots	78	3505	76,119	Orchidaceae (28,576)	Poaceae (12,397)	Cyperaceae (6311)
Magnoliids	20	450	9528	Lauraceae (3106)	Piperaceae (2770)	Annonaceae (2174)
Asterids	143	9763	135,213	Asteraceae (38,700)	Rubiaceae (14,412)	Lamiaceae (8671)
Rosids	154	6582	101,245	Fabaceae (26,245)	Euphorbiaceae (6904)	Rosaceae (6626)
Basal eudicots	17	464	7535	Ranunculaceae (3119)	Proteaceae (1492)	Papaveraceae (1062)
Others	NA	NA	1579
Embryophyta	667	23,562	381,425

The numbers were combined and integrated from the Open Tree of Life and the Plant List (2013) [8], especially for flowering plants, with a particular focus on the “accepted species” for the nonflowering plants due to its classification difficulties. Some numbers are not the same or inconsistent across different databases. The aim of this table is simply to give a rough estimate of the species distribution in the major clades and the 3 largest families within each clade.

Statistics of the described species of embryophytes (land plants) distributed among major clades The numbers were combined and integrated from the Open Tree of Life and the Plant List (2013) [8], especially for flowering plants, with a particular focus on the “accepted species” for the nonflowering plants due to its classification difficulties. Some numbers are not the same or inconsistent across different databases. The aim of this table is simply to give a rough estimate of the species distribution in the major clades and the 3 largest families within each clade. Technologies/platforms for DNA library construction and sequencing The MGISEQ platform, a BGI-developed next-generation sequencing technology based on DNA nanoballs and combinatorial probe anchor synthesis, will be combined with evolving methodologies for improving long-range contiguity, including linked-read technologies from 10X genomics or single-tube long fragment reads from Complete Genomics (a division of BGI). HiC libraries may also be used to construct chromosome-level genome reference assemblies for a member of every family or for other critical species selected by the consortium. Generally speaking, sample providers must prepare either (a) a minimum amount of DNA >50 μg and DNA fragments >20 kb or (b) a minimum amount of DNA >5 ng and DNA fragments >50 kb or preferably >100 kb. For (a), we will use the traditional hierarchical shotgun strategy by combining paired-end libraries with a series of mate-pair libraries. For (b), we will use the strategy of MGISEQ + 10X. The details of these platforms and the library/sequencing strategies are summarized in Table 2.

Table 2:

Technologies/platforms for DNA library construction and sequencing

Platform/strategy	Insert size	Assemblers	Quality requirement for tissue samples	Minimum requirement for DNA quantity	Minimum requirement for DNA fragments	Assembly results
MGISEQ (hierarchical shotgun)	170/350 bp	SOAP de novo	Net weight of fresh plant tissue ≥4 g	2 μg	>20 k	construct contigs
	2–5 K			20 μg		construct scaffolds
	10 k			30 μg
	20 k			50 μg
	40 k			60 μg
MGISEQ + 10X		Supernova2	50–75 mg dried or 100–200 mg fresh leaf tissue	2 ng	>50/100 k	construct scaffolds
MGISEQ + long fragment reads		To be developed	50–75 mg dried or 100–200 mg fresh leaf tissue	1 ng	>50/100 k	construct contigs
MGISEQ + 10X/long fragment reads + Hi-C			500 mg, living tissues/cells		>50/100 k	chromosome-scales

The MGISEQ platform, a BGI-developed next-generation sequencing technology based on DNA nanoballs and combinatorial probe anchor synthesis, will be combined with evolving methodologies for improving long-range contiguity, including linked-read technologies from 10X genomics or single-tube long fragment reads from Complete Genomics (a division of BGI). HiC libraries may also be used to construct chromosome-level genome reference assemblies for a member of every family or for other critical species selected by the consortium. Generally speaking, sample providers must prepare either (a) a minimum amount of DNA >50 μg and DNA fragments >20 kb or (b) a minimum amount of DNA >5 ng and DNA fragments >50 kb or preferably >100 kb. For (a), we will use the traditional hierarchical shotgun strategy by combining paired-end libraries with a series of mate-pair libraries. For (b), we will use the strategy of MGISEQ + 10X. The details of these platforms and the library/sequencing strategies are summarized in Table 2.

Fewer than 300 green plants and protists have had their genomes sequenced and published. The vast majority of the tree of life remains unexplored at the level of complete genomes. 10KP will fill this gap. Specifically, we will sequence genomes from at least 8000 seed-plant genera (Figure 1), at least 1000 nonseed plant genera (Figure 2), at least 1000 green algae, and at least 3000 photosynthetic and heterotrophic protists (Figure 3). The total number of species sequenced will exceed 10,000—the moniker 10KP is essentially a milestone toward a larger goal. We anticipate sequencing a proportionately larger number of eudicot genomes, given that this clade comprises nearly 75% of all angiosperm species (Figure 4). Many of the genomes to be sequenced are cornerstones for addressing important and longstanding questions in biology and evolution, while others represent unexplored potential for medicinal compounds and/or the discovery of high-value natural products. Some representative species from diverse clades are shown (Figures 5–7).

Figure 1:

Figure 2:

Summary tree of nonseed plants (based on the 1KP Capstone Analysis, in press). This shows the phylogenetic relationships for the 5 major categories of seed-free plants, including bryophytes (hornworts, liverworts, mosses), lycophytes, and ferns.

Figure 3:

Summary tree of eukaryotes. Schematic diagram shows the known or predicted relationships among the major eukaryotic groups, based on multi-gene analyses, featuring diverse eukaryotic microbes (algae and protists) [10]. Lineages with 1 or more photosynthetic/plastid-bearing groups are highlighted with a square. The Archaeplastida are the eukaryotic “supergroup” to which green algae and embryophytes belong. Protist genomes sequenced as part of 10KP will come from diverse lineages but exclude true fungi and animals.

Figure 4:

Distribution of species/genus/family abundance across the major clades of embryophytes. Most species belong to the eudicot clade, for which the largest families include Asteraceae, Orchidaceae, Fabaceae, Rubiaceae, and Poaceae.

Figure 5:

Representative images of species from different clades/families of flowering plants. The species names presented here are: a. Canella winterana (L.) Gaertn. (Angiosperms, Magnoliids, Canellales, Canellaceae). Flowers and inflorescence. Photo credit: Walter Judd. b. Austrobaileya scandens C.T. White (Angiosperms, Austrobaileyales, Austrobaileyaceae). Flower. Photo credit: Walter Judd. c. Ceratophyllum demersum L. (Angiosperms, Ceratophyllales, Ceratophyllaceae). Habit and inflorescence. Photo credit adapted from Christian Fischer, CC BY SA 3.0 Wikimedia Commons. d. Illicium floridanum J.Ellis (Angiosperms, Austrobaileyales, Schisandraceae). Flower. Photo credit: Walter Judd. e. Piper neesianum C. DC. (Angiosperms, Magnoliids, Piperales, Piperaceae). Habit and inflorescence. Photo credit: Walter Judd. f. Myrothamnus flabellifolius Welw. (Angiosperms, Eudicots, Gunnerales, Myrothamnaceae). Habit and leaves. Photo credit adapted from JMK, CC BY SA 3.0 Wikimedia Commons. g. Dillenia indica L. (Angiosperms, Eudicots, Dilleniales, Dilleniaceae). Photo credit: Walter Judd.

Figure 7:

Light micrographs of diverse protists, including members of different eukaryotic “supergroups.” a. Stylonema (Archaeplastida [Plantae], red alga). b. Cyanoptyche (Archaeplastida [Plantae], glaucophyte). c. Scherffelia (Archaeplastida [Plantae], Viridiplantae, chlorophyte). d. Stephanosphaera (Archaeplastida [Plantae], Viridiplantae, chlorophyte). e. Chaetosphaeridium (Archaeplastida [Plantae], Viridiplantae, streptophyte). f. Mallomonas (stramenopiles, chrysophyte). g. Coscinodiscus (stramenopiles, diatom). h. Synedra (stramenopiles, diatom). i. Sphacelaria (stramenopiles, brown alga). j. Trithigmostoma (alveolates, ciliate). k. Cryptomonas (Cryptista). l. Paramoeba (Amoebozoa). Micrographs courtesy of Gerd Günther (http://www.mikroskopia.de/index.html), Sebastian Hess (Halifax; Scherffelia), and Ivan Fiala (Czech Republic; Paramoeba).

A phylogeny of seed plants (dated phylogeny based on Smith and Brown, in press). Colors correspond to the number of species in the subtending lineage (red = lower to blue = higher). Some larger clades are highlighted around the phylogeny along with the estimated number of genomes to be sequenced by 10KP in bold below the name. Smaller numbers and arrows inside the phylogeny indicate estimates of some of the already available genomes within the identified clade. Summary tree of nonseed plants (based on the 1KP Capstone Analysis, in press). This shows the phylogenetic relationships for the 5 major categories of seed-free plants, including bryophytes (hornworts, liverworts, mosses), lycophytes, and ferns. Summary tree of eukaryotes. Schematic diagram shows the known or predicted relationships among the major eukaryotic groups, based on multi-gene analyses, featuring diverse eukaryotic microbes (algae and protists) [10]. Lineages with 1 or more photosynthetic/plastid-bearing groups are highlighted with a square. The Archaeplastida are the eukaryotic “supergroup” to which green algae and embryophytes belong. Protist genomes sequenced as part of 10KP will come from diverse lineages but exclude true fungi and animals. Distribution of species/genus/family abundance across the major clades of embryophytes. Most species belong to the eudicot clade, for which the largest families include Asteraceae, Orchidaceae, Fabaceae, Rubiaceae, and Poaceae. Representative images of species from different clades/families of flowering plants. The species names presented here are: a. Canella winterana (L.) Gaertn. (Angiosperms, Magnoliids, Canellales, Canellaceae). Flowers and inflorescence. Photo credit: Walter Judd. b. Austrobaileya scandens C.T. White (Angiosperms, Austrobaileyales, Austrobaileyaceae). Flower. Photo credit: Walter Judd. c. Ceratophyllum demersum L. (Angiosperms, Ceratophyllales, Ceratophyllaceae). Habit and inflorescence. Photo credit adapted from Christian Fischer, CC BY SA 3.0 Wikimedia Commons. d. Illicium floridanum J.Ellis (Angiosperms, Austrobaileyales, Schisandraceae). Flower. Photo credit: Walter Judd. e. Piper neesianum C. DC. (Angiosperms, Magnoliids, Piperales, Piperaceae). Habit and inflorescence. Photo credit: Walter Judd. f. Myrothamnus flabellifolius Welw. (Angiosperms, Eudicots, Gunnerales, Myrothamnaceae). Habit and leaves. Photo credit adapted from JMK, CC BY SA 3.0 Wikimedia Commons. g. Dillenia indica L. (Angiosperms, Eudicots, Dilleniales, Dilleniaceae). Photo credit: Walter Judd. Representative images of species from various clades/families of nonflowering plants. The species names presented here are: a. Picea abies (L.) H. Karst. (Pinophyta, Pinales, Pinaceae). Shoots and female cones. Photo credit adapted from Magnus Manske (CC BY SA 3.0 Wikimedia Commons). b. Ginkgo biloba L. (Ginkgophyta, Ginkgoales, Ginkgoaceae). Leaves and male inflorescence. Photo credit adapted from Sten, CC-BY SA 3.0 Wikimedia Commons. c. Cibotium barometz (L.) J.Sm. (Polypodiopsida, Cyatheales, Cibotiaceae). Fronds (leaves). Photo credit: Pi-Fong Lu. d. Adiantum caudatum Klotzsch (Polypodiopsida, Polypodiales, Pteridaceae). Fronds (leaves) and habit. Photo credit: Pi-Fong Lu. e. Marsilea crenata C.Presl (Polypodiopsida, Salviniales, Marsileaceae). Fronds (leaves) and habit. Photo credit: Pi-Fong Lu. f. Asplenium viride Huds. (Polypodiopsida, Polypodiales, Aspleniaceae). Fronds (leaves) and habit. Photo credit: Pi-Fong Lu. g. Diphasiastrum complanatum (L.) Holub. (Lycopodiopsida, Lycopodiales, Lycopodiaceae. Habit. Photo credit: Pi-Fong Lu. h. Bryum capilare Hedwig (Bryopsida, Bryales, Bryaceae). Gametophyte and Sporophyte. Photo Credit adapted from Lairich Rig (CC BY SA 2.0 Wikimedia Commons). i. Marchantia polymorpha L. (Marchantiopsida, Marchantiales, Marchantiaceae). Thalli with gemmae (asexual reproductive structures). Photo Credit adapted from Holger Casselmann (CC-BY SA 3.0 Wikimedia Commons). Light micrographs of diverse protists, including members of different eukaryotic “supergroups.” a. Stylonema (Archaeplastida [Plantae], red alga). b. Cyanoptyche (Archaeplastida [Plantae], glaucophyte). c. Scherffelia (Archaeplastida [Plantae], Viridiplantae, chlorophyte). d. Stephanosphaera (Archaeplastida [Plantae], Viridiplantae, chlorophyte). e. Chaetosphaeridium (Archaeplastida [Plantae], Viridiplantae, streptophyte). f. Mallomonas (stramenopiles, chrysophyte). g. Coscinodiscus (stramenopiles, diatom). h. Synedra (stramenopiles, diatom). i. Sphacelaria (stramenopiles, brown alga). j. Trithigmostoma (alveolates, ciliate). k. Cryptomonas (Cryptista). l. Paramoeba (Amoebozoa). Micrographs courtesy of Gerd Günther (http://www.mikroskopia.de/index.html), Sebastian Hess (Halifax; Scherffelia), and Ivan Fiala (Czech Republic; Paramoeba). Sample submission portal from the CNGB/10KP website. Figure shows the sample submission portal (as well as the underlying database management) on the 10KP website https://db.cngb.org/10kp/; this website (version 1.0) is still evovling. This sample submission portal is prepared specifically for land plants; most samples will come from botanical gardens or botany research centers/laboratories worldwide. A global community effort is crucial to help supply all families and all genera. For stage 1, we anticipate more active involvement of highly motivated and skilled laboratories, whereas for stage 2, we anticipate more of a community effort to supply the majority of the remaining genera. For algae and protists, all samples will come from the public collections and channelled through the Culture Collection of Algae at the University of Cologne, where quality controls will be enforced.

Sequencing Priority and Data Release

For embryophytes, we are expecting a community effort with sample submission and processing online (Figure 8). The species sampling will be coordinated by Douglas E. Soltis and Pamela S. Soltis (flowering plants) and Sean W. Graham (nonflowering plants). For this part, we will prioritize the sequencing in 2 stages:

Figure 8:

Sample submission portal from the CNGB/10KP website. Figure shows the sample submission portal (as well as the underlying database management) on the 10KP website https://db.cngb.org/10kp/; this website (version 1.0) is still evovling. This sample submission portal is prepared specifically for land plants; most samples will come from botanical gardens or botany research centers/laboratories worldwide. A global community effort is crucial to help supply all families and all genera. For stage 1, we anticipate more active involvement of highly motivated and skilled laboratories, whereas for stage 2, we anticipate more of a community effort to supply the majority of the remaining genera. For algae and protists, all samples will come from the public collections and channelled through the Culture Collection of Algae at the University of Cologne, where quality controls will be enforced.

Stage 1. Create family-level high-quality reference genomes, ideally with chromosome-scale assemblies to facilitate comparative and evolutionary genomics research across the green tree of life. Stage 2. Increase the sample density to the genus level, while recognizing that many genera are likely not monophyletic. For some of the larger genera, we may sequence 2 or more distantly related species. Note that we will accept samples for genus-level sequencing even during the first phase of the project when we are focused on family-level sequencing, but these samples may be not processed immediately. We will, however, conduct appropriate quality controls before freezing the samples for later sequencing. For green algae (chlorophytes and streptophytes) and photosynthetic protists, all samples will be channelled through public culture collections, specifically the Culture Collection of Algae at the University of Cologne (http://www.ccac.uni-koeln.de/) managed by Michael Melkonian and Barbara Melkonian, to ensure uniform quality control. For heterotrophic protists, project coordination will be led by researchers at the Centre for Comparative Genomics & Evolutionary Bioinformatics at Dalhousie University, Halifax, Canada (J. M. Archibald). Annotated genome sequences will be released through the CNGB website (http://db.cngb.org/cnsa) and accompanied by regular submissions of peer-reviewed GigaScience “data release” publications that provide independent quality assessment and give credit/authorships to the appropriate sample providers. Data releases will occur on a regular basis (e.g., quarterly) once the data satisfy one of a series of tiered quality assessments (e.g., gene-sized contigs, sufficient for synteny analysis, chromosome-scale assembly). Additional increases in sample density may be coordinated with the larger Earth Biogenome Project [9], which will likely encompass at least another half decade of effort. A brief workflow for the 10KP is described (Figure 9).

Figure 9:

An overview of the 10KP strategy.

Sample Requirements and Sequencing Technologies

It is essential that proper sample documentation be provided. This includes vouchers for taxonomic verification (embryophytes and eukaryotic microbes are dealt with differently), provenance data (detailing the source, origin, and geo-location of the species for sequencing), and prior informed consent for genome sequencing and data release (from appropriate authorities in compliance with the Nagoya protocol). For green algae and photosynthetic protists, the biological specimen is the strain (with unique numerical identifier) kept in a public repository (culture collection); no strain will be sequenced in 10KP that has not previously been deposited in a public culture collection. For heterotrophic protists, the situation is more difficult as many are difficult to grow and protist collections often lack the resources to keep these difficult strains in a living state. Good communication within the 10KP consortium and with external partners will be critical. All of the sequencing will be conducted on BGI’s low-cost high-accuracy high-throughput MGISEQ platform, in combination with newly developed linked-read technologies (e.g., 10X genomics or BGI’s single-tube long fragment reads phasing technology). These approaches require much smaller amounts of DNA (only 2 ng/library) than traditional methods but they also require high-quality high-molecular-weight (>50 kb) extractions. The dominant alternative is the traditional “hierarchical shotgun” strategy with a series of mate-pair large-insert libraries. However, this approach requires a much larger amount of DNA (e.g., >100 μg), which is prohibitive for many plant species. We note that while our stated goal is to build high-quality reference genomes, transcriptome data are important for genome annotation; if live tissues are available, we will also sequence transcriptomes.

Collaborative Proposals

10KP will also consider collaborative proposals that build on the existing dataset and generate new data using the MGISEQ platform. The aim of these projects should be to go beyond covering the diversity of species and, instead, to address important questions in basic and applied science. Up to 20% of the 10KP sequencing capacity will be devoted to these collaborations. Potential collaborators who wish to launch major subprojects within 10KP should provide a brief (maximum 5 pages) proposal as examplified in the supplementary template (proposal template for subprojects included as supplementary file).

Additional file

Supplementary file, proposal template.docx.

Abbreviations

10KP: 10,000 Plants Genome Sequencing Project; CNGB: China National Gene Bank.

Competing interests

S.C., W.S., Y.F., H.Y., X.L., and X.X. are employees of BGI Shenzhen. The authors otherwise declare that they have no competing interests. Click here for additional data file.

5 in total

1. Towards a working list of all known plant species.

Authors: Eimear Nic Lughadha
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2004-04-29 Impact factor: 6.237

2. Diversity, nomenclature, and taxonomy of protists.

Authors: Sina M Adl; Brian S Leander; Alastair G B Simpson; John M Archibald; O Roger Anderson; David Bass; Samuel S Bowser; Guy Brugerolle; Mark A Farmer; Sergey Karpov; Martin Kolisko; Christopher E Lane; Deborah J Lodge; David G Mann; Ralf Meisterfeld; Leonel Mendoza; Øjvind Moestrup; Sharon E Mozley-Standridge; Alexey V Smirnov; Frederick Spiegel
Journal: Syst Biol Date: 2007-08 Impact factor: 15.683

3. Phylotranscriptomic analysis of the origin and early diversification of land plants.

Authors: Norman J Wickett; Siavash Mirarab; Nam Nguyen; Tandy Warnow; Eric Carpenter; Naim Matasci; Saravanaraj Ayyampalayam; Michael S Barker; J Gordon Burleigh; Matthew A Gitzendanner; Brad R Ruhfel; Eric Wafula; Joshua P Der; Sean W Graham; Sarah Mathews; Michael Melkonian; Douglas E Soltis; Pamela S Soltis; Nicholas W Miles; Carl J Rothfels; Lisa Pokorny; A Jonathan Shaw; Lisa DeGironimo; Dennis W Stevenson; Barbara Surek; Juan Carlos Villarreal; Béatrice Roure; Hervé Philippe; Claude W dePamphilis; Tao Chen; Michael K Deyholos; Regina S Baucom; Toni M Kutchan; Megan M Augustin; Jun Wang; Yong Zhang; Zhijian Tian; Zhixiang Yan; Xiaolei Wu; Xiao Sun; Gane Ka-Shu Wong; James Leebens-Mack
Journal: Proc Natl Acad Sci U S A Date: 2014-10-29 Impact factor: 11.205

4. HOW MANY SPECIES OF ALGAE ARE THERE?

Authors: Michael D Guiry
Journal: J Phycol Date: 2012-09-20 Impact factor: 2.923

5. Prepublication data sharing.

Authors: Ewan Birney; Thomas J Hudson; Eric D Green; Chris Gunter; Sean Eddy; Jane Rogers; Jennifer R Harris; S Dusko Ehrlich; Rolf Apweiler; Christopher P Austin; Lisa Berglund; Martin Bobrow; Chas Bountra; Anthony J Brookes; Anne Cambon-Thomsen; Nigel P Carter; Rex L Chisholm; Jorge L Contreras; Robert M Cooke; William L Crosby; Ken Dewar; Richard Durbin; Stephanie O M Dyke; Joseph R Ecker; Khaled El Emam; Lars Feuk; Stacey B Gabriel; John Gallacher; William M Gelbart; Antoni Granell; Francisco Guarner; Tim Hubbard; Scott A Jackson; Jennifer L Jennings; Yann Joly; Steven M Jones; Jane Kaye; Karen L Kennedy; Bartha Maria Knoppers; Nikos C Kyrpides; William W Lowrance; Jingchu Luo; John J MacKay; Luis Martín-Rivera; W Richard McCombie; John D McPherson; Linda Miller; Webb Miller; Don Moerman; Vincent Mooser; Cynthia C Morton; James M Ostell; B F Francis Ouellette; Julian Parkhill; Parminder S Raina; Christopher Rawlings; Steven E Scherer; Stephen W Scherer; Paul N Schofield; Christoph W Sensen; Victoria C Stodden; Michael R Sussman; Toshihiro Tanaka; Janet Thornton; Tatsuhiko Tsunoda; David Valle; Eero I Vuorio; Neil M Walker; Susan Wallace; George Weinstock; William B Whitman; Kim C Worley; Cathy Wu; Jiayan Wu; Jun Yu
Journal: Nature Date: 2009-09-10 Impact factor: 49.962

5 in total

39 in total

Review 1. A Series of Fortunate Events: Introducing Chlamydomonas as a Reference Organism.

Authors: Patrice A Salomé; Sabeeha S Merchant
Journal: Plant Cell Date: 2019-06-12 Impact factor: 11.277

Review 2. Harnessing evolutionary diversification of primary metabolism for plant synthetic biology.

Authors: Hiroshi A Maeda
Journal: J Biol Chem Date: 2019-09-26 Impact factor: 5.157

3. Recommendations for connecting molecular sequence and biodiversity research infrastructures through ELIXIR.

Authors: Robert M Waterhouse; Anne-Françoise Adam-Blondon; Donat Agosti; Petr Baldrian; Bachir Balech; Erwan Corre; Robert P Davey; Henrik Lantz; Graziano Pesole; Christian Quast; Frank Oliver Glöckner; Niels Raes; Anna Sandionigi; Monica Santamaria; Wouter Addink; Jiri Vohradsky; Amandine Nunes-Jorge; Nils Peder Willassen; Jerry Lanfear
Journal: F1000Res Date: 2021-12-03

4. Gene-guided discovery and engineering of branched cyclic peptides in plants.

Authors: Roland D Kersten; Jing-Ke Weng
Journal: Proc Natl Acad Sci U S A Date: 2018-10-29 Impact factor: 11.205

Review 5. Peptide-based protease inhibitors from plants.

Authors: Roland Hellinger; Christian W Gruber
Journal: Drug Discov Today Date: 2019-06-03 Impact factor: 7.851

6. A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.

Authors: William J Baker; Paul Bailey; Vanessa Barber; Abigail Barker; Sidonie Bellot; David Bishop; Laura R Botigué; Grace Brewer; Tom Carruthers; James J Clarkson; Jeffrey Cook; Robyn S Cowan; Steven Dodsworth; Niroshini Epitawalage; Elaine Françoso; Berta Gallego; Matthew G Johnson; Jan T Kim; Kevin Leempoel; Olivier Maurin; Catherine Mcginnie; Lisa Pokorny; Shyamali Roy; Malcolm Stone; Eduardo Toledo; Norman J Wickett; Alexandre R Zuntini; Wolf L Eiserhardt; Paul J Kersey; Ilia J Leitch; Félix Forest
Journal: Syst Biol Date: 2022-02-10 Impact factor: 15.683

Review 7. A guide to sequence your favorite plant genomes.

Authors: Fay-Wei Li; Alex Harkess
Journal: Appl Plant Sci Date: 2018-03-30 Impact factor: 1.936

8. Can We Use Gene-Editing to Induce Apomixis in Sexual Plants?

Authors: Armin Scheben; Diego Hojsgaard
Journal: Genes (Basel) Date: 2020-07-12 Impact factor: 4.096

Review 9. Network-based approaches for understanding gene regulation and function in plants.

Authors: Dae Kwan Ko; Federica Brandizzi
Journal: Plant J Date: 2020-08-28 Impact factor: 6.417

10. Genome of an allotetraploid wild peanut Arachis monticola: a de novo assembly.

Authors: Dongmei Yin; Changmian Ji; Xingli Ma; Hang Li; Wanke Zhang; Song Li; Fuyan Liu; Kunkun Zhao; Fapeng Li; Ke Li; Longlong Ning; Jialin He; Yuejun Wang; Fei Zhao; Yilin Xie; Hongkun Zheng; Xingguo Zhang; Yijing Zhang; Jinsong Zhang
Journal: Gigascience Date: 2018-06-01 Impact factor: 6.524