| Literature DB >> 35042805 |
.
Abstract
The goals of the Earth Biogenome Project-to sequence the genomes of all eukaryotic life on earth-are as daunting as they are ambitious. The Darwin Tree of Life Project was founded to demonstrate the credibility of these goals and to deliver at-scale genome sequences of unprecedented quality for a biogeographic region: the archipelago of islands that constitute Britain and Ireland. The Darwin Tree of Life Project is a collaboration between biodiversity organizations (museums, botanical gardens, and biodiversity institutes) and genomics institutes. Together, we have built a workflow that collects specimens from the field, robustly identifies them, performs sequencing, generates high-quality, curated assemblies, and releases these openly for the global community to use to build future science and conservation efforts.Entities:
Keywords: assembly; biodiversity; genome; sequencing
Mesh:
Year: 2022 PMID: 35042805 PMCID: PMC8797607 DOI: 10.1073/pnas.2115642118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
The eukaryotic biota of Britain and Ireland
| Kingdom | Phyla | Classes | Orders | Families | Genera | Species |
| All UKSI taxa | 63 | 218 | 880 |
| 19,257 | 72,572 |
| Fungi | 7 | 35 | 166 |
| 3,198 | 18,512 |
| Metazoa | 32 | 96 | 405 |
| 13,169 | 41,613 |
| Viridiplantae | 9 | 38 | 156 |
| 2,083 | 8,072 |
| Chromista and other microbial eukaryotes | 15 | 49 | 153 |
| 807 | 4,375 |
| Global taxa | 70 | 274 | 1,295 |
| 175,901 | 1,879,687 |
| DToL proportion | 0.90 | 0.80 | 0.68 | 0.39 | 0.11 | 0.04 |
Data from the Natural History Museum UK Species Inventory (UKSI; https://www.nhm.ac.uk/our-science/data/uk-species.html) and Species2000 Catalogue of Life (https://www.catalogueoflife.org/) (for Global taxa) (1). Despite their apparent precision, these numbers should be regarded as first estimates. DToL initially aims to sequence one representative of each taxonomic family (bold numbers in column "Families").
The partners in the Darwin Tree of Life Project
| Partner | Roles in project | Location | Lead Investigators |
| Natural History Museum | Sample collection (especially terrestrial animals); Archiving; DNA barcoding; Analysis | London, UK | Ian Barnes |
| Royal Botanic Garden Edinburgh | Sample collection (plants and lichens); Archiving; DNA barcoding; Analysis | Edinburgh, UK | Michelle Hart |
| Royal Botanic Gardens Kew | Sample collection (plants and fungi); Archiving; DNA barcoding; Analysis | London, UK | Paul Kersey |
| Marine Biological Association | Sample collection (marine and littoral species); DNA barcoding; Analysis | Plymouth, UK | Nova Mieszkowska |
| University of Oxford | Sample collection (Wytham Woods genomic observatory; protists); Analysis | Oxford, UK | Peter Holland |
| Earlham Institute | Sample collection (protists); Data infrastructure; Sequencing (protists); Analysis | Norwich, UK | Neil Hall |
| Wellcome Sanger Institute | Sample collection; Data infrastructure; Sequencing; Assembly; Analysis | Cambridge, UK | Mark Blaxter |
| University of Edinburgh | Sequencing; Assembly; Analysis | Edinburgh, UK | Alex Twyford |
| University of Cambridge | Assembly; Analysis | Cambridge, UK | Richard Durbin |
| EMBL-EBI | Databasing; Data infrastructure; Analysis | Cambridge, UK | Paul Flicek |
| Connecting Science | Outreach and engagement | Cambridge, UK | Kenneth Skeldon |
Fig. 1.Sequencing eukaryotic life at scale. The DToL partners have developed a cohesive end-to-end process to take organisms from the field through to publication of high-quality assembled and annotated genomes in the public domain. By collating specimen metadata and tracking information through identification, DNA barcoding, extraction, sequencing, assembly, curation, annotation, and submission, the process assures that genomes are published with rich, informative, and accurate descriptors.
Fig. 2.DToL: sequencing across the diversity of eukaryotes. A terabase of genomes: Eukaryotic species sequenced or undergoing sequencing in DToL to October 2021. The outer circle (black) shows estimated genome size and the middle ring (purple) the BUSCO (Eukaryota ortholog set) completeness of preliminary assemblies. The inner ring (red) shows the 200 species for which genomes have already been submitted to the ENA by DToL. The tree shows species relationships from the National Center for Biotechnology Information TaxonomyDB (https://www.ncbi.nlm.nih.gov/taxonomy) obtained using ETE (http://etetoolkit.org/). The figure was generated using IToL (https://itol.embl.de/) and postprocessed in Adobe Illustrator. The tree is available at https://itol.embl.de/shared/mblaxter2.