| Literature DB >> 35042802 |
Mara K N Lawniczak1, Richard Durbin1,2, Paul Flicek1,3, Kerstin Lindblad-Toh4,5, Xiaofeng Wei6, John M Archibald7, William J Baker8, Katherine Belov9, Mark L Blaxter1, Tomas Marques Bonet10,11,12,13, Anna K Childers14, Jonathan A Coddington15, Keith A Crandall16, Andrew J Crawford17, Robert P Davey18, Federica Di Palma19, Qi Fang20, Wilfried Haerty21, Neil Hall19,21, Katharina J Hoff22, Kerstin Howe1, Erich D Jarvis23,24, Warren E Johnson25,26, Rebecca N Johnson15, Paul J Kersey27, Xin Liu6, Jose Victor Lopez28, Eugene W Myers29, Olga Vinnere Pettersson30, Adam M Phillippy31, Monica F Poelchau32, Kim D Pruitt33, Arang Rhie31, Juan Carlos Castilla-Rubio34, Sunil Kumar Sahu6,20, Nicholas A Salmon1, Pamela S Soltis35, David Swarbreck21, Françoise Thibaud-Nissen33, Sibo Wang6, Jill L Wegrzyn36,37, Guojie Zhang38,39,40,41, He Zhang42, Harris A Lewin43,44, Stephen Richards45.
Abstract
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.Entities:
Keywords: Earth BioGenome Project; ethics; genome assembly; genomics
Mesh:
Year: 2022 PMID: 35042802 PMCID: PMC8795494 DOI: 10.1073/pnas.2115639118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.An example of the documentation that should occur as a sample is being processed. The SPECIMEN_ID (the Natural History Museum, London [NHMUK] barcode under the fly) is photographed alongside the specimen and the barcoded tubes into which different samples of that specimen will be placed. The metadata tracking sheet would have three entries for this fly, where collection-related information would be identical, but tissue type and tissue size would vary (e.g., head, thorax, and abdomen, each in a separate tube). Image credit: M.K.N.L.
Fig. 2.Recommended EBP INSDC assembly BioProject submission structure. Alternatives on this theme could include maternal assembly and paternal assembly at the DataBioProject levels, when complete genome assemblies are generated for both haplotypes of a diploid sexually reproducing species. “Your Project” represents a project sequencing multiple genomes, such as the VGP or DToL.
Fig. 3.Recommended future EBP data service portal. Researchers would easily apply for the hardware resources that are shared by the EBP affiliated project networks.