| Literature DB >> 25944057 |
Christophe Plomion1,2, Jean-Marc Aury3, Joëlle Amselem4, Tina Alaeitabar4, Valérie Barbe3, Caroline Belser3, Hélène Bergès5, Catherine Bodénès1,2, Nathalie Boudet6, Christophe Boury1,2, Aurélie Canaguier6, Arnaud Couloux3, Corinne Da Silva3, Sébastien Duplessis7, François Ehrenmann1,2, Barbara Estrada-Mairey3, Stéphanie Fouteau3, Nicolas Francillonne4, Christine Gaspin8, Cécile Guichard6, Christophe Klopp8, Karine Labadie3, Céline Lalanne1,2, Isabelle Le Clainche6, Jean-Charles Leplé9, Grégoire Le Provost1,2, Thibault Leroy1,2, Isabelle Lesur1,2, Francis Martin7, Jonathan Mercier3, Célia Michotey4, Florent Murat10, Franck Salin1,2, Delphine Steinbach4, Patricia Faivre-Rampant6, Patrick Wincker3,11,12, Jérôme Salse10, Hadi Quesneville4, Antoine Kremer1,2.
Abstract
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole-genome shotgun (WGS) approach, without the use of costly and time-consuming methods, such as fosmid or BAC clone-based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS-FLX technology were assembled into contigs and combined with shorter Illumina sequence reads from paired-end and mate-pair libraries of different insert sizes, to build scaffolds. Errors were corrected and gaps filled with Illumina paired-end reads and contaminants detected, resulting in a total of 17,910 scaffolds (>2 kb) corresponding to 1.34 Gb. Fifty per cent of the assembly was accounted for by 1468 scaffolds (N50 of 260 kb). Initial comparison with the phylogenetically related Prunus persica gene model indicated that genes for 84.6% of the proteins present in peach (mean protein coverage of 90.5%) were present in our assembly. The second and third steps in this project are genome annotation and the assignment of scaffolds to the oak genetic linkage map. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement, the oak genome data have been released into public sequence repositories in advance of publication. In this presubmission paper, the oak genome consortium describes its principal lines of work and future directions for analyses of the nature, function and evolution of the oak genome.Entities:
Keywords: Quercus robur; genome sequence; genomic resources
Mesh:
Year: 2015 PMID: 25944057 DOI: 10.1111/1755-0998.12425
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 7.090