| Literature DB >> 24699266 |
David Chagné1, Ross N Crowhurst2, Massimo Pindo3, Amali Thrimawithana2, Cecilia Deng2, Hilary Ireland2, Mark Fiers4, Helge Dzierzon1, Alessandro Cestaro3, Paolo Fontana3, Luca Bianco3, Ashley Lu4, Roy Storey2, Mareike Knäbel5, Munazza Saeed6, Sara Montanari7, Yoon Kyeong Kim8, Daniela Nicolini3, Simone Larger3, Erika Stefani3, Andrew C Allan9, Judith Bowen2, Isaac Harvey2, Jason Johnston2, Mickael Malnoy3, Michela Troggio3, Laure Perchepied10, Greg Sawyer1, Claudia Wiedow1, Kyungho Won8, Roberto Viola3, Roger P Hellens2, Lester Brewer11, Vincent G M Bus12, Robert J Schaffer9, Susan E Gardiner1, Riccardo Velasco3.
Abstract
We present a draft assembly of the genome of European pear (Pyrus communis) 'Bartlett'. Our assembly was developed employing second generation sequencing technology (Roche 454), from single-end, 2 kb, and 7 kb insert paired-end reads using Newbler (version 2.7). It contains 142,083 scaffolds greater than 499 bases (maximum scaffold length of 1.2 Mb) and covers a total of 577.3 Mb, representing most of the expected 600 Mb Pyrus genome. A total of 829,823 putative single nucleotide polymorphisms (SNPs) were detected using re-sequencing of 'Louise Bonne de Jersey' and 'Old Home'. A total of 2,279 genetically mapped SNP markers anchor 171 Mb of the assembled genome. Ab initio gene prediction combined with prediction based on homology searching detected 43,419 putative gene models. Of these, 1219 proteins (556 clusters) are unique to European pear compared to 12 other sequenced plant genomes. Analysis of the expansin gene family provided an example of the quality of the gene prediction and an insight into the relationships among one class of cell wall related genes that control fruit softening in both European pear and apple (Malus × domestica). The 'Bartlett' genome assembly v1.0 (http://www.rosaceae.org/species/pyrus/pyrus_communis/genome_v1.0) is an invaluable tool for identifying the genetic control of key horticultural traits in pear and will enable the wide application of marker-assisted and genomic selection that will enhance the speed and efficiency of pear cultivar development.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24699266 PMCID: PMC3974708 DOI: 10.1371/journal.pone.0092644
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Basic statistics on the Pyrus communis ‘Bartlett’ genome sequence.
|
| |
| Number of bases used for assembly | 8,204,442,728 |
| Sequenced reads used for assembly | 23,058,965 |
| Non paired end | 12,979,485 |
| Paired end | 10,079,480 |
| Estimated average coverage | 11.4× |
|
| |
| Number of contigs | 182,196 |
| Total size of contigs (bp) | 507,689,959 |
| N50 contig length (bp) | 6,569 |
| Longest contig (bp) | 127,414 |
| Mean contig size (bp) | 2,787 |
| Median contig size (bp) | 1,188 |
| Number of contigs in scaffolds | 47,404 |
| Number of contigs not in scaffolds | 134,792 |
|
| |
| Number of scaffolds | 142,083 |
| Total size of scaffolds (bp) | 577,335,413 |
| N50 scaffold length (bp) | 88,114 |
| Longest scaffold (bp) | 1,291,680 |
| Shortest scaffold (bp) | 501 |
| Number of scaffolds >1K bp | 69,460 |
| Number of scaffolds >10K bp | 4,916 |
| Number of scaffolds >100K bp | 1,262 |
| Number of scaffolds >1M bp | 4 |
| Number of scaffolds >10M bp | 0 |
| Mean scaffold size (bp) | 4,063 |
| Median scaffold size (bp) | 983 |
| Scaffold %N | 12.06% |
Figures are given in bp.
*: the assumed genome size of pear is 600 Mb.
Anchoring of the Pyrus communis ‘Bartlett’ assembly v1.0 genome sequence.
| LG | Length anchored (bp) | Number of anchored scaffolds (unique) | Number of anchoring markers | Median number of markers per scaffold |
| 1 | 8,550,412 | 46 | 115 | 2.0 |
| 2 | 11,234,491 | 58 | 194 | 3.0 |
| 3 | 12,642,036 | 69 | 163 | 2.0 |
| 4 | 8,044,179 | 40 | 105 | 2.0 |
| 5 | 10,949,710 | 57 | 159 | 2.0 |
| 6 | 8,104,341 | 45 | 117 | 1.0 |
| 7 | 8,833,777 | 53 | 102 | 1.0 |
| 8 | 8,189,737 | 36 | 92 | 2.0 |
| 9 | 10,984,512 | 53 | 145 | 2.0 |
| 10 | 9,331,439 | 54 | 113 | 2.0 |
| 11 | 10,224,161 | 53 | 134 | 2.0 |
| 12 | 8,857,939 | 44 | 122 | 2.0 |
| 13 | 10,282,711 | 38 | 127 | 2.5 |
| 14 | 10,094,382 | 51 | 117 | 2.0 |
| 15 | 17,650,274 | 75 | 222 | 2.0 |
| 16 | 8,177,493 | 44 | 124 | 2.0 |
| 17 | 9,204,799 | 52 | 128 | 2.0 |
| Total | 171,356,393 | 868 | 2,279 | 2.0 |
Gene prediction summary for Pyrus communis and comparison with P. bretschneideri, Fragaria vesca and Malus×domestica.
|
|
|
|
| |
| Predicted genes | 43,419 | 42,812 | 34,809 | 54,921 |
| Average gene length (including introns)(nt) | 3,320 | 2,776 | 2,792 | 2,802 |
| Average CDS length (nt) | 1,209 | 1,172 | 1,160 | 1,155 |
| Exons | 221,804 | 202,169 | 174,376 | 273,226 |
| Average exon length | 237 | 248 | 232 | 273 |
| Single exon genes | 10,909 | 12,310 | 5,915 | 10,378 |
| Introns | 178,385 | 159,357 | 139,567 | 218,353 |
| Introns per gene (multi-exon genes only) | 5.49 | 5.22 | 4.83 | 4.90 |
| Average intron length | 398 | 386 | 409 | 491 |
| Genes per 100 Kb | 7.5 | 8.4 | 14.5 | 7.3 |
Gene predictions were performed using Augustus for European pear and GeneMark-ES for strawberry. The apple gene models were estimated as the total number of gene predictions minus an estimation of duplications generated by contig overlaps. The redundancy was filtered out using similarity among predictions and positional considerations.
Figure 1Phylogenetic tree of six rosids, four malvids, and three asterids constructed with 83 euKaryote Orthologous Genes (KOGs).
Bootstrap values are listed on each branch. Nodes represent speciation events and branch length represents the degree of evolutional changes over time. The unit for the scale bar at the bottom is nucleotide substitutions per site. The high bootstrap values strongly support that the species in Rosaceae cluster together to the exclusion of any other, and that the European pear and Chinese pear separation event happened after apple speciation.
Figure 2Protein-protein comparison between European pear and 12 other species: Chinese pear, apple, grape, strawberry, papaya, sweet orange, clementine, kiwifruit, tomato, potato, poplar and Arabidopsis.
The figure shows every possible combination of species included in this proteome ortholog analysis, using concentric circles. Each ring represents a single plant species and is depicted in a unique colour. For the 13 species shown, there are hence a total of 213–1 combination cases, from 556 ortholog groups found in European pear only, 682 clades in Chinese pear only, to 5393 clusters present in all thirteen species. For each combination, the number of ortholog groups discovered is labelled outside the outermost ring and the number of proteins for a species inside a coloured, circular cell that represents the particular species. As the angular width of the cells for each case is drawn proportional to its number of groups, there is no labelling where the angular width is too small. A complete list of all combination cases with detected ortholog genes is provided in Table S4.
Comparison of de novo predicted repeated elements between the European pear, Chinese pear and apple genomes. na: not available from [18].
| European Pear | Chinese pear | Apple | |||
| Class | Count | Size(bp) | Size (bp) | Count | Size(bp) |
| DNA | 87,258 | 16,034,061 | 19,622,007 | 62,032 | 12,036,031 |
| DNA/CMC-EnSpm | 5,245 | 2,498,065 | 1,994,298 | 3,238 | 1,651,963 |
| DNA/MULE-MuDR | 11,081 | 1,654,134 | 2,812,256 | 8,411 | 1,299,397 |
| DNA/PIF-Harbinger | 33,920 | 10,195,913 | 13,681,892 | 22,675 | 6,976,753 |
| DNA/hAT-Ac | 18,863 | 4,763,497 | 10,510,482 | 15,194 | 5,286,418 |
| DNA/hAT-Tag1 | 8,271 | 1,973,418 | 2,689,092 | 10,877 | 3,050,698 |
| DNA/hAT-Tip1 | 9,603 | 2,394,869 | 3,298,090 | 7,398 | 2,034,152 |
| LINE/L1 | 8,410 | 4,077,094 | 10,075,654 | 5,716 | 3,269,671 |
| LINE/RTE-BovB | 7,697 | 2,200,632 | 4,408,634 | 12,750 | 10,996,014 |
| SINE | 425 | 124,106 | 208,975 | 2,191 | 359,108 |
| SINE/tRNA | 20,796 | 2,863,949 | na | 13,153 | 1,805,785 |
| LTR | 37,405 | 9,249,657 | 38,166,045 | 20,479 | 6,282,994 |
| LTR/Caulimovirus | 2,796 | 2,550,963 | 2,625,049 | 1,227 | 1,544,695 |
| LTR/Copia | 73,727 | 42,805,711 | 86,429,855 | 46,798 | 35,591,207 |
| LTR/Copia-like | 110 | 20,863 | na | 97 | 26,513 |
| LTR/Gypsy | 145,766 | 84,633,023 | 130,449,009 | 94,218 | 68,540,726 |
| Low_complexity | 121,061 | 5,222,319 | na | 84,700 | 3,780,221 |
| RC/Helitron | 6,851 | 2,104,677 | na | 5,036 | 1,448,948 |
| Satellite | 207 | 30,594 | 350,647 | 178 | 40,455 |
| Simple_repeat | 63,595 | 2,428,632 | 1,131,844 | 47,558 | 1,891,325 |
| Unknown | 6,610 | 1,545,468 | 4,296,548 | 4,650 | 1,079,420 |
| rRNA | 351 | 100,288 | na | 158 | 31,199 |
|
|
|
|
| ||
Classification of repeated elements in European pear based on the ‘Rosaceae’ clade from RepBase.
| Copies | Assembled (Mb) | Assembly % | |
|
| 241,316 | 149.345 | 25.87 |
| SINEs | 20,442 | 2.865 | 0.50 |
| LINE/RTE | 6,669 | 1.724 | 0.30 |
| LINE/L1 | 7,851 | 4.300 | 0.74 |
| LTR/Ty1/Copia | 56,724 | 44.240 | 7.66 |
| LTR/Gypsy | 114,127 | 85.382 | 14.79 |
|
| 173,387 | 42.058 | 7.28 |
| hobo-Activator | 36,417 | 10.007 | 1.73 |
| Tc1-IS630-Pogo | 107 | 0.015 | 0 |
| En-Spm | 296 | 0.278 | 0 |
| Tourist/Harbinger | 30,859 | 9.770 | 1.69 |
|
| 11,995 | 3.427 | 0.59 |
|
| 194.830 | 33.75 | |
|
| 20,415 | 2.873 | 0.50 |
|
| 186 | 2.873 | 0.01 |
|
| 2,497 | 0.021 | 0.04 |
|
| 226 | 0.000 | 0.00 |
|
| 197.724 | 34.14 |
Figure 3Phylogenetic and gene expression analysis of the expansin-like genes from apple and European pear.
A) Phylogenetic tree of predicted expansin-like genes from apple and European pear. Predicted expansin-like protein models from apple (MDP prefix) and European pear (PCP prefix) were aligned, and a conserved region of alignment of 313 residues was used to construct the phylogenetic tree Geneious 6.1.6 (Biomatters Ltd, Auckland, NZ). The linkage group (LG) of each model is shown where possible; some models are not anchored (LG-NA) to the genome. Models that represent the best hit for published expansins are labelled additionally as such. DdEXP2 from Dictyostelium discoideum was used as an out-group. Bootstrap proportions for 100 trees were calculated and bootstrap values ≥50 are shown. Scale indicates 0.4 substitutions per site. EXPA, α-expansins; EXPB, β-expansins; EXLA, alpha-like expansins; EXLB, beta-like expansins [50]. mRNA-seq expression levels in ‘Comice’ melting pear (CM), ‘Nijisseki’ (NJ) crisp pear and ‘Royal Gala’ (RG) crisp apple, undergoing fruit ripening in storage show that one clade is strongly associated with fruit ripening (coloured green). The inserted graph shows the expression analysis by qPCR of EXP2 in fruit at harvest and during storage, which corresponds to the mRNA-seq data. Yellow bars: RG, red bars CM, orange bars NJ). RPKM: Reads Per Kilobase per Million mapped reads. Single arrow shows the apple expansin (MdEXPA7) mapped to a quantitative trait locus for fruit texture. B) Alignment of the first 170 bp of apple and pear homologues, demonstrating genome duplication preceded speciation.