| Literature DB >> 29795526 |
Valentino Ruggieri1,2, Konstantinos G Alexiou1,2, Jordi Morata1, Jason Argyris1,2, Marta Pujol1,2, Ryoichi Yano3, Satoko Nonaka3, Hiroshi Ezura3, David Latrasse4, Adnane Boualem4, Moussa Benhamed4, Abdelhafid Bendahmane4, Riccardo Aiese Cigliano5, Walter Sanseverino5, Pere Puigdomènech1, Josep M Casacuberta1, Jordi Garcia-Mas6,7.
Abstract
We report an improved assembly (v3.6.1) of the melon (Cucumis melo L.) genome and a new genome annotation (v4.0). The optical mapping approach allowed correcting the order and the orientation of 21 previous scaffolds and permitted to correctly define the gap-size extension along the 12 pseudomolecules. A new comprehensive annotation was also built in order to update the previous annotation v3.5.1, released more than six years ago. Using an integrative annotation pipeline, based on exhaustive RNA-Seq collections and ad-hoc transposable element annotation, we identified 29,980 protein-coding loci. Compared to the previous version, the v4.0 annotation improved gene models in terms of completeness of gene structure, UTR regions definition, intron-exon junctions and reduction of fragmented genes. More than 8,000 new genes were identified, one third of them being well supported by RNA-Seq data. To make all the new resources easily exploitable and completely available for the scientific community, a redesigned Melonomics genomic platform was released at http://melonomics.net . The resources produced in this work considerably increase the reliability of the melon genome assembly and resolution of the gene models paving the way for further studies in melon and related species.Entities:
Mesh:
Year: 2018 PMID: 29795526 PMCID: PMC5967340 DOI: 10.1038/s41598-018-26416-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison between v3.5.1 and v3.6.1 assemblies in terms of chromosome length and gaps extension (number of Ns).
| Chromosome | v3.5.1 | v3.6.1 | ||
|---|---|---|---|---|
| length (bp) | number of Ns (bp) | length (bp) | number of Ns (bp) | |
| Chr01 | 35,383,099 | 4,224,098 | 37,037,532 | 5,746,973 |
| Chr02 | 26,193,771 | 2,467,907 | 27,064,691 | 3,272,777 |
| Chr03 | 29,387,469 | 3,090,555 | 31,666,927 | 5,286,237 |
| Chr04 | 33,123,230 | 3,484,828 | 34,318,044 | 4,557,078 |
| Chr05 | 28,337,775 | 3,556,980 | 29,324,171 | 4,428,209 |
| Chr06 | 35,939,859 | 4,056,829 | 38,297,372 | 6,313,150 |
| Chr07 | 26,773,857 | 2,703,377 | 28,958,359 | 4,821,595 |
| Chr08 | 32,513,408 | 3,874,538 | 34,765,488 | 5,785,898 |
| Chr09 | 24,107,567 | 2,533,680 | 25,243,276 | 3,572,934 |
| Chr10 | 25,362,315 | 2,705,038 | 26,663,822 | 3,957,926 |
| Chr11 | 31,442,130 | 3,210,864 | 34,457,057 | 5,604,798 |
| Chr12 | 26,400,393 | 2,792,021 | 27,563,660 | 3,864,732 |
|
|
|
|
|
|
Figure 1Examples of OpGen correction. The chromosomes are laid out horizontally and homologous segments between v3.5.1 and v3.6.1 are shown as colored blocks. Each color block stands for a scaffold or consecutive merged scaffolds. Blocks that are shifted downwards represent re-oriented segments in v3.6.1. Blocks that are swiped/translocated in another position represent re-ordered segments. Gap size correction is shown as white gaps inside the blocks.
Figure 2AED analyses of the the Melon v4.0 gene models (orange line) in comparison with the Tomato ITAG3.2 (green line) and Maize 6a (grey line) annotations. Shown on the y axis is the cumulative distribution of AED for each dataset. In the upper part of the graph is shown the ratio of the proteins with a known function versus proteins with an unknown function for each quartile of the AED range.
Figure 3Busco completeness comparison between v3.5.1 and v4.0 melon annotations based on Embriophyta odb9 dataset. Colors in the bar represent the different classes of the Busco assessment results.
Figure 4The JBrowse navigator included in the new designed Melonomics.net web page. The browser integrates gene structures and various kinds of evidences at the genomic, transcriptomic and epigenomic level, including the v4.0 gene models, two TE annotations (MeloV4_Repet and MeloV4_RM_Masking), a cumulative RNA-Seq expression track (Merged RNA-Seq), epigenetic marks derived from methylation (MeDIP) of leaf and root tissues jointly with the corresponding RNA-Seq tracks and a variome set including the re-sequencing of seven melon accessions.