| Literature DB >> 22554261 |
Jérôme Grimplet1, John Van Hemert, Pablo Carbonell-Bejerano, José Díaz-Riquelme, Julie Dickerson, Anne Fennell, Mario Pezzotti, José M Martínez-Zapater.
Abstract
BACKGROUND: The first draft assembly and gene prediction of the grapevine genome (8X base coverage) was made available to the scientific community in 2007, and functional annotation was developed on this gene prediction. Since then additional Sanger sequences were added to the 8X sequences pool and a new version of the genomic sequence with superior base coverage (12X) was produced.Entities:
Mesh:
Year: 2012 PMID: 22554261 PMCID: PMC3419625 DOI: 10.1186/1756-0500-5-213
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Number of predicted gene sequences from the 12Xv1 grapevine genome coverage showing cardinality values higher than one when compared with predicted genes in other versions and assemblies
| | ||||||
|---|---|---|---|---|---|---|
| Comparison 8X | 623 | 1429 | 428 | 1774 | 147 | 122 |
| Comparison mRNA | 54 | 14 | 7 | | | |
| Comparison 12Xv0 | 2735 | 846 | 5 | |||
Redundant: multiple genes match the same portion of one gene in other set. Overlap, Split, Merged, and To split: multiple genes match different portions of one gene in other set. Overlap and To split: 12Xv1 genes need to be modified respectively (either merged with another gene or split).
Figure 1 Representation of overlap between the different sets of predicted gene sequences (51476) available for grapevine. 8X: genes identified in the 8X coverage genome sequence; DFCI v5: mRNA sequences identified in the DFCI gene index EST sequence repository version 5; v1: genes identified in the 12X coverage genome assembly, version 1 of the gene prediction; V0: genes identified in the 12X coverage genome assembly, version 0 of the gene prediction; VR: predicted genes from the repeat track of the 12X coverage genome sequence, version 1 of the gene prediction; GrapeGen: mRNA sequences identified in the set of mRNA used to construct the GrapeGen Affymetrix microarray; Grey: genes present in the latest update of the protein prediction (12Xv1).
Figure 2 Plots of the relative position of predicted genes between the 12Xv1 and the 8X coverage assemblies of the grapevine genome sequence. Color code representing chromosomes and genes is identical in the 4 images. Axes represent percentage of the total length of each chromosome. Labels indicate chromosome and corresponding assembly. A) Relative position of genes on the same chromosome number in both assemblies. Colors of the links are identical to the 12X chromosome of origin. B) Relative position of genes in the unknown chromosome in at least one assembly. Small chromosomes marked with “r” represent random chromosomes and were arbitrarily set to 1/10th the regular chromosome size. Unknown chromosomes are magnified 20X relative to regular chromosomes. Black links represent genes belonging to the unknown chromosome in both assemblies. Grey links represent genes belonging to the unknown chromosome in only one assembly. C) Relative position of genes in two different chromosomes in the two assemblies. Small chromosomes marked with “r” represent random chromosomes and were arbitrarily set to 1/10th the regular chromosome size. Colors of the links are identical to the 12X chromosome of origin. D) Relative position of genes on the same chromosome number in both assemblies. To avoid confusion within similar colors, stars represent genes from a odd numbered chromosome and circles represent genes from a chromosome with an even number.
Figure 3 Classification of predicted genes in the consensus set of sequences and the sets of orphan sequences based on their sequence matching results. Blue: predicted genes matching sequences that have an assigned molecular function. Red: predicted genes matching sequences that lack assigned molecular function. Green: predicted genes not matching any known sequence from other species. Purple: predicted genes matching genes from other species considered as viral, transposable elements or related sequences. Consensus: genes of the 12Xv1 shared with another set. 12Xv1: genes unique to the 12X coverage genome sequencing version 1 of the gene prediction. 8X: genes unique to the 8X coverage genome sequencing. DFCIv5: mRNA sequences unique to the DFCI gene index v5. GrapeGen: mRNA sequences unique to mRNA sequences used to construct the GrapeGen Affymetrix microarrays. 12Xv0: genes unique to the 12Xv0.
Figure 4 Functional category distribution of the total non-redundant set of predicted genes. The left pie chart equals higher level categories. The right pie chart equals secondary level categories within the metabolism category.