| Literature DB >> 21871077 |
Giuliana Giannuzzi1, Pietro D'Addabbo, Marica Gasparro, Maurizio Martinelli, Francesco N Carelli, Donato Antonacci, Mario Ventura.
Abstract
BACKGROUND: Segmental duplications (SDs) are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (Vitis vinifera) genome (PN40024).Entities:
Mesh:
Year: 2011 PMID: 21871077 PMCID: PMC3179966 DOI: 10.1186/1471-2164-12-436
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1FISH results and WSSD outputs for a single and a duplicated BAC clone. FISH signals on metaphase and interphase chromosomes (left panel), and WSSD coverages and segmentally duplicated regions according to the nodiv_N method of the genomic loci (right panel) of VV40024H153C18 single (A) and VV40024H153D21 duplicated (B) clones.
Figure 2Comparison of three repeat masking settings. Left panel - The fit results of the WSSD coverage data sample from 39 single BAC-anchored loci (black dots) obtained using the "div10_low", "nodiv_low" and "nodiv_N" masking settings are displayed. The data are fitted to a model of four Gaussian distributions (G1, G2, G3 and G4) drawn in green, red, yellow and magenta, respectively, with their sum drawn in blue. Mean (red vertical line) and sigma (magenta vertical line) values in the box refer to Gaussian G1. On top, the normalized residuals distribution (Pull = (Ndata-Nfit)/σdata, N and σ being the number of events and the error for each bin) is shown. When the pull distribution is fully between -3 and 3, fluctuations are only statistical. Right panel - WSSD outputs and segmentally duplicated regions according to the three masking methods of VV40024H153A08, 153A22, 153C20, 153C22, 153D03 and 153D07 BAC clones.
Segmental duplications in the grapevine genome
| chr | chr_size (bp) | dup_size (bp) | perc_dup | # dup intervals |
|---|---|---|---|---|
| chr1 | 23,037,639 | 2,733,492 | 11.87% | 94 |
| chr2 | 18,779,844 | 3,032,687 | 16.15% | 104 |
| chr3 | 19,341,862 | 3,860,663 | 19.96% | 132 |
| chr4 | 23,867,706 | 3,301,331 | 13.83% | 118 |
| chr5 | 25,021,643 | 3,688,952 | 14.74% | 125 |
| chr6 | 21,508,407 | 2,864,816 | 13.32% | 89 |
| chr7 | 21,026,613 | 2,334,595 | 11.10% | 80 |
| chr8 | 22,385,789 | 2,028,352 | 9.06% | 86 |
| chr9 | 23,006,712 | 5,694,956 | 24.75% | 167 |
| chr10 | 18,140,952 | 3,413,632 | 18.82% | 111 |
| chr11 | 19,818,926 | 2,155,608 | 10.88% | 82 |
| chr12 | 22,702,307 | 4,358,297 | 19.20% | 143 |
| chr13 | 24,396,255 | 4,592,630 | 18.83% | 139 |
| chr14 | 30,274,277 | 4,521,418 | 14.93% | 159 |
| chr15 | 20,304,914 | 4,479,245 | 22.06% | 134 |
| chr16 | 22,053,297 | 5,531,029 | 25.08% | 149 |
| chr17 | 17,126,926 | 2,453,620 | 14.33% | 91 |
| chr18 | 29,360,087 | 4,551,888 | 15.50% | 147 |
| chr19 | 24,021,853 | 5,810,980 | 24.19% | 173 |
| chr1_random | 568,933 | 136,397 | 23.97% | 2 |
| chr3_random | 1,220,746 | 297,331 | 24.36% | 9 |
| chr4_random | 76,237 | 0 | 0.00% | 0 |
| chr5_random | 421,237 | 0 | 0.00% | 0 |
| chr7_random | 1,447,032 | 140,206 | 9.69% | 5 |
| chr9_random | 487,831 | 137,370 | 28.16% | 4 |
| chr10_random | 789,605 | 303,818 | 38.48% | 3 |
| chr11_random | 282,498 | 0 | 0.00% | 0 |
| chr12_random | 1,566,225 | 403,711 | 25.78% | 10 |
| chr13_random | 3,268,264 | 1,079,980 | 33.04% | 28 |
| chr16_random | 740,079 | 672,372 | 90.85% | 8 |
| chr17_random | 829,735 | 126,189 | 15.21% | 4 |
| chr18_random | 5,170,003 | 1,365,584 | 26.41% | 36 |
| chrUn | 43,154,196 | 8,876,440 | 20.57% | 210 |
Size and percentage of segmentally duplicated regions and the number of duplicated intervals for: each chromosome, all nonrandom chromosomes (Tot_nonrandom, from chr1 to chr19), all random chromosomes (Tot_random, from chr1_random to chr18_random), all nonrandom and random chromosomes (Tot_placed), and the whole genome (Tot_whole).
Comparison of NUMT and NUPT content between grapevine whole-genome and segmentally duplicated regions
| Whole chromosome | Segmental duplications | |||||||
|---|---|---|---|---|---|---|---|---|
| chr | NUMTs | NUPTs | NUMTs | NUPTs | ||||
| bp | percentage | bp | percentage | bp | percentage | bp | percentage | |
| chr1 | 35,148 | 0.15% | 31,915 | 0.14% | 7,942 | 0.29% | 2,216 | 0.08% |
| chr2 | 23,352 | 0.12% | 37,460 | 0.20% | 5,029 | 0.17% | 16,919 | 0.56% |
| chr3 | 46,530 | 0.24% | 21,546 | 0.11% | 3,746 | 0.10% | 1,888 | 0.05% |
| chr4 | 40,855 | 0.17% | 26,304 | 0.11% | 8,651 | 0.26% | 2,803 | 0.08% |
| chr5 | 36,974 | 0.15% | 25,890 | 0.10% | 6,936 | 0.19% | 2,263 | 0.06% |
| chr6 | 42,918 | 0.20% | 35,543 | 0.17% | 14,186 | 0.50% | 10,820 | 0.38% |
| chr7 | 29,780 | 0.14% | 29,239 | 0.14% | 1,282 | 0.05% | 2,409 | 0.10% |
| chr8 | 52,538 | 0.23% | 38,971 | 0.17% | 16,355 | 0.81% | 10,005 | 0.49% |
| chr9 | 40,017 | 0.17% | 41,931 | 0.18% | 12,529 | 0.22% | 13,459 | 0.24% |
| chr10 | 40,034 | 0.22% | 29,263 | 0.16% | 6,255 | 0.18% | 9,896 | 0.29% |
| chr11 | 32,762 | 0.17% | 40,550 | 0.20% | 8,227 | 0.38% | 8,354 | 0.39% |
| chr12 | 79,290 | 0.35% | 42,133 | 0.19% | 46,728 | 1.07% | 11,608 | 0.27% |
| chr13 | 59,784 | 0.25% | 40,806 | 0.17% | 22,390 | 0.49% | 12,465 | 0.27% |
| chr14 | 59,292 | 0.20% | 32,831 | 0.11% | 12,094 | 0.27% | 5,976 | 0.13% |
| chr15 | 25,964 | 0.13% | 25,086 | 0.12% | 5,899 | 0.13% | 4,076 | 0.09% |
| chr16 | 55,547 | 0.25% | 41,895 | 0.19% | 21,245 | 0.38% | 10,531 | 0.19% |
| chr17 | 26,574 | 0.16% | 28,029 | 0.16% | 3,702 | 0.15% | 2,812 | 0.11% |
| chr18 | 60,444 | 0.21% | 40,038 | 0.14% | 22,523 | 0.49% | 8,360 | 0.18% |
| chr19 | 39,838 | 0.17% | 30,991 | 0.13% | 12,876 | 0.22% | 6,879 | 0.12% |
| 827,641 | 640,421 | 238,595 | 143,739 | |||||
| chr1_random | 1,763 | 0.31% | 2,762 | 0.49% | 78 | 0.06% | 33 | 0.02% |
| chr3_random | 872 | 0.07% | 1,082 | 0.09% | 0 | 0.00% | 0 | 0.00% |
| chr4_random | 0 | 0.00% | 0 | 0.00% | - | - | - | - |
| chr5_random | 642 | 0.15% | 366 | 0.09% | - | - | - | - |
| chr7_random | 2,355 | 0.16% | 2,195 | 0.15% | 352 | 0.25% | 55 | 0.04% |
| chr9_random | 525 | 0.11% | 81 | 0.02% | 93 | 0.07% | 0 | 0.00% |
| chr10_random | 111 | 0.01% | 1,096 | 0.14% | 0 | 0.00% | 0 | 0.00% |
| chr11_random | 667 | 0.24% | 191 | 0.07% | - | - | - | - |
| chr12_random | 1,522 | 0.10% | 2,588 | 0.17% | 674 | 0.17% | 1,756 | 0.43% |
| chr13_random | 8,607 | 0.26% | 5,168 | 0.16% | 4,914 | 0.46% | 2,903 | 0.27% |
| chr16_random | 636 | 0.09% | 977 | 0.13% | 636 | 0.09% | 977 | 0.15% |
| chr17_random | 94 | 0.01% | 770 | 0.09% | 0 | 0.00% | 0 | 0.00% |
| chr18_random | 5,302 | 0.10% | 3,476 | 0.07% | 2,709 | 0.20% | 509 | 0.04% |
| 23,096 | 20,752 | 9,456 | 6,233 | |||||
| 850,737 | 661,173 | 248,051 | 149,972 | |||||
| chrUn | 417,301 | 0.97% | 69,457 | 0.16% | 159,039 | 1.79% | 21,834 | 0.25% |
| 1,268,038 | 730,630 | 407,090 | 171,806 | |||||
Comparison of gene content between grapevine whole-genome and segmentally duplicated regions
| chr | total genes | full dup genes | chr gene density | full dup gene density | ratio of densities |
|---|---|---|---|---|---|
| chr1 | 1,399 | 72 | 6.07 | 2.63 | 0.43 |
| chr2 | 976 | 93 | 5.20 | 3.07 | 0.59 |
| chr3 | 1,104 | 143 | 5.71 | 3.70 | 0.65 |
| chr4 | 1,363 | 95 | 5.71 | 2.88 | 0.50 |
| chr5 | 1,435 | 138 | 5.74 | 3.74 | 0.65 |
| chr6 | 1,289 | 66 | 5.99 | 2.30 | 0.38 |
| chr7 | 1,357 | 59 | 6.45 | 2.53 | 0.39 |
| chr8 | 1,488 | 67 | 6.65 | 3.30 | 0.50 |
| chr9 | 1,136 | 166 | 4.94 | 2.91 | 0.59 |
| chr10 | 842 | 100 | 4.64 | 2.93 | 0.63 |
| chr11 | 1,082 | 57 | 5.46 | 2.64 | 0.48 |
| chr12 | 1,263 | 175 | 5.56 | 4.02 | 0.72 |
| chr13 | 1,281 | 150 | 5.25 | 3.27 | 0.62 |
| chr14 | 1,625 | 153 | 5.37 | 3.38 | 0.63 |
| chr15 | 957 | 122 | 4.71 | 2.72 | 0.58 |
| chr16 | 1,048 | 140 | 4.75 | 2.53 | 0.53 |
| chr17 | 1,006 | 54 | 5.87 | 2.20 | 0.37 |
| chr18 | 1,796 | 143 | 6.12 | 3.14 | 0.51 |
| chr19 | 1,200 | 192 | 5.00 | 3.30 | 0.66 |
| chr1_random | 7 | 1 | 1.23 | 0.73 | 0.60 |
| chr3_random | 28 | 7 | 2.29 | 2.35 | 1.03 |
| chr4_random | 5 | - | 6.56 | - | - |
| chr5_random | 10 | - | 2.37 | - | - |
| chr7_random | 73 | 8 | 5.04 | 5.71 | 1.13 |
| chr9_random | 4 | 2 | 0.82 | 1.46 | 1.78 |
| chr10_random | 52 | 12 | 6.59 | 3.95 | 0.60 |
| chr11_random | 11 | - | 3.89 | - | - |
| chr12_random | 36 | 6 | 2.30 | 1.49 | 0.65 |
| chr13_random | 156 | 47 | 4.77 | 4.35 | 0.91 |
| chr16_random | 28 | 26 | 3.78 | 3.87 | 1.02 |
| chr17_random | 15 | 3 | 1.81 | 2.38 | 1.32 |
| chr18_random | 211 | 44 | 4.08 | 3.22 | 0.79 |
| chrUn | 2,063 | 248 | 4.78 | 2.79 | 0.58 |
Number of total genes (total genes), number of fully duplicated genes (full dup genes), chromosome gene density (chr gene density), density of fully duplicated genes in duplicated regions (full dup gene density), and ratio between full dup gene density and chr gene density (ratio of densities) are shown for each chromosome, all nonrandom chromosomes, all random chromosomes, all nonrandom and random chromosomes, and for the whole genome. Gene density is calculated as the average number of genes present in 100 kb of genomic sequence.