| Literature DB >> 25653106 |
Koji Kashima1,2, Mio Mejima3, Shiho Kurokawa4, Masaharu Kuroda5, Hiroshi Kiyono6,7, Yoshikazu Yuki8,9.
Abstract
BACKGROUND: We have developed a rice-based oral cholera vaccine named MucoRice-CTB (Cholera Toxin B-subunit) by using an Agrobacterium tumefaciens-mediated co-transformation system. To assess the genome-wide effects of this system on the rice genome, we compared the genomes of three selection marker-free MucoRice-CTB lines with those of two wild-type rice lines (Oryza sativa L. cv. Nipponbare). Mutation profiles of the transgenic and wild-type genomes were examined by next-generation sequencing (NGS).Entities:
Mesh:
Substances:
Year: 2015 PMID: 25653106 PMCID: PMC4320824 DOI: 10.1186/s12864-015-1285-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequence reads for each line
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 50A | 384,384,291 | 375,891,543 | 97.8 | 371,762,264 | 99.6 | 101.3 |
| 51A | 362,591,974 | 352,782,360 | 97.3 | 371,570,800 | 99.6 | 95.1 |
| 55A | 356,620,373 | 349,808,296 | 98.1 | 369,691,614 | 99.0 | 94.7 |
| WT1 | 342,044,404 | 332,246,878 | 97.1 | 371,589,770 | 99.6 | 89.4 |
| WT2 | 323,436,829 | 314,249,500 | 97.2 | 371,575,970 | 99.6 | 84.3 |
Mapping rate represents the ratio of the number of mapped reads to that of total reads. Covered length represents the number of genome bases covered with at least one read. Coverage rate is the ratio of covered length to the total length of the rice reference genome (373,245,519 bps, IRGSP-1.0, build 5 [21]). Depth was calculated by dividing the total length of all mapped reads (100 bps for each read) by covered length.
Distribution of variants among MucoRice-CTB and WT lines
|
| ||||||
|---|---|---|---|---|---|---|
|
| ||||||
|
| ||||||
|
|
|
|
|
| ||
| 50A | 19,802 | 12,706 | 798 (2.15 × 10-6) | 88 | 222 | 488 |
| 51A | 19,612 | 12,586 | 619 (1.67 × 10-6) | 54 | 224 | 341 |
| 55A | 19,398 | 12,736 | 842 (2.28 × 10-6) | 96 | 263 | 483 |
| WT1 | 19,103 | 12,477 | 524 (1.41 × 10-6) | 47 | 170 | 307 |
| WT2 | 20,623 | 12,927 | 826 (2.22 × 10-6) | 79 | 282 | 465 |
Three filters for improving the accuracy of each variant (described in the Methods section) were applied to the total variants (shown as “Total” in the second column). Line-specific variants such as Ins, insertions; Del, deletions and SNPs were selected in accordance with the sharing profile from filtered variants. Mutation rates represent probability of a mutation per nucleotide.
Figure 1Variant distribution profiles over the 12 chromosomes of MucoRice-CTB and WT rice lines. (A) Variant distribution calculated in consecutive non-overlapping 500-kbp windows was averaged for three MucoRice-CTB lines or two WT lines. Blue lines in the upper half of each graph represent average distributions for MucoRice-CTB lines, orange lines in the bottom half represent average distributions for WT lines. The vertical axis represents the number of variants; the values increase upward for MucoRice-CTB lines and downward for WT lines. Black vertical lines in each graph indicate centromere positions. (B) The maximum differences in the numbers of variants per 500-kbp window between MucoRice-CTB and WT lines on each chromosome.
Figure 2Line-specific transitions and transversions in MucoRice-CTB and WT. (A) Numbers of line-specific substitutions in each MucoRice-CTB line and WT line. (B) Average numbers and percentage of transitions (Ts) and transversions (Tv), and the Ts/Tv ratio in MucoRice-CTB and WT lines.
Figure 3Variant-sharing profiles. Number of variants in each MucoRice-CTB or WT line. Dots and dashes at the bottom indicate the presence or the absence of variants, respectively. More than one dot beneath a bar indicates that corresponding lines share the variants. The five leftmost bars show line-specific variants, whereas the rightmost bar shows common variants shared by all lines.
Prediction of the effects of variants
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
|
| FRAME_SHIFT | 5 | 8 | 12 | 10 | 5 |
| SPLICE_SITE_ACCEPTOR | 1 | 0 | 0 | 1 | 0 | |
| SPLICE_SITE_DONOR | 2 | 0 | 0 | 0 | 0 | |
| START_LOST | 0 | 0 | 0 | 0 | 0 | |
| STOP_GAINED | 1 | 0 | 0 | 0 | 0 | |
| STOP_LOST | 2 | 0 | 0 | 0 | 0 | |
|
| CODON_CHANGE_PLUS_CODON_DELETION | 0 | 1 | 1 | 0 | 2 |
| CODON_CHANGE_PLUS_CODON_INSERTION | 0 | 0 | 0 | 0 | 0 | |
| CODON_DELETION | 2 | 1 | 5 | 1 | 5 | |
| CODON_INSERTION | 0 | 1 | 0 | 0 | 0 | |
| NON_SYNONYMOUS_CODING | 21 | 15 | 21 | 8 | 7 | |
|
| NON_SYNONYMOUS_START | 0 | 0 | 0 | 0 | 0 |
| SPLICE_SITE_REGION | 3 | 0 | 0 | 0 | 3 | |
| START_GAINED | 3 | 1 | 0 | 1 | 0 | |
| SYNONYMOUS_CODING | 13 | 4 | 7 | 7 | 7 | |
| SYNONYMOUS_STOP | 1 | 0 | 0 | 0 | 0 | |
|
| DOWNSTREAM | 307 | 237 | 337 | 205 | 323 |
| INTERGENIC | 395 | 305 | 404 | 250 | 407 | |
| INTRAGENIC | 0 | 0 | 0 | 0 | 0 | |
| INTRON | 30 | 31 | 40 | 29 | 56 | |
| UPSTREAM | 8 | 6 | 10 | 7 | 4 | |
| UTR_3_PRIME | 1 | 4 | 2 | 0 | 2 | |
| UTR_5_PRIME | 3 | 5 | 3 | 5 | 5 | |
Line-specific variants (SNPs and InDels) that may affect protein function were categorized into 23 types. These types were further grouped into HIGH, MODERATE, LOW, and MODIFIER according to potential severity. The assignment criteria were pre-defined in the annotation program (SNPEff).