| Literature DB >> 24996600 |
Yeong Deuk Jo, Yoomi Choi, Dong-Hwan Kim, Byung-Dong Kim, Byoung-Cheorl Kang1.
Abstract
BACKGROUND: Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24996600 PMCID: PMC4108787 DOI: 10.1186/1471-2164-15-561
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General features of mitochondrial genomes of two pepper lines and one tobacco line
| Features | FS4401 | Jeju |
|
|---|---|---|---|
| Genome size (bp) | 507,452 | 511,530 | 430,597 |
| GC content (%) | 44.5 | 44.6 | 45.0 |
| Coding sequences (bp)a | 40,085 (7.9%) | 39,524 (7.7%) | 43,642 (10.1%) |
| Plastid-derived sequences (bp)b | 59,873 (11.8%) | 64,815 (12.7%) | 19,492 (4.5%) |
| Repeated sequences (bp)c | 42,505 (8.4%) | 70,122 (13.7%) | 73,511 (17.1%) |
| Gene content (number) | 66 | 64 | 61 |
| Protein coding genesd | 38 | 37 | 37 |
| rRNAs | 3 | 3 | 3 |
| tRNAse | 25 (13) | 24 (12) | 21 (9) |
aAll of the copies of duplicated genes were included.
bComplete sequences of chloroplast genomes of FS4401 [33] and Nicotiana tabacum [34] were used to screen for plastid-derived sequences in mitochondrial genomes of FS4401/Jeju and N. tabacum, respectively, based on the BLASTN algorithm. The value for N. tabacum was different from that of Sugiyama et al. [35] probably due to differences in methodology used to isolate plastid-derived sequence.
cSequences longer than 100 bp and showing similarity higher than 95% between copies were considered repeated. In case repeated sequence units were overlapping, nucleotides that were included in repeated sequences at least one time were counted without repetition.
d rpl10, which was reported to be a conserved mitochondrial gene by Kubo and Arimura [32] was added to the list of genes described by Sugiyama et al. in tobacco [35].
eThe number of tRNA genes included in plastid-derived sequences is given in parenthesis.
The tobacco line is ‘Bright Yellow 4′ for which the mitochondrial genome was analyzed by Sugiyama et al. [35].
Figure 1Gene maps of the mitochondrial genomes of CMS and male-fertile pepper lines. (a) Gene map of FS4401 (CMS) (b) Gene map of Jeju (male-fertile). The genes drawn outside of the circle are transcribed clockwise and those inside, counterclockwise. The colors of the genes denote the functions of the gene products. Large repeat sequences (>1 kb) are shown as colored arrows on the outer circle. Sequence blocks that were syntenic between genomes (>2 kb; > 95% similarity) are depicted on the inner circles. They were drawn in two lines of inner circles to separate blocks in different directions.
Differences between Jeju and FS4401 in sequences of known genes
| Genes in Jeju | Gene length in Jeju | Polymorphism in FS4401 | Corresponding sequence in tobacco |
|---|---|---|---|
|
| 1977 | 904 gcA (A) → 904 gcG (A) | 904 gcG (A) |
|
| 597 | 16 acGAATATGCAg (T | 16 acGAATATGCAg (TNMQ) |
| → 16 acg (T) | |||
|
| 462 | 178 ccCAACAGTTTg (P | 178 ccCAACTGTTTg (PN |
| → 178 ccg (P) | |||
|
| 999 | 337 ccCGGGAAGGGggat (P | 337 ccCGGGAAGGGggat (PGKGD) |
| → 337 ccggat (PD) | |||
|
| 333 | 178 tTCttc ( | 178 tCTttc ( |
| → 178 tCTttc ( | |||
|
| 1296 | ψ | Higher similarity with |
| 283 gGt (G) → gCt (A) | |||
| 316 ACa (T) → CAa (Q) | |||
| 454 aaAGaa (KE) → aaCCaa (NQ) | |||
| no similarity in downstream of 931th bp due to DNA rearrangement | |||
|
| |||
| no similarity in upstream of 497th bp due to DNA rearrangement |
aThe SNP polymorphism can potentially be eliminated if the plant mitochondrial.
C-to-U RNA editing occur.
The pattern of base changes and corresponding amino acid changes is described. The polymorphic site (the number indicates the position of the first nucleotide), with the polymorphic nucleotide capitalized, and the corresponding amino acid in parentheses, is given to the left of the arrow for Jeju and to the right of the arrow for FS4401.
Figure 2Structure of the gene copies in Jeju and FS4401. The sequences correspond to gene-coding region are drawn as the wider rectangles and the upstream or downstream regions as the narrower bars. The sequence units that show high similarity (>99%) to each other and included in the same category of sequence characteristics (non-coding region/coding region, atp6 region showing high conservation/poor conservation among plant taxa) are depicted as the same color. The overall scheme of figure was adopted from Kim et al. [26].
Figure 3Distribution of specific s, sequences showing similarity with the other pepper mtDNA, tobacco mtDNA and FS4401 plastid genome, repeated sequences in FS4401 and Jeju mtDNA. Locations of ORFs (longer than 300 bp) that are specific to FS4401 (above) or Jeju (below) are shown on FS4401 or Jeju mtDNA, respectively. Red-colored ORFs are specifically present only in one of genomes or carry structural rearrangements. Blue-colored ORFs show polymorphism in length or sequence compared to its counterpart. Known genes are depicted in grey. The sequences showing similarity between genomes were determined based on alignment generated using default parameters of the BLASTN algorithm and is depicted by black rectangles or bars. The distribution of repeated sequences in each genome (>100 bp; > 95%) is depicted with black bars and rectangles. The name of ORFs indicates the number of amino acids in encoded proteins except for the case of ‘orf507’ for which the number of nucleotides in the ORF was adopted to name the ORF in consistent with the previous research [27].
Figure 4Comparison of sequence structure around and ψ between FS4401, Jeju, and CM334. The sequence blocks conserved between two lines are depicted in the same colors.
Figure 5Localization of syntenic sequence blocks of mitochondrial genomes in other crops. Sequence blocks showing synteny (>2 kb, > 95%) between a CMS line and a different line were depicted as blue-green color. mtDNAs of CMS lines were used as the reference genome in each comparison. Distribution of repeated sequences (>100 bp, > 95%) in CMS lines is shown with brown bars and boxes. The CMS-associated genes in each crop are indicated above the alignments. Sequence blocks and repeated sequences are depicted in two layers to show the direction and length.