| Literature DB >> 23792890 |
Reetu Tuteja1, Rachit K Saxena, Jaime Davila, Trushar Shah, Wenbin Chen, Yong-Li Xiao, Guangyi Fan, K B Saxena, Andrew J Alverson, Charles Spillane, Christopher Town, Rajeev K Varshney.
Abstract
The hybrid pigeonpea (Cajanus cajan) breeding technology based on cytoplasmic male sterility (CMS) is currently unique among legumes and displays major potential for yield increase. CMS is defined as a condition in which a plant is unable to produce functional pollen grains. The novel chimeric open reading frames (ORFs) produced as a results of mitochondrial genome rearrangements are considered to be the main cause of CMS. To identify these CMS-related ORFs in pigeonpea, we sequenced the mitochondrial genomes of three C. cajan lines (the male-sterile line ICPA 2039, the maintainer line ICPB 2039, and the hybrid line ICPH 2433) and of the wild relative (Cajanus cajanifolius ICPW 29). A single, circular-mapping molecule of length 545.7 kb was assembled and annotated for the ICPA 2039 line. Sequence annotation predicted 51 genes, including 34 protein-coding and 17 RNA genes. Comparison of the mitochondrial genomes from different Cajanus genotypes identified 31 ORFs, which differ between lines within which CMS is present or absent. Among these chimeric ORFs, 13 were identified by comparison of the related male-sterile and maintainer lines. These ORFs display features that are known to trigger CMS in other plant species and to represent the most promising candidates for CMS-related mitochondrial rearrangements in pigeonpea.Entities:
Keywords: cytoplasmic male sterility; mitochondria; next-generation sequencing; open reading frames; pigeonpea
Mesh:
Substances:
Year: 2013 PMID: 23792890 PMCID: PMC3789559 DOI: 10.1093/dnares/dst025
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.A scheme showing linking scaffold with the help of graph in a preliminary view of the assembly. Assembly graphs were used as a guide to connect the scaffolds. Each box represent a scaffold, ‘||’ represent the 3′ end, and ‘|>’ represent the 5′ end of each scaffold. Number on each scaffold represents the scaffold number and size of the scaffold. The thick black lines indicate that the scaffolds are attached in correct orientation and spotted line indicate that the scaffolds are attached in reverse orientation in the assembly. Numbers on these lines represents the sequence coverage. The orientation of each scaffold was confirmed through Sanger sequencing.
Generation of 454/FLX data and assembly statistics of ICPW 29, ICPA 2039, ICPB 2039, and ICPH 2433
| Genotypes | Number of sequence reads (length) generated | Newbler | Celera | CLC Bio | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of scaffolds | Bases in scaffolds (bp) | N50 scaffold size (bp) | Number of large contigsa | Bases in large contigs (bp) | Number of scaffolds | Bases in scaffolds (bp) | Number of big contigsb | Big contig length (bp) | Number of contigs | Bases in contigs (bp) | ||
| ICPW 29 | 74 109 (23.8 Mb)/164 071 (53.4 Mb)c | 156/18c (4)d | 723 814/672 137c (575 487)e | 108 193/159 243c | 202/425c | 694 265/858 696c | 34 | 618 692 | 15 | 501 689 | 392 | 951 760 |
| ICPA 2039 | 121 170 (38.8 Mb) | 30 (7)d | 672 918 (532 372)e | 169 595 | 345 | 828 279 | 84 | 662 357 | 20 | 475 091 | 1348 | 1 782 550 |
| ICPB 2039 | 51 723 (15.6 Mb)/117 163 (36.4 Mb)c | 387/52c (34)d | 415 181/459 802c (335 926)e | 1153/12 823c | 430/669c | 404 882/716 244c | 113 | 199 602 | 0 | 0 | 1032 | 529 436 |
| ICPH 2433 | 116 021 (37.1 Mb) | 108 (9)d | 681 810 (539 865)e | 169 903 | 184 | 677 158 | 44 | 577 054 | 18 | 468 611 | 564 | 1 227 305 |
aNewbler assembler classifies contigs >500 bp as large contigs.
bCelera assembler classifies contigs >10 kb as big contigs.
cData and assembly statistics of ICPW 29 and ICPB 2039 additional reads.
dNumber of scaffolds from the mitochondrial genome.
eBase in scaffolds from the mitochondrial genome.
Genome coverage by coding features in ICPA 2039 mitochondrial genome assembly
| Class | Feature | ICPA 2039a (%) |
|---|---|---|
| Total size | 545 742 bp | |
| Coding | Protein exons | 29 346 bp (5.4) |
| Introns | 31 018 bp (5.6) | |
| rRNA | 5 255 bp (0.9) | |
| tRNA | 1477 bp (0.2) | |
| Non-coding | Mitochondria-like | 220 747 bp (40.5) |
| Nuclear-like | 40 330 bp (7.4) |
aFigure in parentheses represents the percentage of total size.
Figure 2.Correlation of gene order between the mitochondrial gene maps of C. cajan and V. radiata. Left-hand side is represented by genes identified in C. cajan and top side is represented by genes of V. radiata. Shaded blocks in the image represent the correlation of gene orders.
Figure 3.Alignments of Cajanus mitochondrial genomes. The outer circle represents the finalized mitochondrial genome assembly and gene annotation of male-sterile line ICPA 2039. Second, third, and fourth circles from the outer circle represent the scaffolds of ICPH 2433, ICPW 29, and ICPB 2039 mapped on ICPA 2039 assembly. Numbers on each circle represent the scaffolds of each line. Hn represent the scaffolds for ICPH 2433, Wn represent the scaffolds for ICPW 29 and Bn represent the scaffolds for ICPB 2039, where n is the scaffold number.
Potential chimeric ORFs identified from the no-coverage and rearrangement regions between the ICPA 2039 and ICPB 2039 lines
| ORF start | ORF stop | ORF length | Nearest gene | Subject start | Subject stop | Chimera length | Identity | Subject features | No of transmembrane helices |
|---|---|---|---|---|---|---|---|---|---|
| 260 331 | 260 702 | 371 | 260 342 | 260 702 | 361 | 98 | ORF | 1 | |
| 164 867 | 165 424 | 557 | 165 424 | 165 353 | 72 | 100 | ORF | 1 | |
| 420 342 | 420 902 | 560 | — | 233 322 | 232 862 | 461 | 96 | 0 | |
| 534 744 | 535 115 | 371 | — | 352 424 | 352 403 | 361 | 98 | 1 | |
| 264 435 | 265 745 | 1310 | — | 60 093 | 59 813 | 281 | 100 | 2 | |
| 165 464 | 165 853 | 389 | 265 742 | 265 981 | 241 | 97 | ORF | 3 | |
| 164 867 | 165 424 | 557 | 165 424 | 165 353 | 72 | 100 | ORF | 1 | |
| 276 037 | 276 405 | 368 | — | 468 809 | 468 752 | 58 | 98 | 3 | |
| 396 025 | 396 876 | 851 | 264 909 | 265 396 | 488 | 97 | ORF | 2 | |
| 396 285 | 396 641 | 356 | 265 144 | 265 396 | 253 | 99 | ORF | 1 | |
| 264 069 | 264 434 | 365 | — | 44 935 | 45 092 | 158 | 100 | 1 | |
| 165 633 | 166 088 | 455 | 265 981 | 265 886 | 96 | 100 | ORF | 0 | |
| 8534 | 8842 | 308 | 476 033 | 476 052 | 20 | 100 | — | 1 |