| Literature DB >> 23521852 |
Md Shofiqul Islam, Bruno Studer, Stephen L Byrne, Jacqueline D Farrell, Frank Panitz, Christian Bendixen, Ian Max Møller, Torben Asp.
Abstract
BACKGROUND: Perennial ryegrass (Lolium perenne L.) is one of the most important forage and turf grass species of temperate regions worldwide. Its mitochondrial genome is inherited maternally and contains genes that can influence traits of agricultural importance. Moreover, the DNA sequence of mitochondrial genomes has been established and compared for a large number of species in order to characterize evolutionary relationships. Therefore, it is crucial to understand the organization of the mitochondrial genome and how it varies between and within species. Here, we report the first de novo assembly and annotation of the complete mitochondrial genome from perennial ryegrass.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23521852 PMCID: PMC3664089 DOI: 10.1186/1471-2164-14-202
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characterization of mitochondrial enrichment and intactness
| Protein | Total (mg) | 731 ± 40 | 42.0 ± 7 |
| CCO activity | Total (μmol min-1) | 13.0 ± 2.90 | 5.87 ± 1.41 (45%) |
| Specific (nmol min-1 mg-1) | 18.0 ± 2.83 | 139 ± 9.90 (7.7-fold) | |
| Latency (%) | 90.0 ± 1.41 | 89.5 ± 0.71 | |
| MDH activity | Total (μmol min-1) | 214 ± 61.5 | 26.5 ± 0.71 (12%) |
| Specific (nmol min-1 mg-1) | 291 ± 68.6 | 639 ± 91.2 (2.2-fold) | |
| Latency (%) | 22.0 ± 1.41 | 77.5 ± 0.71(3.5-fold) | |
| Yield of mtDNA | Total (μg) | - | 3.47 ± 0.27 |
*Data are means ± SD of two separate isolations each of 30 g fresh leaves.
Summary of the perennial ryegrass mitochondrial genome sequencing and assembly
| Total number of single reads | 287,367 | 403 | 115,768,584 |
| Total number of single reads after quality control* | 281,706 | 403 | 113,583,420 |
| Number of initial contigs | 2,403 | 703 | 1,689,209 |
| Number of mitochondrial contigs | 9 | 80,314 | 722,827 |
| Number of final contigs after closing all gaps: Genome size | 1 | - | 678,580 |
| Coverage | - | - | 167-fold |
*Removal of reads matching chloroplast DNA.
Figure 1Map of the perennial ryegrass mitochondrial genome. Protein, tRNA and rRNA-coding genes are shown inside and outside the circles. The second outer circle represents the circular master molecule. Genes and exons are indicated by arrowheads, p indicates a pseudogene. The forward and reverse DNA strands are shown in clockwise and anticlockwise orientation, respectively. The middle black peaked circle represents the G+C content of the master molecule. The inner circle shows the size markers in kbp in clockwise orientation. The first nucleotide of the cox1 gene is the starting point of the circular master molecule. This figure was generated using the CGView Server [17].
Main features of the assembled perennial ryegrass mitochondrial genome
| Genome size (nt) | 678,580 |
| G+C content (%) | 44.1 |
| Gene number | 73a |
| -Protein coding genes | 39 (34)b (1)c |
| -tRNA genes | 28 (14)b (1)c |
| -rRNA genes | 6 (3)b |
| Proteins | |
| -Protein coding sequences (nt) | 35,514 |
| -Protein coding sequence (%) | 5.23 |
| RNAs | |
| -RNA gene coding sequences (nt) | 13,209 |
| -RNA gene coding sequences (%) | 1.95 |
| Introns | |
| -Cis-spliced group II introns | 22 |
| -Trans-spliced group II introns | 4 |
| -Introns sequences of these genes (nt) | 26,631 |
| -Introns of these genes sequences (%) | 3.92 |
| ORFs | |
| -No. of ORFs | 149 |
| -ORFs (>300 bp) sequences (nt) | 67,410 |
| -ORFs sequences (%) | 9.93 |
| Total (proteins, RNAs and ORFs) sequences | 142,764 |
| Total (proteins, RNAs and ORFs) sequences (%) | 21.03 |
| Repeats | |
| -Large repeat sequences (nt) | 116,177 |
| -Large repeat sequences (%) | 17.12 |
| Transposable element-related sequences (%) | 3.76 |
aTotal numbers including duplications and pseudogenes, bIn parentheses, number of unique genes, c In parentheses, number of pseudogenes.
Genes identified in the perennial ryegrass mitochondrial genome
| A. Protein and rRNA-coding genes | ||||||
| 1. Complex I genes | | | | | | |
| - | - | x | ||||
| | 387 | 284,188 | 284,574 | + | 129 | |
| | 81 | 247,595 | 247,675 | - | 27 | |
| | 192 | 245,995 | 246,186 | - | 64 | |
| | 60 | 512,368 | 512,427 | - | 20 | |
| | 258 | 76,540 | 76,797 | - | 85 | |
| - | - | x | ||||
| | 153 | 324,853 | 325,005 | - | 51 | |
| | 396 | 323,224 | 323,619 | - | 132 | |
| | 159 | 588,650 | 588,809 | + | 53 | |
| | 573 | 591,214 | 591,786 | + | 191 | |
| | 186 | 593,194 | 593,381 | + | 61 | |
| 357 | 137,432 | 137,788 | - | 118 | ||
| - | - | + | ||||
| | 461 | 43,953 | 44,413 | + | 153 | |
| | 515 | 45,442 | 45,956 | + | 172 | |
| | 423 | 48,576 | 48,998 | + | 141 | |
| | 89 | 51,790 | 51,878 | + | 29 | |
| 315 | 263,099 | 263,413 | + | 104 | ||
| - | - | - | ||||
| | 228 | 637,049 | 637,277 | - | 76 | |
| | 1,218 | 634,967 | 636,186 | - | 406 | |
| | 21 | 76,157 | 76,177 | - | 7 | |
| | 396 | 237,378 | 237,773 | - | 132 | |
| | 150 | 236,297 | 236,446 | - | 49 | |
| 945 | 510,437 | 511,381 | - | 314 | ||
| - | - | + | ||||
| | 143 | 213,945 | 214,087 | + | 47 | |
| | 69 | 214,890 | 214,958 | + | 23 | |
| | 467 | 216,269 | 216,735 | + | 155 | |
| | 244 | 217,735 | 217,978 | + | 81 | |
| | 262 | 219,684 | 219,945 | + | 86 | |
| - | - | - | ||||
| | 143 | 449,072 | 449,214 | - | 47 | |
| | 69 | 448,201 | 448,269 | - | 23 | |
| | 467 | 446,424 | 446,890 | - | 155 | |
| | 244 | 445,181 | 445,424 | - | 81 | |
| | 262 | 443,214 | 443,475 | - | 86 | |
| 573 | 669,561 | 670,133 | + | 190 | ||
| 2. Complex III gene | | | | | | |
| 1,194 | 232,540 | 233,733 | + | 397 | ||
| 3. Complex IV genes | | | | | | |
| 1,989 | 1 | 1,989 | + | 662 | ||
| - | - | + | ||||
| | 388 | 484,835 | 485,222 | + | 130 | |
| | 395 | 486,506 | 486,897 | + | 130 | |
| 843 | 670,403 | 671,245 | + | 280 | ||
| 4. Complex V genes | | | | | | |
| 1,530 | 312,019 | 313,548 | - | 509 | ||
| 588 | 335,648 | 336,235 | + | 195 | ||
| 714 | 415,086 | 415,799 | + | 237 | ||
| 468 | 292,308 | 292,775 | + | 155 | ||
| 225 | 35,605 | 35,829 | + | 74 | ||
| 5. Cytochrome | | | | | | |
| biogenesis genes | | | | | | |
| 651 | 132,035 | 132,685 | - | 216 | ||
| 723 | 549,113 | 549,835 | - | 240 | ||
| 1,770 | 83,571 | 85,340 | - | 589 | ||
| - | - | + | ||||
| | 755 | 519,648 | 520,402 | + | 252 | |
| | 679 | 521,399 | 522,077 | + | 225 | |
| 6. Ribosomal proteins | | | | | | |
| genes | | | | | | |
| 636 | 82,801 | 83,436 | - | 211 | ||
| 1,464 | 575,833 | 577,296 | + | 487 | ||
| - | - | + | ||||
| | 72 | 228,586 | 228,657 | + | 24 | |
| | 1,578 | 229,590 | 231,167 | + | 525 | |
| 1,035 | 93,583 | 94,617 | - | 344 | ||
| 1,032 | 603,531 | 604,562 | + | 343 | ||
| 447 | 177,808 | 178,254 | - | 148 | ||
| 447 | 380,279 | 380,725 | + | 148 | ||
| 378 | 137,010 | 137,387 | - | 125 | ||
| 351 | 248,209 | 248,559 | - | 116 | ||
| 159 | 190,863 | 191,021 | + | 52 | ||
| 159 | 472,138 | 472,296 | - | 52 | ||
| 552 | 190,178 | 190,729 | + | 183 | ||
| 552 | 472,430 | 472,981 | - | 183 | ||
| 558 | 231,058 | 231,615 | + | 185 | ||
| 7. Other protein coding genes | | | | | | |
| 1,977 | 77,314 | 79,290 | - | 658 | ||
| 816 | 604,582 | 605,397 | + | 271 | ||
| 8. rRNA genes | | | | | | |
| 122 | 500,941 | 501,062 | + | - | ||
| 122 | 568,955 | 569,076 | + | - | ||
| 1,966 | 498,862 | 500,827 | + | - | ||
| 1,966 | 566,875 | 568,841 | + | - | ||
| 3,461 | 115,757 | 119,217 | - | - | ||
| 3,457 | 348,904 | 352,360 | + | - | ||
| B. tRNA-coding genes | ||||||
| 72 | 616,184 | 616,255 | - | Asn | GTT | |
| 74 | 129,749 | 129,822 | - | Asp | GTC | |
| 74 | 338,298 | 338,371 | + | Asp | GTC | |
| 74 | 615,471 | 615,544 | + | Asp | GTC | |
| 71 | 221,721 | 221,791 | - | Cys | GCA | |
| 71 | 441,367 | 441,437 | + | Cys | GCA | |
| 72 | 151,087 | 151,158 | + | Gln | TTG | |
| 72 | 185,132 | 185,203 | + | Gln | TTG | |
| 72 | 407,374 | 407,445 | - | Gln | TTG | |
| 72 | 477,956 | 478,027 | - | Gln | TTG | |
| 73 | 649,329 | 649,401 | + | Glu | TTC | |
| 74 | 304,917 | 304,990 | + | His | GTG | |
| - | - | - | Leu | CAA | ||
| | 37 | 549,042 | 549,076 | - | | |
| | 34 | 548,992 | 549,029 | - | | |
| 73 | 174,964 | 175,036 | - | Lys | TTT | |
| 73 | 383,497 | 383,569 | + | Lys | TTT | |
| 74 | 11,905 | 11,978 | - | Met | CAT | |
| 74 | 615,054 | 615,127 | - | Met | CAT | |
| 73 | 630,497 | 630,569 | + | Met | CAT | |
| 73 | 371,486 | 371,558 | - | Phe | GAA | |
| 74 | 172,177 | 172,250 | - | Pro | TGG | |
| 75 | 282,933 | 283,007 | + | Pro | TGG | |
| 74 | 386,282 | 386,355 | + | Pro | TGG | |
| 88 | 138,301 | 138,388 | - | Ser | GCT | |
| 86 | 379,430 | 379,515 | + | Ser | TGA | |
| 74 | 171,961 | 172,034 | - | Trp | CCA | |
| 74 | 386,498 | 386,571 | + | Trp | CCA | |
| 83 | 587,933 | 588,015 | + | Tyr | GTA | |
| 73 | 262,935 | 263,007 | + | Phe | GAA | |
A total of 39 protein-coding genes and 6 rRNA-coding genes (A) and 28 tRNA-coding genes (B) were identified. ap, pseudogene; bBoldface, sum of all exons; lower-case letters, exons of protein-coding genes; hyphenated, duplicate genes; cPlus and minus, encoded by the forward and reverse DNA strand, respectively; x, trans-spliced.
Differences in the tRNA gene content in sequenced mitochondrial genomes of grasses
| - | - | - | - | - | - | - | - | - | |
| - | - | - | - | - | - | - | - | - | |
| + | + | + | + | + | + | + | + | + | |
| - | - | - | - | - | - | - | - | - | |
| - | - | - | - | - | - | - | - | - | |
| + | + | + | + | + | + | + | + | + | |
| - | - | - | - | - | - | - | - | - | |
| + | - | - | - | - | - | - | - | - | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | - | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| - | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | - | - | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | + | |
The accession numbers of the mitochondrial genomes are: Ferrocalamus rimosivaginus (JQ235166 to JQ235179), Bambusa oldhamii (EU365401), Triticum aestivum (AP008982), Oryza sativa (BA000029), O. rufipogon (NC_013816), Sorghum bicolor (NC_008360), Zea mays (AY506529) and Z. perennis (NC_008331). +, present; -, absent.
Figure 2Dot plot of the perennial ryegrass mitochondrial genome. Five repeats including four large inverted repeat pairs, IR1-IR4 and one direct repeat, DR (green) are marked by arrows. The inverted and direct repeat coordinates in the master molecule are: IR1, (178,334-228,600 bp; 484,825-434,554 bp); IR2, (147,726-178,558 bp; 410,808-379,975 bp); IR3, (106,923-131,907 bp; 361,199-336,212 bp); IR4 (94,022-95,555 bp; 604,126-602,594 bp) and DR (498,590-507,147 bp; 566,604-575,159 bp). This figure was generated using the PipMaker software [20].
Transposable elements found in the perennial ryegrass mitochondrial genome
| Transposable element | | | | |
| 1.DNA transposon | ||||
| -EnSpm | 3 | 220 | 3 | 324 |
| -Helitron | 4 | 329 | - | - |
| -MuDR | 2 | 118 | - | - |
| -hAT | 1 | 253 | - | - |
| 2.LTR retrotransposon | ||||
| -Copia | 20 | 4,939 | 3 | 153 |
| -Gypsy | 36 | 11,819 | 14 | 2,413 |
| 3.Non-LTR retrotransposon | ||||
| -L1 | 27 | 4,662 | 2 | 118 |
| Total | ||||
aReference DNA; Bold face, sum of fragments and lengths.
Expression profile of the protein-coding genes in the perennial ryegrass mitochondrial genome
| 2,392 | 0.975 | 2,453 | |
| 4,573 | 1.464 | 3,124 | |
| 2,816 | 0.354 | 7,955 | |
| 5,967 | 1.485 | 4,018 | |
| 1,391 | 0.312 | 4,458 | |
| 8,251 | 2.010 | 4,105 | |
| 2,426 | 0.942 | 2,575 | |
| 2,766 | 1.182 | 2,340 | |
| 2,766 | 1.182 | 2,340 | |
| 6,974 | 0.570 | 12,236 | |
| 12,226 | 1.191 | 10,265 | |
| 17,574 | 1.986 | 8,849 | |
| 6,083 | 0.780 | 7,799 | |
| 9,713 | 0.840 | 11,564 | |
| 15,214 | 1.527 | 9,963 | |
| 2,388 | 0.585 | 4,082 | |
| 4,633 | 0.711 | 6,516 | |
| 3,454 | 0.465 | 7,429 | |
| 5,361 | 0.222 | 24,150 | |
| 235 | 0.648 | 363 | |
| 1,327 | 0.720 | 1,843 | |
| 1,702 | 1.431 | 1,190 | |
| 3,739 | 1.767 | 2,116 | |
| 1,645 | 0.633 | 2,599 | |
| 1,635 | 1.461 | 1,119 | |
| 9,172 | 1.647 | 5,569 | |
| 503 | 1.032 | 487 | |
| 424 | 1.029 | 412 | |
| 205 | 0.444 | 461 | |
| 205 | 0.444 | 461 | |
| 5,072 | 0.375 | 13,526 | |
| 388 | 0.348 | 1,115 | |
| 7 | 0.156 | 42 | |
| 7 | 0.156 | 42 | |
| 52 | 0.549 | 96 | |
| 52 | 0.549 | 96 | |
| 4,963 | 0.555 | 8,942 | |
| 416 | 1.974 | 211 | |
| 698 | 0.813 | 859 | |
| Total | 149,414 | 35.514 |
Gene name, RNA-seq read counts, protein-coding gene length in kbp, and the relative expression.
Copy number of mitochondrial genes that differ in perennial ryegrass, bamboo, wheat, rice and maize
| 1 | 1 | 1 | 2 | 2 | |
| 1 | 1 | 1 | 2 | 1 | |
| 1 | 1 | 2 | 1 | 1 | |
| 1 | 1 | 2 | 1 | 1 | |
| 1 | 1 | 1 | 2 | 1 | |
| 1 | 1 | 1 | 2 | 2 | |
| 1 | 1 | 1 | 2 | 1 | |
| 1 | 1 | 1 | 2 | 2 | |
| 1 | 1 | 1 | 3 | 1 | |
| 1 | 1 | 1 | 2 | 1 | |
| 2 | 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 2 | 1 | |
| 0 | 0 | 0 | 3 | 0 | |
| 2 | 1 | 1 | 2 | 0 | |
| 1 | 1 | 1 | 1 | 2 | |
| 1 | 1 | 1 | 1 | 2 | |
| 2 | 1 | 1 | 1 | 1 | |
| 2 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 1 | 0 | |
| 2 | 1 | 3 | 2 | 1 | |
| 2 | 1 | 3 | 2 | 1 | |
| 2 | 1 | 2 | 2 | 1 | |
| 2 | 0 | 0 | 0 | 0 | |
| 3 | 1 | 2 | 1 | 2 | |
| 1 | 1 | 1 | 1 | 2 | |
| 1 | 1 | 3 | 1 | 1 | |
| 0 | 1 | 1 | 1 | 2 | |
| 2 | 1 | 3 | 1 | 1 | |
| 2 | 0 | 1 | 1 | 0 | |
| 1 | 0 | 0 | 1 | 1 | |
| 3 | 1 | 2 | 1 | 2 | |
| 4 | 1 | 3 | 1 | 1 | |
| 2 | 0 | 0 | 0 | 0 |
Gene fragments, pseudogenes and chloroplast-derived genes are excluded. Genes with the same number of copies in all five species are not included. aFrom bamboo (Acc. No. JQ235166 to JQ235179) [22], rice (Acc. No. BA000029) [21], maize (Acc. No. AY506529) [19], and wheat (Acc. No. AP008982) [27]. Lower case letter indicates exons of this gene.
Figure 3Amino acid sequence alignment of the perennial ryegrass 4 and 4-p genes with the 4 gene of maize, sorghum, rice, wheat and bamboo. Amino acid sequence of the rps4 gene of maize (Acc. No. YP_588274.1), sorghum (Acc. No. YP_762343.1), rice (Acc. No. YP_514660.1), wheat (Acc. No. ADE08097.1) and bamboo (Acc. No. AEK66732.1). Color code: white, conserved; green, identical; cyan, similar and yellow, different residues. This alignment was generated using the SDSC Biology WorkBench [42].