| Literature DB >> 24647560 |
Jungeun Lee1, Yoonjee Kang1, Seung Chul Shin1, Hyun Park1, Hyoungseok Lee1.
Abstract
BACKGROUND: Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been researched as an important ecological marker and as an extremophile plant for studies on stress tolerance. Despite its importance, little genomic information is available for D. antarctica. Here, we report the complete chloroplast genome, transcriptome profiles of the coding/noncoding genes, and the posttranscriptional processing by RNA editing in the chloroplast system.Entities:
Mesh:
Year: 2014 PMID: 24647560 PMCID: PMC3960257 DOI: 10.1371/journal.pone.0092501
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Map of the chloroplast genome of Deschampsia antarctica.
Genes lying outside of the outer circle are transcribed clockwise, while those inside the circle are transcribed counterclockwise. Genes belonging to different functional groups are color coded. The innermost darker gray corresponds to GC, while the lighter gray corresponds to AT content. IR, inverted repeat; LSC, large single copy region; SSC, small single copy region.
Genes present in the Deschampsia antarctica chloroplast genome.
| Products | Genes | |
| 1 | Photosystem I |
|
| 2 | Photosystem II |
|
| 3 | Cytochrome b6/f |
|
| 4 | ATP synthase |
|
| 5 | Rubisco |
|
| 6 | NADH oxidoreductase |
|
| 7 | Large subunit ribosomal proteins |
|
| 8 | Small subunit ribosomal proteins |
|
| 9 | RNAP |
|
| 10 | Other proteins |
|
| 11 | Proteins of unknown function |
|
| 12 | Ribosomal |
|
| 13 | Transfer RNAs |
|
Gene containing two introns.
Gene containing a single intron.
Two gene copies in the IRs.
Gene divided into two independent transcription units.
Pseudogene.
Genes containing introns in the Deschampsia antarctica chloroplast genome and the length of the exons and introns.
| Gene | Location | Length (bp) | ||||
| Exon I | Intron I | Exon II | Intron II | Exon III | ||
|
| LSC | 40 | 830 | 209 | ||
|
| LSC | 159 | 802 | 408 | ||
|
| LSC | 126 | 749 | 228 | 728 | 159 |
|
| LSC | 6 | 760 | 642 | ||
|
| LSC | 9 | 686 | 525 | ||
|
| LSC | 9 | 893 | 402 | ||
|
| LSC | 117 | - | 231 | ||
|
| IR | 393 | 660 | 432 | ||
|
| IR | 777 | 712 | 756 | ||
|
| SSC | 549 | 1012 | 540 | ||
|
| LSC | 38 | 2486 | 33 | ||
|
| LSC | 37 | 537 | 50 | ||
|
| LSC | 39 | 605 | 37 | ||
|
| IR | 42 | 801 | 35 | ||
|
| IR | 38 | 811 | 35 | ||
*rps12 is trans-spliced gene with 59 end exon located in the LSC region and the duplicated 39 end exon located in IR regions.
The codon–anticodon recognition pattern and codon usage in the Deschampsia antarctica chloroplast genome.
| Amino acid | Codon | No. | tRNA | Amino acid | Codon | No. | tRNA |
|
| UUU | 790 |
| UAU | 599 | ||
|
| UUC | 448 | trnF-GAA |
| UAC | 211 | trnY-GUA |
|
| UUA | 790 | trnL-UAA |
| UAA | 48 | |
|
| UUG | 445 | trnL-CAA |
| UAG | 20 | |
|
| CUU | 492 |
| CAU | 371 | ||
|
| CUC | 226 |
| CAC | 164 | trnH-GUG | |
|
| CUA | 363 | trnL-UAG |
| CAA | 572 | trnQ-GUU |
|
| CUG | 150 |
| CAG | 235 | ||
|
| AUU | 874 |
| AAU | 647 | ||
|
| AUC | 379 | trnI-GAU |
| AAC | 274 | trnN-GUU |
|
| AUA | 562 | trnI-CAU |
| AAA | 865 | trnK-UUU |
|
| AUG | 522 | trn(f)M-CAU |
| AAG | 367 | |
|
| GUU | 473 |
| GAU | 619 | ||
|
| GUC | 182 | trnV-GAC |
| GAC | 209 | trnD-GUC |
|
| GUA | 505 | trnV-UAC |
| GAA | 807 | trnE-UUC |
|
| GUG | 196 |
| GAG | 372 | ||
|
| UCU | 458 |
| UGU | 215 | ||
|
| UCC | 328 | trnS-GGA |
| UGC | 106 | trnC-GCA |
|
| UCA | 296 | trnS-UGA |
| UGA | 17 | |
|
| UCG | 161 |
| UGG | 430 | trnW-CCA | |
|
| CCU | 375 |
| CGU | 312 | trnR-ACG | |
|
| CCC | 243 |
| CGC | 152 | ||
|
| CCA | 291 | trnP-UGG |
| CGA | 311 | |
|
| CCG | 151 |
| CGG | 152 | ||
|
| ACU | 507 |
| AGA | 436 | trnR-UCU | |
|
| ACC | 236 | trnT-GGU |
| AGG | 221 | |
|
| ACA | 331 | trnT-UGU |
| AGU | 349 | |
|
| ACG | 173 |
| AGC | 176 | trnS-GCU | |
|
| GCU | 593 |
| GGU | 539 | ||
|
| GCC | 242 |
| GGC | 225 | trnG-GCC | |
|
| GCA | 413 | trnA-UGC |
| GGA | 653 | trnG-UCC |
|
| GCG | 202 |
| GGG | 359 |
*Numerals indicate the frequency of usage of each codon in 23430 in codons in 81 potential protein-coding genes.
Figure 2Sequence alignment of eight Poaceae chloroplast genomes.
The top line shows genes in order (transcriptional direction indicated by arrows). The sequence similarity of the aligned regions between Deschampsia antarctica and the other seven species is shown as horizontal bars indicating the average percent identity between 50% and 100% (shown on the y-axis of the graph). The x-axis represents the coordinate in the chloroplast genome. Genome regions are color coded as protein-coding (exon), tRNA or rRNA, and conserved noncoding sequences (CNS).
Figure 3Comparison of the rbcL-psaI region among eight Poaceae species.
The genes and intergenic regions between rbcL and psaI are indicated by boxes, with the length presented in bp. (Lp: Lolium perenne, Fa: Festuca arundinacea, As: Agrostis stolonifera, Hv: Hordeum vulgare, Ta: Triticum aestivum, Bd: Brachypodium distachyon, Os: Oryza sativa subsp. japonica).
Figure 4Maximum parsimony analysis of nine Poaceae species based on the whole plastome sequence.
The plastome sequences of Oryza sativa and Bambusa oldhamii were included as outgroup species. The phylogenetic tree was drawn using MEGA5, and bootstrap support was achieved using 1,000 replicates.
Figure 5Repeat analysis in the Deschampsia antarctica chloroplast genome.
Repeat sequences are compared among eight chloroplast genomes in the Poaceae family. To identify repeat sequences, the REPuter program was used. Repeats with length >20 bp and sequence identity e-value <10−3 were selected and categorized to four types based on their orientations (F: forward, P: palindromic, R: reverse).
RNA Expression of protein coding genes in the Deschampsia antarctica chloroplast genome.
| locus ID | gene name | locus | FPKM | locus ID | gene name | locus | FPKM |
| DeanCp027 |
| 48846–49209 | 87311 | DeanCp032 |
| 56279–56411 | 326 |
| DeanCp037 |
| 60890–61013 | 19529 | DeanCp073 |
| 100781–101054 | 314 |
| DeanCp064 |
| 79951–80233 | 13440 | DeanCp076 |
| 104531–104714 | 309 |
| DeanCp043 |
| 64095–64224 | 10915 | DeanCp025 |
| 47535–48015 | 288 |
| DeanCp002 |
| 83–1145 | 10274 | DeanCp016 |
| 30197–30941 | 283 |
| DeanCp042 |
| 63234–63348 | 9557 | DeanCp074 |
| 101191–101431 | 254 |
| DeanCp051 |
| 70154–70286 | 7916 | DeanCp004 |
| 4488–5567 | 250 |
| DeanCp011 |
| 17020–17110 | 7370 | DeanCp080 |
| 109031–109337 | 202 |
| DeanCp018 |
| 32063–33432 | 6499 | DeanCp036 |
| 59093–60056 | 188 |
| DeanCp039 |
| 61282–61402 | 6371 | DeanCp065 |
| 80495–81980 | 185 |
| DeanCp038 |
| 61143–61260 | 4812 | DeanCp069 |
| 85606–87851 | 165 |
| DeanCp010 |
| 16638–16743 | 3750 | DeanCp070 |
| 88150–88621 | 159 |
| DeanCp006 |
| 7354–7465 | 3335 | DeanCp060 |
| 76697–77069 | 152 |
| DeanCp050 |
| 69989–70106 | 3089 | DeanCp023 |
| 41199–43189 | 152 |
| DeanCp040 |
| 61412–61664 | 3020 | DeanCp068 |
| 83879–84167 | 149 |
| DeanCp052 |
| 70389–70611 | 2998 | DeanCp045 |
| 65145–65658 | 144 |
| DeanCp020 |
| 35628–35940 | 2224 | DeanCp058 |
| 75691–76042 | 144 |
| DeanCp017 |
| 31349–31595 | 2138 | DeanCp046 |
| 65815–66175 | 144 |
| DeanCp005 |
| 6761–6947 | 1970 | DeanCp024 |
| 44159–44765 | 142 |
| DeanCp009 |
| 11675–11864 | 1948 | DeanCp048 |
| 67131–67782 | 141 |
| DeanCp030 |
| 53858–55292 | 1782 | DeanCp081 |
| 109549–110080 | 136 |
| DeanCp007 |
| 8635–9697 | 1333 | DeanCp034 |
| 57149–57707 | 134 |
| DeanCp033 |
| 56726–56837 | 1308 | DeanCp003 |
| 1685–3221 | 133 |
| DeanCp041 |
| 62963–63059 | 1282 | DeanCp061 |
| 77186–78490 | 132 |
| DeanCp049 |
| 68293–69820 | 1197 | DeanCp059 |
| 76143–76554 | 128 |
| DeanCp071 |
| 93397–93832 | 1096 | DeanCp063 |
| 79429–79873 | 113 |
| DeanCp079 |
| 108276–108522 | 1052 | DeanCp035 |
| 58162–58861 | 110 |
| DeanCp019 |
| 33523–35047 | 971 | DeanCp075 |
| 101464–103684 | 102 |
| DeanCp054 |
| 72341–73561 | 956 | DeanCp077 |
| 105547–106507 | 87 |
| DeanCp057 |
| 75472–75586 | 956 | DeanCp082 |
| 110198–110741 | 85 |
| DeanCp044 |
| 64666–64867 | 922 | DeanCp055 |
| 73770–74796 | 74 |
| DeanCp001 |
| 66870–89475 | 840 | DeanCp014 |
| 24536–28943 | 66 |
| DeanCp021 |
| 36086–38291 | 676 | DeanCp015 |
| 29236–29947 | 62 |
| DeanCp022 |
| 38316–40569 | 558 | DeanCp083 |
| 110838–112939 | 58 |
| DeanCp008 |
| 9644–11066 | 513 | DeanCp078 |
| 106654–108157 | 57 |
| DeanCp084 |
| 112940–114122 | 510 | DeanCp072 |
| 99622–100414 | 52 |
| DeanCp053 |
| 70745–72153 | 495 | DeanCp056 |
| 74860–75292 | 48 |
| DeanCp031 |
| 55577–55853 | 419 | DeanCp062 |
| 78636–79356 | 44 |
| DeanCp047 |
| 125837–126080 | 402 | DeanCp067 |
| 82674–83874 | 39 |
| DeanCp066 |
| 81998–82280 | 400 | DeanCp090 |
| 131435–132638 | 39 |
| DeanCp029 |
| 51509–53006 | 385 | DeanCp013 |
| 22302–24333 | 31 |
| DeanCp026 |
| 48118–48856 | 342 | DeanCp012 |
| 19034–22265 | 29 |
| DeanCp028 |
| 51099–51513 | 329 |
RNA editing sites in the Deschampsia antarctica chloroplast genome.
| Gene | length | location from start | codonchange | amino acid change | Nucleotidechange | Number of reads |
|
| 1536 | 1258 | CAU>UAU | His>Tyr | C>U | U;11 (28.9%), C; 27 (71.1%) |
|
| 3231 | 398 | CGC>CAC | Arg>His | G>A | A:4 (25%), G:10 (62.5%), U:1(6.3%), |
|
| 2031 | 603 | GAA-GAU | Glu>Asp | A>U | A:19 (74.1%), U:7 (25.9%) |
|
| 2031 | 612 | GCG-GCA | Ala->Ala | G>A | A:7 (24.1%), G:22 (75.9%) |
|
| 4407 | 650 | AUA>AGA | Ile>Arg | U>G | G:2 (20.0%. U:8 (80%) |
|
| 1524 | 334 | UUG>CUG | Leu>Leu | U>C | C:13 (43.3%), U:17(56.7%) |
|
| 1524 | 367 | AUA>GUA | Ile>Val | A>G | A:23 (76.7%), G:7(23.3%) |
|
| 1524 | 933 | GAA>GAC | Glu>Asp | A>C | A:41(54.7%), C:34(45.3%) |
|
| 1524 | 1148 | UCA>UUA | Ser>Leu | C>U | C:2(2.9%), U:66 (97.1%) |
|
| 513 | 44 | UCC>UUC | Ser>Phe | C>U | U:19 (100%) |
|
| 606 | 588 | UAU>UAA | Tyr>stop | U>A | A:55 (66.3%),U:28 (33.7%) |
|
| 606 | 580 | GUG>CUG | Val>Leu | G>C | G:31(36%), C:55 (64%) |
|
| 606 | 370 | AAU>GAU | Asn>Asp | A>G | G:3 (42.9%), A:4 (57.1%) |
|
| AAU>AAC | Asn>Asn | A>C | A:4 (30.8%), C: 9 (69.2%) | ||
|
| 480 | 480 | UGA>UGG | stop>Trp | A>G | A:4 (30.8%), G: 9 (64.3%) |
|
| 738 | 125 | CCA>CUA | Pro>Leu | C>U | C:2(9.5%), U:19 (90.5%) |
|
| 363 | 13 | CAC>UAC | His>Tyr | C>U | C:3(50%). U:3 (50%) |
|
| 117 | 111 | UUC>UUU | Phe>Phe | C>U | C:2 (15.4%), U: 10 (76.9%), G: 1(7.7%) |
|
| 96 | 56 | CCA>CUA | Pro>Leu | C>U | U:2 (100%) |
|
| 360 | 308 | UCA>UUA | Ser>Leu | C>U | C:5 (45.5%), U:6 (54.5%) |
|
| 5127 | 867 | AGC>AGU | Ser>Ser | C>U | C:25 (83.3%), U:5 (16.7%) |
|
| 648 | 611 | CCA>CUA | Pro>Leu | C>U | U:19 (100%) |
|
| 1026 | 527 | UCC>UUC | Ser>Phe | C>U | C:2 (18.2%), U:9 (81.8%) |
|
| 411 | 182 | UCA>UUA | Ser>Leu | C>U | C:1 (7.7%), U:12 (92.3%) |
|
| 411 | 250 | GGC>ACG | Gly>Ser | G>A | G:2 (33.3%), A:4 (66.7%) |
|
| 720 | 30 | UUC>UUU | Phe>Phe | C>U | C:6 (30%), U:14 (70%) |
|
| 1503 | 878 | UCA>UUA | Ser>Leu | C>U | C:1 (9.1%), U:10 (90.9%) |
|
| 531 | 347 | CCA>CUA | Pro>Leu | C>U | U: 29 (100%) |
|
| 1089 | 722 | GCA>GUA | Ala>Val | C>U | C: 9 (81.8%), U: 2 (18.2%) |
|
| 1089 | 474 | UCA>UUA | Ser>Leu | C>U | C: 2 (10.5%), U: 17 (89.5%) |
*indicates the number of reads with an alternate base and the number of reads with the same base as the reference.
Figure 6Distribution of plastid small RNAs in the Deschampsia antarctica chloroplast genome.
The reads from small RNA-seq were divided into two groups according to the length (20–24 nt and >30 nt) and aligned to the D. antarctica chloroplast genome with 100% identity. The distributions of reads were compared between the two groups. In total, 12,753,636 reads were distributed unevenly in the chloroplast genome with high density in the coding regions of psbA and rbcL, intergenic regions, and inverted repeat regions in which most of the rRNA genes exist. The 27 loci enriched with 20–24 nt RNAs are indicated in red, along with the number of reads. The y-axis shows the number of reads (from 0 to 1000).
Distribution of small RNAs in the chloroplast genome of Deschampsia antarctica.
| Loca-tion | start | end | F/R | core sequence | length | reads number | At | Os | Hv | ||
|
| 1229 | 1209 | R |
| 20 | 36 | + | ||||
|
| 4378 | 4356 | R |
| 23 | 819 | + | + | |||
|
| 8129 | 8150 | F |
| 21 | 1200 | |||||
|
| 12314 | 12333 | F |
| 20 | 1056 | |||||
|
| 14786 | 14806 | F |
| 21 | 148 | |||||
|
| 21313 | 21332 | F |
| 20 | 166 | |||||
|
| 29130 | 29147 | F |
| 18 | 20314 | + | ||||
|
| 31305 | 31325 | F |
| 21 | 34100 | + | + | + | ||
|
| 34274 | 34297 | F |
| 21 | 1558 | |||||
|
| 38291 | 38315 | R |
| 21 | 1224 | |||||
|
| 41108 | 41088 | R |
| 21 | 7428 | + | ||||
|
| 43251 | 43232 | R |
| 20 | 450 | + | + | |||
|
| 47285 | 47265 | R |
| 21 | 102512 | |||||
|
| 55406 | 55426 | F |
| 21 | 111 | + | + | |||
|
| 55424 | 55431 | F |
| 21 | 160 | + | + | |||
|
| 57004 | 57024 | F |
| 21 | 120 | |||||
|
| 59574 | 59553 | R |
| 22 | 230 | |||||
|
| 63758 | 63739 | R |
| 20 | 2740 | |||||
|
| 70702 | 70721 | F |
| 20 | 11965 | + | + | + | ||
|
| 73658 | 73677 | F |
| 20 | 130600 | + | + | |||
|
| 80081 | 80098 | F |
| 18 | 2770 | |||||
|
| 87859 (127454) | 87839 (127474) | R |
| 21 | 7196 | + | + | + | ||
|
| 87863 (127450) | 87843 (127470) | R |
| 21 | 5203 | + | + | + | ||
|
| 92921 (122393) | 92941 (122373) | F |
| 21 | 4056 | |||||
|
| 95093 (120221) | 95113(120201) | F |
| 21 | 982 | |||||
|
| 98927 (116387) | 98946 (116368) | F |
| 20 | 19903 | + | ||||
|
| 101690 | 101709 | F |
| 20 | 1149 | |||||
*F/R: Direction of transcripts (F: forward, R: reverse).
Figure 7Relative locations of small RNAs in the Deschampsia antarctica chloroplast genome.
a Relative locations of plastid small RNAs according to the gene structure; b examples of small RNAs located proximal to the 5′ ends of the coding genes; c examples of small RNAs located proximal to the 3′ end of the coding genes.
Figure 8Sequence conservation among orthologs of plastid small RNAs.
To determine if the identified sRNAs are evolutionarily conserved, Deschampsia antarctica sRNAs were compared with the plastid sRNAs identified in Arabidopsis, rice, or barley [28], [29]. The sequence aligments of sRNAs which have >90% sequence homology are shown. The multiple sequence alignments were performed with ClustalW2 algorithm (http://www.ebi.ac.uk/Tools/msa/clustalw2/) and visualized with Jalview program [41]. The consensus sequences between ortholog sRNAs were shown at the bottom of each alignment.