| Literature DB >> 18687674 |
Tingting Lu1, Shuliang Yu, Danlin Fan, Jie Mu, Yingying Shangguan, Zixuan Wang, Yuzo Minobe, Zhixin Lin, Bin Han.
Abstract
A huge amount of cDNA and EST resources have been developed for cultivated rice species Oryza sativa; however, only few cDNA resources are available for wild rice species. In this study, we isolated and completely sequenced 1888 putative full-length cDNA (FLcDNA) clones from wild rice Oryza rufipogon Griff. W1943 for comparative analysis between wild and cultivated rice species. Two cDNA libraries were constructed from 3-week-old leaf samples under either normal or cold-treated conditions. Homology searching of these cDNA sequences revealed that >96.8% of the wild rice cDNAs were matched to the cultivated rice O. sativa ssp. japonica cv. Nipponbare genome sequence. However, <22% of them were fully matched to the cv. Nipponbare genome sequence. The comparative analysis showed that O. rufipogon W1943 had greater similarity to O. sativa ssp. japonica than to ssp. indica cultivars. In addition, 17 novel rice cDNAs were identified, and 41 putative tissue-specific expression genes were defined through searching the rice massively parallel signature-sequencing database. In conclusion, these FLcDNA clones are a resource for further function verification and could be broadly utilized in rice biological studies.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18687674 PMCID: PMC2575888 DOI: 10.1093/dnares/dsn018
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Mapping of the 1888 FLcDNAs onto Oryza sativa genomic sequences.
List of 15 Oryza rufipogon W1943 genes with specific alternative splicing patterns
| Accession Number | Length (bp) | Chromosome | Number of exon | Protein |
|---|---|---|---|---|
| CT841942 | 978 | 07 | 6 (1st intron: GC-AG) | |
| CU406810 | 958 | 06 | 6 (1st intron: GT-TG) | Dual-specificity phosphatase protein |
| CT841893 | 1011 | 01 | 6 | Drought-induced protein |
| CT841874 | 1369 | 01 | 4 | Vesicle transport protein |
| CU405853 | 1377 | 05 | 1 | Dehydration-responsive protein |
| CU405923 | 639 | 07 | 1 | IAA amidohydrolase |
| CU406279 | 648 | 05 | 1 | |
| CU406025 | 839 | 02 | 1 | |
| CT841561 | 740 | 06 | 2 | |
| CU406579 | 468 | 09 | 2 | |
| CU406935 | 1345 | 01 | 2 | |
| CU406600 | 1107 | 01 | 2 | |
| CU405570 | 952 | 01 | 2 | |
| CU406091 | 893 | 01 | 3 | |
| CU406134 | 665 | 10 | 3 |
Figure 2Total 17 W1943 cDNAs had alternative splicing patterns different from previous ESTs or mRNAs in public database. It revealed four typical splicing patterns in wild rice species.
List of 10 novel cDNA transcripts of Oryza rufipogon W1943
| Accession Number | Protein | Length (bp) | Chromosome | Identity (%) |
|---|---|---|---|---|
| CU405785 | — | 727 | 05 | 99 |
| CU406138 | — | 568 | 02 | 99 |
| CU406022 | — | 543 | 12 | 99 |
| CU405757 | — | 477 | 04 | 100 |
| CU406921 | — | 414 | 02 | 100 |
| CU406535 | — | 389 | 02 | 100 |
| CU406832 | — | 530 | 10 | 92 |
| CU406871 | — | 458 | 01 | 84 |
| CU861804 | — | 383 | 06 | 99 |
| CU861721 | — | 554 | 01 | 100 |
List of seven sense–antisense cDNA transcripts of Oryza rufipogon W1943
| Accession Number | Length (bp) | Protein | Location (chr) | Identity (%) | Antisense gene | Location (chr) | Protein |
|---|---|---|---|---|---|---|---|
| CU405785 | 727 | — | 05 | 99 | CA764081 | 01 | DNA-directed RNA polymerase 3 |
| CU861795 | 475 | — | 09 | 79 | CT858901 | unsure | Unknown |
| CU406355 | 837 | — | 12 | 97 | AK107125 | 12 | AP2 domain, putative |
| CU406396 | 520 | — | 02 | 99 | AK103485 | 02 | Hypothetical |
| CT841800 | 941 | — | 11 | 99 | AK121962 | 11 | Patatin, putative |
| CU861688 | 693 | — | 08 | 99 | AK109182 | 08 | Hypothetical |
| CT841937 | 1552 | — | 08 | 98 | AK106713 | 08 | Unknown |
List of 24 no-hit Oryza sativa ssp. japonica genome sequences
| Number | Accession Number | 93–11 location | ESTs or mRNA hits | Protein | |
|---|---|---|---|---|---|
| 1 | CT842002 | — | Contig005912 | AK241925.1 | — |
| 2 | CT842007 | — | Contig008507 | CT856206 | — |
| 3 | CU405940 | — | Contig001402 | AK103326 | Unknown protein |
| 4 | CU406172 | — | Contig014596 | AK242967.1 | — |
| 5 | CT842006 | — | Contig000383 | AK111647 | GTP-binding protein |
| 6 | CU861753 | — | Contig000750 | AK099287 | Ring-box protein |
| 7 | CU406308 | — | Contig000444 | AK070131 | Unknown protein |
| 8 | CT841996 | — | Contig002576 | CT834800 | Unknown protein |
| 9 | CU406568 | — | Contig003848 | AK064050 | Bowman Birk trypsin inhibitor |
| 10 | CU406582 | — | Contig000444 | AK107776 | Unknown protein |
| 11 | CU406596 | — | Contig001277 | AK242711.1 | Hypothetical protein |
| 12 | CT842008 | — | Contig008507 | CT856206 | Unknown protein |
| 13 | CU406895 | — | Contig003011 | CT859459 | Hypothetical protein |
| 14 | CU861744 | — | Contig000750 | AK099287 | Ring-box protein |
| 15 | CU405657 | — | — | CT856885 | — |
| 16 | CT841712 | — | — | CA766528 | — |
| 17 | CU405768 | — | — | CT836656 | 60S ribosomal protein L7A |
| 18 | CU405675 | — | — | CA756235 | 60S ribosomal protein L17 |
| 19 | CU406202 | — | — | NM_001063334 | Unknown |
| 20 | CU406924 | — | — | AC145809 | — |
| 21 | CU405898 | — | — | CN130755.1 ( | Ribulose-bisphosphate carboxylase |
| 22 | CU406778 | — | — | BE429292.1 ( | Hydrophobin |
| 23 | CU861677 | — | — | FF534517.1 ( | Hypothetical protein |
| 24 | CT841912 | — | — | EH277383.1 ( | Unknown protein |
Figure 3Chromosomal distributions of the three different rice cDNAs (W1943, KOME, NCGR) along the ssp. japonica cv. Nipponbare chromosomal pseudomolecule sequences. Though relative small quantities of W1943 cDNAs, it had about similar trace trends and no visible large bias comparing with KOME and NCGR (KOME, Oryza sativa ssp. japonica Nipponbare cDNAs; NCGR, Oryza sativa ssp. indica Guangluai 4 cDNAs.).
Figure 4The first five highest frequency SSR motifs in the overall cDNA sequences, 5′-UTR sequences, ORF sequences and 3′-UTR sequences, respectively.
Figure 5Comparative analysis with Oryza sativa cDNA sequences in public databases. (A) The relationships of ORFs among 823 W1943, KOME and NCGR co-cDNA groups at amino acid level. (B) The synonymous divergent (Ks) relationships of 194 ORF identical cDNA groups.
List of 4 miRNAs
| Accession Number | Gene length (bp) | Pre-miRNA length (bp) | Hit-miRNA | miRNA seq | Chromosome |
|---|---|---|---|---|---|
| CU406292 | 1416 | 262 (220–490) | osa-MIR159a | uuuggauugaagggagcucug | 01 |
| CU405943 | 1511 | 101 (160–280) | osa-MIR156j | ugacagaagagagugagcac | 06 |
| CU861819 | 561 | 80 (390–470) | osa-miR818e | aaucccuuauauuuugggacgg | 04 |
| CU861752 | 727 | 150 (325–475) | osa-miR446 | aucaauaugaaugugggaaau | 10 |
List of Oryza rufipogon W1943 tissue-specific genes (unit: tpm)
| Clone Acc. | Leaf | Root | NGS | NCA | NGD | NME | NPO | PFAM Acc. | Description | E-value |
|---|---|---|---|---|---|---|---|---|---|---|
| CU406902 | 44 199 | 0 | 101 | 0 | 19 | 0 | 0 | PF07207 | Lir1 | 4.8e–85 |
| CU405979 | 36 785 | 0 | 894 | 0 | 256 | 9 | 0 | |||
| CT841733 | 25 112 | 41 | 120 | 0 | 241 | 0 | 0 | PF00101 | RuBisCO_small | 2.5e–45 |
| CU405975 | 15 421 | 1278 | 0 | 0 | 650 | 223 | 0 | |||
| CT841994 | 9140 | 0 | 10 | 0 | 18 | 0 | 0 | |||
| CU406521 | 3504 | 6 | 0 | 0 | 0 | 0 | 0 | PF01070 | FMN_dh | 2.8e–31 |
| CU405996 | 3069 | 0 | 27 | 0 | 28 | 15 | 0 | PF00430 | ATP-synt_B | 3.4e–28 |
| CU405670 | 2653 | 0 | 11 | 5 | 21 | 4 | 23 | PF00085 | Thioredoxin | 7.8e–43 |
| CU406006 | 2337 | 0 | 0 | 0 | 0 | 0 | 10 | |||
| CU406668 | 2126 | 3 | 17 | 0 | 16 | 0 | 0 | |||
| CT841650 | 1997 | 0 | 0 | 0 | 0 | 0 | 0 | PF00112 | Peptidase_C1 | 6e–109 |
| CT841731 | 1942 | 0 | 0 | 0 | 12 | 0 | 0 | PF02507 | PSI_PsaF | 0 |
| CT841902 | 1486 | 0 | 24 | 0 | 31 | 0 | 0 | |||
| CU405952 | 1253 | 7 | 110 | 5 | 2 | 5 | 0 | |||
| CU406199 | 1235 | 0 | 16 | 0 | 0 | 0 | 0 | |||
| CU406624 | 1012 | 0 | 60 | 58 | 0 | 3 | 5 | PF05899 | DUF861 | 2.1e–37 |
| CU406431 | 0 | 189 | 0 | 0 | 0 | 18 | 17 | |||
| CU405706 | 1456 | 15 907 | 0 | 183 | 803 | 0 | 0 | PF01439 | Metallothio_2 | 2.7e–32 |
| CU406330 | 0 | 358 | 4 | 0 | 0 | 1 | 31 | |||
| CT841629 | 217 | 2721 | 157 | 36 | 80 | 25 | 86 | PF01124 | MAPEG | 3.1e–63 |
| CU406513 | 18 | 230 | 0 | 0 | 0 | 0 | 0 | PF01439 | Metallothio_2 | 1.6e–34 |
| CU406576 | 0 | 231 | 0 | 11 | 0 | 0 | 0 | |||
| CU406281 | 29 | 449 | 0 | 0 | 0 | 14 | 0 | |||
| CT841966 | 15 | 520 | 0 | 0 | 0 | 0 | 0 | PF00188 | SCP | 5.7e–55 |
| CU405942 | 0 | 185 | 0 | 5 | 0 | 0 | 0 | PF00967 | Barwin | 3e–84 |
| CU406520 | 5 | 1209 | 0 | 0 | 0 | 0 | 0 | |||
| CU406670 | 0 | 189 | 0 | 0 | 0 | 0 | 0 | PF00280 | Potato_inhibit | 1.4e–20 |
| CU406238 | 41 | 0 | 987 | 33 | 31 | 0 | 0 | PF04398 | DUF538 | 4.9e–41 |
| CT841875 | 16 | 0 | 0 | 162 | 3 | 3 | 15 | |||
| CT841950 | 119 | 135 | 76 | 3079 | 107 | 19 | 0 | |||
| CT841815 | 107 | 135 | 76 | 3087 | 107 | 19 | 0 | |||
| CU406940 | 59 | 68 | 19 | 31 | 1393 | 4 | 0 | PF02065 | Melibiase | 3.5e–13 |
| CU406598 | 565 | 0 | 606 | 757 | 16 965 | 0 | 0 | PF00234 | Tryp_alpha_amyl | 1.6e–31 |
| CU406533 | 7 | 0 | 14 | 30 | 4662 | 119 | 0 | PF00234 | Tryp_alpha_amyl | 5.5e–33 |
| CU406609 | 0 | 0 | 0 | 0 | 143 | 0 | 0 | |||
| CU406264 | 0 | 0 | 0 | 0 | 237 | 0 | 0 | |||
| CU405759 | 0 | 0 | 0 | 0 | 779 | 0 | 0 | |||
| CU406038 | 14 | 14 | 0 | 0 | 247 | 0 | 0 | |||
| CU405951 | 0 | 25 | 0 | 0 | 13 | 1347 | 0 | PF01439 | Metallothio_2 | 6.5e–22 |
| CU406698 | 13 | 0 | 0 | 0 | 0 | 0 | 289 | PF00481 | PP2C | 2.4e–14 |
| CU406351 | 103 | 4 | 36 | 66 | 48 | 42 | 3228 |
NGS, 3 days—Germinating seed; NCA, 35 days—Callus; NGD, 10 days—Germinating seedlings grown in dark; NME, 60 days—Crown vegetative meristematic tissue; NPO, mature pollen.
List of seven cDNAs preferentially expressed under cold-stress, drought-stress and salinity in leaf (unit: tpm)
| Clone Acc. | Normal leaf | NCL | NDL | NSL | PFAM Acc. | Description | E-value |
|---|---|---|---|---|---|---|---|
| CU406310 | 96 | 2872 | 3 | 255 | Null | Null | Null |
| CT841781 | 96 | 3089 | 3 | 257 | Null | Null | Null |
| CT841558 | 102 | 2404 | 2 | 365 | Null | Null | Null |
| CU406554 | 11 | 568 | 0 | 83 | Null | Null | Null |
| CT841576 | 303 | 0 | 3435 | 68 | PF00234 | Tryp_alpha_amyl | 4.6e–33 |
| CU406485 | 0 | 0 | 1477 | 0 | Null | Null | Null |
| CU405946 | 0 | 113 | 0 | 591 | PF00257 | Dehydrin | 2.2e–54 |
NCL, 14 days—Young leaves stressed in 4°C cold for 24 h; NDL, 14 days—Young leaves stressed in drought for 5 days; NSL, 14 days—Young leaves stressed in 250 mM NaCl for 24 h.