| Literature DB >> 18053163 |
Tokihiko Nanjo1, Tetsuya Sakurai, Yasushi Totoki, Atsushi Toyoda, Mitsuru Nishiguchi, Tomoyuki Kado, Tomohiro Igasaki, Norihiro Futamura, Motoaki Seki, Yoshiyuki Sakaki, Kazuo Shinozaki, Kenji Shinohara.
Abstract
BACKGROUND: Populus is one of favorable model plants because of its small genome. Structural genomics of Populus has reached a breakpoint as nucleotides of the entire genome have been determined. Reaching the post genome era, functional genomics of Populus is getting more important for well-comprehended plant science. Development of bioresorce serving functional genomics is making rapid progress. Huge efforts have achieved deposits of expressed sequence tags (ESTs) in various plant species consequently accelerating functional analysis of genes. ESTs from full-length cDNA clones are especially powerful for accurate molecular annotation. We promoted collection and annotation of the ESTs from Populus full-length enriched cDNA clones as part of functional genomics of tree species.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18053163 PMCID: PMC2222646 DOI: 10.1186/1471-2164-8-448
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of the predicted insert size in the second version of the P. nigra full-length cDNA library (PnFL2). The fragment sizes of 17,273 P. trichocarpa CDS that were substituted for the P. nigra ESTs were determined.
Figure 2Flow chart of the functional annotation of PnFL cDNA clones. In total, 19,841 nonredundant PnFL clones were subjected to functional annotation. Parallelogrammatic elements with a left number indicate the results of each adjacent procedure (see Results and discussion). The annotation lists are summarized in aAdditional file 2, bAdditional file 3 and cTable 1.
Domain detection by InterProScan for characterizing of no-hit clones a
| Clone name b | Accession | Name | |
| PnFL1-083_I10 | IPR012336 | Thioredoxin-like fold | 0.0069 |
| PnFL2-001_C15 | IPR000480 | Glutelin | 5.40E-06 |
| PnFL2-003_H05 | IPR001810 | Cyclin-like F-box | 0.0014 |
| PnFL2-004_G12 | IPR007836 | Ribosomal protein L41 | 1.60E-08 |
| PnFL2-009_G19 | IPR006031 | XYPPX repeat | 19 |
| PnFL2-009_I06 | IPR003882 | Pistil-specific extensin-like protein | 1.60E-07 |
| PnFL2-010_B02 | IPR006031 | XYPPX repeat | 64 |
| PnFL2-010_P09 | IPR006121 | Heavy metal transport/detoxification protein | 1.00E-09 |
| PnFL2-013_P01 | IPR000772 | Ricin B lectin | 2.20E-23 |
| IPR008997 | Ricin B-related lectin | 1.40E-30 | |
| PnFL2-016_G15 | IPR010978 | tRNA-binding arm | 0.0046 |
| PnFL2-021_F12 | IPR000480 | Glutelin | 9.90E-05 |
| PnFL2-021_J01 | IPR000167 | Dehydrin | 0.00011 |
| PnFL2-026_J14 | IPR001627 | Semaphorin/CD100 antigen | 9.042 |
| PnFL2-028_L04 | IPR000480 | Glutelin | 9.90E-05 |
| PnFL2-032_H01 | IPR000048 | IQ calmodulin-binding region | 7.401 |
| PnFL2-034_F13 | IPR000480 | Glutelin | 6.50E-07 |
| IPR000976 | Wilm's tumour protein | 9.60E-05 | |
| IPR006706 | Extensin-like region | 1.20E-31 | |
| PnFL2-034_J01 | IPR010978 | tRNA-binding arm | 0.0046 |
| PnFL2-036_J14 | IPR001810 | Cyclin-like F-box | 5.50E-05 |
| PnFL2-046_A21 | IPR000772 | Ricin B lectin | 2.20E-23 |
| IPR008997 | Ricin B-related lectin | 1.40E-30 | |
| PnFL2-046_B04 | IPR000480 | Glutelin | 4.50E-07 |
| PnFL2-048_D17 | IPR006031 | XYPPX repeat | 64 |
| PnFL2-048_H02 | IPR000048 | IQ calmodulin-binding region | 7.401 |
| PnFL2-055_F11 | IPR003267 | Small proline-rich | 3.10E-05 |
| PnFL2-064_O17 | IPR000480 | Glutelin | 1.90E-05 |
| IPR003882 | Pistil-specific extensin-like protein | 5.00E-06 | |
| PnFL2-067_N14 | IPR009424 | Protein of unknown function DUF1070 | 1.10E-27 |
| PnFL2-075_P11 | IPR001810 | Cyclin-like F-box | 5.90E-05 |
| PnFL2-076_H24 | IPR006031 | XYPPX repeat | 19 |
| PnFL2-077_G19 | IPR001878 | Zinc finger, CCHC-type | 1.70E-06 |
| PnFL2-078_N14 | IPR001179 | Peptidylprolyl isomerase, FKBP-type | 0.00067 |
| PnFL2-079_I20 | IPR006077 | Vinculin/alpha-catenin | 2.40E-05 |
| PnFL2-087_M22 | IPR000048 | IQ calmodulin-binding region | 7.401 |
| PnFL2-090_P08 | IPR008011 | Complex 1 LYR protein | 3.70E-15 |
| PnFL2-098_J19 | IPR002885 | Pentatricopeptide repeat | 2.80E-08 |
| PnFL2-102_L04 | IPR000480 | Glutelin | 1.30E-06 |
| IPR003882 | Pistil-specific extensin-like protein | 1.90E-07 |
a Substituted P. trichocarpa CDS for PnFL ESTs were subjected to the InterProScan program. This table corresponds to 'annotation list c ' in Fig. 2.
b PnFL clones whose substitutive sequences have at least one hit are listed.
Figure 3Overview of the functional classification of the P. nigra ESTs. In total, 10,829 of the 19,841 nonredundant ESTs that comprised the 5' or 3' reads that yielded the lowest E value for each clone were assigned to eukaryotic clusters of KOGs. Designations of functional categories and the proportion of each: A, RNA processing and modification; B, chromatin structure and dynamics; C, energy production and conversion; D, cell cycle control and mitosis; E, amino acid transport and metabolism; F, nucleotide transport and metabolism; G, carbohydrate transport and metabolism; H, coenzyme transport and metabolism; I, lipid transport and metabolism; J, translation, ribosomal structure, and biogenesis; K, transcription; L, replication and repair; M, cell wall/membrane/envelope biogenesis; O, posttranslational modification, protein turnover, and chaperone functions; P, inorganic ion transport and metabolism; Q, secondary metabolite biosynthesis, transport, and catabolism; T, signal transduction; U, intracellular trafficking, secretion, and vesicular transport; V, defense mechanisms; W, extracellular structures; Y, nuclear structure; Z, cytoskeleton; R, general functional prediction only; and S, function unknown.
Figure 4Cumulative count of homologs of Arabidopsis and rice. All the CDS of both Arabidopsis and rice were compared with the PnFL ESTs by using the tblastx program. The curves depict the percentages of genes in the Arabidopsis (solid line) and the rice (broken line) genomes that have greater sequence similarity than the E value ascribed to a corresponding sequence in the PnFL ESTs. For instance, as shown in the inset, 50% of the genes have a hit with an E value of < 10-31 in Arabidopsis and of < 10-9 in rice.
Figure 5The putative physical distribution of PnFL clones in 19 P. trichocarpa chromosomes (top: north; bottom: south). Each red pixel shows a locus that corresponds to a PnFL clone. The number underneath each chromosome is the chromosome number and that above is the distribution index (see Methods).