| Literature DB >> 21149391 |
Shusei Sato1, Hideki Hirakawa, Sachiko Isobe, Eigo Fukai, Akiko Watanabe, Midori Kato, Kumiko Kawashima, Chiharu Minami, Akiko Muraki, Naomi Nakazaki, Chika Takahashi, Shinobu Nakayama, Yoshie Kishida, Mitsuyo Kohara, Manabu Yamada, Hisano Tsuruoka, Shigemi Sasamoto, Satoshi Tabata, Tomoyuki Aizu, Atsushi Toyoda, Tadasu Shin-i, Yohei Minakuchi, Yuji Kohara, Asao Fujiyama, Suguru Tsuchimoto, Shin'ichiro Kajiyama, Eri Makigano, Nobuko Ohmido, Nakako Shibagaki, Joyce A Cartagena, Naoki Wada, Tsutomu Kohinata, Alipour Atefeh, Shota Yuasa, Sachihiro Matsunaga, Kiichi Fukui.
Abstract
The whole genome of Jatropha curcas was sequenced, using a combination of the conventional Sanger method and new-generation multiplex sequencing methods. Total length of the non-redundant sequences thus obtained was 285 858 490 bp consisting of 120 586 contigs and 29 831 singlets. They accounted for ~95% of the gene-containing regions with the average G + C content was 34.3%. A total of 40 929 complete and partial structures of protein encoding genes have been deduced. Comparison with genes of other plant species indicated that 1529 (4%) of the putative protein-encoding genes are specific to the Euphorbiaceae family. A high degree of microsynteny was observed with the genome of castor bean and, to a lesser extent, with those of soybean and Arabidopsis thaliana. In parallel with genome sequencing, cDNAs derived from leaf and callus tissues were subjected to pyrosequencing, and a total of 21 225 unigene data have been generated. Polymorphism analysis using microsatellite markers developed from the genomic sequence data obtained was performed with 12 J. curcas lines collected from various parts of the world to estimate their genetic diversity. The genomic sequence and accompanying information presented here are expected to serve as valuable resources for the acceleration of fundamental and applied research with J. curcas, especially in the fields of environment-related research such as biofuel production. Further information on the genomic sequences and DNA markers is available at http://www.kazusa.or.jp/jatropha/.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21149391 PMCID: PMC3041505 DOI: 10.1093/dnares/dsq030
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.The strategy and status of sequencing and assembly.
Assembly statistics
| Total length of contigs and singlets | 285 858 490 |
| Total number of contigs and singlets | 150 417 |
| Average length of contigs and singlets | 1900 |
| Maximum length of contigs and singlets | 29 744 |
| N50 | 3833 |
| G + C content (%) | 34.3 |
| Number of contigs (JcCA) | 32 212 |
| Number of contigs (JcCB) | 60 363 |
| Number of contigs (JcCC) | 2483 |
| Number of contigs (JcCD) | 25 528 |
| Number of singlets (JcSR) | 26 819 |
| Number of singlets (JcPR) | 1347 |
| Number of BAC end sequences (JHL/JHS/JMS) | 1665 |
| Contigs | |
| Total number of contigs | 120 586 |
| Total length of contigs | 276 710 623 |
| Average length of contigs | 2295 |
| Singlets | |
| Total number of singlets | 29 831 |
| Total length of singlets | 9 147 867 |
| Average length of singlets | 307 |
Repetitive sequences in the Jatropha genomic sequences
| Repeat type | Jatropha genomic sequences | ||
|---|---|---|---|
| Number of elements | Coverage (kb) | Percentage of sequence | |
| Class I | |||
| LINEs | 195 | 136.9 | 0.05 |
| LTR: Copia | 31 740 | 22 318.2 | 8.03 |
| LTR: Gypsy | 67 658 | 56 655.7 | 19.60 |
| LTR: other | 13 454 | 6436.6 | 2.23 |
| Total class I | 113 047 | 86 447.4 | 29.91 |
| Class II | |||
| Coding class II | 5709 | 4102.9 | 1.42 |
| MITE | 5980 | 1802.8 | 0.62 |
| Total class II | 11 689 | 5905.7 | 2.04 |
| Short tandem repeats | 2092 | 148.1 | 0.05 |
| Unclassified | 25 977 | 14 953.3 | 5.17 |
Figure 2.GO category classification. The percentages of number of genes classified into each GO slim category in J. curcas, R. communis, and A. thaliana are, respectively, shown in blue, red, and yellow bars. (A) GO terms; (B) biological process; (C) cellular component; and (D) molecular function.