| Literature DB >> 26199933 |
Michihiko Shimomura1, Hiroyuki Kanamori2, Setsuko Komatsu3, Nobukazu Namiki1, Yoshiyuki Mukai2, Kanako Kurita2, Kaori Kamatsuki1, Hiroshi Ikawa1, Ryoichi Yano4, Masao Ishimoto5, Akito Kaga5, Yuichi Katayose2.
Abstract
We elucidated the genome sequence of Glycine max cv. Enrei to provide a reference for characterization of Japanese domestic soybean cultivars. The whole genome sequence obtained using a next-generation sequencer was used for reference mapping into the current genome assembly of G. max cv. Williams 82 obtained by the Soybean Genome Sequencing Consortium in the USA. After sequencing and assembling the whole genome shotgun reads, we obtained a data set with about 928 Mbs total bases and 60,838 gene models. Phylogenetic analysis provided glimpses into the ancestral relationships of both cultivars and their divergence from the complex that include the wild relatives of soybean. The gene models were analyzed in relation to traits associated with anthocyanin and flavonoid biosynthesis and an overall profile of the proteome. The sequence data are made available in DAIZUbase in order to provide a comprehensive informatics resource for comparative genomics of a wide range of soybean cultivars in Japan and a reference tool for improvement of soybean cultivars worldwide.Entities:
Year: 2015 PMID: 26199933 PMCID: PMC4493290 DOI: 10.1155/2015/358127
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Genome assembly and annotation of G. max cv. Enrei.
| Reference mapping length | With gaps [bp] | Without gaps [bp] | Ratio |
|---|---|---|---|
| Chromosome | 946,877,581 | 904,901,085 | 95.6 |
| Scaffold | 31,116,190 | 22,803,649 | 73.3 |
| Total | 977,993,771 | 927,704,734 | 94.9 |
|
| |||
| Gene models | |||
|
| |||
| Number of gene models | 60,838 | ||
| Mean coding sequence length | 1455.3 [bp] | ||
| Mean number of exons per gene | 4.5 | ||
| Mean exon length | 323.4 [bp] | ||
Figure 1Phylogenetic tree of G. max cv. Williams 82 (Gmw), G. max cv. Enrei (Gme), A. thaliana (Ath), A. lyrata (Aly), and M. truncatula (Mtr). The pink bar represents the 95% probability density. Mya represents a unit in million years.
Figure 2Enzymes involved in the major pathway for anthocyanin and flavonoid biosynthesis and the corresponding genes in Gmax275 and G. max cv. Enrei. PAL (phenylalanine ammonia-lyase), 4CL (4-coumaroyl-CoA-ligase), C4H (cinnamate-4-hydroxylase), CHS (chalcone synthase), CHI (chalcone isomerase), F3H (flavanone 3-hydroxylase), FLS (flavonol synthase), DFR (dihydroflavonol 4-reductase), and ANS (anthocyanidin synthase).
Figure 3A region in soybean chromosome 8 showing the position of CHS gene clusters. The region encompassing 8.3~8.5 Mb of soybean chromosome 8 of Williams 82 Gmax275 (a) and Enrei cultivar (b) is characterized by CHS gene clusters. Most of the CHS genes correspond in both cultivars as indicated by the position and UniProt annotation of identified genes. However many CHS genes could not be localized in Enrei cultivar due to fragmented sequence.
Composition of storage proteins in G. max cv. Enrei.
| Chromosome | Related number of gene | Weight % (mass | mol % |
|---|---|---|---|
| Chr10 | 6 | 19.8 | 14.27 |
| Chr20 | 4 | 15.7 | 14.58 |
| Chr03 | 1 | 6.6 | 4.31 |
| Chr13 | 1 | 4.6 | 3.06 |
| Chr19 | 1 | 2.8 | 1.88 |
| Chr04 | 1 | 2.4 | 1.99 |
| Chr02 | 1 | 2.1 | 1.36 |
| Chr11 | 1 | 1.2 | 0.85 |
| Chr01 | 1 | 0.1 | 0.07 |
|
| |||
| Total | 17 | 55.4 | 42.4 |