| Literature DB >> 28028039 |
Naoya Urasaki1, Hiroki Takagi2, Satoshi Natsume2, Aiko Uemura2, Naoki Taniai1, Norimichi Miyagi1, Mai Fukushima3, Shouta Suzuki3, Kazuhiko Tarora1, Moritoshi Tamaki1, Moriaki Sakamoto1, Ryohei Terauchi2, Hideo Matsumura3.
Abstract
Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant.Entities:
Keywords: Cucurbitaceae; Momordica charantia; bitter gourd; de novo sequencing; draft genome
Mesh:
Substances:
Year: 2017 PMID: 28028039 PMCID: PMC5381343 DOI: 10.1093/dnares/dsw047
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Summary of assembly results in OHB3-1 genome sequence
| Nuclear genome | |
| Scaffold number | 1,029 |
| Total length (bp) | 285,543,823 |
| N50 (bp) | 1,100,631 |
| Maximum length (bp) | 7,185,522 |
| GC content (%) | 36.4 |
| Mitochondrial genome | |
| Scaffold number | 1 |
| Total length (bp) | 312,781 |
| GC content (%) | 41.1 |
| Chloroplast genome | |
| Scaffold number | 3 |
| Total length (bp) | 140,659 |
| Maximum length (bp) | 131,815 |
| GC content (%) | 35.8 |
Summary of predicted genes in bitter gourd (OHB3-1) scaffold sequence
| Total predicted genes | Average length (aa) of encoded protein | Annotated genes | Putative transposable elements | ||
|---|---|---|---|---|---|
| BLAST (NR) | BLAST (UniProt) | InteProScan | |||
| 45,859 | 358 | 34,986 | 25,348 | 24,183 | 8,839 |
aPrediction of protein-coding genes and their translated sequences were conducted by FGENESH.
bEncoded amino acid sequences of the predicted genes were applied to BLASTP searching against non-redundant protein database in NCBI and UniProtKB/Swiss-Prot database or InterProscan analysis for conserved domain searching.
cTransposable elements in the predicted genes were surveyed by TranposonPSI.
Figure 1Phylogenic analysis of putative trypsin inhibitor (A) and ribosome inactivating protein (B) in bitter gourd. Based on 34 amino acid sequences of trypsin inhibitor proteins and 18 amino acid sequences of ribosome inactivating proteins of bitter gourd (Supplementary Table S12, S14, and S15), phylogenetic tree for each protein family was constructed using the Neighbor-Joining method by MEGA7.0.18. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the p-distance method and are in the units of the number of amino acid differences per site.