| Literature DB >> 21599934 |
Christian Clepet1, Tarek Joobeur, Yi Zheng, Delphine Jublot, Mingyun Huang, Veronica Truniger, Adnane Boualem, Maria Elena Hernandez-Gonzalez, Ramon Dolcet-Sanjuan, Vitaly Portnoy, Albert Mascarell-Creus, Ana I Caño-Delgado, Nurit Katzir, Abdelhafid Bendahmane, James J Giovannoni, Miguel A Aranda, Jordi Garcia-Mas, Zhangjun Fei.
Abstract
BACKGROUND: Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. RESULT: We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21599934 PMCID: PMC3118787 DOI: 10.1186/1471-2164-12-252
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of melon cDNA libraries and summary of melon ESTs
| Library | Cultivar | Tissue | No. 5' sequences | No. 3' sequences | Total sequences |
|---|---|---|---|---|---|
| PFTP2 | PI161375 | mixture of fruits in four developmental stages | 12,673 | 547 | 13,220 |
| SFTP2 | Piel de Sapo T-111 | mixture of fruits in four developmental stages | 3,630 | 139 | 3,769 |
| VFTP2 | Vedrantais | mixture of fruits in four developmental stages | 3,621 | 149 | 3,770 |
| DFTP2 | Dulce | mixture of fruits in four developmental stages | 3,549 | 70 | 3,619 |
| VFLP2 | Vedrantais | mixture of flowers in three developmental stages | 19,261 | 638 | 19,899 |
| PFLP2 | PI161375 | mixture of flowers in three developmental stages | 3,548 | 62 | 3,610 |
| DFLP2 | Dulce | mixture of flowers in three developmental stages | 3,411 | 126 | 3,537 |
| SFLP2 | Piel de Sapo T-111 | mixture of flowers in three developmental stages | 320 | 0 | 320 |
| MNFG2 | Piel de Sapo T-111 | leaf infected by melon necrotic spot virus (MNSV) | 7,776 | 295 | 8,071 |
| MNRP2 | Piel de Sapo T-111 | root infected by melon necrotic spot virus (MNSV) | 7,953 | 297 | 8,250 |
| MNCP2 | Piel de Sapo T-111 | cotyledon infected by melon necrotic spot virus (MNSV) | 3,454 | 58 | 3,512 |
| 69,196 | 2,381 | 71,577 | |||
| CM-DEa | Dulce | callus | 5,485 | 0 | 5,485 |
| CM-PEa | PI161375 | callus | 5,527 | 0 | 5,527 |
| CM-TEa | Piel de Sapo | callus | 5,700 | 0 | 5,700 |
| CM-VEa | Vedrantais | callus | 5,467 | 0 | 5,467 |
| 22,179 | 0 | 22,179 | |||
| 91,375 | 2,381 | 93,756 | |||
Statistics of melon unigenes
| Singleton | Contig | Unigene | |
|---|---|---|---|
| 12,791 | 11,653 | 24,444 | |
| 598.7 | 972.0 | 776.7 | |
| 7,658,604 | 11,326,166 | 18,984,770 |
Figure 1Histogram of number of ESTs in each melon unigene.
Most abundant melon unigenes (>500 EST members)
| Unigene ID | No. of ESTs | GenBank nr hit description | E value |
|---|---|---|---|
| MU46026 | 2054 | ELP (EXTENSIN-LIKE PROTEIN); lipid binding | 1e-31 |
| MU45978 | 1120 | type I proteinase inhibitor-like protein | 9e-06 |
| MU46015 | 915 | acyl carrier protein | 1e-25 |
| MU45913 | 827 | type-2 metallothionein | 2e-19 |
| MU45877 | 819 | No hits found | |
| MU45416 | 786 | No hits found | |
| MU45994 | 785 | B12D-like protein | 3e-38 |
| MU46019 | 683 | Wound-induced proteinase inhibitor 1 | 1e-10 |
| MU45854 | 626 | No hits found | |
| MU45964 | 619 | lipid binding protein | 3e-16 |
| MU43757 | 618 | 60s acidic ribosomal protein | 4e-23 |
| MU47776 | 594 | histone cluster 2, H3c2-like | 2e-60 |
| MU45763 | 591 | type-2 metallothionein | 2e-19 |
| MU47828 | 580 | chloroplast photosystem II 10 kDa protein | 1e-52 |
| MU45654 | 568 | chlorophyll A/B binding protein | 7e-147 |
| MU45963 | 554 | histone H4 | 3e-38 |
| MU45282 | 514 | ascorbate peroxidase | 1e-120 |
| MU45991 | 509 | ubiquitin carrier-like protein | 4e-83 |
Figure 2Size distribution of cDNAs, CDS (A) and 5' and 3' UTRs (B) of melon full-length transcripts.
Figure 3Venn diagram of ortholog group distribution in melon, cucumber, Arabidopsis, grape, and rice. Numbers in individual sections indicate the numbers of ortholog groups
Figure 4Heatmap representation of expression profiles of melon tissue-specific genes.
Statistics of melon simple sequence repeats (SSRs)
| Unit size | Number of SSRs |
|---|---|
| di-nucleotide | 1657 |
| tri-nucleotide | 2157 |
| tetra-nucleotide | 124 |
| penta-nucleotide | 51 |
| hexa-nucleotide | 79 |
Statistics of melon single nucleotide polymorphisms (SNPs)
| SNP | No. SNPs | type | total |
|---|---|---|---|
| A -> G | 864 | Transition | 1972 |
| C -> T | 1108 | ||
| A -> C | 255 | ||
| A -> T | 289 | transversion | 976 |
| C -> G | 210 | ||
| G -> T | 222 | ||
| T -> - | 40 | ||
| A -> - | 38 | ||
| G -> - | 23 | indel | 125 |
| C -> - | 24 | ||