| Literature DB >> 23167289 |
Arwa Shahin1, Martijn van Kaauwen, Danny Esselink, Joachim W Bargsten, Jaap M van Tuyl, Richard G F Visser, Paul Arens.
Abstract
BACKGROUND: Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23167289 PMCID: PMC3576253 DOI: 10.1186/1471-2164-13-640
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General statistics of 454 sequencing and assembly for lily and tulip
| 139,480 | 104,323(75%) | 336 | 77,097(74%) | 27,226(26%) | 14,773 | 615 | |
| 326,539 | 221,597(68%) | 338 | 182,393(82%) | 39,204(18%) | 21,898 | 663 | |
| 374,240 | 255,081(68%) | 341 | 202,707(79.5%) | 52,374(20.5%) | 24,700 | 688 | |
| 442,476 | 299,655(69%) | 343 | 241,782(81%) | 57,873(19%) | 26,075 | 694 | |
| 1,282,735 | 880,656(69%) | 340 | 471,378(53.5%) | 409,278(46.5%) | 52,172 | 555 | |
| 310,973 | 207,229(67%) | 389 | 158,007(76%) | 49,222(24%) | 17,646 | 625 | |
| 316,372 | 211,380(67%) | 386 | 165,282(78%) | 46,098(22%) | 17,007 | 632 | |
| 627,345 | 418,609(67%) | 388 | 293,043(70%) | 125,566(30%) | 24,713 | 629 | |
| 592,034 | 407,392(69%) | 281 | 303,558(74.5%) | 103,834(25.5%) | 38,716 | 559 | |
| 263,175 | 185,464(70%) | 283 | 125,293(67.6%) | 60,171(32%) | 24,557 | 517 | |
| 221,334 | 149,768(67%) | 278 | 109,309(34%) | 40,459(27%) | 14,325 | 522 | |
| 1,076,543 | 742,624(69%) | 281 | 536,776 (74%) | 205,848(28%) | 54,575 | 557 | |
| 1,703,888 | 1,378,898 | 314 | 827,772(60%) | 551,126(40%) | 81,791 | 514 |
Transcriptome coverage of lily and tulip genomes
| 10 | 11.2 | 21.2 | 25.8 | |
| 14.5 | 13.2 | 27.7 | 33.8 | |
| 17 | 17.9 | 34.9 | 42.6 | |
| 18 | 20 | 38 | 46.3 | |
| 11 | 19.5 | 30.5 | 37.2 | |
| 10.8 | 18 | 28.8 | 35 | |
| 16 | 50.7 | 66.7 | 81.3 | |
| 21.6 | 30 | 51.6 | 63 | |
| 12.7 | 17 | 29.7 | 36 | |
| 7.5 | 11 | 18.5 | 22.6 | |
| 30.4 | 60 | 90.4 | 110 |
The estimated percentage of transcriptome coverage for each genotype was calculated based on the number of genes and average gene size in rice.
SNP markers identification for lily and tulip
| 14,773 | 4,309 (29%) | 406 (9.4%) | 1,171 (27%) | |
| 21,898 | 9,261 (42%) | 558 (6%) | 1,292 (14%) | |
| 24,700 | 10,024 (41%) | 730 (7%) | 2,026 (20%) | |
| 26,075 | 11,298 (43%) | 607 (5%) | 2,075 (18%) | |
| 17,646 | 7,456 (42%) | 722 (10%) | 2,371 (32%) | |
| 17,007 | 7,587 (45%) | 690 (10%) | 2,510 (33%) | |
| 24,713 | 11,787 (48%) | 1,002 (8.5%) | 3,265 (28%) | |
| 38,716 | 13,832 (36%) | 595 (4.3%) | 1,646 (12%) | |
| 24,557 | 6,347 (26%) | 310 (5%) | 776 (12%) | |
| 14,325 | 4,476 (31%) | 223 (5%) | 535 (12%) | |
| 54,575 | 20,661 (38%) | 822 (4%) | 2,033 (10%) |
* Percentages of contigs that contain at least one SNP calculated according to the total number of contigs.
** Percentage of SNP markers from total nr. of contigs that contain at least one SNP.
Number of contigs that contain at least one SNP was calculated. SNP markers with 50 bps flanking sequences free of secondary SNP, and with one secondary SNP allowance in the flanking sequences were identified.
SSR repeat description in lily and tulip
| 14,773 | 271 (1.9%) | 85 (31%) | 161 (59%) | 4 | 6 | 15 | |
| 21,898 | 603 (2.8%) | 216 (36%) | 301 (50%) | 51 | 12 | 23 | |
| 24,700 | 735 (3%) | 299 (41%) | 330 (45%) | 66 | 13 | 27 | |
| 26,075 | 745 (2.8%) | 312 (42%) | 341 (46%) | 50 | 17 | 25 | |
| 17,646 | 696 (3.9%) | 168 (24%) | 449 (65%) | 30 | 9 | 40 | |
| 17,007 | 683 (4%) | 146 (21%) | 468 (69%) | 28 | 11 | 30 | |
| 24,713 | 957 (3.9%) | 216 (23%) | 642 (67%) | 45 | 15 | 39 | |
| 38,716 | 881 (2.3%) | 262 (30%) | 491 (56%) | 58 | 19 | 51 | |
| 24,557 | 521 (2%) | 140 (27%) | 317 (61%) | 33 | 12 | 19 | |
| 14,325 | 302 (2%) | 80 (28%) | 184 (64%) | 9 | 11 | 18 | |
| 54,575 | 1,302 (2.9%) | 393 (30%) | 719 (55%) | 95 | 35 | 60 |
* Percentage of SSR repeats calculated according to the total number of contigs.
** Percentage of SSR repeats from total nr. of SSRs.
Location of di- and tri-nucleotide repeats in lily and tulip contigs presented as a percentage of the total number of identified SSR in each cultivar in coding and UTR regions identified using ORF-Predictor software
| | ||||
|---|---|---|---|---|
| 35 | 65 | 50 | 50 | |
| 28 | 72 | 44 | 56 | |
| 32 | 68 | 58 | 42 | |
| 33 | 67 | 58 | 42 | |
| 32 | 68 | 57 | 43 | |
| 26 | 74 | 57 | 43 | |
| 32 | 68 | 48 | 52 | |
| 35 | 65 | 50 | 50 | |
| 43 | 57 | 52 | 48 | |
Figure 1Venn diagram of the distribution of orthologous groups in lily, tulip and rice, calculated with OrthoMCL. Overlapping regions denote groups with at least two proteins of all species that are part of the intersection. All circles and overlapping areas are scaled to the number of groups in the respective region.
Figure 2Representation of transcriptome ontology assignments for Lily-All and Tulip-All assemblies of 454 sequencing data. GO terms: A, molecular function; B, biological process; C, cellular compound.
Figure 3Representation of transcriptome ontology assignments (GO term) for the orthologous sequences between lily and tulip from 454 sequencing data. A, molecular function, B, biological process and C, cellular compound.
Figure 4Common SNP and SSR markers between lily and tulip. Common markers identified among the parents of the lily and tulip mapping populations developed based on orthologous sequences of each genotype combination. Common markers represent orthologous genetic loci with a polymorphism. Actual polymorphisms may differ between the parents.