| Literature DB >> 23672264 |
Nicolas Sierro1, Jan van Oeveren, Michiel J T van Eijk, Florian Martin, Keith E Stormo, Manuel C Peitsch, Nikolai V Ivanov.
Abstract
Genomics-based breeding of economically important crops such as banana, coffee, cotton, potato, tobacco and wheat is often hampered by genome size, polyploidy and high repeat content. We adapted sequence-based whole-genome profiling (WGP™) technology to obtain insight into the polyploidy of the model plant Nicotiana tabacum (tobacco). N. tabacum is assumed to originate from a hybridization event between ancestors of Nicotiana sylvestris and Nicotiana tomentosiformis approximately 200,000 years ago. This resulted in tobacco having a haploid genome size of 4500 million base pairs, approximately four times larger than the related tomato (Solanum lycopersicum) and potato (Solanum tuberosum) genomes. In this study, a physical map containing 9750 contigs of bacterial artificial chromosomes (BACs) was constructed. The mean contig size was 462 kbp, and the calculated genome coverage equaled the estimated tobacco genome size. We used a method for determination of the ancestral origin of the genome by annotation of WGP sequence tags. This assignment agreed with the ancestral annotation available from the tobacco genetic map, and may be used to investigate the evolution of homoeologous genome segments after polyploidization. The map generated is an essential scaffold for the tobacco genome. We propose the combination of WGP physical mapping technology and tag profiling of ancestral lines as a generally applicable method to elucidate the ancestral origin of genome segments of polyploid species. The physical mapping of genes and their origins will enable application of biotechnology to polyploid plants aimed at accelerating and increasing the precision of breeding for abiotic and biotic stress resistance.Entities:
Keywords: Nicotiana tabacum; genome; next-generation sequencing; physical map; polyploidy; tobacco; whole-genome profiling
Mesh:
Year: 2013 PMID: 23672264 PMCID: PMC3824204 DOI: 10.1111/tpj.12247
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417
Metrics for the physical map construction using a 51 nt tag length
| Number of BACs tested | 425 088 |
| Genome equivalents | 10.4 |
| Number of deconvolutable reads ( | 907.7 |
| Number of unique WGP tags | 1 239 733 |
| Number of tagged BACs (FPC-ready) | 361 034 |
| Percentage of tagged BACs (FPC-ready) | 85 |
| Mean number of tags per BAC | 32.1 |
| Number of contigs | 9750 |
| Number of BACs in contigs | 330 632 |
| Number of singletons | 30 402 |
| Mean number of BACs per contig | 34 |
| N50 BACs per contig | 60 |
| Mean contig size (Mbp) | 0.462 |
| N50 contig size (Mbp) | 0.689 |
| Genome coverage (Mbp) | 4508 |
| Percentage genome coverage | 100 |
Figure 1Log10 frequency of the weighted proportion of S tags. A value of 0 indicates a T origin (red bars) and a value of one indicates an S origin (blue bars), according to the enrichment P values (see Experimental procedures). The gray bars indicate counts of: (a) BACs. (b) WGP contigs of undefined origin, i.e. for which unequivocal assignment to S or T was not possible. Absolute counts are given above each bar.
Ancestral origin of the WGP tags, BACs and contigs
| S origin (%) | T origin (%) | Undefined origin (%) | Unknown origin (%) | |
|---|---|---|---|---|
| WGP tags | 494 973 (39.9) | 311 399 (25.1) | 0 (0.0) | 433 361 (35.0) |
| BACs | 219 143 (60.7) | 134 933 (37.4) | 1375 (0.4) | 5583 (1.5) |
| WGP contigs | 5236 (53.7) | 4465 (45.8) | 45 (0.5) | 4 (0.0) |
Number and proportion of BACs classified according to their ancestral origin compared to the ancestral origin of the WGP contigs to which they belong
| WGP contig | ||||
|---|---|---|---|---|
| BAC | S origin (%) | T origin (%) | Undefined origin (%) | Unknown origin (%) |
| S origin | 199 254 (98.0) | 3155 (1.6) | 861 (0.4) | 0 (0.0) |
| T origin | 6647 (5.3) | 117 989 (94.0) | 833 (0.7) | 0 (0.0) |
| Undefined origin | 491 (43.8) | 595 (53.1) | 35 (3.1) | 0 (0.0) |
| Unknown origin | 150 (19.4) | 603 (78.1) | 10 (1.3) | 9 (1.2) |
Comparison of determined ancestral origins of BACs and WGP contigs with the putative origin assigned to linkage group regions
| S origin (%) | T origin (%) | Undefined origin (%) | Unknown origin (%) | |
|---|---|---|---|---|
| S linkage group regions | ||||
| BACs | 331 (78.8) | 88 (21.0) | 1 (0.2) | 0 (0.0) |
| WGP contigs | 308 (79.0) | 81 (20.8) | 1 (0.3) | 0 (0.0) |
| T linkage group regions | ||||
| BACs | 118 (30.7) | 262 (68.2) | 3 (0.8) | 1 (0.3) |
| WGP contigs | 128 (35.8) | 227 (63.4) | 3 (0.8) | 0 (0.0) |
Figure 2Strategies to scaffold DNA sequences using the WGP physical map and to scaffold WGP contigs using DNA sequences.
(a) Scaffolding of DNA sequences using the WGP physical map. The orientation of the scaffolded DNA sequences is indicated by the mapping position of WGP tags from different bins of the same WGP contigs.
(b) Scaffolding of WGP contigs using DNA sequences. Tags from end bins of different WGP contigs that map to the same DNA sequence are used to link WGP contigs.