| Literature DB >> 24116150 |
Ramesh Buyyarapu1, Ramesh V Kantety, John Z Yu, Zhanyou Xu, Russell J Kohel, Richard G Percy, Simone Macmil, Graham B Wiley, Bruce A Roe, Govind C Sharma.
Abstract
Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24116150 PMCID: PMC3792896 DOI: 10.1371/journal.pone.0076757
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of 454, ABI-Sanger and hybrid assemblies for Ctg-465 and Ctg-3301 in cotton.
| ABI-Sanger Assembly | 454 Assembly | Sanger-454 Hybrid Pool Assembly | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ctg/BAC-Pool | BAC Clone | Estimated TM-1 insert size in Kb | GenBank ID | No. of Contigs | Assembly Length | GenBank ID | No. of Contigs | Assembly Length | GenBank ID | No. of Contigs | Assembly Length |
| Pool-4 | CBV101N18 | 125 | AC245509 | 16 | 109,312 | - | 158 | 208,437 | SRR346495 | 87 | 203,770 |
| CBV103A21 | 130 | AC245640 | 13 | 126,266 | |||||||
| CBV056O13 | 85 | AC245649 | 18 | 164,636 | |||||||
| CBV102M21 | 90 | AC245508 | 23 | 133,998 | |||||||
| Pool-5 | CBV103L18 | 120 | AC245652 | 46 | 114,868 | - | 173 | 130,029 | SRR346497 | 100 | 189,133 |
| CBV123K05 | 150 | - | 30 | 31,469 | |||||||
| CBV198M14 | 90 | AC245641 | 16 | 112,554 | |||||||
|
|
|
|
|
|
|
|
| ||||
| Pool-1 | CBV133F04 | 97 | AC245642 | 26 | 221,760 | - | 55 | 227,629 | SRR346488 | 54 | 241,782 |
| CBV064M05 | 100 | AC245654 | 18 | 148,473 | |||||||
| CBV165G03 | 100 | AC245655 | 16 | 174,508 | |||||||
| CBV103I14 | 105 | AC245648 | 29 | 150,308 | |||||||
| Pool-2 | CBV004P17 | 110 | AC245643 | 15 | 138,762 | - | 139 | 252,198 | SRR346491 | 70 | 275,267 |
| CBV155K01 | 160 | AC245653 | 0 | 0 | |||||||
| CBV015C19 | 110 | AC245645 | 12 | 131,490 | |||||||
| CBV188K21 | 90 | AC245644 | 15 | 139,937 | |||||||
| Pool-3 | CBV006G16 | 120 | AC245651 | 34 | 108,719 | - | 307 | 194,195 | SRR346493 | 79 | 218,525 |
| CBV031N01 | 110 | AC245647 | 0 | 0 | |||||||
| CBV076H08 | 160 | AC245650 | 23 | 135,599 | |||||||
| CBV176F03 | 120 | AC245646 | 30 | 173,886 | |||||||
|
|
|
|
|
|
|
|
| ||||
Figure 1A schematic diagram representing MTP based BAC-pool sequencing for complex genomes.
454 and Sanger hybrid sequencing approach was used to derive high quality sequence assemblies in allotetraploid cotton.
Figure 2Diagrammatic representation of BAC-pools and their homology across the pools and respective BAC clone ABI assemblies in Ctg-3301 and Ctg-465.
Figure 3Comparative genomic analysis of Ctg-3301 (Chromosome A12) and Ctg-465 (Chromosome D12) of Upland cotton with Arabidopsis genome.