| Literature DB >> 20128932 |
Shaun D Jackman1, Inanç Birol.
Abstract
Gigabase-scale genome assemblies are now feasible using short-read sequencing technology, bringing the cost of such projects below the million-dollar mark.Entities:
Mesh:
Year: 2010 PMID: 20128932 PMCID: PMC2847710 DOI: 10.1186/gb-2010-11-1-202
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Assembly statistics for maize, horse, panda, blue-stain fungus (G. clavigera) and P. syringae genomes and their cost
| B73 maize | Domestic horse* | Giant panda† |
|
| |
|---|---|---|---|---|---|
| Genome length | 2.3 Gb | 2.5-2.7 Gb | 2.4-2.5 Gb | 32.5 Mb | 6.1 Mb |
| Sequencing technology/ies | Sanger | Sanger | Illumina | Sanger, 454, Illumina | Illumina |
| Number of contigs | 125,325 | 55,316 | 198,274 | 3,361 | 1,346 |
| Contig N50 | 40 kb | 112 kb | 40 kb | 32 kb | 11 kb |
| Number of scaffolds | 61,161 | 9,687 | 81,469 | 2,322 | 71 |
| Scaffold N50 | 76 kb | 46 Mb | 1.3 Mb | 782 kb | 317 kb |
| Estimated sequencing cost | $30 million | $15 million | $0.6 million | $100,000 | $4,000 |
Contiguity statistics are calculated for *contigs and scaffolds 1 kb or longer and †contigs and scaffolds 100 bp or longer.
Effect of the choice of k-mer size on the single-end contig N50 for the giant panda assembly using ABySS 1.1.0
| k-mer size | 27* | 30 | 32 | 34 | 35 | 36 | 37 | 38 |
|---|---|---|---|---|---|---|---|---|
| Contig N50 (bp) | 1,381 | 1,724 | 1,863 | 1,940 | 1,952 | 1,942 | 1,924 | 1,860 |
*The k-mer size used for the reported giant panda genome assembly [1].