| Literature DB >> 19393050 |
Yue Liu1, Xiang Qin, Xing-Zhi Henry Song, Huaiyang Jiang, Yufeng Shen, K James Durbin, Sigbjørn Lien, Matthew Peter Kent, Marte Sodeland, Yanru Ren, Lan Zhang, Erica Sodergren, Paul Havlak, Kim C Worley, George M Weinstock, Richard A Gibbs.
Abstract
BACKGROUND: We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19393050 PMCID: PMC2686734 DOI: 10.1186/1471-2164-10-180
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The Genome Assembly process. Sequence from pooled BACs, individual BACs and Whole Genome Shotgun was combined in a number of different ways as outlined here. At the top left, pooled BACs were deconvoluted and assembled as individual BACs. On the top right, individually sequenced BACs were also assembled as individual BACs. Overlapping WGS data was added to all BACs and each was assembled as an enriched BAC (eBAC) using three different assembly methods. The best assembly of each eBAC was used in the merging and scaffolding steps. On the left, WGS data was assembled as a WGS assembly to produce Btau_2.0. Contigs from this assembly that were not contained within the eBACs were used to fill gaps in the BAC assembly. The sequence scaffolds were placed on chromosomes using the composite map (Integrated Bovine Map) to produce Btau_3.1. Some scaffolds were split and a multi-step placement procedure described in the text was used to place scaffolds on chromosomes for version Btau_4.0. A more complete description can be found in the text.
Read Statistics
| All WGS | 14.09 | 1.41 | NA | NA | 26.98 | 0.21 | 26.98 | 0.21 |
| Total | 15.51 | 27.2 | 27.19 | 27.19 | ||||
| Trimmed | 11.72 | 1.15 | 19 | 4 | 23.09 | 0.17 | 23.09 | 0.17 |
| Total | 12.88 | 23 | 23.26 | 23.26 | ||||
| Paired | 11.01 | 1.07 | 5 | 5.2 | NA | NA | NA | NA |
| Total | 12.08 | 10.2 | NA | NA | ||||
| Assembled | 8.98 | 0.89 | 14.9 | 3.2 | 18.74 | 0.11 | 18.74 | 0.11 |
| Total | 9.87 | 18.1 | 18.86 | 18.86 | ||||
| Unassembled | 0.39 | 0.04 | 0.31 | 0.06 | NA | NA | NA | NA |
| Total | 0.43 | 0.37 | NA | NA | ||||
| All | NA | NA | NA | NA | 6.42 | NA | 6.42 | NA |
| Total | NA | NA | 6.42 | 6.42 | ||||
| Trimmed | NA | NA | NA | NA | 5.11 | NA | 5.11 | NA |
| Total | NA | NA | 5.11 | 5.11 | ||||
| Assembled | NA | NA | NA | NA | 4.28 | NA | 4.28 | NA |
| Total | NA | NA | 4.28 | 4.28 | ||||
| All | NA | NA | NA | NA | 5.38 | NA | 5.38 | NA |
| Total | NA | NA | 5.38 | 5.38 | ||||
| Trimmed | NA | NA | NA | NA | 5.12 | NA | 5.12 | NA |
| Total | NA | NA | 5.12 | 5.12 | ||||
| Assembled | NA | NA | NA | NA | 2.92 | NA | 2.92 | NA |
| Total | NA | NA | 2.92 | 2.92 | ||||
Basepair Statistics
| Trimmed | 8.34 | 0.84 | 14.5 | 3.2 | 24.2 | 0.1 | 24.2 | 0.1 |
| Total | 9.18 | 17.7 | 24.3 | 24.3 | ||||
| Asssembled | 6.48 | 0.65 | 11.6 | 2.5 | 19 | 0.1 | 19 | 0.1 |
| Total | 7.13 | 14.1 | 19.1 | 19.1 | ||||
| Unassembled | 0.2 | 0.023 | 0.13 | 0.04 | N/A | N/A | N/A | N/A |
| Total | 0.22 | 0.17 | N/A | N/A | ||||
| 3.09× | 0.31× | 5.2× | 1.1× | 7.0× | N/A | 7.0× | N/A | |
| Total | 3.4× | 6.3× | 7.0× | 7.0× | ||||
| 6.02× | 1.00× | 10.3× | 3.6× | N/A | N/A | N/A | N/A | |
| Total | 7.02× | 13.9× | N/A | N/A | ||||
Scaffold Placement Statistics for Btau_4.0
| Scaffolds/Contigs | Number | N50 (kbp) | Total (Gbp) | Percent* |
| Anchored & Oriented | 2,194 | 2,187 | 2.54 | 89 |
| Anchored Not Oriented | 137 | 500 | 0.04 | 1 |
| Unanchored Scaffolds | 11,830 | 94 | 0.28 | 10 |
*Percentage of genome sequence in each category. Scaffolds includes single contig scaffolds as well as multiple contig scaffolds.
Assembly Completeness
| Percentage Matched | Btau 1.0 | Btau 2.0 | Btau 3.1 | Btau 4.0 |
| Finished BACs | 18 | N/A | 73 | 73 |
| Contigs | 94.57 | N/A | 98.50 | 98.50 |
| Markers | N/A | 10,387 | 21,971 | 21,971 |
| Scaffolds | N/A | 95.70 | 98.61 | 98.61 |
| Unigene sets or ESTs | 23,924 | 23,924 | 1,040,000 | 1,040,000 |
| Scaffolds | 83.82 | 92.40 | 95.00 | 95.00 |
| BES | N/A | 321,287 | N/A | N/A |
| Scaffolds | N/A | 95.20 | N/A | N/A |
The table gives the numbers of BACS, markers, ESTs or BES used for comparison, and the percentage of Contigs or Scaffolds that matched each set. Additional file 2 gives the same comparisons for the unassembled reads as well as the assembled sequences.
Comparison to Independent Chromosome Maps
| Misplaced in Scaffolds | ||||
| Total Shared | Btau_4.0 | Btau_3.1 | Integrated Bovine Map[ | |
| Chr6 | 61 | 0 | 15 | 7 |
| Chr19 | 45 | 0 | 6 | 9 |
| Chr29 | 28 | 0 | 7 | 7 |
Assembly contig and scaffold statistics
| | 795,212 | 321,107 | 131,620 | 131,620 |
| | 4.20 | 18.90 | 48.70 | 48.70 |
| | 2.26 | 2.62 | 2.73 | 2.73 |
| | 2.26 | 2.62 | 2.73 | 2.73 |
| | 85 | 95.1 | 95 | |
| | 0 | 2215 | 2,055 | 2,194 |
| | 712 | 1,393 | 2,187 | |
| | 1.07 | 2.08 | 2.54 | |
| | 0.89 | 1.99 | 2.43 | |
| | 0 | 34.5 | 72.7 | 89 |
| | 0 | 2194 | 998 | 137 |
| | 535 | 547 | 500 | |
| | 0.63 | 0.32 | 0.04 | |
| | 0.51 | 0.3 | 0.04 | |
| | 0 | 20.3 | 11 | 1 |
| | 449,727 | 98,058 | 13,045 | 11,830 |
| | 13.5 | 189 | 166 | 94 |
| | 2.34 | 1.4 | 0.47 | 0.28 |
| | 2.26 | 1.2 | 0.44 | 0.26 |
| | 100 | 45.2 | 16.3 | 10 |
| | 449,727 | 102,467 | 16,098 | 14,161 |
| | 13.5 | 434 | 997 | 1,922 |
| | 2.34 | 3.1 | 2.87 | 2.87 |
| | 2.26 | 2.62 | 2.73 | 2.73 |
| | 100 | 100 | 100 | 100 |
Marker Statistics for Btau_3.1
| Marker | Number in Btau_3.1 |
| Total assessed | 21,971 |
| With matches | 21,666 |
| Without matches | 305 |
| Low identity matches | 1,670 |
| Repeat matches | 1,606 |
| Removed due to conflicts | 595 |
| Total used in final map | 17,795 |