| Literature DB >> 21269493 |
Stephanie Hammond1, Jeffrey C Swanberg, Mihailo Kaplarevic, Kelvin H Lee.
Abstract
BACKGROUND: Chinese hamster ovary (CHO) cells are among the most widely used hosts for therapeutic protein production. Yet few genomic resources are available to aid in engineering high-producing cell lines.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21269493 PMCID: PMC3038171 DOI: 10.1186/1471-2164-12-67
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Alignment of CHO short reads to mouse and rat reference genomes
| Sequence Type | Mouse | Rat | ||
|---|---|---|---|---|
| Total aligned | 6,582,209 | (8.71%) | 6,281,589 | (8.31%) |
| Aligned to unique regions | 3,202,228 | (4.47%) | 3,046,308 | (4.28%) |
| Aligned to repeat regions | 3,379,981 | (4.24%) | 3,235,281 | (4.03%) |
| Aligned to protein-coding genes* | 2,678,662 | (3.52%) | 2,359,457 | (3.12%) |
* Pseudogenes, RNA genes, and genes on Y chromosome were excluded from this analysis. Genes include exons, introns, and 5'- and 3'-untranslated regions
Figure 1Distribution of CHO short reads along the mouse and rat reference genomes.
Figure 2Coverage of genes by short reads. Normalized read counts of CHO sequences aligned to known protein-coding genes in the mouse and rat reference genomes. The raw number of short reads aligned to each gene is normalized by gene size. Gene coverage is classified as low, with fewer than 100 normalized read counts, medium, between 100-500 normalized read counts, or high, with greater than 500 normalized read counts.
Summary of CHO contigs extracted from consensus sequences
| Contig Size | ||||||||
|---|---|---|---|---|---|---|---|---|
| Reference | Total | < 100 bp | 100 to 500 bp | > 500 bp | ||||
| Size (bp) | % GC | Size (bp) | % GC | Size (bp) | % GC | Size (bp) | % GC | |
| Mouse | 54 | 43.0 | 46 | 42.9 | 140 | 44.0 | 637 | 45.2 |
| Rat | 53 | 42.9 | 46 | 42.8 | 139 | 44.2 | 616 | 46.5 |
The average size and average GC content of CHO contigs generated from reference-guided alignment
Summary of assembly and BLAST analysis of CHO sequences
| Mouse | Rat | |
|---|---|---|
| Total contigs | 1,864,122 | 1,707,312 |
| Contigs with BLAST hits | 559,545 | 482,888 |
| Average% similarity of BLAST hits | 96.46% | 96.39% |
| Contigs hit known genes | 264,917 | 214,353 |
| Total unique genes hit | 17,883 | 19,481 |
Figure 3Genomic landscape of CHO contigs. CHO consensus sequences for A) DHFR and B) Alppl2 genes aligned to mouse reference chromosomes. Grey bars represent nucleotides identical to the reference sequence and red bars represent both nucleotides that differ from the reference sequence and gaps in the consensus sequences. CHO contigs aligned to an Alppl2 exon (highlighted in green in B) shown at the C) chromosome and D) nucleotide level. Red nucleotides indicate differences between the CHO consensus and reference sequences. Consensus sequences aligned to the mouse reference genome using BLAT [30] at the UCSC Genome Browser [31].
Figure 4Functional classification of genes identified in BLAST analysis.