| Literature DB >> 21492448 |
Peng Xu1, Jiongtang Li, Yan Li, Runzi Cui, Jintu Wang, Jian Wang, Yan Zhang, Zixia Zhao, Xiaowen Sun.
Abstract
BACKGROUND: Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. RESULT: To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp.Entities:
Mesh:
Year: 2011 PMID: 21492448 PMCID: PMC3083359 DOI: 10.1186/1471-2164-12-188
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sequence statistics of the BES of common carp
| BAC end sequencing reads | 80,448 |
|---|---|
| BES a | 75,744 |
| BES after trimming b | 72,789(96.1%) |
| Redundant BES c | 7,069 |
| BES after filtering redudance | 65,720 |
| Average read length (bp) | 647 |
| BES mate-pairs d | 29,046(88.4%) |
| Total bases sequenced | 42,522,168 |
| Vertebrates Repeat masked bases | 7,356,797(17.3%) |
a Number of sequences with phred base calling (quality score ≥ 20) with edited read length ≥50 bp
b Number of sequences free of E. coli and vector DNA, with edited read length ≥50 bp
c Number of sequences with 95% identity and full-length covered by selfblasting
d Number of clones with both ends successfully sequenced
Figure 1Read length distribution of BES after base calling, trimming for .
Figure 2Distribution of major microsatellite types in common carp BES.
Figure 3Distribution of identity scores of the carp-BES alignments to zebrafish genome assembly 8 (zv8). Frequencies of BES with percent identity scores from 75 to 100% are shown.
Summary of BES mapping
| Number of BAC clones | Number of BES | ||
|---|---|---|---|
| Mate-pairs | 29,046 | ||
| Mapped | 26,809 | 38,736 | |
| Paired endsa | 3,133 | 6,266 | |
| Long BES pairsb | 3,182 | 6,364 | |
| mate-reversec | 267 | 534 | |
| Unmated | 5,345 | 10,690 | |
| Singletone | 14,882 | 14,882 | |
| Unmapped | 2,237 | 4,474 | |
a. Two ends (≤ 300 kb apart) derived from the same BAC clone were mapped to the same chromosome with the same orientation.
b. Two BES (>300 kb apart) in one BAC located in the same chromosome with the same orientation.
c. Two BES in one BAC located in the same chromosome with the different orientation.
d. Two ends from the same BAC were mapped to different chromosomes.
e. Only one end in carp BAC was aligned to zebrafish chromosome.
Estimated coverage of chromosomes by the common carp BACs. Zebrafish genome assembly 8 (zv8) were used for the calculation.
| Chromosome | Size (bp) | Covered (bp) | Number of clones* | Coverage |
|---|---|---|---|---|
| 1 | 59,305,620 | 28,385,015 | 132 | 47.86% |
| 2 | 58,009,534 | 22,677,316 | 102 | 39.09% |
| 3 | 60,907,308 | 31,396,678 | 148 | 51.55% |
| 4 | 71,658,100 | 32,427,816 | 168 | 45.25% |
| 5 | 74,451,498 | 35,726,625 | 164 | 47.99% |
| 6 | 61,647,013 | 32,747,166 | 149 | 53.12% |
| 7 | 76,918,211 | 46,016,985 | 220 | 59.83% |
| 8 | 55,568,185 | 27,732,046 | 123 | 49.91% |
| 9 | 54,736,511 | 37,568,203 | 176 | 68.63% |
| 10 | 43,467,561 | 18,729,705 | 83 | 43.09% |
| 11 | 44,116,856 | 24,200,149 | 108 | 54.85% |
| 12 | 46,853,116 | 23,019,149 | 102 | 49.13% |
| 13 | 50,748,729 | 33,185,367 | 153 | 65.39% |
| 14 | 52,930,158 | 21,972,192 | 105 | 41.51% |
| 15 | 47,237,297 | 23,826,254 | 111 | 50.44% |
| 16 | 51,890,894 | 22,116,433 | 100 | 42.62% |
| 17 | 49,469,313 | 35,083,736 | 161 | 70.92% |
| 18 | 49,271,716 | 24,262,639 | 113 | 49.24% |
| 19 | 48,708,673 | 19,679,618 | 87 | 40.40% |
| 20 | 51,884,995 | 34,768,791 | 151 | 67.01% |
| 21 | 47,572,505 | 18,049,644 | 89 | 37.94% |
| 22 | 41,415,389 | 20,094,889 | 100 | 48.52% |
| 23 | 44,714,728 | 24,485,475 | 115 | 54.76% |
| 24 | 40,403,431 | 22,721,787 | 106 | 56.24% |
| 25 | 38,768,535 | 10,635,174 | 51 | 27.43% |
| Total | 1,322,655,876 | 671,508,852 | 3,117 | 50.77% |
*BAC clones are counted only when they have two sequenced ends in the same chromosome with the correct orientation.
Five categories of conserved microsyntenies
| Type | Number of BAC clones |
|---|---|
| Type1 (protein-coding | 697 |
| Type2 (protein-coding | 4 |
| Type3 (non-coding | 13 |
| Type4 (protein-coding | 2,025 |
| Type5 (non-coding | 4 |