| Literature DB >> 27516089 |
Alexey I Makunin1,2, Ilya G Kichigin3, Denis M Larkin4, Patricia C M O'Brien5, Malcolm A Ferguson-Smith5, Fengtang Yang6, Anastasiya A Proskuryakova3, Nadezhda V Vorobieva3, Ekaterina N Chernyaeva7, Stephen J O'Brien7, Alexander S Graphodatsky3,8, Vladimir A Trifonov3,8.
Abstract
BACKGROUND: B chromosomes are dispensable and variable karyotypic elements found in some species of animals, plants and fungi. They often originate from duplications and translocations of host genomic regions or result from hybridization. In most species, little is known about their DNA content. Here we perform high-throughput sequencing and analysis of B chromosomes of roe deer and brocket deer, the only representatives of Cetartiodactyla known to have B chromosomes.Entities:
Keywords: B chromosome Evolution; DOP-PCR; Grey Brocket Deer; High-throughput Sequencing; Protooncogenes; Siberian Roe Deer
Mesh:
Substances:
Year: 2016 PMID: 27516089 PMCID: PMC4982142 DOI: 10.1186/s12864-016-2933-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of sequencing and mapping of sorted chromosome-specific DNA
| Sample | Reference | # reads | Reads, bp | % contam | % target |
|---|---|---|---|---|---|
| CFA12 | canFam3 | 746,338 | 198,383,243 | 1.3 | 40.2 |
| BTAMix | bosTau7 | 708,228 | 184,977,360 | 1.1 | 49.3 |
| CPYB1 | bosTau7 | 1,033,322 | 304,920,244 | 1.7 | 13.8 |
| CPYB2 | bosTau7 | 886,978 | 268,895,542 | 1.1 | 12.7 |
| MGOB | bosTau7 | 716,158 | 239,022,974 | 4.2 | 31.9 |
Samples: CFA12 – dog chromosome 12; BTAMix – cattle sorting peak with chromosomes 23, 26, 28, and 29; CPYB1, CPYB2 – Siberian roe deer B chromosomes (technical replicates); MGOB – grey brocket deer B chromosomes. # reads, reads bp – initial number and volume of sequence reads. # contam – percent of reads mapped to human genome (see text for details). # target – percent of reads mapped to target genome with MAPQ > 20 after contamination removal
Positions occupied by DOP-clones (DOP-positions) and target regions
| Sample | Positions (bp) in genome | Target region size, bp | % positions (occupancy) in target regions |
|---|---|---|---|
| CFA12 | 36,819 (5,285,070) | 72,498,081 | 53.4 (4.4) |
| BTAMix | 50,035 (6,997,158) | 202,419,725 | 82.1 (3.0) |
| CPYB1 | 12,158 (1,530,385) | 1,979,679 | 8.3 (14.6) |
| CPYB2 | 9,665 (1,182,269) | 1,979,679 | 9.4 (13.2) |
| MGOB | 12,530 (1,934,874) | 9,311,710 | 43.5 (14.3) |
Sample names as in previous; Positions (bp) in genome – number of non-repetitive DOP-positions throughout the reference genome and their cumulative size; Target region size, bp – size of regions present on chromosome determined with our method; % positions (occupancy/coverage) in target regions: % positions – percent of DOP-positions in target region relative to the whole genome (column 2), occupancy – percent of target region size (column 3) occupied by DOP-positions
Fig. 1Mean pairwise distances between consecutive DOP-positions (a) and position coverage with reads (b) in the flow sorting peak containing bovine chromosomes 23, 26, 28, and 29. Note the signal on chromosome 6 corresponding to KIT region translocation to BTA29
Fig. 2Localization of canine BAC clone with KIT gene on chromosomes of (a) cattle and (b) grey brocket deer. Note that in brocket deer, autosomal copies of KIT are located on MGO24 corresponding to a region of BTA6 [27]. Translocation of a 480-kbp regions from BTA6 to BTA29 is a previously characterized rearrangement associated with colour sidedness in cattle [24]
Regions detected on B chromosomes of Siberian roe deer and grey brocket deer with flow-sorted chromosome sequencing
| Region | Genes |
|---|---|
| Siberian roe deer ( | |
| chr3:74,548,027-76,486,586 |
|
| chr28:11,359,721-11,400,841 | - |
| Grey brocket deer ( | |
| chr1:80,557,678-80,815,945 |
|
| chr1:132,195,520-132,248,825 |
|
| chr3:53,939,956-54,035,661 |
|
| chr3:97,346,492-97,530,439 |
|
| chr5:112,038,039-112,092,762 |
|
| chr6:67,475,289-69,302,666 |
|
| chr6:72,725,708-72,802,304 |
|
| chr6:101,762,919-101,983,089 |
|
| chr6:115,148,215-115,688,073 |
|
| chr7:11,112,904-11,187,619 |
|
| chr7:21,377,048-21,645,864 | - |
| chr8:78,145-782,808 |
|
| chr10:28,363,651-29,133,083 |
|
| chr14:5,823,596-6,377,396 |
|
| chr17:46,546,364-46,775,437 |
|
| chr19:59,129,525-59,366,644 |
|
| chr22:1,765,826-1,920,639 |
|
| chr22:11,626,319-11,984,586 |
|
| chr23:8,994,141-9,084,277 |
|
| chr23:50,678,412-50,701,238 |
|
| chr25:37,085,021-37,276,232 | - |
| chr25:41,480,442-41,787,784 | - |
| chr28:628,155-915,003 |
|
| chr28:12,110,619-12,174,951 |
|
| chr29:34,728,621-35,503,130 |
|
| chr29:49,338,779-50,252,639 |
|
Coordinates in cattle genome assembly (bosTau7) are given. Only RefSeq cattle genes are listed, (f) – partial gene fragments located at region margins, (f?) – genes overlapping putative deletions not covered by reads
Sequence variation of DOP-PCR amplified chromosome-specific DNA relative to bosTau7 genome classified relative to bovine RefSeq genes
| BTAMix | CPYB | CPYB-CCA | MGOB | |
|---|---|---|---|---|
| A | ||||
| Total | 6,068,678 | 344,775 | 297,042 | 1,332,441 |
| Intergenic | 4,518,518 | 262,448 | 226,421 | 991,640 |
| Intron | 1,501,565 | 80,987 | 69,282 | 327,012 |
| 5` UTR | 1,337 | 0 | 0 | 27 |
| 3` UTR | 15,213 | 0 | 0 | 2,780 |
| Coding | 32,045 | 1,340 | 1,339 | 10,982 |
| B | ||||
| Total | 19188 (6374) | 15267 (2915) | 3086 (1929) | 46592 (1735) |
| Intergenic | 14829 (4970) | 12187 (2349) | 2476 (1530) | 35292 (1334) |
| Intron | 4556 (1474) | 3134 (558) | 605 (392) | 10102 (320) |
| 5` UTR | 4 (2) | - | - | 2 (0) |
| 3` UTR | 49 (18) | - | - | 101 (8) |
| Coding syn. | 46 (8) | 17 (3) | 1 (1) | 113 (2) |
| Coding non-syn. | 44 (15) | 26 (10) | 12 (9) | 103 (12) |
| C | ||||
| Total | 316 (952) | 23 (118) | 96 (154) | 29 (768) |
| Intergenic | 305 (909) | 22 (112) | 91 (148) | 28 (743) |
| Intron | 330 (1,019) | 26 (145) | 115 (177) | 32 (1,022) |
| 5` UTR | 334 (669) | - | - | 14 (-) |
| 3` UTR | 310 (845) | - | - | 28 (348) |
| Coding | 356 (1,393) | 31 (103) | 103 (134) | 51 (732) |
A. Lengths of cattle genome regions covered by reads. Samples: BTAMix – mixed peak of cattle chromosomes 23, 26, 28 and 29; CPYB – Siberian roe deer B chromosomes (combination of samples CPYB1 and CPYB2); CPYB-CCA – same, but regions not covered by European roe deer contigs are excluded; MGOB – grey brocket deer B chromosomes
B. Number of total and heterozygous (in brackets) variants called and their annotation. Numbers of variants do not add up due to overlapping annotations and excluded NMD_target_transcript annotation. Intergenic variants also include up/downstream variants; coding non-synonymous variants include missense, stop codon gain/loss, frameshift, inframe indels. Sample descriptions – see 3A, except for CPYB-CCA – CPYB variants excluding the variants observed in Capreolus capreolus genomic contigs
C. Variant density (bp per 1 variant) calculated as length of sequence covered by reads divided by number of called variants. Numbers are given for all and heterozygous (in brackets) variants. Sample descriptions – see 3B
Fig. 3Region of BTA3 present on B chromosomes of Siberian roe deer. BAC clone localization (green – present, red – absent) and cDNA data taken from [7]. Two samples from one flow-sorting experiment were used to produce reads 1 and 2. Read coverage data omitted. Note two regions lacking reads – one is covered by BACs, the other corresponds to an assembly gap