| Literature DB >> 32328205 |
Tomasz Szmatoła1,2, Artur Gurgul1,2, Igor Jasielczuk1,2, Weiwei Fu3, Katarzyna Ropka-Molik2.
Abstract
In the present study, runs of homozygosity (ROH) detected with the use of a standard bovine 54k single nucleotide polymorphism (SNP) genotyping assay and two different ROH detection approaches, based on 50 (M1) or 15 (M2) consecutive SNPs, were compared with results of whole genome sequencing. Both microarray-based methods accurately recognised medium-sized ROH, however, it was found that M2 method seemed to better than M1 identify short ROH, but highly overestimated their number, leading to numerous false positive calls. Moreover, long ROH identified with microarray data tended to break into shorter segments in sequencing data because of the presence of regions with high heterozygosity within the ROH sequences. This may indicate, that these long ROH are formed by closely positioned shorter homozygous segments that may be of older origin or may be created by two similar but not identical haplotypes, showing minor internal recombination signs. Such finding also suggests that at least some of the results of previous studies in regard to long ROH may be biased leading to inaccurate estimations of genomes autozygosity via ROH classification into length categories. © The author(s).Entities:
Keywords: autozygosity; microarray; next generation sequencing; runs of homozygosity
Year: 2020 PMID: 32328205 PMCID: PMC7171384 DOI: 10.7150/jgen.39147
Source DB: PubMed Journal: J Genomics
Figure 1Visualization of ROH identified with all used methods. Black lines for each chromosome represent average homozygosity in a sliding window of 15 consecutive SNPs. Blue colour represents S1 approach; violet represents M1 approach; red represents S2 approach and brown represents M2 approach.
Figure 2Zoom on the long ROH identified on chromosome 3 and 26 for all used methods. Black lines for each chromosome represent average homozygosity in a sliding window of 15 consecutive SNPs. Blue colour represents S1 approach; violet represents M1 approach; red represents S2 approach and brown represents M2 approach.
The detailed characteristics of gaps found within long ROH.
| Gap in adjacent ROH regions | Gap length (bp) | Neighbouring heterozygous SNPs in the gap | GC content in the gap (%) | GC content in the whole chromosome | Mean adjusted gap coverage | Adjusted gap coverage SD | ||
| Chromosome | Start (kb) | End (kb) | ||||||
| 3 | 96894 | 96897 | 3278 | 12 | 35.44 | 41.80 | 18.60 | 5.59 |
| 26 | 31581 | 31603 | 23306 | 21 | 44.21 | 42.77 | 16.25 | 3.87 |
| 26 | 35808 | 35811 | 3144 | 15 | 37.66 | 42.77 | 14.66 | 2.31 |
| Gap in adjacent ROH regions | Gap length (bp) | Neighbouring heterozygous SNPs in the gap | GC content in the gap (%) | GC content in the whole chromosome | Mean adjusted gap coverage | Adjusted gap coverage SD | ||
| Chromosome | Start (kb) | End (kb) | ||||||
| 3 | 91581 | 91582 | 1009 | 4 | 48.12 | 41.80 | 14.80 | 2.30 |
| 3 | 95575 | 95583 | 7573 | 4 | 57.53 | 41.80 | 14.29 | 2.21 |
| 3 | 96894 | 96897 | 3278 | 12 | 35.44 | 41.80 | 18.60 | 5.59 |
| 26 | 28115 | 28116 | 894 | 6 | 42.79 | 42.77 | 15.50 | 3.83 |
| 26 | 31580 | 31603 | 23306 | 21 | 44.21 | 42.77 | 16.25 | 3.87 |
| 26 | 33806 | 33808 | 2630 | 5 | 47.08 | 42.77 | 18.17 | 3.19 |
| 26 | 35808 | 35811 | 3144 | 15 | 37.66 | 42.77 | 14.66 | 2.31 |