| Literature DB >> 31015470 |
Shweta Ramdas1, Ayse Bilge Ozel2, Mary K Treutelaar3, Katie Holl4, Myrna Mandel5, Leah C Solberg Woods6, Jun Z Li7,8.
Abstract
We performed whole-genome sequencing for eight inbred rat strains commonly used in genetic mapping studies. They are the founders of the NIH heterogeneous stock (HS) outbred colony. We provide their sequences and variant calls to the rat genomics community. When analyzing the variant calls we identified regions with unusually high levels of heterozygosity. These regions are consistent across the eight inbred strains, including Brown Norway, which is the basis of the rat reference genome. These regions show higher read depths than other regions in the genome and contain higher rates of apparent tri-allelic variant sites. The evidence suggests that these regions may correspond to duplicated segments that were incorrectly overlaid as a single segment in the reference genome. We provide masks for these regions of suspected mis-assembly as a resource for the community to flag potentially false interpretations of mapping or functional results.Entities:
Mesh:
Year: 2019 PMID: 31015470 PMCID: PMC6478900 DOI: 10.1038/s41597-019-0041-6
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Consistent heterozygosity patterns across the eight lines. Shown are the fractions of heterozygous genotypes (y-axis) in non-overlapping 1000-SNV windows in chromosome 1, displayed for the eight inbred lines.
Fig. 2Recurrence pattern of heterozygous genotypes across 8 founders. All variant sites were categorized as having heterozygous genotypes in 0, 1,…, up to 8 lines. The bar graph displays the observed site counts in the 9 categories, while the dots show the expected number of sites if recurrence is random, estimated under a simple binomial model.
Fig. 3Higher read depths in high-heterozygosity windows. (a) Boxplot of per-window average read depth stratified on the x-axis by the number of lines for which a given window is classified as high-het. It shows that windows with high-het in all eight lines tend to show higher average read depth. (b) An example of a 5 Mb region in Chromosome 1 and line AC, with a high-het region in the middle. Y axis is the read depth for individual sites, showing that in the high-het window, both the heterozygosity calls (upper panel) and homozygosity calls (lower panel) show higher read depth and often higher variance of read depth.