| Literature DB >> 27595709 |
Adam T H Utsunomiya1, Daniel J A Santos2, Solomon A Boison3, Yuri T Utsunomiya2, Marco Milanesi4, Derek M Bickhart5, Paolo Ajmone-Marsan6,7, Johann Sölkner8, José F Garcia4,9, Ricardo da Fonseca2,10, Marcos V G B da Silva11.
Abstract
BACKGROUND: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies.Entities:
Keywords: Bos indicus; Bos taurus; GWAS; Imputation; Linkage disequilibrium; Misassembly
Mesh:
Year: 2016 PMID: 27595709 PMCID: PMC5011828 DOI: 10.1186/s12864-016-3049-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Flowchart of the pipeline to detect Candidate Misplaced Markers (CMM). Oval symbols denote the beginning and end of the pipeline; the hexagon indicates the beginning of a loop structure; rectangles indicate a computation; diamonds indicate the points where decisions are made; parallelograms indicate output of information. The arrows indicate the flow of the pipeline and F and T denote FALSE and TRUE, respectively
Fig. 2Histogram of the number of SNPs with unexpected Linkage Disequilibrium (LD) pattern detected by chromosome
Fig. 3Distribution of frequencies of misassembled contig sizes. The red line represents the N50 of the UMDv3.1 bovine genome assembly
Fig. 4Circos plot of misassemblies. The 29 bovine autosomes are represented by polygons of different colors. Arrows within the inner track indicate the flow of misassemblies from their current to their estimated true locations (arrow heads). a Annotations in the outer track refer to the SNP probe name in Additional file 4: Table S1. b Annotations in the outer track refer to the MisSeg aliases in Additional file 4: Table S1. c Annotations in the outer track refer to the contig name in Additional file 4: Table S1
Fig. 5Manhattan plot of -log10(p-value) of SNPs for the Polled/Horned (PH) phenotypes in Nellore cattle
Fig. 6Linkage disequilibrium (LD) decay and SNP imputation accuracies for the R237 segment in Italian Holstein (HOL) and Nellore (NEL). From the top to the bottom of the picture, before (a) and after (b) correcting the segment using LD information, we plotted the LD decay of the segment in Italian Holstein (a.1, b.1) and Nellore (a.2, b.2), the SNP imputation accuracies in Italian Holstein (a.3, b.3) and Nellore (a.4, b.4), and a magnification of the segment in Italian Holstein (a.5, b.5) and Nellore (a.6, b.6)