| Literature DB >> 35615028 |
Pei Li1, Liu-Qiong Wei1, Yi-Fan Pan1, Yuan-Ming Zhang1.
Abstract
Although methodologies and software packages for bulked segregant analysis (BSA) are well established, it is difficult to detect extremely over-dominant and small-effect genes for quantitative traits in F2 population. To address this issue, we proposed a combinatorial strategy to identify all types of quantitative trait loci (QTLs) using extreme phenotype individuals in F2. To popularize this strategy, we developed an R software package dQTG.seq v1.0.1. It has some features not found in other BSA software packages: 1) new (dQTG-seq1 and dQTG-seq2) and existing (G', deltaSNP, Euclidean distance (ED), and SmoothLOD) methods are available to identify all types of QTLs in bi-parental segregation populations, one data file with two BSA and three QTL-mapping data formats was inputted, and two *.csv files and one figure were outputted; 2) main smoothing methods (AIC, Window size, and Block) have been incorporated into each of the above-mentioned methods; 3) the threshold value of LOD score for significant QTLs is determined by permutation experiments. To save running time, vroom function was used to read the dataset, and parallel operation was used to estimate parameters. In real data analyses, users should select a suitable initial value of window size, depending on the species, and appropriate smoothing methods to obtain the best result. dQTG-seq2 detects more known loci and genes for rice grain number per panicle than composite interval mapping (CIM) and inclusive CIM, especially extremely over-dominant and small-effect genes. A handbook for our software package (https://cran.r-project.org/web/packages/dQTG.seq/index.html) has been provided in the supplemental materials for the users' convenience.Entities:
Keywords: BC, backcross; BSA, bulked segregant analysis; Bi-parental segregation populations; Bulked segregant analysis; CIM, composite interval mapping; DH, doubled haploid; ED, Euclidean distance; Extremely over-dominant QTLs; GCIM, genome-wide composite interval mapping; ICIM, inclusive composite interval mapping; QTG, quantitative trait gene; QTL, quantitative trait locus; R; RIL, recombinant inbred line; SNP, single nucleotide polymorphism; Small-effect QTLs; dQTG-seq
Year: 2022 PMID: 35615028 PMCID: PMC9120062 DOI: 10.1016/j.csbj.2022.05.009
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1A combinatorial strategy of mapping all types of QTLs for quantitative traits in bi-parental segregation populations via combination of BSA and whole-genome sequencing.
Fig. 2BSA (A) and Extreme individual (B) formats for input file.
Fig. 3Previously reported genes for rice grain number per panicle in immortalized F2 using the dQTG-seq2 (A), SmoothLOD (B), G' (C), deltaSNP (D), ED (E), composite interval mapping (CIM, F) and inclusive CIM (ICIM, G) methods. Horizontal dotted lines indicate thresholds of significant QTLs. Various statistics of genome-wide scanning using new and existing methods are indicated by black curves. The genes with absolute dominant ratio |d/a| < 2.0, small-effects, and |d/a| ≥ 2.0 are indicated by blue, pink, and red colors, respectively. If |d/a| ≥ 2.0 and its size is small, the gene name is in pink color and its corresponding solid line is in red color. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)