| Literature DB >> 34979909 |
Sara Casu1, Mario Graziano Usai2, Tiziana Sechi1, Sotero L Salaris1, Sabrina Miari1, Giuliana Mulas1, Claudia Tamponi3, Antonio Varcasia3, Antonio Scala3, Antonello Carta1.
Abstract
BACKGROUND: Gastroinestinal nematodes (GIN) are one of the major health problem in grazing sheep. Although genetic variability of the resistance to GIN has been documented, traditional selection is hampered by the difficulty of recording phenotypes, usually fecal egg count (FEC). To identify causative mutations or markers in linkage disequilibrium (LD) to be used for selection, the detection of quantitative trait loci (QTL) for FEC based on linkage disequilibrium-linkage analysis (LDLA) was performed on 4097 ewes (from 181 sires) all genotyped with the OvineSNP50 Beadchip. Identified QTL regions (QTLR) were imputed from whole-genome sequences of 56 target animals of the population. An association analysis and a functional annotation of imputed polymorphisms in the identified QTLR were performed to pinpoint functional variants with potential impact on candidate genes identified from ontological classification or differentially expressed in previous studies.Entities:
Mesh:
Year: 2022 PMID: 34979909 PMCID: PMC8722200 DOI: 10.1186/s12711-021-00690-7
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Dates of sampling, number of animals sampled, mean and standard deviation of Fec and LnFec [ln(Fec + 14)]
| Date of sampling | Number of animals | Mean Fec EPG | SD (Fec) | Mean lnFec | SD (lnFec) |
|---|---|---|---|---|---|
| 2000-01-21 | 939 | 349.25 | 563.20 | 5.11 | 1.33 |
| 2000-06-02 | 916 | 110.60 | 180.43 | 4.25 | 1.05 |
| 2000-10-02 | 918 | 387.23 | 545.3 | 5.45 | 1.09 |
| 2001-03-30 | 890 | 177.42 | 199.09 | 4.77 | 1.06 |
| 2001-09-07 | 866 | 301.33 | 528.23 | 4.98 | 1.24 |
| 2002-02-02 | 866 | 142.41 | 284.31 | 4.26 | 1.21 |
| 2003-03-06 | 699 | 48.90 | 118.08 | 3.54 | 0.96 |
| 2003-07-01 | 1207 | 127.93 | 220.00 | 4.13 | 1.28 |
| 2004-09-20 | 748 | 271.22 | 308.72 | 5.15 | 1.09 |
| 2005-07-07 | 226 | 177.28 | 138.97 | 4.97 | 0.81 |
| 2005-09-05 | 468 | 359.55 | 627.65 | 4.96 | 1.42 |
| 2006-01-23 | 857 | 218.37 | 355.68 | 4.68 | 1.29 |
| 2006-07-17 | 521 | 356.00 | 531.96 | 5.14 | 1.34 |
| 2007-11-13 | 655 | 267.16 | 578.94 | 4.46 | 1.51 |
| 2008-07-22 | 645 | 402.67 | 797.06 | 4.95 | 1.53 |
| 2009-06-24 | 763 | 848.77 | 1300.64 | 5.77 | 1.57 |
| 2010-02-08 | 554 | 265.98 | 446.89 | 4.83 | 1.31 |
| 2010-05-18 | 672 | 161.61 | 275.56 | 4.35 | 1.26 |
| 2011-04-06 | 653 | 346.36 | 477.36 | 5.16 | 1.29 |
| 2012-06-16 | 609 | 1590.02 | 2019.34 | 6.54 | 1.56 |
| 2015-09-03 | 528 | 162.57 | 246.80 | 4.45 | 1.24 |
| 2016-07-20 | 473 | 506.18 | 773.90 | 5.21 | 1.58 |
| 2017-05-20 | 554 | 204.57 | 302.27 | 4.72 | 1.18 |
| 2018-05-08 | 674 | 247.60 | 465.71 | 4.29 | 1.60 |
| 2019-02-12 | 693 | 389.99 | 961.97 | 5.26 | 1.25 |
EPG = eggs per g; lnFec = [ln(Fec + 14)]
Estimates and standard errors of genetic, permanent environment (Pe) and residual variances and repeatability (Rp) and heritability (h2) estimates for LnFec
| Component | Standard error | |
|---|---|---|
| Genetic (ln2) | 0.341 | 0.026 |
| Pe (ln2) | 0.097 | 0.016 |
| Residual (ln2) | 1.187 | 0.015 |
| Rp | 0.270 | 0.012 |
| h2 | 0.210 | 0.015 |
Fig. 1Manhattan plot of the corresponding to the null hypothesis that the effects of principal components that explain 99% of the variability due to the Sarda base gametes at each locus are zero. The grey line indicates the 0.05 genome-wide significance threshold determined by Bonferroni correction for 43,390 tests
QTL regions from the LDLA analysis
| QTLR | OAR | Range (bp) | nSNP | Pos (bp) | ||
|---|---|---|---|---|---|---|
| Q_01_1 | 1 | 99,988,914–99,988,914 | 1 | 99,988,914 | 6.256 | 0.0087 |
| Q_02_1 | 2 | 135,598,790–135,598,790 | 1 | 135,598,790 | 6.078 | 0.0093 |
| Q_02_2 | 2 | 213,089,849–213,166,468 | 3 | 213,135,270 | 6.554 | 0.0093 |
| Q_04_1 | 4 | 5,252,005–9,837,812 | 6 | 5,386,849 | 7.404 | 0.0111 |
| Q_06_1 | 6 | 12,638,149–12,695,314 | 3 | 12,695,314 | 6.864 | 0.0099 |
| Q_07_1 | 7 | 88,040,710–88,085,726 | 2 | 88,040,710 | 6.762 | 0.0098 |
| Q_12_1 | 12 | 35,973,543–41,153,795 | 19 | 39,430,517 | 9.191 | 0.0125 |
| Q_15_1 | 15 | 34,024,406–34,081,200 | 2 | 34024406 | 7.362 | 0.0110 |
| Q_19_1 | 19 | 18,933,658–31,429,916 | 11 | 18,983,777 | 7.302 | 0.0104 |
| Q_20_1 | 20 | 16,796,770–36,098,292 | 154 | 21,170,858 | 12.861 | 0.0176 |
QTLR: QTL region identifier; OAR Ovis aries autosome; Range: position, in bp from the ovine genome assembly v4.0, of the first and last significant SNP of the QTL region; nSNP number of SNPs from the OvineSNP50 Beadchip exceeding the 0.05 genome-wide significance threshold ; Pos: position of the most significant SNP in bp; : negative logarithm of the p-value of the most significant SNP; : ratio between the ANOVA estimator of the QTL variance at peak position () and the total phenotypic variance of the pseudo-phenotypes ()
Description of the QTL regions from whole-genome sequences and results of the imputation procedure
| QTLR | Interval | IC | Accuracy | |||
|---|---|---|---|---|---|---|
| Range (Mb) | nSNPsa | MR (nSNPs/Mb) | Mean ± SD | nSNPsb | Mean ± SD | |
| Q_01_1 | 99–100 | 15,410 | 7711 | 0.983 ± 0.017 | 33 | 0.983 ± 0.026 |
| Q_02_1 | 135–137 | 21,708 | 10,859 | 0.973 ± 0.015 | 39 | 0.986 ± 0.009 |
| Q_02_2 | 212–214 | 24,361 | 12,181 | 0.982 ± 0.013 | 35 | 0.995 ± 0.004 |
| Q_04_1 | 4–10 | 86,568 | 14,428 | 0.967 ± 0.020 | 113 | 0.987 ± 0.011 |
| Q_06_1 | 12–14 | 26,466 | 13,234 | 0.973 ± 0.014 | 40 | 0.990 ± 0.005 |
| Q_07_1 | 87–89 | 26,375 | 13,188 | 0.974 ± 0.016 | 43 | 0.979 ± 0.034 |
| Q_12_1 | 35–42 | 54,785 | 7827 | 0.985 ± 0.014 | 119 | 0.995 ± 0.006 |
| Q_15_1 | 33–35 | 21,528 | 10,764 | 0.972 ± 0.018 | 34 | 0.986 ± 0.009 |
| Q_19_1 | 18–32 | 167,487 | 11,963 | 0.972 ± 0.019 | 222 | 0.987 ± 0.017 |
| Q_20_1 | 16–37 | 268,299 | 12,778 | 0.976 ± 0.021 | 365 | 0.987 ± 0.028 |
| Total | – | 712,987 | 11,884 | 0.976 ± 0.017 | 1043 | 0.988 ± 0.021 |
QTLR: QTL region identifier; Range: start and end position of the explored interval in Mb from the ovine genome assembly v4.0; nSNPa: number of polymorphic sites from sequence data; MR: mutation rate, i.e. number of SNPs per Mb; nSNPb: number of polymorphic sites from both the OvineSNP50 Beadchip and sequence data used for calculating the accuracy of imputation; IC: information content and Accuracy: accuracy of imputation of gametes carried by phenotyped ewes
Results of the association analysis based on imputed alleles at the polymorphic sites from WGS
| QTLR | OAR | SNP ID | Pos | |
|---|---|---|---|---|
| Q_01_1 | 1 | rs430083769 | 100,326,208 | 6.4402 |
| Q_02_1 | 2 | rs416887054 | 135,331,051 | 6.6044 |
| Q_02_2 | 2 | rs418850058 | 213,396,760 | 5.5896 |
| Q_04_1 | 4 | rs430604472 | 8,686,421 | 8.2004 |
| Q_06_1 | 6 | rs400533049 | 13,159,311 | 6.9217 |
| Q_07_1 | 7 | rs398163681 | 87,832,764 | 6.6496 |
| Q_12_1 | 12 | rs405838891 | 41,043,088 | 8.0020 |
| Q_15_1 | 15 | rs427631513 | 34,330,615 | 4.9680 |
| Q_19_1 | 19 | rs420265308 | 19,228,546 | 8.8439 |
| Q_20_1 | 20 | rs404860665 | 26,158,816 | 13.2036 |
QTLR: QTL region identifier; OAR Ovis aries autosome; SNP ID: SNP identifier; Pos: position of the most significant SNP in bp based on the ovine genome assembly v4.0.; -log10 (Pvalue) negative logarithm of the P-value of the most significant SNP
Summary of the genomic features in the investigated regions
| QTLR | tRNA | lncRNA | miRNA | misc_RNA | Pseudogene | Protein coding (n transcr) |
|---|---|---|---|---|---|---|
| Q_01_1 | 1 | 10 | 1 | 12 | 65 (153) | |
| Q_02_1 | 1 | 2 | 1 | 11 (38) | ||
| Q_02_2 | 1 | 1 | 3 (16) | |||
| Q_04_1 | 1 | 6 | 2 | 30 (74) | ||
| Q_06_1 | 1 | 5 | 1 | 10 (83) | ||
| Q_07_1 | 2 | 6 | 3 (22) | |||
| Q_12_1 | 3 | 12 | 1 | 7 | 77 (170) | |
| Q_15_1 | 3 | 2 | 23 (43) | |||
| Q_19_1 | 1 | 12 | 1 | 5 | 27 (118) | |
| Q_20_1 | 43 | 46 | 1 | 47 | 423 (761) | |
| Total | 54 | 103 | 1 | 3 | 77 | 672 (1478) |
QTLR: QTL region identifier; lncRNA: long non coding RNAs; miRNA; microRNAs; misc_RNA: miscellaneous RNA; pseudogene: genes with pseudogene biotype; protein coding: number of genes (and number of transcripts) that contains an open reading frame (ORF)