| Literature DB >> 36057580 |
Iulia Blaj1, Jens Tetens2,3, Jörn Bennewitz4, Georg Thaller5, Clemens Falker-Gieske2.
Abstract
BACKGROUND: Structural variants and tandem repeats are relevant sources of genomic variation that are not routinely analyzed in genome wide association studies mainly due to challenging identification and genotyping. Here, we profiled these variants via state-of-the-art strategies in the founder animals of four F2 pig crosses using whole-genome sequence data (20x coverage). The variants were compared at a founder level with the commonly screened SNPs and small indels. At the F2 level, we carried out an association study using imputed structural variants and tandem repeats with four growth and carcass traits followed by a comparison with a previously conducted SNPs and small indels based association study.Entities:
Keywords: Genome wide association studies; Imputation; Pig; Structural variants; Tandem repeats; Whole-genome sequencing; lncRNA
Mesh:
Year: 2022 PMID: 36057580 PMCID: PMC9440560 DOI: 10.1186/s12864-022-08716-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Number of variants per chromosome
| 2,100,792 | 492,397 | 1,268 | 104,628 | 10,893 | |
| 1,447,573 | 343,180 | 908 | 56,192 | 6,574 | |
| 1,386,966 | 291,597 | 716 | 49,565 | 5,897 | |
| 1,342,424 | 290,615 | 765 | 49,228 | 5,929 | |
| 1,157,084 | 267,350 | 714 | 37,884 | 4,988 | |
| 1,631,542 | 370,136 | 896 | 64,079 | 7,295 | |
| 1,255,167 | 284,413 | 769 | 44,794 | 5,424 | |
| 1,422,204 | 338,937 | 798 | 52,268 | 6,549 | |
| 1,423,977 | 328,226 | 926 | 50,282 | 6,340 | |
| 1,002,412 | 222,468 | 537 | 23,164 | 5,613 | |
| 955,612 | 214,708 | 603 | 28,679 | 4,183 | |
| 771,362 | 170,298 | 462 | 21,773 | 3,062 | |
| 1,617,902 | 410,438 | 956 | 81,922 | 8,385 | |
| 1,374,393 | 321,456 | 812 | 51,769 | 6,160 | |
| 1,210,587 | 298,692 | 757 | 55,135 | 5,892 | |
| 912,732 | 210,198 | 571 | 29,850 | 4,852 | |
| 788,319 | 174,886 | 484 | 22,770 | 3,139 | |
| 603,633 | 123,587 | 259 | 20,574 | 2,555 | |
| 22,404,681 | 5,153,582 | 13,201 | 844,556 | 103,730 |
Co-localization of different types of variation. Pearson correlation coefficient r-values based on 500 kb windows in autosomes are shown in the upper triangle; p-values are shown in the lower triangle
| 0.87 | 0.48 | -0.43 | 0.57 | ||
| 0.49 | -0.43 | 0.55 | |||
| -0.23 | 0.36 | ||||
| 0.01 | |||||
Fig. 1Variant annotation and impact classification. Percentage breakdown for the SNPs, small indels, SVs, TRs, and pTRs. A Percentage of effects classified by region for each type of variation. B Percentage of effects classified by impact for each type of variation
Fig. 2Gene enrichment analysis. A. Enriched GO Biological Processes for SVs overlapping genes (n=1,448); B. Top 30 enriched GO Biological Processes for TRs overlapping genes (n=1,229)
Fig. 3Manhattan plots of the genome wide association studies for the four traits. The genome-wide significant threshold is given by –log10 (0.05/54,075). Significant SVs are red, while significant TRs are marked by the color blue
Top five associated genes per chromosome for each trait: average daily gain (ADG), backfat thickness (BFT), meat to fat ratio (MFR), and carcass length (CRCL). Gene type in brackets. The variants overlapping or in the proximity of these genes were selected not to be in high LD (r2 < 0.8) with the previously associated SNPs and small indels
| 1 | ||
| 2 | ||
| 4 | ||
| 7 | ||
| 16 | ||
| 1 | ||
| 2 | ||
| 4 | ||
| 7 | ||
| 16 | ||
| 1 | ||
| 2 | ||
| 4 | ||
| 5 | ||
| 7 | ||
| 14 | ||
| 15 | ||
| 18 | ||
| 4 | ||
| 7 | ||
| 8 | ||
| 9 | ||
| 14 | ||
| 16 | ||
| 17 |
Fig. 4Venn diagram displaying all possible relations among the gene sets overlapping or in the vicinity of significantly associated SVs and TRs. These variants were not tagged (r2 < 0.8) by previously identified SNPs or small indels
Summary of published data on functional biological roles of the top five gene selection
| ADG | Large loss of muscle mass [ | |
| ADG | Regulation of lipolysis in adipose tissue [ | |
| ADG, CRCL | Ketogenesis [ | |
| ADG, BFT | Obesity in humans [ | |
| ADG, BFT, MFR, and CRCL | Body length in fish [ | |
| ADG, BFT, MFR | Autism spectrum disorders [ | |
| BFT, MFR | Regulate muscle contraction, required for growth and postnatal survival [ | |
| BFT, MFR | Residual Feed Intake in beef cattle [ | |
| BFT, CRCL | Regulating pancreatic development and insulin secretion [ | |
| MFR | Muscle function, muscle growth [ | |
| MFR | Growth in fish [ | |
| MFR | Body fat ratio [ | |
| MFR | Plasma triglyceride concentration [ | |
| MFR | Body fat [ | |
| MFR | Body weight [ | |
| MFR | Fatty acid composition in intramuscular fat [ | |
| MFR | Lipid metabolism and insulin sensitivity [ | |
| MFR | Fat deposition in chicken [ | |
| CRCL | Decreased body length [ | |
| CRCL | Susceptibility to Diet-Induced Obesity [ | |
| CRCL | Susceptibility to Diet-Induced Obesity [ | |
| CRCL | Feeding behavior and metabolic rate [ | |
| CRCL | Decreased body weight [ | |
| CRCL | Decreased body size [ |