| Literature DB >> 35189950 |
Tomas Fitzgerald1, Ian Brettell1, Adrien Leger1, Nadeshda Wolf2, Natalja Kusminski2, Jack Monahan1, Carl Barton1, Cathrin Herder2, Narendar Aadepu2,3, Jakob Gierten3, Clara Becker3, Omar T Hammouda3, Eva Hasel3, Colin Lischik3, Katharina Lust3, Natalia Sokolova3, Risa Suzuki3, Erika Tsingos3, Tinatini Tavhelidse3, Thomas Thumberger3, Philip Watson3, Bettina Welz3, Nadia Khouja2, Kiyoshi Naruse4, Ewan Birney1, Joachim Wittbrodt3, Felix Loosli5.
Abstract
BACKGROUND: Unraveling the relationship between genetic variation and phenotypic traits remains a fundamental challenge in biology. Mapping variants underlying complex traits while controlling for confounding environmental factors is often problematic. To address this, we establish a vertebrate genetic resource specifically to allow for robust genotype-to-phenotype investigations. The teleost medaka (Oryzias latipes) is an established genetic model system with a long history of genetic research and a high tolerance to inbreeding from the wild.Entities:
Keywords: Copy number variation; Genetics; Genome sequencing; Inbred panel; Medaka; Population genetics; Quantitative traits; eQTL
Mesh:
Year: 2022 PMID: 35189950 PMCID: PMC8862526 DOI: 10.1186/s13059-022-02623-z
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 17.906
Fig. 1Inbreeding, fecundity and eye size in the MIKK panel lines. A Status of all MIKK panel lines during the first 14 generations of inbreeding, showing cause of death for non-extant lines. B Average fecundity of MIKK panel lines in generation F16, as measured during peak egg production in July 2020. C Distribution of mean relative eye size for male and female medaka across all MIKK panel lines
Fig. 2Homozygosity of the MIKK panel. A The overall proportion of homozygous 10 kb genomic regions comparing the MIKK panel to classical inbred and wild Kiyosu fish. B The proportion of homozygous 10-kb genomic windows across the MIKK panel for each chromosome. C The proportion of homozygous 10-kb genomic windows (excluding chr1) for all MIKK panel lines
Fig. 3Heterozygosity across chromosome 1 in the MIKK panel defines the sex-determining region. A Homozygosity state from the HMM for all 10-kb windows across all MIKK panel lines (the color ordering white, gray, black, and red represent increasing levels of heterozygosity). B Two different estimates of the proportion of MIKK panel lines that are classified as heterozygous across all 10-kb windows on chromosome 1 (blue: based on heterozygous SNP count; red: based on HMM state space)
Fig. 4Identity by state (IBS) across the MIKK panel and classical inbred strains. A 1-IBS from all medaka samples, including classical inbred strains. B 1-IBS for all MIKK panel lines that have an available “sibling line”
Fig. 5Fixation Index (F) comparing allele frequencies of the MIKK panel against wild Kiyosu medaka. A Mean FST in 10-kb windows across the medaka genome. B log10 of the number of SNPs per 10-kb window. C log10 of number of SNPs per window vs mean F
Fig. 6ABBA-BABA analysis. A Phylogenetic tree generated from the Ensembl release 102 50-fish multiple alignment, showing only the medaka lines used in the ABBA-BABA analysis. B Schema of the comparisons carried out in the ABBA-BABA analysis. C Circos plot comparing introgression (d) between the MIKK panel and either iCab (yellow), HNI (orange), or HSOK (purple), calculated within 500-kb sliding windows using a minimum of 250 SNPs per window
Fig. 7Minor allele frequency distributions and LD decay for biallelic, non-missing SNPs in the 1000 Genomes Phase 3 variant calls (N = 2504) (1KG), and the MIKK panel Illumina-based calls excluding one of each pair of sibling lines (N = 63), across all autosomes (1KG: chrs 1-22; MIKK: chrs 1-24). A Density of allele frequencies in the 1KG and MIKK panel calls. B LD decay for each autosome, calculated by taking the mean r2 of pairs of SNPs with MAF > 0.1 within non-overlapping 100-bp windows of distance from one another, up to a maximum of 10 kb. Inset: mean r2 within 100-bp windows, up to a maximum of 1 kb. LD decays faster on chromosome 2 for the MIKK panel due to its higher recombination rate, consistent with the genetic linkage map described in [58]
Fig. 8eQTL analysis results. A Manhattan plot reporting only the 5 best p-values per transcript. For visual comfort, the p-value threshold (dashed line) was set to 1e−6. B Zoomed-in slice of the Manhattan plot, showing all p-values in the genomic window chr18:0-1200000. Transcripts associated with at least 1 p-value lower than 10e−6 are highlighted in color, with all SNPs associated shown in the same color. The width of the colored bands corresponds to the full length of the transcript. C Normalized TPM expression of MIKK lines for 3 significant eQTL SNPs associated with transcript ENSORLT00000002786.2 (cyp17a2 gene). Lines are sorted according to their genotypes (AA, AB, or BB). Each point represents one individual line, overlaid with letter-value plot boxes showing the quantile distribution [59]. D Metagene density plot with upstream and downstream 500k regions showing the density of significant eQTL hits according to their positions relative to a prototypic gene. Values are aggregated in 100 bins of 5000 bases for the adjacent regions and 50 bins dynamically computed for p-values falling inside gene bodies. Densities are weighted by bin length to allow comparisons across the different regions