| Literature DB >> 31114855 |
Vinicius H da Silva1,2,3, Veronika N Laine4, Mirte Bosse1, Lewis G Spurgin5, Martijn F L Derks1, Kees van Oers2, Bert Dibbits1, Jon Slate6, Richard P M A Crooijmans1, Marcel E Visser1,2, Martien A M Groenen1.
Abstract
Chromosome inversions have clear effects on genome evolution and have been associated with speciation, adaptation, and the evolution of the sex chromosomes. In birds, these inversions may play an important role in hybridization of species and disassortative mating. We identified a large (≈64 Mb) inversion polymorphism in the great tit (Parus major) that encompasses almost 1,000 genes and more than 90% of Chromosome 1A. The inversion occurs at a low frequency in a set of over 2,300 genotyped great tits in the Netherlands with only 5% of the birds being heterozygous for the inversion. In an additional analysis of 29 resequenced birds from across Europe, we found two heterozygotes. The likely inversion breakpoints show considerable genomic complexity, including multiple copy number variable segments. We identified different haplotypes for the inversion, which differ in the degree of recombination in the center of the chromosome. Overall, this remarkable genetic variant is widespread among distinct great tit populations and future studies of the inversion haplotype, including how it affects the fitness of carriers, may help to understand the mechanisms that maintain it.Entities:
Keywords: zzm321990 Parus majorzzm321990 ; CNVs; songbird; structural variation
Mesh:
Year: 2019 PMID: 31114855 PMCID: PMC6609730 DOI: 10.1093/gbe/evz106
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—(A) PCA: based on the SNPs located on Chromosome 1A, a principal component analysis revealed two distinct subpopulations. The distinction is given by Eigenvector One, which gave the initial evidence of inversion carriers. (B) F: these two subpopulations display highly differentiated SNPs across the whole of Chromosome 1A, except at regions near to telomeres. (C) Heterozygosity: each subpopulation exhibits a particular heterozygosity level across the Chromosome 1A. The inv-norm subpopulation has many SNPs with high heterozygosity within the region bounded by the tentative breakpoints given by F analysis (3–68 Mb, delimited by the red-dashed lines). The purple dashed line represents the maximum expected in norm-norm birds. SNPs above this threshold are considered informative.
. 2.—The pairwise LD on the Chromosome 1A. (A) measured in 2,296 great tits. (B) measured in 2,179 norm-norm birds. Figures in the lower panels (C and D) support possible recombination events in the center of the inversion. In other words, possible recombination in the center of the inversion is supported by the distinct genotype distribution in comparison with the rest of the inversion and confirmed by R2. As R2 metric has reduced power to detect LD among SNPs with low allele frequency, the LD is reflected only in the center of the inversion. (C) R2 measured in 2,296 great tits reveals an LD block only in the middle of the chromosome. The full inversion does not show elevated LD, due to the limitation of R2 at dealing with low-frequency SNP alleles outside the center of the inversion. (D) Genotype frequency of informative SNPs (heterozygosity >0.6) across Chromosome 1A in the inv-norm subpopulation. The vertical dotted line roughly indicates the genomic region of middle block which harbors a higher number of birds with “AA” genotypes when compared with the rest of the inversion. Along with the LD pattern from R2 method, the genotype frequencies suggest a different genetic structure at the center of the inversion.
. 3.—Genotype distribution within/outside the center of the inversion (20–55 Mb) in inversion carriers. The number of genotypes is represented on a log2 scale to improve the visualization but untransformed values are shown on the upper x axis. Based on the number of “AA” genotypes it is possible to identify inv-norm-birds which harbor a different genotype distribution at the center of the inversion and therefore possibly have different inversion haplotypes (black bars among the dashed lines).
. 4.—CNVs in the inversion breakpoint. (A) CNV frequency across the Chromosome 1A and the genomic interval of the previously identified CNV region “2802” (64.83–67.67 Mb; da Silva et al. 2018), which is located at the inversion breakpoint. (B) F values across the chromosome. A red circle is highlighting the SNP used to the PCR-RFLP analysis. (C) A CNV in the inversion breakpoint is present in the vast majority of inv-norm birds whereas is rarely found in norm-norm birds. (D) Digestion pattern of the PCR-RFLP at the SNP AX-100689781. The black bars represent the expected gel patterns alongside each of the two observed patterns in each subpopulation (i.e. norm-norm and inv-norm). Distinct copy number genotypes are evidenced by the allele intensities in the gel after electrophoresis. The values above each gel picture depicts the fingerprint name and the degree of confidence to tag a specific karyotype state (i.e. percent of the birds with concordant inversion genotype between SNP array and PCR-RFLP). Green was used in highly confident profiles, blue in the medium confidence one, and red for B4, which has high heterozygosity (expected in inv-norm) but was only identified in two norm-norm birds. To differentiate between fingerprints note the distinct intensities of subsets of bands; between B1 and B2 the greatest difference is mainly at the 300/169 bp bands and between B3 and B4 the greatest difference is between the 469/300 bp bands.
. 5.—Representation of the whole Chromosome 1A with the complex structural rearrangement in the downstream breakpoint of the inversion. Blocks in gray represent the inversion region whereas those in black are genomic regions outside the inversion. CNVs identified by sequencing in the two inv-norm birds which were sequenced are labeled as CNV1-3 for simplicity. Horizontal curly brackets define the structural complex which encompasses CNVs 1-3. The above chromosomal representation displays the chromosome as shown in the reference genome (Laine et al. 2016). The below representation displays the expected genomic structure in the inversion. CNVs are relatively larger than their real length for schematic purposes.