| Literature DB >> 28552356 |
Thomas W Marchant1, Edward J Johnson1, Lynn McTeir1, Craig I Johnson1, Adam Gow1, Tiziana Liuti1, Dana Kuehn2, Karen Svenson3, Mairead L Bermingham4, Michaela Drögemüller5, Marc Nussbaumer6, Megan G Davey1, David J Argyle1, Roger M Powell7, Sérgio Guilherme8, Johann Lang9, Gert Ter Haar10, Tosso Leeb5, Tobias Schwarz1, Richard J Mellanby1, Dylan N Clements1, Jeffrey J Schoenebeck11.
Abstract
In morphological terms, "form" is used to describe an object's shape and size. In dogs, facial form is stunningly diverse. Facial retrusion, the proximodistal shortening of the snout and widening of the hard palate is common to brachycephalic dogs and is a welfare concern, as the incidence of respiratory distress and ocular trauma observed in this class of dogs is highly correlated with their skull form. Progress to identify the molecular underpinnings of facial retrusion is limited to association of a missense mutation in BMP3 among small brachycephalic dogs. Here, we used morphometrics of skull isosurfaces derived from 374 pedigree and mixed-breed dogs to dissect the genetics of skull form. Through deconvolution of facial forms, we identified quantitative trait loci that are responsible for canine facial shapes and sizes. Our novel insights include recognition that the FGF4 retrogene insertion, previously associated with appendicular chondrodysplasia, also reduces neurocranium size. Focusing on facial shape, we resolved a quantitative trait locus on canine chromosome 1 to a 188-kb critical interval that encompasses SMOC2. An intronic, transposable element within SMOC2 promotes the utilization of cryptic splice sites, causing its incorporation into transcripts, and drastically reduces SMOC2 gene expression in brachycephalic dogs. SMOC2 disruption affects the facial skeleton in a dose-dependent manner. The size effects of the associated SMOC2 haplotype are profound, accounting for 36% of facial length variation in the dogs we tested. Our data bring new focus to SMOC2 by highlighting its clinical implications in both human and veterinary medicine.Entities:
Keywords: GWAS; QTL; SMOC2; brachycephaly; craniofacial; dog; morphology; retrotransposon; selection; whole-genome sequencing
Mesh:
Substances:
Year: 2017 PMID: 28552356 PMCID: PMC5462623 DOI: 10.1016/j.cub.2017.04.057
Source DB: PubMed Journal: Curr Biol ISSN: 0960-9822 Impact factor: 10.834
Figure 1Capturing Gross Interbreed and Subtle Intrabreed Variation in Skull Shape
(A) Three-dimensional isosurfaces of canine skulls are reconstructed from computed tomography (CT) scans of referral patients.
(B–E) Lateral images of a smooth collie (B; dolichocephalic), Bernese mountain dog (C; mesocephalic), border terrier (D; mesocephalic), and pug (E; brachycephalic) with corresponding isosurfaces were included in our analysis. Head images and isosurfaces are not to scale.
(F) Lateral and dorsoventral views of the canine skull with wireframe diagrams superimposed, representing the changes in viscerocranium shape for negative and positive viscerocranium PC1 scores (“ve PC”). Red circles indicate surface landmarks of the rostrum. Connecting blue lines are added to provide visual context to shape. Circles connected by black dotted lines indicate landmarks of the hard palate.
(G) Individual breed members cluster together when viscerocranium shape (viscerocranium PC1) is plotted against body size (neurocranium centroid). BMD-, Bernese mountain dog; BORD, border collie; BORT, border terrier; BOX-, boxer; COLL, smooth collie; PUG-, pug; YORK, Yorkshire terrier.
See also Figures S1 and S2 and Table S1.
Figure 2Morphology of Skull Substructures Are Associated with Multiple QTL
Manhattan plots for neurocranium centroid size (A) and viscerocranium PC1 GWASs (B). Red dashed line (3.6 × 10−7) indicates threshold for multiple testing with significant SNPs colored red. The associated SNPs and candidate genes at each locus are summarized in Table 1. Insets: skull schematics indicate the region of landmarks used for datasets. Expected (x axis) and observed (y axis) −log10(p) values are plotted for all SNPs (black circles) and pruned SNPs (gray circles). Histograms depict the frequency (y axis) of viscerocranium PC1 and neurocranium centroid, respectively. See also Figures S1 and S3 and Table S2.
SNPs Showing Genome-wide Significance with Skull Datasets
| Dataset | Chromosome | Index SNP | Position | Candidate Gene | Allele | p Value |
|---|---|---|---|---|---|---|
| Viscerocranium | 1 | BICF2P250912 | 55,983,871 | G > A | 1.91 × 10−20 | |
| Mandible | 1 | BICF2P250912 | 55,983,871 | G > A | 8.43 × 10−10 | |
| Neurocranium | 3 | TIGRP2P56799_rs8666557 | 91,103,945 | G > T | 3.64 × 10−9 | |
| Neurocranium | 7 | BICF2S23352941 | 43,719,549 | A > G | 5.71 × 10−13 | |
| Neurocranium | 10 | G580f46S240 | 8,183,593 | C > T | 3.06 × 10−15 | |
| Neurocranium | 15 | BICF2P355320 | 41,257,020 | C > T | 1.73 × 10−19 | |
| Neurocranium | 18 | BICF2S23615757 | 20,272,961 | A > C | 3.31 × 10−8 |
See also Table S2. For intragenic SNPs, genes are denoted by asterisks.
Only index SNPs are listed. A complete list of significant SNPs is shown in Table S2.
Derived alleles are shown after ancestral alleles
Figure 3Regional Association and Critical Interval Determination of the CFA1 Viscerocranium QTL
(A) SNP associations with viscerocranium PC1 are shown for ∼1 Mb on either side of significant SNPs on CFA1. SNPs are colored depending on the degree of LD (r2) with the index SNP (BICF2P250912; 1.91 × 10−20).
(B) Ten-SNP sliding window haplotype association.
(C) Genotypes between 55,881,672 and 56,020,217 (including ∼500 kb of flanking sequence) were phased and ranked by their viscerocranium PC1 value. Only haplotypes from brachycephalic dogs (viscerocranium PC1 ≤ −0.2; see Figure S3) were considered. Haplotypes are paired by subject and ranked by viscerocranium PC1 value. Alleles colored light gray match the consensus haplotype; dark gray alleles are variant. A 187.7-kb critical interval is defined by at least three meiotic recombinations (indicated above by black bar). The 12 SNPs that constitute the associated haplotype (red bar) are distributed within or up to ∼44 kb downstream of SMOC2. Black arrows indicate 3 of 37 dogs that have a homozygous variant haplotype. These dogs are registered as two Dogues de Bordeaux and a Chihuahua. The red arrow indicates a Japanese Chin that is homozygosed for a recombinant haplotype within the critical interval.
See also Figures S2 and S6 and Table S3.
Variant Filtering within the Viscerocranium Critical Interval
| Software | GATK/SnpSft | Pindel |
|---|---|---|
| Variant type | SNPs/INDELS | Structural variants |
| Base pairs analyzed | 187,377 | 187,377 |
| Pre-filtering | 3,674 | 162 |
| Post-filtering | 4 | 1 |
Filtering criteria are listed in Table S4. See also Figure S4 and Table S4.
Figure 4Characterization of the Intronic LINE-1 Retrotransposon within SMOC2
(A) Schematic of a full-length canine LINE-1 element consisting of 5′ UTR/3′ UTR, open reading frames 1 and 2 (ORF1/ORF2), and a polyadenylated tail (AAAAn) flanked by target site duplications (TSD). The structural variant within SMOC2 is 1,506 bp in length (in addition to a poly(A) tail) and has a 99.1% match to the consensus sequence of canine LINE-1.
(B and C) Distribution of the SMOC2 LINE-1 fragment for (B) viscerocranium PC1 and neurocranium centroid size (C) across all individuals.
(D) Ventral-dorsal view of the canine hard palate and its constituent bones.
(E) Length and width of the canine palate and constituent bones normalized by the neurocranium centroid for homozygous ancestral (white), heterozygotes (gray), and homozygous-derived (black) individuals for the SMOC2 LINE-1 insertion.
(F) Relative expression levels of SMOC2 both up- and downstream of the LINE insertion (<0.05 ∗; <0.01 ∗∗; <0.001 ∗∗∗). Error bars represent SEM.
(G) RNA sequencing (RNA-seq) data reveal four genes with significant changes in mRNA levels (red) for homozygous SMOC2 LINE-1 carriers compared to non-carriers (three each). Neighboring genes to SMOC2 are colored green.
(H) Schematic of genomic DNA (gDNA) spanning exon 8 and 9 of SMOC2, including the LINE-1 fragment. mRNA transcripts include the canonical splicing of SMOC2 (i) followed by the three most abundant SMOC2 isoforms when the LINE-1 element is present (ii–iv). All isoforms have premature stop codons prior to exon 9. C/T indicates the SNP in exon 8. Schematic is not to scale.
(I) Exons 1–13 of SMOC2 contribute to a follistatin-like module (FS), thyroglobulin-like modules (TY), a unique SMOC module, an extracellular calcium-binding module (EC), and a signal peptide (SP).
See also Figures S5 and S7 and Tables S5 and S6.
Figure 5Size Effects of the Viscerocranium Shape and Neurocranium Centroid Size QTL
(A and C) Boxplots depicting the distribution of normalized size-corrected viscerocranium PC1 (A) and normalized neurocranium centroid size (C) for 11 loci linked with body size and skull shape. Distributions are subdivided by genotype —homozygous ancestral (AA), heterozygotes (AD), and homozygous derived (DD). ∗∗∗ denotes p < 0.001 in Mann-Whitney-Wilcoxon and Kolmogorov-Smirnov tests.
(B and D) A stepwise linear regression model for viscerocranium PC1 (B) and neurocranium centroid (D) determined the best explanatory model for ancestral (left) and derived (right) genotypes for each positional candidate.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Various veterinary referral hospitals | N/A | |
| Trizol | Life Technologies | 15596026 |
| RNAlater | Life Technologies | AM7020M |
| CanineHD Whole-Genome Genotyping SNP BeadChip | Illumina | WG-440-1001 |
| Truseq DNA nano kit | Illumina | FC-121-4001 |
| TruSeq Stranded mRNA Library Prep Kit High Throughput | Illumina | RS-122-2103 |
| Illumina TruSeq Nano DNA library prep HT | SeqLab | 20000903 |
| RNA and DNA sequencing data | This paper | ENA: PRJEB17926, |
| Dog reference genome (CanFam3.1, ENSEMBL release-85) | ENSEMBL | |
| Dog genotypes | This paper | |
| Dog genetic variants | Dog Biomedical Variant Database Consortium ( | N/A |
| Mouse: Smoc2tm1.1(KOMP)Vlcg | The Jackson Laboratory | |
| Mouse embryos | Roslin Institute Biological Research Facility | N/A |
| Chicken embryos | NARF | |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | This paper | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: CFA30 QTL Forward: AGGGATAGTCCAGCTCCAAGGCTGGTAT | This paper | N/A |
| gDNA targeted primer: CFA30 QTL Reverse: CTCTTTCAGGCTTCCCCAGTTGTACCTA | This paper | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| gDNA targeted primer: | [ | N/A |
| cDNA targeted primer: | This paper | N/A |
| cDNA targeted primer: | This paper | N/A |
| cDNA targeted primer: | This paper | N/A |
| cDNA targeted primer: | This paper | N/A |
| cDNA targeted primer: | [ | N/A |
| cDNA targeted primer: MRPS7 Reverse: CAGCAGCTCGTGTGACAACT | [ | N/A |
| Read alignment: bwa v0.7.8 | [ | |
| Variant caller: GATK v3.7 | [ | |
| WGS utility: Picard | ||
| Structural variant caller: Pindel v0.2.3 | [ | |
| Annotation: SNPsift v4.0 | [ | |
| Effect prediction: HaploReg v4.1 | [ | N/A |
| Effect prediction: CADD v3.1 | [ | N/A |
| Utility: PLINK v1.07 | [ | |
| Utility: PLINK v1.90 beta | [ | |
| Admixture assessments: STRUCTURE v2.3 | [ | |
| Linear mixed model: GEMMA v0.94.1 | [ | |
| Phasing: SHAPEIT v2.r837 | [ | |
| Graphics and data analysis: R v3.3.0 | The Comprehensive R Archive Network (CRAN) | |
| RNaseq alignment: STAR v2.5.1b | [ | |
| RNaseq analysis: DESeq2 v1.12.14 | [ | |
| RNaseq analysis: RSubread v1.22.3 | [ | |
| Data visualization: Integrative Genomics Viewer v2.3.59 | [ | |
| DICOM reconstruction and landmarking: CheckPoint v2016.11.21.0711 WIN x64 | Stratovan | |
| ImageJ v1.50 g | [ | |
| Geometric morphometrics: MorphoJ v1.06c | [ | |
| KOD Xtreme HotStart Polymerase | Merck | 71975-3 |
| Saliva sample collection kit | Peformagene | PG-100 |
| Lysing matrix D 2mL tube | MPBio | 116913050 |
| RNeasy Minikit | QIAGEN | C-74104 |
| SuperScript III First- Strand Synthesis SuperMix | Life Technologies | 11752050 |