| Literature DB >> 26747202 |
Shumaila Sayyab1, Agnese Viluma2, Kerstin Bergvall3, Emma Brunberg4, Vidhya Jagannathan5, Tosso Leeb5, Göran Andersson6, Tomas F Bergström6.
Abstract
Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10-20 cases and 10-20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted resequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrländer dogs. To our knowledge, this is the first time this family trio WGS-approach has been used successfully to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrländer dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences, and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G > C (p.R52P) was found to be concordant in eight additional cases, and 16 healthy Kromfohrländer dogs.Entities:
Keywords: FAM83G; NGS; WGS; canine; whole-genome-sequencing
Mesh:
Substances:
Year: 2016 PMID: 26747202 PMCID: PMC4777115 DOI: 10.1534/g3.115.025643
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Schematic representation of the filtering pipeline used to evaluate genetic variants.
Figure 2The family trio used for WGS and the validation of the missense variant at FAM83G (c155G > C, p.R52P) by Sanger sequencing. (A) The genome of Kromfohrländer dogs from a family trio consisting of an offspring affected by HFH and both its healthy parents was sequenced. (B) Both parents were heterozygous G/C (S) at position 155 of the FAM83G gene, and the affected offspring was homozygous C/C, resulting in a change at amino acid position 52 from an arginine to proline. The nucleotide sequence of codons 51–53 from the canine genome reference sequence is shown at the top.
Validation of the genotypes obtained by WGS of the family trio
| Gene | |||||||
|---|---|---|---|---|---|---|---|
| Position | chr5: 41014230 | chr5: 41014231 | chr5: 41055619 | chr6: 24951743 | chr37: 30832376 | chr12: 14853811 | chr20: 39951727 |
| Affected offspring | T/T | T/T | C/C | T/T | T/T | T/T | T/T |
| Healthy sire | G/T | C/T | C/G | G/T | C/T | C/T | G/T |
| Healthy dam | G/T | C/T | C/G | G/T | C/T | C/T | G/T |
Genotyping was made by Sanger sequencing of the seven candidate causative variants located in six different genes: GRAPL, FAM83G, PDILT, DLGAP2, TDRD6, and USP4. The genes and their respective chromosomal positions are indicated above. Genotypes obtained from the affected offspring, the healthy sire, and the healthy dam are indicated.
Genotyping by Sanger sequencing of the seven candidate causative variants located in six different genes to validate the concordance among cases
| Differences in: | GRAPL | GRAPL | FAM83G | PDILT | DLGAP2 | TDRD6 | USP4 |
|---|---|---|---|---|---|---|---|
| Position | chr5:41,014,230 | chr5:41,014,231 | chr5:41,055,619 | chr6:24,951,743 | chr37:30,832,376 | chr12:14,853,811 | chr20:39,951,727 |
| Case 1 | T/T | T/T | C/C | G/T | C/T | C/C | T/T |
| Case 2 | T/T | T/T | C/C | G/T | C/C | C/C | G/G |
| Case 3 | T/T | T/T | C/C | G/T | C/T | C/T | G/G |
| Case 4 | T/T | T/T | C/C | T/T | C/C | T/T | G/G |
| Case 5 | T/T | T/T | C/C | G/T | C/T | C/T | T/T |
| Case 6 | T/T | T/T | C/C | G/G | C/T | T/T | T/T |
| Case 7 | T/T | T/T | C/C | G/G | C/T | C/T | G/G |
| Case 8 | T/T | T/T | C/C | G/G | C/T | C/T | G/T |
| Control 1 | G/G | C/C | G/G | G/T | T/T | C/T | T/T |
| Control 2 | G/G | C/C | G/G | G/G | T/T | C/T | G/G |
| Control 3 | G/G | C/C | G/G | G/T | T/T | C/T | G/G |
| Control 4 | G/G | C/C | G/G | G/G | C/T | C/C | G/G |
| Control 5 | G/T | C/T | C/G | G/T | C/T | C/T | G/G |
| Control 6 | G/T | C/T | G/G | G/T | C/T | C/T | G/T |
| Control 7 | G/G | C/C | G/G | G/G | C/T | C/C | G/G |
| Control 8 | G/G | C/C | G/G | G/G | C/T | C/C | G/G |
| Control 9 | G/T | C/T | C/G | G/G | T/T | C/C | G/T |
| Control 10 | G/G | C/C | G/G | G/G | C/T | C/C | G/T |
| Control 11 | G/T | C/T | C/G | G/T | C/C | C/T | G/G |
| Control 12 | G/G | C/C | G/G | G/T | C/T | C/T | G/G |
| Control 13 | G/G | C/C | G/G | G/T | C/C | C/T | G/G |
| Control 14 | G/G | C/C | G/G | G/G | C/T | C/C | G/G |
| Control 15 | G/G | C/C | G/G | G/T | C/T | C/T | G/G |
| Control 16 | G/G | C/C | G/G | G/T | C/C | C/T | G/T |
| CanFam3.1 | G/G | C/C | G/G | G/G | C/C | C/C | G/G |
The GRAPL, FAM83G, PDILT, DLGAP2, TDRD6, and USP4 genes and their respective chromosomal positions with genotypes are indicated. The eight additional affected individuals (cases 1–8), the 16 healthy controls (controls 1–16), and in the CanFam3.1 reference genome sequence are indicated to the left.