| Literature DB >> 21967120 |
Anna V Kukekova1, Jennifer L Johnson, Clotilde Teiling, Lewyn Li, Irina N Oskina, Anastasiya V Kharlamova, Rimma G Gulevich, Ravee Padte, Michael M Dubreuil, Anastasiya V Vladimirova, Darya V Shepeleva, Svetlana G Shikhevich, Qi Sun, Lalit Ponnala, Svetlana V Temnykh, Lyudmila N Trut, Gregory M Acland.
Abstract
BACKGROUND: Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed.Entities:
Mesh:
Year: 2011 PMID: 21967120 PMCID: PMC3199282 DOI: 10.1186/1471-2164-12-482
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Silver fox transcriptome: summary statistics of reads and assemblies.
| Run | Total number of reads | Number of filtered bases (Gb) | Fully aligned (%)* | Partially aligned (%)* | Number of Isogroups* | Number of Isotigs* | Average Isotig Length | N50 Isotig Length | Number of Contigs* (length >= 100 bp) | Total number of bases in contigs at least 100 bp in length | Singletons* (%) | Repeats* (%)* | Outliers* (%)* | Too Short* (%)* | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2,565,892 | 0.96 | 376 +/-148 (433) | 1,921,335 (74.9%) | 250,138 (9.7%) | 37,679 | 50,311 | 1,565 | 2,473 | 55,656 | 52,072,035 | 287,577 (11.2%) | 1,321 (0.1%) | 78,309 (3.1%) | 26,993 (1.1%) | |
| 3,379,343 | 1.21 | 358 +/-146 (403) | 2,458,444 (72.7%) | 338,660 (10.0%) | 44,791 | 60,492 | 1,593 | 2,647 | 67,078 | 60,587,572 | 426,108 (12.6%) | 1,023 (0.1%) | 118,205 (3.6%) | 36,637 (1.1%) | |
| 5,945,235 | 2.17 | 4,551,142 (76.6%) | 571,726 (9.6%) | 59,713 | 87,400 | 1,820 | 3,293 | 96,461 | 83,773,326 | 562,591 (9.5%) | 14,192 (0.2%) | 181,469 (3.1%) | 63,630 1.1% | ||
The number of reads is tabulated for both samples (Tame, Aggressive), both individually and all together, with the total number of bases for each set that passed filtering criteria and were used for mapping against the canine RefSeq database, together with analytic results of the corresponding fox transcriptome assemblies. Isogroups: an isogroup is a collection of contigs containing reads with implied connections between them, analogous to individual genes; Isotigs: an isotig is meant to be analogous to an individual transcript or splice variant; Contigs: a contig represents the consensus sequence of a self-consistent set of overlapping reads, roughly analogous to exons; Singletons: reads that did not overlap with any other reads in the input; Repeats: reads that were either inferred to be repetitive early in the assembly process or determined to partially overlap a contig; Outliers: reads that were identified by the GS De Novo Assembler as problematic, and were excluded from the final contigs; Too Short: the trimmed read was too short (< 50bp) to used in the computation. (see also Methods).
Figure 1Comparison of read length distributions for transcriptome sequences of one tame and one agressive fox. Reads were generated from cDNA from prefrontal cortex using the Roche 454 FLX Titanium platform. Data represents a total of 6 runs comprising 5 half full runs for one tame individual (green) and 6 half full runs for one aggressive fox (red). Reads were quality filtered with standard parameters and cleaned of adaptor sequences. For statistics of reads see table 1.
Summary statistics from mapping fox sequencing reads against the Canine RefSeq database.
| Total number of reads | Fully mapped | Partially mapped | Unmapped | Repeats | Chimeric | Too short to map |
|---|---|---|---|---|---|---|
| 5,945,235 | 21.23% | 2.32% | 56.11% | 17.91% | 1.38% | 1.04% |
Figure 2Mapping fox sequencing reads against the canine RefSeq database: gene yield vs sequence depth. The yield of identifiable genes from fox transcriptome sequencing is plotted as a function of sequencing depth. Sets of reads, each set representing half a sequencing run (approximately 500,000 reads) were combined variously into 9 datasets each representing from half a run (9 datapoints, circled cluster at left extremity of plot) to 11 half-runs (1 datapoint, circled, rightmost extremity of plot). The vertical axis gives the percentage of genes in the canine RefSeq database hit by at last one read for each datapoint. Green diamonds = datapoints comprised of tame reads only; red squares = aggressive reads only; brown circles = datapoint combines tame and aggressive reads.
Figure 3Mapping fox sequencing reads against the canine RefSeq database: breadth of coverage. For each accession number in the canine RefSeq database to which at least one fox read mapped, the breadth of coverage was calculated as the percent length of the canine transcript covered by all fox reads mapping to that accession number. The breadth of coverage appears to be bivariate.
Summary statistics from mapping fox sequencing reads against the repeatmasked CanFam2 sequence assembly of the dog genome.
| Total number of reads | Fully mapped | Partially mapped | Unmapped | Repeats | Chimeric | Too short to map |
|---|---|---|---|---|---|---|
| 5,945,235 | 29.97% | 5.12% | 32.38% | 3.13% | 28.36% | 1.04% |
Numbers of SNPs detected, and their informativeness, in tame and aggressive fox samples.
| Homozygous Tame | Heterozygous Tame | |
|---|---|---|
| Homozygous Aggressive | 7,260 | 7,315 |
| Heterozygous Aggressive | 11,530 | 4,386 |
SNPs were classified by the observed genotypes in the tame and aggressive individuals, as either homozygous in both (i.e. for opposite alleles), heterozygous in both, or homozygous in one individual and heterozygous in the other.
Figure 4Portion of fox chromosome 12 (VVU12) demonstrating SNP distribution and informativeness. The region of VVU12 that is homogous to canine chromosome 35 is graphed to indicate the position and zygosity of SNPs detected by transcriptome analysis of one tame and one aggressive fox. The top row/lines indicates the alignment of CFA35 with this part of VVU12; the next row (Fox map cM) provides fox meiotic map distances; and the bottom row indicates estimated position in megabases, based on extrapolation from the canine genome sequence assembly. On the four central tracks, each SNP is represented by a single tick mark (+) for each individual (Aggressive or Tame), on either the heterozygous or homozygous track for that individual. Similar data is presented for all fox autosomes in Supplementary Figure 3. The grey box highlights a region that has previously been shown [32] to harbor a quantitative trait locus (QTL) associated with specific components of fox behavior with an association peak at 23 cM. The aggressive and tame individuals are homozygous for opposite SNP alleles in this interval. The outlined box on the Tame Het row indicates how the region of homozygosity is extended in the tame individual.
Mapping of SNPs, identified from the fox transcriptome, onto the dog genome sequence.
| Number of SNPs | Type of genomic region |
|---|---|
| 37 | coding |
| 9 | intron |
| 40 | utr |
| 8 | non-gene |
| 5 | repetitive |
| 1 | unknown |
100 randomly selected high-confidence fox SNPs were mapped onto the dog genome sequence assembly (CanFam2) to identify their distribution among different types of genome sequence.
Figure 5Biofunction groups of genes expressed differentially between tame and aggressive foxes. Expression differences between the tame and aggressive individuals were evaluated as the proportionate read count per gene. Genes with at least a 2-fold difference in expression (at p < 0.05) were sorted into Biofunction groups by Ingenuity IPA (Version 8.6). Biofunction groups were then ranked by the negative logarithm of the biofunction p-value, which estimates the probability that the biofunction group is over-represented in the set of differentially expressed genes. The vertical axis gives the statistical support for each bioinformatic group being upregulated in the tame (green) or aggressive individual (red) (a). The 10 highest ranked biofunction groups representing genes with higher expression in the tame individual. (b). The 10 highest ranked biofunction groups representing genes with higher expression in the aggressive individual.
RT-qPCR validation of expression differences identified by transcriptome sequence analysis.
| Transcriptome Sequencing | Relative RT-qPCR | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| XM_848547 | SCGN | 2.24 | 6.28 | 3.28E-10 | 136 | 402 | A>T | 2.44 | A>T |
| XM_544247 | ITGA8 | 4.48 | 4.85 | 1.24E-06 | 68 | 20 | T>A | 5.55 | T>A |
| XM_546414 | CDON | 5.95 | 3.51 | 4.44E-04 | 6 | 47 | A>T | 1.09 | A>T |
| NM_001006648 | HTR2C | 9.22 | 2.69 | 0.01 | 14 | 2 | T>A | 2.89 | T>A |
| NM_001003300 | KCNMA1 | 2.19 | 2.65 | 0.01 | 26 | 75 | A>T | 1.43 | A>T |
| XM_543946 | LCOR | 10.63 | 2.14 | 0.03 | 1 | 14 | A>T | 1.75 | A>T |
| XM_849641 | PRRG2 | 11.71 | 2.11 | 0.03 | 8 | 0 | T>A | 1.31 | T>A |
| XM_846863 | LRRC20 | 10.54 | 2.08 | 0.04 | 8 | 1 | T>A | 1.12 | T>A |
| XM_542630 | SLITRK6 | 9.87 | 2.04 | 0.04 | 1 | 13 | A>T | ||
Nine genes, identified as differentially expressed between the tame and aggressive fox by transcriptome sequence analysis, were selected for expression evaluation by relative RT-qPCR. Results are sorted by Z test p-value. Except for one gene (SLITRK6, result in bold and italic), which had the least significant Z test p-value, RT-qPCR consistently confirmed a difference of expression between the tame and aggressive samples in the same direction as had transcriptome analysis (compare columns 8 and 10).