| Literature DB >> 27651241 |
Robin Dowell1,2,3, Aaron Odell4, Phillip Richmond4,5, Daniel Malmer6, Eitan Halper-Stromberg7, Beth Bennett8, Colin Larson8, Sonia Leach7, Richard A Radcliffe9.
Abstract
The Inbred Long- and Short-Sleep (ILS, ISS) mouse lines were selected for differences in acute ethanol sensitivity using the loss of righting response (LORR) as the selection trait. The lines show an over tenfold difference in LORR and, along with a recombinant inbred panel derived from them (the LXS), have been widely used to dissect the genetic underpinnings of acute ethanol sensitivity. Here we have sequenced the genomes of the ILS and ISS to investigate the DNA variants that contribute to their sensitivity difference. We identified ~2.7 million high-confidence SNPs and small indels and ~7000 structural variants between the lines; variants were found to occur in 6382 annotated genes. Using a hidden Markov model, we were able to reconstruct the genome-wide ancestry patterns of the eight inbred progenitor strains from which the ILS and ISS were derived, and found that quantitative trait loci that have been mapped for LORR were slightly enriched for DNA variants. Finally, by mapping and quantifying RNA-seq reads from the ILS and ISS to their strain-specific genomes rather than to the reference genome, we found a substantial improvement in a differential expression analysis between the lines. This work will help in identifying and characterizing the DNA sequence variants that contribute to the difference in ethanol sensitivity between the ILS and ISS and will also aid in accurate quantification of RNA-seq data generated from the LXS RIs.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27651241 PMCID: PMC5110614 DOI: 10.1007/s00335-016-9663-6
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Summary of genome-wide variants identified in the ILS and ISS using the short insert paired-end library
| Variant typea | ILSb | ISSc | Commond |
|---|---|---|---|
| Total | 1,582,616 | 1,114,887 | 1,434,163 |
| SNPs | 1,346,137 | 943,224 | 1,226,435 |
| Deletions | 118,831 | 87,076 | 107,161 |
| Insertions | 117,648 | 84,587 | 100,567 |
| Coding total | 15,472 | 10,732 | 15,219 |
| Synonymous | 9833 | 7140 | 9563 |
| Non-synonymous | 5391 | 3421 | 5451 |
| Coding deletions | 57 | 39 | 53 |
| Coding insertions | 47 | 39 | 44 |
| Coding frameshift | 108 | 74 | 87 |
| Coding stop | 36 | 19 | 21 |
| dbSNP | 1,372,946 | 949,884 | 1,281,181 |
| Private (not in dbSNP) | 209,670 | 165,003 | 152,982 |
aThe variant type is in comparison to the reference
bDifferent than both ISS and reference
cDifferent than both ILS and reference
dSame in ILS and ISS, different than reference
Fig. 1Differences between ILS and ISS strains: Variant Breakdown. a Combined totals of variants that differ between the ILS and ISS strains. Synonymous mutations refer to a SNP(s) that alters the codon sequence but not the amino acid produced. Synonymous and non-synonymous mutations arise from the impact of a SNP on the underlying codon. Frameshifts are any indel not divisible by three, whereas deletions and insertions retain the reading frame. Variants are counted once per gene. b Combined totals of structural variations that differ between ILS and ISS strains. Losses and gains refer to changes in copy number, whereas a deletion is a complete loss/absence of a region. Chromosomal translocations refer to exchanges of large segments between chromosomes and inversions are reversals. c Distribution of structural variations by type, summarized by a sliding window approach (500 k windows with 100 k step size) (Color figure online)
Summary of genome-wide structural variants identified in the ILS and ISS
| Variant typea | ILSb | ISSc |
|---|---|---|
| Total | 4271 | 2882 |
| Deletions | 1488 | 1091 |
| Gains | 527 | 246 |
| Losses | 2168 | 1467 |
| Inversions | 29 | 31 |
| Translocations | 59 | 47 |
aThe type of variant, in comparison to the reference
bDifferent than both ISS and reference
cDifferent than both ILS and reference
Fig. 2Chromosomal breakdown of ancestor inference. For each chromosome, distributions of inferred ancestry in the ILS (on left) and ISS (on right) genomes. QTL regions (as listed in Table S6) are boxed (Color figure online)
Fig. 3QTLs have an elevated variant density. Histogram summarizing the variant density (variants/kb) throughout the genome on autosomes. Red for non-QTL regions; blue for QTL regions (Color figure online)
Fig. 4Impact of distinct reference genomes on differential expression. a Venn diagram comparing differential expression results between ILS and ISS whole brain RNA-seq samples when mapped to the reference genome (mm10, red) versus strain-specific genomes created from SeqNature (green). Bar charts show fraction of genes that were borderline to cutoff, increased or decreased when showing mapping-specific significance. b Venn diagram comparing DEXSeq-annotated exonic binned differential expression results when mapped to mm10 versus the strain-specific genomes. c CDF plot of strain-distinguishing variants over DEXSeq-annotated exonic bins for genes considered differentially expressed only when mapped to mm10 (red), only when mapped to strain-specific genomes (green), and common to both mappings (blue). d DEXSeq gene plot showing expression level (y-axis) for each exon (x-axis) for ENSMUSG00000021156 mapped to mm10 (above) and strain-specific genomes (below). Exonic bins considered differentially expressed against mm10 are no longer considered differentially expressed when mapped to the strain-specific genomes (Color figure online)