| Literature DB >> 29242387 |
Thomas D Brekke1, Katherine A Steele2, John F Mulley3.
Abstract
Nonmodel rodents are widely used as subjects for both basic and applied biological research, but the genetic diversity of the study individuals is rarely quantified. University-housed colonies tend to be small and subject to founder effects and genetic drift; so they may be highly inbred or show substantial genetic divergence from other colonies, even those derived from the same source. Disregard for the levels of genetic diversity in an animal colony may result in a failure to replicate results if a different colony is used to repeat an experiment, as different colonies may have fixed alternative variants. Here we use high throughput sequencing to demonstrate genetic divergence in three isolated colonies of Mongolian gerbil (Meriones unguiculatus) even though they were all established recently from the same source. We also show that genetic diversity in allegedly "outbred" colonies of nonmodel rodents (gerbils, hamsters, house mice, deer mice, and rats) varies considerably from nearly no segregating diversity to very high levels of polymorphism. We conclude that genetic divergence in isolated colonies may play an important role in the "replication crisis." In a more positive light, divergent rodent colonies represent an opportunity to leverage genetically distinct individuals in genetic crossing experiments. In sum, awareness of the genetic diversity of an animal colony is paramount as it allows researchers to properly replicate experiments and also to capitalize on other genetically distinct individuals to explore the genetic basis of a trait.Entities:
Keywords: Meriones; genetic diversity; inbreeding; laboratory rodents; replication
Mesh:
Year: 2018 PMID: 29242387 PMCID: PMC5919727 DOI: 10.1534/g3.117.300495
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Relative coverage in females and males can be used to identify sex chromosomes. Plotted here is standardized coverage in females against standardized coverage in males; histograms show the density of markers along each axis. The markers shown are those with low overall coverage; a long tail exists in both females and males and is not shown here. Markers with <10× total standardized coverage were annotated as unknown (gray) because those have too little coverage to reliably distinguish X- and Y-linkage from autosomes. Of the remaining tags, those with <1× standardized coverage in females are annotated as Y-linked (blue). Those that satisfy the inequality: coveragemale < 3/4 coveragefemale − 5 were identified as X-linked (red). This line has a slope designed to discriminate points in the X-linked cluster (slope = 1/2) from those in the autosomal cluster (slope = 1) while remaining fairly conservative near the origin. All remaining tags were annotated as autosomal (black).
Diversity metrics in gerbils across the genome
| Colony | Autosomal | X-Linked | Y-Linked | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bases | Private | Poly. | π | Het. | Bases | Private | Poly. | π | Het. | Bases | Private | Poly. | π | Het. | ||
| Bangor | 12 | 2,131,103 | 5702 | 24,516 | 0.0054 | 0.459 | 53,540 | 134 | 822 | 0.0039 | 0.619 | 32,014 | 32 | 489 | 0.0067 | 0.809 |
| Sheffield | 5 | 1,485,832 | 592 | 11,409 | 0.0049 | 0.421 | 45,414 | 26 | 547 | 0.0060 | 0.630 | 20,850 | 12 | 265 | 0.0086 | 0.772 |
| Edinburgh | 5 | 1,688,545 | 1336 | 8,422 | 0.0032 | 0.313 | 50,487 | 40 | 564 | 0.0046 | 0.558 | 18,560 | 20 | 253 | 0.0100 | 0.846 |
| All colonies and F1’s | 24 | 2,150,776 | n/a | 28,885 | 0.0059 | 0.447 | 54,464 | n/a | 929 | 0.0036 | 0.606 | 32,752 | n/a | 551 | 0.0093 | 0.806 |
N, number of individuals evaluated; bases, number of bases with sufficient coverage to evaluate nucleotide diversity; private, number of sites with private alleles; poly., number of sites that are polymorphic; π, nucleotide diversity in each colony; het., average heterozygosity at the polymorphic sites.
Figure 2PCA of 28,885 autosomal SNPs. Eigenvector 1 explains the majority of the diversity in these samples and strongly differentiates the Edinburgh colony from the others. Sheffield contains a subset of the genetic diversity found within Bangor colony. F1 offspring between Edinburgh female and Sheffield male are intermediate.
Figure 3Structure plots of gerbil colonies. Structure was run for 20,000 iterations with 10,000 iterations of burn-in for K = 2, 3, and 4. In general, the different colonies have similar assignment especially at low K values. Edinburgh animals fall out uniquely at higher K values (i.e., blue). The F1 hybrids have mixed ancestry as expected for the offspring of a Sheffield × Edinburgh cross.
Nucleotide diversity in various rodent colonies
| Species | π | Number of Individuals | Breeding Scheme | Region of Genome for Which π Was Evaluated | Approximate Number of Bases Surveyed | Citation |
|---|---|---|---|---|---|---|
| 0.0059 | 24 | Outbred colony | Near autosomal restriction sites (GBS) | 2,200,000 | This article | |
| 0.0289 | 69 | Collaborative Cross | Whole genome sequencing | 2,300,000,000 | ||
| 0.0055 | 8 | Eight inbred strains | Autosomal genes (Sanger) | 14,000 | ||
| 0.0102 | 64 | Wild caught | Autosomal genes (Sanger) | 15,000 | ||
| 0.0054 | 26 | Wild caught | Autosomal genes (Sanger) | 15,000 | ||
| 0.0006 | 13 | Outbred colony | Near restriction sites (ddRAD) | 400,000 | ||
| 0.0010 | 1 | Outbred colony | Near restriction sites (ddRAD) | 400,000 | ||
| 0.0006 | 14 | Outbred colony | Near restriction sites (ddRAD) | 1,300,000 | J. Good, personal communication | |
| 0.0002 | 11 | Outbred colony | Near restriction sites (ddRAD) | 1,400,000 | J. Good, personal communication | |
| 0.0015–0.0029 | 96 | 96 inbred strains | CEL I-based SNP detection | 5,800,000 | ||
| 0.0022 | 58 | Wild caught | Autosomal genes (Sanger) | 10,000 |
Based on mean value of 69 Collaborative Cross individuals. Data from the column titled “% het (autosomes) in sequenced sample” in table S2 of Srivastava . Approximate number of bases surveyed is based on a genome size of 2.7 GB divided by the mean “% coverage at 15×.”
The Peromyscus animals evaluated here are the BW and PO strains originating with the Peromyscus stock center, bred at Harvard University in the Hoekstra laboratory, and sequenced by Weber .
The Phodopus animals evaluated here are from the Good laboratory at the University of Montana described in Brekke and Good (2014).