| Literature DB >> 31725724 |
Jingtao Lilue1,2, Anu Shivalikanjli1, David J Adams3, Thomas M Keane1,4.
Abstract
For over a century, mice have been used to model human disease, leading to many fundamental discoveries about mammalian biology and the development of new therapies. Mouse genetics research has been further catalysed by a plethora of genomic resources developed in the last 20 years, including the genome sequence of C57BL/6J and more recently the first draft reference genomes for 16 additional laboratory strains. Collectively, the comparison of these genomes highlights the extreme diversity that exists at loci associated with the immune system, pathogen response, and key sensory functions, which form the foundation for dissecting phenotypic traits in vivo. We review the current status of the mouse genome across the diversity of the mouse lineage and discuss the value of mice to understanding human disease.Entities:
Mesh:
Year: 2019 PMID: 31725724 PMCID: PMC6855407 DOI: 10.1371/journal.pgen.1008446
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Discovery of SSDRs in the mouse genome.
Loci for chromosome 1 and 6 are shown. Li and colleagues [18] defined segmental polymorphisms between 129 and C57BL/6J (blue). Cutler and colleagues [19] cataloged copy number variations in 41 inbred strains (green). Locke and colleagues [20] identified genome regions with high copy number variation calls in 351 different mouse strains and wild-caught mice (yellow). Morgan and colleagues [21] used a combination of wild and inbred mouse strains to define copy number variable regions (orange). Lilue and colleagues [16] used de novo assembly of 16 mouse strains (red). The gene families supported by multiple studies are named above. N/A indicates no protein coding genes in the region. SSDR, strain-specific diversity region.
Fig 2A summary of SSDRs in 16 mouse strains.
(A) Proportion of sequence and coding genes in SSDRs for the classical and wild-derived inbred mouse strains. (B) Summary of annotated genes encoded in SSDRs. For gene (families) with known function, only 3.1% have complete sequences (introns and intergenic regions) for multiple mouse strains (green), 9.8% have coding regions from multiple mouse strains (yellow), and all others are based on a single mouse strain (red). (C) Top 10 PANTHER protein classes overrepresented in mouse SSDRs. X-axis indicates times underrepresentation or overrepresentation. Numbers after each protein class indicate corrected FDR(log10 value). CDS, coding sequence; FDR, false discovery rate; SSDR, strain-specific diversity region.