| Literature DB >> 35317627 |
George Powell1,2, Helen Long1,2, Louisa Zolkiewski2,3, Rebecca Dumbell4, Ann-Marie Mallon2, Cecilia M Lindgren1,5,4,6, Michelle M Simon2.
Abstract
Understanding the genetic aetiology of loci associated with a disease is crucial for developing preventative measures and effective treatments. Mouse models are used extensively to understand human pathobiology and mechanistic functions of disease-associated loci. However, the utility of mouse models is limited in part by evolutionary divergence in transcription regulation for pathways of interest. Here, we summarize the alignment of genomic (exonic and multi-cell regulatory) annotations alongside Mendelian and complex disease-associated variant sites between humans and mice. Our results highlight the importance of understanding evolutionary divergence in transcription regulation when interpreting functional studies using mice as models for human disease variants.Entities:
Keywords: Mendelian disease; alignment; annotation; complex disease; conservation; orthologue
Mesh:
Year: 2022 PMID: 35317627 PMCID: PMC8941414 DOI: 10.1098/rsbl.2021.0630
Source DB: PubMed Journal: Biol Lett ISSN: 1744-9561 Impact factor: 3.703
Figure 1Alignment of genomic annotations between humans and mice. Bars represent the percentage of human bases that align with the mouse genome. Coloured bars represent the percentage of bases that align with a common annotation in the mouse (i.e. the same annotation in each species). Black bars represent the percentage of bases that align to a different annotation in the mouse (i.e. do not have a common annotation). The dashed blue line represents the genome-wide percentage of human bases that align with the mouse genome. The genomic coverage for each human annotation is labelled in brackets on the Y-axis. The sum of coverage is greater than 100% due to the overlap of annotations (electronic supplementary material, table S2). Human protein-coding sequences show the greatest alignment to the mouse genome (95.5%). The fraction of human annotation that aligns to the same annotation in mice is highest for protein-coding sequences (88.2%), proximal intronic sequences (56.0%), untranslated regions (UTRs; 40.8%) and promoters (38.5%), and lowest for distal enhancers (3.1%), topologically associated domain (TAD) boundaries (4.8%) and miscellaneous (2.6%). CTCF, CCCTC‐binding factor, which is encoded by the CTCF gene; CDS, coding DNA sequence.
Figure 2Alignment of human SNV sites associated with complex disease (GWAS Catalog) and Mendelian disease (ClinVar) between humans and mice. Bars represent the percentage of human variant sites that align with the mouse genome. Coloured bars represent the percentage of variant sites that aign with a common annotation in the mouse (i.e. the same annotation in each species). Black bars represent the percentage of variant sites that align with a different annotation in the mouse (i.e. do not have a common annotation). The dashed blue line represents the total percentage of variant sites that align with the mouse genome. Variant sites associated with human Mendelian disease are more conserved between the species than variant sites associated with human complex disease. However, annotation of non-exonic regulatory features (excluding the promoter) is poorly conserved, suggesting functional divergence between the species.