| Literature DB >> 20011118 |
Melissa K Boles1, Bonney M Wilkinson, Laurens G Wilming, Bin Liu, Frank J Probst, Jennifer Harrow, Darren Grafham, Kathryn E Hentges, Lanette P Woodward, Andrea Maxwell, Karen Mitchell, Michael D Risley, Randy Johnson, Karen Hirschi, James R Lupski, Yosuke Funato, Hiroaki Miki, Pablo Marin-Garcia, Lucy Matthews, Alison J Coffey, Anne Parker, Tim J Hubbard, Jane Rogers, Allan Bradley, David J Adams, Monica J Justice.
Abstract
An accurate and precisely annotated genome assembly is a fundamental requirement for functional genomic analysis. Here, the complete DNA sequence and gene annotation of mouse Chromosome 11 was used to test the efficacy of large-scale sequencing for mutation identification. We re-sequenced the 14,000 annotated exons and boundaries from over 900 genes in 41 recessive mutant mouse lines that were isolated in an N-ethyl-N-nitrosourea (ENU) mutation screen targeted to mouse Chromosome 11. Fifty-nine sequence variants were identified in 55 genes from 31 mutant lines. 39% of the lesions lie in coding sequences and create primarily missense mutations. The other 61% lie in noncoding regions, many of them in highly conserved sequences. A lesion in the perinatal lethal line l11Jus13 alters a consensus splice site of nucleoredoxin (Nxn), inserting 10 amino acids into the resulting protein. We conclude that point mutations can be accurately and sensitively recovered by large-scale sequencing, and that conserved noncoding regions should be included for disease mutation identification. Only seven of the candidate genes we report have been previously targeted by mutation in mice or rats, showing that despite ongoing efforts to functionally annotate genes in the mammalian genome, an enormous gap remains between phenotype and function. Our data show that the classical positional mapping approach of disease mutation identification can be extended to large target regions using high-throughput sequencing.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20011118 PMCID: PMC2782131 DOI: 10.1371/journal.pgen.1000759
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Synteny map of mouse Chromosome 11.
Mouse Chromosome 11 is aligned with human chromosomes 22, 7, 2, 5, 1, and 17. Syntenic regions are represented in the same color and lines indicate breaks of synteny. The mouse Chromosome 11 ENU mutagenesis screen was targeted to the Trp53-Wnt3 (69 Mb–103.5 Mb) interval within the region most conserved with human Chromosome 17, indicated by the black lines.
Confirmed ENU–induced mutations that are potentially causative for the abnormal phenotype in each line.
| Mutant Line | Phenotype of Mutant Line | Gene name | Base Change | Lesion | Classification |
|
| abnormal craniofacial morphology |
| G653Ae | G11R | Non-Neutral 3/78% |
|
| abnormal craniofacial morphology |
| A49310Ge | K663E | Non-Neutral 3/78% |
|
| A1219Ge | I207V | Neutral 2/69% | ||
|
| abnormal craniofacial morphology |
| T10645Ae | V414E | Non-Neutral 2/70% |
|
| C87859A3U | noncoding | |||
|
| A4968Ci | noncoding | |||
|
| T5105Ai | noncoding | |||
|
| abnormal craniofacial morphology |
| A4098Ge | K55R | Neutral 3/78% |
|
| abnormal craniofacial morphology |
| T33214Ae | V516E | Non-Neutral 2/70% |
|
| decreased body size |
| T8151Ge | F453V | Non-Neutral 2/78% |
|
| T5526Ai | noncoding | |||
|
| A29970Gi | noncoding | |||
|
| decreased body size |
| G51912Ti | noncoding | |
| decreased circulating cholesterol |
| A29323Gi | noncoding | ||
|
| T51696Ci | noncoding | |||
|
| T13811Ai | noncoding | |||
|
| decreased body size |
| C14382Ae | R354R | |
|
| decreased body size |
| C106442Ai | noncoding | |
|
| female infertility |
| T10900A3U | noncoding | |
|
| female infertility |
| T18282Ce | S209P | Neutral 1/60% |
|
| T33177Cd | noncoding | |||
|
| G4223Ai | noncoding | |||
|
| embryonic lethality (5.5–8.5 dpc3) |
| A10267Ci | noncoding | |
|
| embryonic lethality (9.5–12.5 dpc3) |
| T87481A3U | noncoding | |
|
| embryonic lethality (9.5–12.5 dpc3) |
| C8461Ae | C632stop | |
|
| T5212Ci | noncoding | |||
|
| embryonic lethality (9.5–12.5 dpc3) |
| T95354Ai | noncoding | |
|
| embryonic lethality (5.5–8.5 dpc3) |
| A46941Te | M908L | Non-Neutral 2/70% |
|
| T39151Ci | noncoding | |||
|
| T92972Ce | D568D | |||
|
| perinatal lethality |
| T136735Ai | Splice site | |
|
| T4599C3U | noncoding | |||
|
| C1086Ai | noncoding | |||
|
| A17277Gi | noncoding | |||
|
| T91284Ad | noncoding | |||
|
| embryonic lethality (9.5–12.5 dpc3) |
| G43695Ti | noncoding | |
|
| embryonic lethality (13.5–18.5 dpc3) |
| C4228Te | Q106stop | |
|
| A2624G5U | noncoding | |||
|
| perinatal lethality |
| T20167Ce | V237A | Neutral 0/53% |
|
| T22219Ci | noncoding | |||
|
| embryonic lethality (9.5–12.5 dpc3) |
| T107950Ci | noncoding | |
|
| T5507Ce | S364S | |||
|
| embryonic lethality (5.5–8.5 dpc3) |
| A1728Ge | K22E | Non-Neutral 3/78% |
|
| embryonic lethality (5.5–8.5 dpc3) |
| T165383Ci | noncoding | |
|
| postnatal lethality with anemia |
| A9600Te | E541V | Non-Neutral 3/78% |
|
| postnatal lethality |
| A1955Ge | Y340C | Non-Neutral 4/82% |
|
| A53039Ce | Q341P | Non-Neutral 3/78% | ||
|
| A630T5U | noncoding | |||
|
| T1610Ai | noncoding | |||
|
| T31367Ci | noncoding | |||
|
| postnatal lethality |
| G2447Ae | C504Y | Non-Neutral 4/82% |
|
| postnatal lethality |
| A5492G3U | noncoding | |
|
| embryonic lethality (after 12.5dpc3) |
| C17008Ti | noncoding | |
|
| G1465Ae | K36K | |||
|
| decreased body size, |
| C13844Te | Q193stop | |
| late onset tremors |
| A14611G3U | noncoding | ||
|
| T7179Ae | D916E | Neutral 4/85% | ||
|
| hyperactive, seizures, craniofacial |
| T4392Ai | noncoding | |
|
| hyperactive, jerky, hearing loss |
| T134653A3U | noncoding |
The average amount of sequence obtained for each mutant was 80% of the 7.8Mb (one-fourth) of the 34 Mb region containing exons of annotated genes. The mouse gene symbol is shown. Location of lesion indicated by e (exon), i (intron), 5U (5′ UTR), 3U (3′ UTR), u (upstream), or d (downstream). No lesions were found in the mutant lines crf05, inf4, gro40, l11Jus38, l11Jus45, nur01, nur05, and skc1. Lesions were identified in the mutant lines crf06 and l11Jus39 that confirmed in some, but not all of the samples, indicating that the lesion was not causative of the phenotype, so these mutants are not included in the table.
*: Number refers to nucleotide position within entire genomic sequence from Ensembl v52.
**: Number refers to amino acid position within the first protein coding transcript in Ensembl v52.
†: SNAP analysis was used to determine the likelihood of an amino acid change being deleterious to the protein. The resulting classification is shown as Neutral or non-Neutral, along with the reliability index on a scale from 0 to 9, with 9 being the most reliable prediction, and finally, the predicted accuracy, shown as a percentage. This analysis is based on human sequence, so may not be as reliable for the mouse.
Kile et al. [5].
Clark et al. [56].
Hentges et al. [16].
Stat3 is not the causative lesion because l11Jus14 maps to the Mpo-Chad interval and Stat3 is outside this interval.
Associated with mutation in human gene MPDU1, OMIM 6040410.
Associated with mouse mutant MGI 106613.
Associated with mutation in human gene PLEKHM1, OMIM 611466, and a rat mutation, MGI 2443207.
Associated with mouse mutant MGI 1858204.
Associated with mutations in human gene STAT3, OMIM 147060, and mouse mutant MGI 103038.
Associated with mouse mutation MGI 2135679.
Associated with mutations in human gene SLC4A1, OMIM109240, and mouse mutant, MGI 109393.
Associated with mutation in human gene AIPL1 OMIM 604393.
Associated with mutation of human gene NF1, OMIM 601321, OMIM 607785, and mouse mutant MGI 97306.
Associated with mutation in human gene ASPA, OMIM 271900.
Figure 2Conservation analysis of sequence surrounding ENU–induced lesions.
A graph of match scores shows that exons and some non-coding elements are highly conserved. The match score is based on a 100 base pair comparison across seven vertebrates: mouse, human, rat, Rhesus monkey, horse, dog, and chicken. Red = lesion occurred in an exon, blue = lesion occurred in a 5′ or 3′ UTR, green = lesion occurred in an intron, and yellow = lesion occurred downstream of a gene.
Figure 3Mutation of nucleoredoxin in l11Jus13 (Nxn.
(A) Haplotype map of 239 unaffected and 83 affected mice used for meiotic mapping. The location of each marker and Nxn is displayed (Ensembl v52). The mutation lies in a 6Mb region located between rs3702197 and rs13481117. (B) The Nxn locus is depicted with introns 6 and 7 boxed. (C) The boxed region is expanded to illustrate the consequence of splicing in the wild type and mutant. A transversion (T to A) abolishes a consensus splice donor site leading to aberrant RNA splicing. The six base pair cryptic splice site used in the mutant is underlined. (D) Aberrant splicing in Nxn predicts an in frame insertion of 10 amino acids, GMELEGKWKA, (white), which occurs within the acidic region (black) of Nxn. (E) RT–PCR using primers flanking exon 6–7 from pools of three E14.5 heads of wild-type, heterozygous, and homozygous mutants demonstrates aberrant splicing in the mutant allele. (F) Western blots of Nxn and Actin from wild-type, heterozygous and homozygous mutants at E12.5 show reduced protein in the homozygous mutant. A reduction in protein was also observed at E15.5 and E18.5, and a polyclonal antibody against the N-terminus gave similar results (data not shown).
Figure 4Nxn mutants have cleft palates and small mandibles.
Homozygous Nxn embryos were compared to Nxn+/+Inv control littermates. (A) Gross morphology at E18.5. Nxn mutants (right) have a shortened snout compared to control littermates (left). (B) The palate (inside the dotted lines) was examined at E18.5, and a cleft palate (yellow arrow) occurred in Nxn embryos (right) but not in control littermates (left). (C) Skeletal preparations were carried out at E18.5. The mandible is shorter in length in Nxn embryos (bottom) than in control littermates (top). (D) The mean and standard error are plotted to show the difference in mandible length. Mutant mandibles are significantly (p<0.001) shorter than controls (n = 10 mandibles per genotype).