| Literature DB >> 32138667 |
Yousuf A Khan1,2, Irwin Jungreis3,4, James C Wright5, Jonathan M Mudge6, Jyoti S Choudhary7, Andrew E Firth7, Manolis Kellis8,7.
Abstract
BACKGROUND: POLG, located on nuclear chromosome 15, encodes the DNA polymerase γ(Pol γ). Pol γ is responsible for the replication and repair of mitochondrial DNA (mtDNA). Pol γ is the only DNA polymerase found in mitochondria for most animal cells. Mutations in POLG are the most common single-gene cause of diseases of mitochondria and have been mapped over the coding region of the POLG ORF.Entities:
Keywords: CUG; Initiation; Mitochondria; POLG; PhyloCSF; Polymerase; Ribosome; Synonymous site conservation; synplot2
Mesh:
Substances:
Year: 2020 PMID: 32138667 PMCID: PMC7059407 DOI: 10.1186/s12863-020-0828-7
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Architecture of the human POLG transcript. a. Diagram of the primary transcript for POLG. The dashed lines represent exon boundaries (not to scale). The protein product Pol γ contains a mitochondrial targeting sequence at the N-terminus and the rest of the protein consists of several domains that make up the DNA polymerase super-domain. b. UCSC Genome Browser [4] image of (from top to bottom) ATG codons (green) and stop codons (red) in the three theoretical reading frames on the minus strand of chromosome 15; first three exons of previously-annotated POLG transcripts ENST00000268124.10 and ENST00000442287.6; incomplete novel transcript ENST00000650303.1;Synonymous Constraint track showing regions with enhanced synonymous conservation; PhyloCSF tracks for the three minus strand frames; and PhyloCSF Candidate Coding Region (PCCR) track. The cluster of PCCRs suggests coding in some previously unannotated frame. The PhyloCSF signals suggest translation in chromosomal frame 3 in exon 2 and frame 2 in exon 3 (purple rectangles), terminating at a well-conserved stop codon in exon 3. There are no ATG codons in this frame in the 5′ portion of exon 2 or in any frame in exon 1 (dark red rectangles), suggesting that the initiation codon is not ATG. The coding region of ENST00000650303, ORF-Y, begins at a well-conserved CTG codon. The ATG and stop codon of a likely regulatory ORF, ORF-Z are also indicated (black rectangle)
Fig. 2Synonymous site conservation in the POLG coding region for the major vertebrate clades. Clades shown are a. mammals, b. amphibians, c. sauropsids, and d. teleost fish. In each subfigure, the top panel shows the position of 0-frame stop codons in each sequence in the alignment. The following panels show the positions of stop codons in the + 1 and + 2 frames. The blue dots represent stop codons and the grey regions represent alignment gaps. The bottom two panels show the synonymous site conservation analysis, with the brown line showing the ratio of the observed number of synonymous substitutions within a given window to the number expected under a null model of neutral evolution at synonymous sites, and the red line showing the corresponding p-value. The horizontal grey dashed line indicates a p = 0.05 threshold after an approximate correction for multiple testing (namely scaling by [sliding window size]/[POLG ORF length]). All subfigures use a 25-codon sliding window. The stop codon of ORF-Y in mammals is indicated with a black arrow
Fig. 3Ribosome profiling analysis of ORF-Y. Aggregated ribosome profiling data for all studies available on GWIPs-viz (subfigure a) and Trips-Viz (subfigures b-c). a. Ribosome profiling coverage of POLG exon 2. The top panel in blue shows the aggregate of initiating ribosome profiling experiments (samples treated with harringtonine or lactimidomycin) and the bottom panel in red shows the aggregate of elongating ribosome profiling experiments. b. Ribosome profiling coverage of part of exon 3 containing the ORF-Y ‘UGA’ stop codon (box). c. Read counts by frame for the regions covering ORF-Y only, the ORF-Y/Main ORF overlap, the Main ORF only, and then all of ORF-Y and the Main ORF
Fig. 4Initiation context of ORF-Y. a. Weblogo of initiation context sequences extracted from all mammalian POLG mRNA sequences that contain ORF-Y. The start codon is underlined. b. Representation of the consensus downstream RNA secondary structure for mammalian POLG mRNA sequences that contain ORF-Y. The structure was determined with RNAalifold. The arrow is pointing at the + 14 nucleotide, where the ‘G’ in ‘CUG’ is nucleotide 0
Fig. 5Mass spectrometry evidence for translation of ORF-Y. a. Predicted translation of human ORF-Y. The CUG initiation codon is presumed to translate to methionine. The two peptides detected by mass spectrometry are colored in blue and red. b. Spectra for the first (red) peptide. c. Spectra for the second (blue) peptide. The sequences of the fragmented ions and their abundances are shown in both b and c
Fig. 6POLG contains a further upstream ORF-Z. a. Schematic of where ORF-Z is located relative to the architecture of POLG. b. Ribosome profiling data mined from GWIPs-viz. The top panel in blue represents initiating ribosomes while the bottom panel in red represents elongating ribosomes. Arrows indicate positions of the initiation codons of all three ORFs, which exactly match peaks in initiating ribosome coverage. c. Weblogo of ORF-Z initiation contexts extracted from mammalian POLG mRNA sequences that contain ORF-Y and at least 150 nucleotides of 5′ UTR. The start codon is underlined
Variants in ORF-Y and ORF-Z. Variants that are synonymous when translated in the reading frame of the main POLG ORF or that are listed as UTR variants, with their predicted effects on the translation product of ORF-Y or ORF-Z
| rsID | Ref > Alt | Position (Anchor) | ORF-Y | ORF-Z |
|---|---|---|---|---|
| G>A | chr15:89333152 (GRCh38.p12) | no change | not in ORF | |
| C>T | chr15:89330213 (GRCh38.p12) | G259S | not in ORF | |
| G>A | chr15:89333227 (GRCh38.p12) | P194S | not in ORF | |
| G>C | chr15:89333233 (GRCh38.p12) | L192V | not in ORF | |
| C>T | chr15:89333239 (GRCh38.p12) | G190R | not in ORF | |
| C>T | chr15:89333254 (GRCh38.p12) | G185S | not in ORF | |
| G>A | chr15:89333266 (GRCh38.p12) | P181S | not in ORF | |
| G>A/C | chr15:89333271 (GRCh38.p12) | A179G | not in ORF | |
| A>G | chr15:89333283 (GRCh38.p12) | V175A | not in ORF | |
| C>T | chr15:89333287 (GRCh38.p12) | A174T | not in ORF | |
| C>T | chr15:89333302 (GRCh38.p12) | G169R | not in ORF | |
| G>A | chr15:89333332 (GRCh38.p12) | P159S | not in ORF | |
| C>A/G/T | chr15:89333371 (GRCh38.p12) | A146T/A146S/A146P | not in ORF | |
| C>A | chr15:89333374 (GRCh38.p12) | A145S | not in ORF | |
| G>C | chr15:89333419 (GRCh38.p12) | L130V | not in ORF | |
| C>A | chr15:89333422 (GRCh38.p12) | A129S | not in ORF | |
| G>A | chr15:89333425 (GRCh38.p12) | R128W | not in ORF | |
| G>A/C | chr15:89333491 (GRCh38.p12) | R106G/R106W | not in ORF | |
| G>A/C/T | chr15:89333518 (GRCh38.p12) | L97I/L97V/L97F | not in ORF | |
| A>G | chr15:89333535 (GRCh38.p12) | I91T | not in ORF | |
| C>A/G/T | chr15:89333545 (GRCh38.p12) | A88T/A88P/A88S | not in ORF | |
| T>C | chr15:89333569 (GRCh38.p12) | I80V | not in ORF | |
| T>C | chr15:89333575 (GRCh38.p12) | S78G | not in ORF | |
| C>A | chr15:89333578 (GRCh38.p12) | A77S | not in ORF | |
| C>T | chr15:89333593 (GRCh38.p12) | A72T | not in ORF | |
| T>A/C | chr15:89333596 (GRCh38.p12) | T71A/T71S | not in ORF | |
| C>G/T | chr15:89333599 (GRCh38.p12) | A70T/A70P | not in ORF | |
| C>T | chr15:89333602 (GRCh38.p12) | A69T | not in ORF | |
| C>T | chr15:89333605 (GRCh38.p12) | A68T | not in ORF | |
| C>T | chr15:89333626 (GRCh38.p12) | A61T | not in ORF | |
| C>A | chr15:89333641 (GRCh38.p12) | A56S | not in ORF | |
| G>A | chr15:89333668 (GRCh38.p12) | R47C | not in ORF | |
| T>C/G | chr15:89333695 (GRCh38.p12) | S38R/S38G | not in ORF | |
| C>G | chr15:89333701 (GRCh38.p12) | G36R | not in ORF | |
| G>A/C | chr15:89333716 (GRCh38.p12) | R31G/R31C | not in ORF | |
| G>A | chr15:89333722 (GRCh38.p12) | R29C | not in ORF | |
| G>A | chr15:89333725 (GRCh38.p12) | R28W | not in ORF | |
| C>T | chr15:89333734 (GRCh38.p12) | E25K | not in ORF | |
| G>A/C | chr15:89333740 (GRCh38.p12) | L23V/L23L | not in ORF | |
| C>A | chr15:89333775 (GRCh38.p12) | R11L | not in ORF | |
| A>C | chr15:89333782 (GRCh38.p12) | S9A | not in ORF | |
| G>A | chr15:89333802 (GRCh38.p12) | P2L | not in ORF | |
| T>C | chr15:89333821 (GRCh38.p12) | not in ORF | no change | |
| G>A | chr15:89333834 (GRCh38.p12) | not in ORF | S20F |
Fig. 7Model of translation. Schematic of how the translation of all three ORF’s is regulated. The yellow oval represents the small subunit and the large purple oval represents the large subunit. The red, pink, brown, and green portions of the mRNA correspond to ORF-Z, ORF-Y no overlap, ORF-Y/main CDS overlap, and main CDS only portions retrospectively. The thin black lines represent the UTR’s. The thin blue arrow represents the small subunit remaining attached to the mRNA after termination at the ORF-Z stop codon and reinitiating at the main CDS start codon