Literature DB >> 32290485

Comparative Analysis of CpG Sites and Islands Distributed in Mitochondrial DNA of Model Organisms.

Krzysztof Kowal1, Angelika Tkaczyk1, Tomasz Ząbek2, Mariusz Pierzchała3, Brygida Ślaska1.   

Abstract

: The information about mtDNA methylation is still limited, thus epigenetic modification remains unclear. The lack of comprehensive information on the comparative epigenomics of mtDNA prompts comprehensive investigations of the epigenomic modification of mtDNA in different species. This is the first study in which the theoretical CpG localization in the mtDNA reference sequences from various species (12) was compared. The aim of the study was to determine the localization of CpG sites and islands in mtDNA of model organisms and to compare their distribution. The results are suitable for further investigations of mtDNA methylation. The analysis involved both strands of mtDNA sequences of animal model organisms representing different taxonomic groups of invertebrates and vertebrates. For each sequence, such parameters as the number, length, and localization of CpG islands were determined with the use of EMBOSS (European Molecular Biology Open Software Suite) software. The number of CpG sites for each sequence was indicated using the newcpgseek algorithm. The results showed that methylation of mtDNA in the analysed species involved mitochondrial gene expression. Our analyses showed that the CpG sites were commonly present in genomic regions including the D-loop, CYTB, ND6, ND5, ND4, ND3, ND2, ND1, COX3, COX2, COX1, ATP6, 16s rRNA, and 12s rRNA. The CpG distribution in animals from different species was diversified. Generally, the number of observed CpG sites of the mitochondrial genome was higher in the vertebrates than in the invertebrates. However, there was no relationship between the frequency of the CpG sites in the mitochondrial genome and the complexity of the analysed organisms. Interestingly, the distribution of the CpG sites for tRNA coding genes was usually cumulated in a larger CpG region in vertebrates. This paper may be a starting point for further research, since the collected information indicates possible methylation regions localized in mtDNA among different species including invertebrates and vertebrates.

Entities:  

Keywords:  CpG sites; model organisms; mtDNA

Year:  2020        PMID: 32290485      PMCID: PMC7222804          DOI: 10.3390/ani10040665

Source DB:  PubMed          Journal:  Animals (Basel)        ISSN: 2076-2615            Impact factor:   2.752


1. Introduction

Nearly all animal mitochondrial genomes are about 16. 5 kbp (kilo base pairs) in length, whereas plant mitochondrial genomes range between 200 and 2000 kbp [1]. Generally, the mammalian mitochondrial genome is a circular double-stranded DNA (dsDNA) molecule containing 13 protein-coding genes, 22 transfer RNAs (tRNAs), two ribosomal RNAs (rRNAs) genes, and one non-coding control region (D-loop region) [2]. The exception is the mtDNA genome of Caenorhabditis elegans, which lacks the ATP8 gene [3] and the non-coding AT region. The non-coding region of mtDNA contains an origin of replication and three promoters: one for the light strand (LSP) and two for the heavy strand (HSP1 and HSP2). Transcription begins from promoters: LSP and HSP2 encode 13 protein-coding genes involved in the oxidative phosphorylation (OXPHOS) and 22 tRNAs, whereas HSP1 generates a short transcript containing rRNA genes [4]. MtDNA is packed into structures called nucleoids or mitochromosomes. The major part of the nucleoid constitutes transcription factor A (TFAM), which contributes to mtDNA packing. Hence, any alterations in the TFAM content influence the mitochromosome and, consequently, mtDNA is exposed to DNA methyltransferases (DNMTs) [5]. The activity of methyltransferase (DNMT) was first detected in mitochondria isolated from loach embryos in 1970s [6]. Next, 5-methylcytosine (m5C) was found in beef heart mitochondria [7]. In 1977, the specificity of nuclear and mitochondrial DNMT was demonstrated. The mitochondrial and nuclear enzymes are specific to monopyrimidines and di- and tripyrimidines, respectively [8]. Reis and Goldstein [9] and Pollack et al. [10] conducted a study on mitochondria from human and mouse fibroblasts. Their results indicated that methylation in mtDNA occurred with a frequency of 2–5% only in CpG dinucleotides. Currently, the methylation frequency in mtDNA ranges from 2 to 8%, but its pattern is unknown [11]. Scientists found that nearly 25% of all methylations identified in embryonic stem cells were non-CpG methylations (CpA, CpT, and CpC). In normal somatic cells, the non-CpG methylation level is relatively low, with enrichment mainly in the coding regions of active genes [12,13]. Little is known about mitochondrial epigenetic modifications, as studies on mtDNA methylation are not as common as studies on the methylation of nDNA. So far, it has been observed that hypomethylation may occur in mtDNA methylation [4,14,15]. During methylation, DNA undergoes covalent modification, usually at cytosine residues within CpG dinucleotides, and is catalysed by DNA methyltransferase (DNMT) in the presence of the methyl donor S-adenosyl-L-methionine (SAM) [16]. Clusters of CpG sites form GC-rich islands that have a CpG located approximately every 10 base pairs [17,18]. In recent years, the methylation of mammalian mtDNA has gained interest. There are papers rejecting the existence of mtDNA methylation [19]. For instance, Hong et al. [11] used the bisulfite genomic sequencing method to determine CpG methylation in a human colon cancer cell line and primary human cells. Additionally, next-generation sequencing was used for total DNA. As a result, no CpG methylation was found in mtDNA [11]. In turn, Bellizzi et al. [20] detected methylated cytosines in the D-loop region of mtDNA isolated from blood and cultured cells from humans and mice. To address the controversy of the existence of mtDNA methylation, an interesting study on the methylation of the D-loop was conducted by Liu et al. [21]. They confirmed the existence of methylation with varying frequency in different human tissues. The analysis of 6 CpG sites in human blood samples indicated that the methylation level varied from 2% to 34% but was almost undetectable in saliva. Generally, the estimated average frequency of mtDNA methylation was lower than 2%. Moreover, it was found that the form of mtDNA had an impact on its level. It has been found that the circular structure affects the bisulfite conversion efficiency, hence mtDNA methylation is overestimated [21]. The evidence supporting mtDNA methylation is related to mtDNA abnormalities including amyotrophic lateral sclerosis [22], Down’s syndrome (DS) [14], glioblastoma [23], or nonalcoholic fatty liver disease [24]. For instance, cultured amniocytes from DS patients showed TFAM downregulation [5]. In turn, Infantino et al. [14] detected hypomethylation in DS cells in which the mtDNA content was increased. Despite new evidence, the information about mtDNA methylation is still limited. Therefore, this epigenetic modification is a controversial issue. The lack of comprehensive information on the comparative epigenomics of mtDNA suggests that there is a need to conduct comprehensive investigations of the epigenomic modification of mtDNA in different species. The aim of the study was to determine the localization of CpG sites and islands in mtDNA of model organisms and to compare its distribution. The results are suitable for further investigations of mtDNA methylation.

2. Materials and Methods

Reference sequences of twelve animal mtDNAs obtained from GenBank were analysed. The study was carried out on sequences of animal model organisms representing different taxonomic groups of invertebrates and vertebrates (Table 1).
Table 1

MtDNA reference sequences of analysed model organisms.

OrganismAccession Number of Reference Sequence *Length of MtDNA (bp **)
invertebrates
Caenorhabditis elegans NC_001328.113,794
Drosophila melanogaster NC_024511.219,524
Daphnia magna NC_026914.114,948
vertebrates
Latimeria chalumnae NC_001804.116,407
Danio rerio NC_002333.216,596
Ambystoma mexicanum NC_005797.116,369
Gallus gallus NC_040970.116,785
Mus musculus NC_005089.116,299
Canis lupus familiaris NC_002008.416,727
Crocodylus porosus NC_008143.116,916
Pan troglodytes ellioti KM679417.116,559
Homo sapiens NC_012920.116,569

* NC, KM—nucleotide accession prefixes. ** bp—base pair.

In order to analyse both strands of mtDNA, sequences from GenBank were rewritten in the EMBOSS revseq algorithm to obtain complementary sequences representing the H-strand of mtDNA [25]. Regions with frequency of CG dinucleotides that were higher than expected were identified in each of the 24 analysed sequences from the 12 species. Two EMBOSS algorithms were used. The Cpgplot uses a sliding window within which the observed/expected ratio of CpG is calculated [26]. For a sequence region reported as a CpG island, the following constraints were established: the observed/expected ratio >0.6, %C + %G > 50%, and the sequence length should exceed 200 bp. The newcpgseek uses a running sum calculated from all positions in the sequence rather than a window to produce a score. If there is a missing CG dinucleotide at a position, the score is decremented; if there is a CG dinucleotide, the score is incremented by a constant (user-defined) value. When the score for a region in the sequence is higher than the threshold (17 at the moment), a putative island is declared. Sequence regions scoring above the threshold are searched for recursively. This method overpredicts islands but finds smaller ones around primary exons. The newcpgseek displays the actual CpG count, the %C + %G sum, and the observed/expected ratio in a region where the score is above the threshold [25]. For each sequence, such parameters as the number, length, and localization of CpG islands were determined. Using the newcpgseek algorithm, the number of the CpG sites for each sequence was indicated.

3. Results

3.1. CpG Islands in mtDNA

The positions of the CpG islands in the mtDNA of 12 organisms are presented in Table 2 and Table 3. There were no CpG islands on the strands of the mtDNA genomes of Caenorhabditis elegans, Daphnia magna, and Drosophila melanogaster, i.e. all invertebrates analysed in the study. In the analysed animal models, the length of the CpG islands varied from 202 bp to 313 bp in the L-strand and from 200 bp to 632 bp in the H-strand. The results of the L-strand showed that one CpG island was located in the COX2 gene (Homo sapiens and Gallus gallus), and two CpG islands were found in the sequence from Danio rerio (Table 2). The longest CpG island among all the tested animal models was detected in the canine mtDNA genome located in the D-loop region in the position of the VNTR: 5′-GTACACGT(A/G)C-′3 region.
Table 2

Positions of CpG islands in the mitochondrial genomes of the analysed animals on the light strand.

OrganismGenome Length (bp *)% GC **Positions of CpG Islands ***Genome RegionLength of CpG Islands (bp)Sum of C+G ****%C + %GObs/Exp *****
Danio rerio 16,596 0.403281..3531 16s rRNA 251 12650.200.95
6205..6432rep_origin, TRNY, COX1228 12052.630.91
Gallus gallus 16,785 0.468703..8925 COX2 223 11852.910.97
Canis lupus familiaris 16,727 0.4016,137..16,449D-loop(VNTR)313 17054.312.71
Pan troglodytes ellioti 16,559 0.4414,246..14,447 CYTB 202 10350.991.27
Homo sapiens 16,569 0.447764..8036 COX2 273 13750.181.13

* bp—base pair. ** guanine–cytosine (GC) base pairs. *** guanine-cytosine-rich regions (CpG islands). **** cytosine (C), guanine (G). ***** the observed/expected ratio.

Table 3

Positions of CpG islands in the mtDNA of the analysed animals on the H strand *.

OrganismGenome Length (bp **)% GC ***Start and Stop of MtDNA Sequence ****MtDNA RegionLength of CpG Islands (bp) *****Sum of C+G%C + %GObs/Exp ******
Danio rerio 16,596 0.40981..1180 TRNI, 12s rRNA 200 10552.501.31
6205..6432TRNN *, TRNY *, COX1228 12052.631.17
Latimeria chalumnae 16,407 0.42145..370 12s rRNA 226 113 50.00 0.80
Crocodylus porosus 16,916 0.4351..311 12s rRNA 261 133 50.96 1.61
12,371..12,699 ND5 329 171 51.98 1.38
Gallus gallus 16,785 0.461784..1992 12s rRNA 209 108 51.57 1.23
6901..7108 COX1 208 11153.371.25
9456..9794 ATP6 339 17451.331.25
9920..10,551 COX3 632 32351.110.99
13,647..13,925 ND5 279 143 51.25 1.23
14,984..15,210 CYTB 227 11952.421.20
16,297..16,508 ND6 212 11051.890.99
Canis lupus familiaris 16,727 0.4016,179..16,449D-loop VNTR (16,130..16,430)271 14954.980.83
Pan troglodytes ellioti 16,559 0.442848..3136 ND1 289 14650.521.26
5572..5779 COX1 208 11253.851.17
12,379..12,642 ND5 264 140 53.03 1.41
14,246..14,447 CYTB 202 10350.991.27
Homo sapiens 16,569 0.441123..1352 12s rRNA 230 115 50.00 1.15
3382..3717 ND1 336 17852.981.26
12,907..13,115 ND5 209 109 52.15 1.29
14,804..15,044 CYTB 241 12652.281.33

* genes in which CpG sites are frequently distributed among species were marked with bold font (genes encoded on the L strand). ** bp—base pair. *** guanine–cytosine (GC) base pairs. **** mitochondrial DNA (mtDNA). ***** guanine-cytosine-rich regions (CpG islands). ****** the observed/expected ratio.

The present study showed an increased number of CpG islands on the H-strand of mtDNA, compared to the L-strand (Table 2, Table 3). It is worth noting that the mtDNA of Gallus gallus (7), Crocodylus porosus (4), and Homo sapiens (4) had the highest numbers of CpG islands. CpG islands were found frequently in genomic regions covering loci of 12s rRNA (71%), CYTB (43%), ND5 (57%), and COX1 (43%). Interestingly, two CpG islands were observed in the 12s rRNA and ND5 genes from the mtDNA genome of Crocodylus porosus and the COX1 gene from Gallus gallus (Table 3). Moreover, only in the Danio rerio genome was the CpG island located in tRNA-coding genes, which were also encoded on the L strand. The analysis of Canis lupus familiaris mtDNA showed the presence two CpG islands on both strands occupying the VNTR sequence in the D-loop region in the same localization (Table 3).

3.2. Strongly Enriched CpG Regions in mtDNA

The distribution of CpG sites in the mtDNA genomes of the analysed animals and the total number of CpG sites for each animal model are presented in Table 4. The analyses showed that the CpG sites were commonly detected in genomic regions, including the D-loop, CYTB, ND6, ND5, ND4, ND3, ND2, ND1, COX3, COX2, COX1, ATP6, 16s rRNA, and 12s rRNA. The CpG distribution in animals varies. Generally, the number of the CpG sites of the mitochondrial genome was higher in the vertebrates than in the invertebrates. However, there was no relationship between the frequency of the CpG sites in the mitochondrial genome and the complexity of the analysed organism. CG-rich regions were mainly observed in genes encoding proteins or rRNA molecules; however, CpG dinucleotides were also found in non-coding sequences such as the AT region in the mtDNA of Caenorhabditis elegans and the D-loop in the vertebrates. Noteworthy, in some of the analysed species, e.g. Homo sapiens, Pan troglodytes ellioti, Ambystoma mexicanum, and Crocodylus porosus, CpG sites were found in intergenic areas (Table 5). The CpG sites were not commonly located in the tRNA coding genes. For example, no CpGs were observed in the locus of the TRNQ gene in any of the analysed species. It should be emphasized that CpG sites are distributed in a cluster overlapping many tRNA coding genes, such as TRNW, TRNA, TRNN, TRNC, and TRNY (Table 5). The in silico analysis revealed diverse distribution of the CpG sites in the replication origin region between both the species and the strands of the analysed vertebrates. Mammalian species share a structurally identifiable replication origin at a fixed mitochondrial genome location (between TRNC and TRNN), in contrast to avian and crocodilian species [27]. There were no CpG dinucleotides on the mtDNA of Crocodylus porosus and Gallus gallus in a location analogous to the region of the replication origin in the other vertebrates (Table 5).
Table 4

Distribution of CpG sites in the mtDNA of the analysed animals including the L- strand and the H- strand.

Caenorhabditis elegans Daphnia magna Drosophila melanogaster Latimeria chalumnae Danio rerio Ambystoma mexicanum Crocodylus porosus Gallus gallus Mus musculus Canis lupus familiaris Pan troglodytes ellioti Homo sapiens
StrandLHLHLHLHLHLHLHLHLHLHLHLH
Genomic region
TRNF 25
12s rRNA 2 2 6 5 5 5 17 31 17 14 18 37 27 61 17 10 11 10 32 17 43 15 36
16s rRNA 8 11 12 5 15 37 30 48 13 26 42 62 29 25 18 29 9 28
TRNV 2 4
TRNL1 4 6 5 2
ND1 2 5 4 18 4 17 20 14 21 3 10 23 58 22 29 14 15 15 27 14 34 25 37
TRNI 222 2
TRNM 323 23 32 2323
ND2 2 2 4 2 2 21 6 22 6 13 13 7 16 3 6 4 15 4 25 8 27
TRNW 2 23
TRNN 2
TRNC 3 2
TRNY 3 18 3 3 3
COX1 2 5 12 29 2 14 41 6 21 19 37 22 41 17 32 10 16 19 20 28 16 36 16
TRNS1 5 3 3
TRND 4
COX2 8 10 6 9 11 17 2 9 6 9 15 15 5 4 6 15 12 18 10
TRNK 4
ATP8 22
ATP6 7 21 3 11 6 5 7 13 8 23 2 4 11 4 9 3 21 7 30
COX3 12 18 2 6 11 4 21 5 10 11 22 11 9 2 6 7 14 8 10 9 27
TRNG 3
ND3 2 9 4 2 6 2 13 31 5 8 4 3 8 7 9 11 4
TRNR 3 2
ND4L 2152 3847 1826 26 10
ND4 12 23 2 8 17 12 34 8 20 11 35 12 39 7 19 7 23 15 39 13 39
TRNH 2
TRNS2 2 42 7 4
TRNL2 2 2
ND5 23 2 2 16 57 8 33 2 5 20 78 5 51 14 42 20 64 12 46 16 59
ND6 4 18 7 22 12 3 14 9 16 3 2 13 5 5
AT-REGION 64
TRNE 3
CYTB 2 10 5 9 2 2 14 10 16 22 8 13 23 21 13 28 13 11 16 27 20 22 19 33
TRNP 2
TRNT 2
D-LOOP 3 10 10 9 11 4 9 15 17 9 11 77 26 18 35 14 20
sum of all CpG sites/strand ** 144079174162814330816332898226247492183326110201192317170332196356

* genes in which CpG sites are frequently distributed among species were marked with bold font. ** guanine-cytosine-rich sequences.

Table 5

Distribution of CpG sites in regions overlapping more than one gene in mtDNA. *

SpeciesStrandStart and Stop of MtDNA Sequence CpG CountGenes/Replication Origin Region
Caenorhabditis elegans L3341..33563 TRNL1, TRNS1
Daphnia magna L 1302..1323 3 TRNY *, COX1
H 1293..1319 4 TRNY *, COX1
Latimeria chalumnae L2762..27884 16s rRNA, TRNL1, ND1
H1106..11344 TRNV, 16s rRNA
H2693..281912 16s rRNA, TRNL1, ND1
H 5279..5466 14 TRNN *, TRNC *, TRNY *
H7857..79086 COX2, TRNK
H8526..886125 ATP6, COX3
H15,468...15,5236 CYTB, TRNW
Danio rerio L 6225..6412 14 TRNN *, rep_origin *, TRNY *
L11,558..11,5793 ND4L, ND4
H951..140236 TRNI, 12s rRNA
H3727..387312 TRNL1, ND1
H 6219..6414 18 rep_origin *, TRNY *
H8802..88454 COX2, TRNK
H9538..982923 ATP6, COX3
H10,883..11,25326 ND3, TRNR, ND4L
Ambystoma mexicanum L 5153..5198 5 rep_origin *
L15,333..15,34615,446..15,46322 intergenic region
H2606..26495 16s rRNA, TRNL1
H 5051..5179 10 TRNA *, TRNN *, rep_origin *
H15,336..15,35515,439..15,46433 intergenic region
Crocodylus porosus L11,619..11,6794 TRNS2, intergenic region
L13,688..13,7134ND5, ND6 *
H3624..372011 ND1, TRNI
H4664..502324 ND2, TRNW
H7648..793121 COX2, TRNK
H9918..99352 TRNR, ND4L
H11,590..11,6174 intergenic region
H11,822..12,01115 TRNL2, ND5
Gallus gallus H1199..272698 D-loop, TRNP, 12s rRNA, 16s rRNA, TRNV
H4971..50408 ND1, TRNI
H6404..652310TRNA *, TRNN *
H9542..10,09737 ATP6, COX3
Mus musculus L 5167..5187 3 rep_origin
H 5168..5186 18 rep_origin
Canis lupus familiaris L 5187..5226 7 rep_origin *, TRNC *
L7969..79913 ATP8, ATP6
H2652..26925 16s rRNA, TRNL1
H4983..518312TRNW, TRNA *, TRNN *, rep_origin *, TRNC *
H7982..79953 ATP8, ATP6
Pan troglodytes ellioti L5156..51874intergenic region, TRNC *
L7951..80035 ATP8, ATP6
H 4946..5783 57 TRNW, TRNA *, TRNN *, TRNC *, TRNY *, COX1
H7964..80046 ATP8, ATP6
H8558..872012 ATP6, COX3
Homo sapiens L5737..57685intergenic region, TRNC *
H 5540..6268 50 TRNW, TRNA *, TRNN *, TRNC *, TRNY *, COX1

* CpG sites that are frequently repeated in the overlapping replication origin region, tRNA encoding genes, and COX1 gene were marked with bold font (genes encoded on the L-strand).

4. Discussion

The methylation of the mtDNA is still a matter of debate [21]. The present study indicated plausible sites of methylation as epigenetic modification of mtDNA and demonstrated different levels of the distribution of CpG sites and islands in various animal model species. No CpG islands were detected in the invertebrates, whereas CpG sites were found in both the invertebrates and the vertebrates, but they occurred frequently in the more complex organisms. This is the first study presenting the theoretical CpG localization on both strands of the mtDNA reference sequences in various species. As demonstrated by available literature, Caenorhabditis elegans does not have a DNMT; hence, no methylation is detected [28]. However, invertebrates with very low or undetectable methylation of CpG, e.g. Drosophila melanogaster or Caenorhabditis elegans, are a minority, as reported by Suzuki et al. [17]. In most invertebrates, mosaic nDNA methylation takes place, but it is not clearly known whether it occurs in mtDNA [17]. A low level of methylation was observed in the case of essential genes, including CYTB, COX1, and 12s rRNA. Moreover, despite the lack of CpG islands in Caenorhabditis elegans, single CpG sites were observed in the non-coding AT-region (Table 4), which is located between the tRNAala and tRNApro genes previously described by Okimoto et al. [29]. The low occurrence of CpG sites and islands in the mtDNA genomes of invertebrates may be related to the different modes of epigenetic control of replication and expression, such as non-CpG (CpA, CpT, and CpC) methylation. The co-existence of non-CpG sites was also observed within nDNA in human specific cell types such as stem cells, oocytes, neurons, and glial cells [30]. Yet, most CpG sites were indicated in protein-coding genes and rRNA-coding genes. The occurrence of CG nucleotides might be correlated with the length of the sequence: the longer the sequence, the greater the likelihood of a multitude of CpG sites. It should be emphasized that the theoretical presence of CpG sites and islands in the mtDNA genes of the analysed animals does not indicate methylation of these genes. However, the possibility of OXPHOS gene methylation within specific cells of the analysed animal species should not be excluded in certain circumstances. The oxidative phosphorylation system (OXPHOS) is a biochemical pathway located in the mitochondrial inner membrane responsible for energy production, apoptosis, and cell differentiation [31]. A proper OXPHOS function is important for cellular homeostasis, tissue dynamics, and health status of individuals [32]. Another non-coding region is the D-loop, where CpG sites observed in the region may be related to its regulative function of mtDNA in terms of replication and expression. Liu et al. [21] found that DNA methylation took place in the main non-coding region, which contains regulatory regions for the heavy (HSP1/2) and light strands (LSP) and an initiation site for heavy strand replication. First, replication is initiated at a specific site on the H-strand (called OH). After replication of two-thirds of the H-strand, the replication of the L-strand starts and proceeds in the opposite direction [2]. Except for Canis lupus familiaris, a higher number of CpG sites were found on the H-strand of mtDNA (Table 4). It is worth noting that four genes i.e. CYTB, COX1, ND1, and 12s rRNA, were rich in CpG sites in all the analysed sequences. CYTB called cytochrome c reductase and COX1 encoding cytochrome C oxidase subunit 1 belong to respiratory chain complexes III and IV. They are involved in the electron transport chain of mitochondrial oxidative phosphorylation (OXPHOS) and are essential for ATP synthesis [33]. Additionally, methylation observed in the sequence of Homo sapiens including CYTB, COX1, D-loop, and 12s rRNA has been reported by Liu et al. [21]. The LPS promoter is an important component of the non-coding region contributing to the expression of the OXPHOS complex I subunit ND6 [21]. The methylation of the ND6 gene was reported in many studies [24,34,35]. For instance, Pirola et al. [24] analysed the methylation of ND6, COX1, and the D-loop region with the use of quantitative methylation specific-PCR in the context of non-alcoholic fatty liver disease in humans [24]. The authors found a significant association between the condition of non-alcoholic steatohepatitis (NASH) and the methylation of the ND6 gene, which inversely correlated with ND6 transcription and protein expression in the liver affected by NASH [24]. The results reported in this paper showed the presence of CpG sites in the ND6 gene in all the analysed vertebrates and the number of CpG sites varying from 3 to 22. Noteworthy, all NADH dehydrogenase subunits (ND1, ND2, ND3, ND4, ND5, ND6) were rich in CpG sites in the mtDNA of the different species (Table 4). However, the ND3 and ND6 genes were rich in CpG sites only in the vertebrates. The regulation of single subunits of NADH dehydrogenase is not completely understood. In the case of oxidative stress, DNMT is upregulated and suppresses the expression of the ND6 gene through methylation. In turn, the downregulation of ND6 contributes to upregulation of ND1 [5]. ROS (reactive oxygen species) are targeted at mitochondria; hence, it has been proposed that the increased level of DNMT1 reflects adaption to oxidative stress. MTERF1 (mitochondrial terminator factor 1) probably interacts with m5C in CpG dinucleotides or with mtDNA and, consequently, DNMT1 is bound [5]. These results indicate that all the analysed sequences (with the exception of the L-strand of Drosophila melanogaster) have CpG sites in the ND1 gene; however, the CpG distribution in the ND6 gene is mainly limited to the H-strand of chordates. Moreover, the number of the CpG sites in ND1 was higher (from 2 to 58) than in ND6 (from 3 to 22), especially in Latimeria chalumnae, Ambystoma mexicanum, Crocodylus porosus, Gallus gallus, Mus musculus, Canis lupus familiaris, Pan troglodytes ellioti, and Homo sapiens (Table 4). The transcription of the H-strand of mtDNA starts at two initiation sites (H1, H2) within the control region. The produced transcript from H1 terminates at the 3′ end of the 16S rRNA gene and processes the two rRNAs and two tRNAs, whereas H2 terminates at the 5′ end of the 12S rRNA gene and generates a polycistronic molecule contributing to the mRNAs and most of the tRNAs encoded in the H strand [36]. The mitochondrial transcription termination factor (mTERF) has been associated with the H1 and H2 binding sites, but Martin et al. [36] evidenced only mTERF-bound by H1. The distribution of methylation of CpG sites in these regions may influence transcription regulation. Martin et al. [36] demonstrated that 12s rRNA is commonly methylated in the mitochondrial regions in animals. The CpG sites were widely distributed in the genomes of all the animals analysed in our study, but higher numbers were found in the vertebrates (Table 4). Methylation of the 16s rRNA gene was recognized as an emerging resistance mechanism against aminoglycosides and was evidenced in microorganisms that are often multidrug resistant. Methylation of 16s rRNA disturbs translation [37,38]. The present results showed the presence of CpG-rich regions of 16s rRNA genes in both the invertebrates and vertebrates but not in the carnivores (Canis lupus familiaris) and primates (Pan troglodytes ellioti and Homo sapiens). This may be caused by the 1-methyladenosine (m1A) modification in 16S rRNA catalysed by tRNA methyltransferase 61B (TRMT61B) [39]. In the analysis of mtDNA methylation, several challenges that can affect the correct detection of the levels of mtDNA methylation have to be overcome. The first one is the high mtDNA copy number in cells; it naturally varies from hundreds to thousands of copies depending on the cell type. Application of super-resolution microscopy provides more details. Currently, the number of mtDNA molecules per nucleoid in human cells is estimated at 1.4 [40]. Another problem is the presence of nuclear mitochondrial sequences (Numts), first denoted as “NUMT” in the cat [41], which refer to a DNA segment transferred from mtDNA to nDNA. This phenomenon was observed in various eukaryotes including plants (Arabiopsis thaliana, Oryza sativa), invertebrates (Caenorhabditis elegans, Drosophila melanogaster), and vertebrates (Mus musculus, Rattus norvegicus, and Homo sapiens) [42]. Therefore, the issue whether low levels of CpG methylation occur in mtDNA or whether it is caused by contamination by methylated NUMTs is being questioned [43]. Moreover, the circular structure of mtDNA influences bisulfite conversion and causes overestimation of mtDNA methylation [21].

5. Conclusions

The theoretical study on the distribution of CpG sites and islands in the mitochondrial genome of twelve model animal species provides interesting information about the localization of CpG-rich regions that can be methylated in specific cells in certain conditions. The CpG methylation in mtDNA exerts an impact on various molecular processes, including replication, translation, and gene expression. Since methylation in mtDNA, in comparison to methylation in nDNA, is still not sufficiently understood, research in this area is advisable.
  41 in total

1.  EMBOSS: the European Molecular Biology Open Software Suite.

Authors:  P Rice; I Longden; A Bleasby
Journal:  Trends Genet       Date:  2000-06       Impact factor: 11.639

2.  Impairment of methyl cycle affects mitochondrial methyl availability and glutathione level in Down's syndrome.

Authors:  Vittoria Infantino; Alessandra Castegna; Francesco Iacobazzi; Iolanda Spera; Iris Scala; Generoso Andria; Vito Iacobazzi
Journal:  Mol Genet Metab       Date:  2010-12-09       Impact factor: 4.797

Review 3.  Mitochondrial DNA mutations in human disease.

Authors:  Robert W Taylor; Doug M Turnbull
Journal:  Nat Rev Genet       Date:  2005-05       Impact factor: 53.242

4.  Methylation profiling by bisulfite sequencing analysis of the mtDNA Non-Coding Region in replicative and senescent Endothelial Cells.

Authors:  Valentina Bianchessi; Maria Cristina Vinci; Patrizia Nigro; Valeria Rizzi; Floriana Farina; Maurizio C Capogrossi; Giulio Pompilio; Valentina Gualdi; Andrea Lauri
Journal:  Mitochondrion       Date:  2016-02-22       Impact factor: 4.160

Review 5.  Prenatal exposure to oxidative phosphorylation xenobiotics and late-onset Parkinson disease.

Authors:  Eldris Iglesias; Alba Pesini; Nuria Garrido-Pérez; Patricia Meade; M Pilar Bayona-Bafaluy; Julio Montoya; Eduardo Ruiz-Pesini
Journal:  Ageing Res Rev       Date:  2018-04-22       Impact factor: 10.895

Review 6.  Biogenesis of the bc1 Complex of the Mitochondrial Respiratory Chain.

Authors:  Mama Ndi; Lorena Marin-Buera; Roger Salvatori; Abeer Prakash Singh; Martin Ott
Journal:  J Mol Biol       Date:  2018-05-04       Impact factor: 5.469

7.  Dynamic changes in the human methylome during differentiation.

Authors:  Louise Laurent; Eleanor Wong; Guoliang Li; Tien Huynh; Aristotelis Tsirigos; Chin Thing Ong; Hwee Meng Low; Ken Wing Kin Sung; Isidore Rigoutsos; Jeanne Loring; Chia-Lin Wei
Journal:  Genome Res       Date:  2010-02-04       Impact factor: 9.043

Review 8.  The role of mitochondria in the life of the nematode, Caenorhabditis elegans.

Authors:  William Y Tsang; Bernard D Lemire
Journal:  Biochim Biophys Acta       Date:  2003-07-14

9.  Mitochondrial DNA in mortal and immortal human cells. Genome number, integrity, and methylation.

Authors:  R J Shmookler Reis; S Goldstein
Journal:  J Biol Chem       Date:  1983-08-10       Impact factor: 5.157

10.  Mitochondrial 16S rRNA Is Methylated by tRNA Methyltransferase TRMT61B in All Vertebrates.

Authors:  Dan Bar-Yaacov; Idan Frumkin; Yuka Yashiro; Takeshi Chujo; Yuma Ishigami; Yonatan Chemla; Amit Blumberg; Orr Schlesinger; Philipp Bieri; Basil Greber; Nenad Ban; Raz Zarivach; Lital Alfonta; Yitzhak Pilpel; Tsutomu Suzuki; Dan Mishmar
Journal:  PLoS Biol       Date:  2016-09-15       Impact factor: 8.029

View more
  4 in total

1.  DNA methylation differs extensively between strains of the same geographical origin and changes with age in Daphnia magna.

Authors:  Jack Hearn; Fiona Plenderleith; Tom J Little
Journal:  Epigenetics Chromatin       Date:  2021-01-06       Impact factor: 4.954

Review 2.  The Trinity of cGAS, TLR9, and ALRs Guardians of the Cellular Galaxy Against Host-Derived Self-DNA.

Authors:  Vijay Kumar
Journal:  Front Immunol       Date:  2021-02-11       Impact factor: 7.561

Review 3.  Mitochondrial Short-Term Plastic Responses and Long-Term Evolutionary Dynamics in Animal Species.

Authors:  Sophie Breton; Fabrizio Ghiselli; Liliana Milani
Journal:  Genome Biol Evol       Date:  2021-07-06       Impact factor: 3.416

4.  Characterisation of the Complete Mitochondrial Genome of Critically Endangered Mustela lutreola (Carnivora: Mustelidae) and Its Phylogenetic and Conservation Implications.

Authors:  Jakub Skorupski
Journal:  Genes (Basel)       Date:  2022-01-10       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.