Literature DB >> 19997590

Comparative sequence analysis of the non-protein-coding mitochondrial DNA of inbred rat strains.

Avinash Abhyankar1, Hee-Bok Park, Giancarlo Tonolo, Holger Luthman.   

Abstract

The proper function of mammalian mitochondria necessitates a coordinated expression of both nuclear and mitochondrial genes, most likely due to the co-evolution of nuclear and mitochondrial genomes. The non-protein coding regions of mitochondrial DNA (mtDNA) including the D-loop, tRNA and rRNA genes form a major component of this regulated expression unit. Here we present comparative analyses of the non-protein-coding regions from 27 Rattus norvegicus mtDNA sequences. There were two variable positions in 12S rRNA, 20 in 16S rRNA, eight within the tRNA genes and 13 in the D-loop. Only one of the three neutrality tests used demonstrated statistically significant evidence for selection in 16S rRNA and tRNA-Cys. Based on our analyses of conserved sequences, we propose that some of the variable nucleotide positions identified in 16S rRNA and tRNA-Cys, and the D-loop might be important for mitochondrial function and its regulation.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19997590      PMCID: PMC2781161          DOI: 10.1371/journal.pone.0008148

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Mitochondria are the major energy producers in eukaryotic cells. Over millions of years of coexistence and coevolution, mitochondria have lost a considerable amount of their genome to the eukaryotic nuclear DNA [1], [2]. The mammalian mitochondrial DNA (mtDNA) encodes 37 genes, 13 of which form essential subunits of four mitochondrial respiratory chain complexes. The remaining genes for these complexes are encoded by the nuclear genome. Consequently, mitochondrial biogenesis, and hence function, needs an elaborate coordination of nuclear and mitochondrial gene expression [3], [4]. Apart from several ultra-short intergenic non-coding regions, mtDNA possesses a large non-coding D-loop that harbors regulatory regions for transcription and replication. The D-loop regulates mitochondrial replication and transcription in accordance with the energy demands, while the mitochondrial rRNAs and tRNAs ensure fulfillment of this task. Having its own genetic code different from the nuclear genetic code, mitochondria need their own protein biosynthesis system in the form of the mitochondrial ribosome (mitoribosome) built around 12S rRNA and 16S rRNA. The mitoribosome is responsible for the biosynthesis of the 13 proteins coded by the mtDNA and translates them with the help of 22 tRNAs also encoded by mtDNA. The non-protein-coding regions of the mtDNA are indispensable for cellular energy homeostasis, and genetic variation in these regions could have metabolic and fitness consequences. Since the protein-coding and the non-protein-coding regions of mtDNA serve different purposes – function and regulation of function – the variation pattern and the evolutionary pressures are expected to be different. Furthermore, the relative significance of coding sequence variation compared to the regulatory sequence variation, from an evolutionary perspective, remains poorly understood [5]. For this reason we investigated the protein-coding and the non-protein-coding regions separately. Here we present a molecular evolutionary analysis of the RNA genes and the D-loop of the rat mitochondrial genome. Information from 27 complete Rattus norvegicus mtDNA sequences was used.

Results

Ribosomal RNA Genes

The mitoribosome is composed of a small subunit consisting of 12S rRNA and 29 proteins and a large subunit consisting of 16S rRNA and 58 proteins [6]. Comparison of the 27 rat mtDNA sequences (Table S1) revealed seven variable positions in 12S rRNA, five of them unique to the wild rats (Table S2). Excluding the five variant positions unique to the wild rats, only positions 935 and 942 were considered for further analysis. None of these two variable positions alter the predicted 12S rRNA secondary structure or the free energy estimates. Mapping of these two sites on the consensus secondary structure for mammalian mitochondrial 12S rRNA showed that they are located in the 3′ minor domain [7]. However, we could not find any conservation at these two positions when compared to nine different mammalian species (data not shown). In 16S rRNA there were 23 variable positions, 20 of those were found among the inbred strains, while three variable positions were unique to the wild rats (Table S2). Within 16S rRNA, we noted a poly-C tract starting at position 1131 varying between five and eleven cytosines. Six of the variant positions were located in this poly-C tract. Taken together in haplotypes, the variant positions within 16S rRNA affect the topology and free energy estimates of the predicted secondary structures. We also assessed the conservation pattern for these variants using multiple alignments of nine different mammalian mitochondrial sequences. Of the variable positions in 16S rRNA only position 2170 was conserved among mammalian species; this C to T substitution is located in a 28-nucleotide long conserved sequence in close proximity to the L1-binding domain (Figure 1).
Figure 1

Location of variable position 2170 in the predicted secondary structure of the mammalian mitochondrial 16S rRNA.

In the enlarged L1 binding domain, position 2170 is encircled and highlighted by a red arrow. L1-BD denotes the L1 binding domain. I, II, III, IV, V and VI represent the rRNA domains, while black arrows represent the predicted tertiary interactions. Blue color represents regions predicted from comparative sequence analysis, orange colour represents predictions by Mfold software, while green colour in domain III represents alternative secondary structure predicted using Alifold software. The figure has been modified from reference [6].

Location of variable position 2170 in the predicted secondary structure of the mammalian mitochondrial 16S rRNA.

In the enlarged L1 binding domain, position 2170 is encircled and highlighted by a red arrow. L1-BD denotes the L1 binding domain. I, II, III, IV, V and VI represent the rRNA domains, while black arrows represent the predicted tertiary interactions. Blue color represents regions predicted from comparative sequence analysis, orange colour represents predictions by Mfold software, while green colour in domain III represents alternative secondary structure predicted using Alifold software. The figure has been modified from reference [6].

Transfer RNA Genes

The comparative analysis of the 22 tRNAs in mtDNA revealed a high degree of conservation. Only five of the 22 tRNAs had variable sites occurring in more than one strain (Table S2). All singletons were attributed to the wild rat sequences, except one at position 15350 that was unique to the WKY/NCrl strain. Three variable sites were observed in tRNA-Cys and two in tRNA-Pro, while tRNA-Tyr, tRNA-Asp and tRNA-Thr had one variable site each. There was a clear grouping pattern of the Wistar-derived and non-Wistar derived strains of the three variable positions in tRNA-Cys (positions 5200, 5202 and 5237). All strains originating from the Wistar rat (Table S2) shared the same allele at all these three positions indicating inheritance of an ancestral haplotype. At position 5202 the ‘Wistar’ allele was also shared by three wild rats – Wild/Cop, Wild/Tku and Wild/Mcwi. A similar Wistar-specific grouping was seen for the remaining four variable tRNA genes (tRNA-Tyr, tRNA-Asp, tRNA-Thr and tRNA-Pro). To assess the structural implications of these variants, we modeled their secondary structures based on consensus secondary structures of the mammalian mitochondrial tRNAs [8]. Figure 2 shows our secondary structure models for all the five tRNA genes.
Figure 2

Inferred secondary structures for (A) tRNA-Cys, (B) tRNA-Tyr, (C) tRNA-Asp, (D) tRNA-Thr and (E) tRNA-Pro.

Variable sites are highlighted in grey. The anticodon loop appears at the bottom of each model.

Inferred secondary structures for (A) tRNA-Cys, (B) tRNA-Tyr, (C) tRNA-Asp, (D) tRNA-Thr and (E) tRNA-Pro.

Variable sites are highlighted in grey. The anticodon loop appears at the bottom of each model.

The D-Loop

The only major non-coding region of the mitochondrial DNA is the D-loop. A total of 13 variable sites were found in the D-loop of inbred strains – eleven substitutions and two insertion/deletions. We mapped the known D-loop functional sites to the rat mitochondrial sequence [9], [10], [11], [12] (Table 1). Six substitutions were located in the termination associated sequences (TAS, ETAS), one substitution in the central block (CB), while the conserved sequence block 2 (MT-CSB2) had one insertion/deletion. Position 15460, located in ETAS1 deserves special attention since it is not only conserved between various Rattus species (R.rattus, R.exulans, R.tiomanicus, R.hoffmanni, R.tanezumi, and R.sordidus) but also in nine different mammalian species. The last nucleotide of the D-loop was also variable.
Table 1

Functional sites in the D-loop of rat mtDNA.

LocusDescriptionStart (bp)End (bp)Variants
ETAS1 Termination-associated sequence15446155031 substitution
TAS-D Termination associated sequence15497155111 substitution
TAS-C Termination associated sequence15520155310
TAS-B Termination associated sequence15541155541 substitution
TAS-A Termination associated sequence15571155841 substitution
ETAS2 Termination-associated sequence15511155722 substitutions
CB Central Block15673159791 substitution
MT-OHR H-strand origin16026160260
MT-CSB1 Conserved sequence block 116027160520
MT-CSB2 Conserved sequence block 216083160991 insertion/deletion
MT-CSB3 Conserved sequence block 316116161330
MT-LSP L-strand transcript initiation site16193161930
MT-TFL Tfam binding site16212162260
MT-TFH Tfam binding site16267162860
MT-HSP1 H-strand transcript initiation site16298162980

Nucleotide numbering represents position in the BN/SsNHsdMCW mtDNA sequence (Acc. No. NC_001665).

Nucleotide numbering represents position in the BN/SsNHsdMCW mtDNA sequence (Acc. No. NC_001665).

Tests for Selection

The Tajima's D test and the Fu & Li's D and F tests were performed on all the RNA genes and the D-loop to assess any deviation from neutrality. Since the results of these tests would also be influenced by population size changes, we estimated the FS and R2 statistics but did not find support for such population changes (data not shown). We found evidence for selection in 16S rRNA and tRNA-Cys based on Tajima's D test, whereas Fu and Li's D and F tests did not provide any evidence for selection in the RNA genes or in the D-loop (Table 2).
Table 2

Summary statistics for selection analyses in the non-protein-coding mtDNA.

FeatureNo. variable sitesNo. haplotypesDiversity per siteTajima's DFu & Li's DFu & Li's F
12S rRNA 320.0012.0370.8581.35
16S rRNA 2080.0032.56** 0.7651.48
tRNA-Cys 330.0222.323* 0.621.00
tRNA-Tyr 120.0071.5660.6421.01
tRNA-Asp 1201.5660.621.00
tRNA-Thr 120.0071.5050.6420.99
tRNA-Pro 230.003−1.164−1.558−1.66
D-loop 1260.0031.3341.1051.10

All estimates exclude singletons observed in the sequences from wild rats.

Tajima's D = Results of Tajima's D test.

Fu & Li's D = Results of Fu & Li's D test with outgroup.

Fu & Li's F = Results of Fu & Li's F test with outgroup.

 = P<0.05.

 = P<0.01.

All estimates exclude singletons observed in the sequences from wild rats. Tajima's D = Results of Tajima's D test. Fu & Li's D = Results of Fu & Li's D test with outgroup. Fu & Li's F = Results of Fu & Li's F test with outgroup. = P<0.05. = P<0.01.

Discussion

Mitochondrial DNA encodes few but essential components of the respiratory chain complexes I, III, IV and V. The two ribosomal RNAs provide a scaffold for the mitochondrial ribosomal proteins (MRPs). The mammalian mitoribosome has significantly reduced RNA content as compared to its bacterial counterpart; this reduction is accounted for by an increase in the number of MRPs [13], [14]. This reduction exerts strict structural constraints on the ribosomal RNAs for efficient and accurate function. In bacteria and archea the ribosomal protein L1 has a dual function as a ribosomal protein binding 23S rRNA and as a translational repressor by binding mRNA [15], [16]. The L1 binding domain in the mammalian mitochondrial 16S rRNA was found to be highly conserved [17]. According to our analysis, only one of the observed variable positions in the rat 16S rRNA (position 2170) is highly conserved and might be of functional importance due to its close proximity to the L1 binding domain (Figure 1). Out of the 22 tRNA genes only five had variant positions among the 27 investigated rat sequences. According to our prediction, tRNA-Cys variant A5202G could potentially have a destabilizing effect on its secondary structure and compromise the efficiency of cystein incorporation in a growing peptide chain. Stem-loop structures in the vicinity of the L-strand origin are also important for accurate and efficient replication of mtDNA [18], [19], [20]. Two of the three tRNA-Cys variants (positions 5200 and 5202) are located in these loop structures. Taken together, the observed variation in the rat mitochondrial tRNA-Cys might not only affect the role as a tRNA but also affect priming of L-strand replication. Mitochondria have an unusually high capacity for initiation of DNA replication, higher than needed for maintenance of mtDNA copy number. However, almost 95 percent of the replication events terminate prematurely resulting in formation of the 7S DNA [21]. Specific conserved short sequences have been identified that are associated with this premature termination event and are referred to as TAS and ETAS (extended TAS) elements [11], [22]. It has been shown that this replication termination might regulate the mtDNA copy number [23], [24]. The levels of mtDNA within a cell change according to the oxidative needs and, coupled with transcription, defines the oxidative capacity of the cell. Eight variant nucleotide positions within the D-loop were located in known functional sites. However, analysis of mitochondrial D-loop sequences from 27 mammalian species revealed a length variation in the ETAS sequences [25]. Moreover, in the human mtDNA two regions, HV1 and HV2, have been shown to be hypervariable [26], [27]. HV1 in human mtDNA corresponds to positions 15284−15643 in the rat and hence, the variant nucleotide positions at these locations might not lead to major functional changes (Table 1). However, the D-loop variations located within the central block (CB) and conserved sequence block 2 (MT-CSB2) might affect mitochondrial biogenesis, since they are located outside the two hypervariable regions. The results of the neutrality tests did not provide obvious evidence for selection in any of the non-protein-coding regions. Only Tajima's D test provided evidence for selection in 16S rRNA and tRNA-Cys. The disagreement between the tests is likely caused by the different approaches employed to identify deviation from neutrality. The two Fu & Li's tests consider the genealogy of the sequences used to estimate the statistics, while the Tajima's test is genealogy independent. Considering the different sensitivities of the neutrality tests to the number of variable sites, these results must be interpreted with caution. Moreover, due to high mutation rate in mtDNA, especially in the D-loop, it is not possible to account for reverse-mutations, and hence we cannot completely rule out selection with the methods used. It should also be considered that the results presented here are based on analysis of 23 inbred strains and only four sequences from wild rats. In conclusion, we have identified a few sites in the RNA genes and the D-loop that might play a role in mitochondrial biogenesis and maintenance.

Materials and Methods

Sequences and Analysis

Twenty seven complete Rattus norvegicus mtDNA sequences available in public databases were used, 13 of which have been sequenced in our laboratory (Table S1). The wild rats included in the study were caught at different geographical locations – Wild/Swe (Malmö, Sweden), Wild/Mcwi (Milwaukie, USA), Wild/Cop (Copenhagen, Denmark), and Wild/Tku (Tokyo, Japan). Total genomic DNA extracted from rat tail was used to PCR amplify mtDNA with 32 overlapping primer pairs (Table S3). PCR products were cleaned with ExoSAP-IT (USB Corporation). Cycle sequencing was performed using BigDye (Applied Biosystems) followed by ethanol-EDTA precipitation and separation on ABI3730 DNA Sequencer (RSKC-Malmö core facility). The sequences were processed with Phred [28], [29] to assign quality values to each base call and assembled with the STADEN software [30].

Comparative Sequence Analysis of the Non-Coding Mitochondrial DNA

Multiple sequence alignments were computed using ClustalX [31] and visually inspected. DnaSP v. 4.50.3 [32] was used to estimate the nucleotide statistics (segregating sites, haplotypes, nucleotide diversity). Since no crystal structure data are available for mammalian mitochondrial ribosomal RNAs, we referred to the predicted models for the mammalian mitoribosome [6], [33], [34]. Selected rat mitochondrial tRNA secondary structures were modeled on the predicted mammalian mitochondrial tRNA structures [8]. To assess the impact of variations in the RNA genes Mfold web server was used to compute the minimum free energy structures [35].

Tests to Identify Selection

Tajima's D test [36], Fu & Li's D and F tests with outgroup [37] were performed using DnaSP v. 4.50.3 [32]. For the Fu & Li's tests, we used the mouse reference mtDNA sequence (NC_005089) as outgroup. All these tests assess whether the DNA sequence is evolving randomly (neutrally) or by a non-random process. Non-random processes stand for either directional selection or balancing selection. However, non-random events might also be due to changes in the population size. To assess the effect of population changes we estimated the FS [38] and R2 [39] statistics using the DnaSP program. Fu's FS test estimates population changes by considering the number of different haplotypes in the sample, while the R2 compares the difference between the number of singleton mutations and the average number of nucleotide differences. In lieu of coalescence based permutation tests, the selection tests were assessed for sensitivity to the number of segregating sites. Both the Fu & Li's tests were considerably less sensitive to the number of segregating sites modeled on the dataset. Since the wild rat sequences were quiet divergent from the inbred population we did not include these wild rat sequences in the neutrality tests. Rat strains and mtDNA sequences (0.04 MB DOC) Click here for additional data file. Variable positions in the rat non-protein-coding mtDNA (0.06 MB XLS) Click here for additional data file. Primer pairs used for PCR and DNA sequence analysis (0.05 MB DOC) Click here for additional data file.
  42 in total

Review 1.  The Staden sequence analysis package.

Authors:  R Staden
Journal:  Mol Biotechnol       Date:  1996-06       Impact factor: 2.695

2.  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection.

Authors:  Y X Fu
Journal:  Genetics       Date:  1997-10       Impact factor: 4.562

3.  Mammalian mitochondrial D-loop region structural analysis: identification of new conserved sequences and their functional and evolutionary implications.

Authors:  E Sbisà; F Tanzariello; A Reyes; G Pesole; C Saccone
Journal:  Gene       Date:  1997-12-31       Impact factor: 3.688

4.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

5.  Statistical tests of neutrality of mutations.

Authors:  Y X Fu; W H Li
Journal:  Genetics       Date:  1993-03       Impact factor: 4.562

6.  Elongation of displacement-loop strands in human and mouse mitochondrial DNA is arrested near specific template sequences.

Authors:  J N Doda; C T Wright; D A Clayton
Journal:  Proc Natl Acad Sci U S A       Date:  1981-10       Impact factor: 11.205

7.  The complete nucleotide sequence of the Rattus norvegicus mitochondrial genome: cryptic signals revealed by comparative analysis between vertebrates.

Authors:  G Gadaleta; G Pepe; G De Candia; C Quagliariello; E Sbisà; C Saccone
Journal:  J Mol Evol       Date:  1989-06       Impact factor: 2.395

8.  Identification by in Organello footprinting of protein contact sites and of single-stranded DNA sequences in the regulatory region of rat mitochondrial DNA. Protein binding sites and single-stranded DNA regions in isolated rat liver mitochondria.

Authors:  P Cantatore; L Daddabbo; F Fracasso; M N Gadaleta
Journal:  J Biol Chem       Date:  1995-10-20       Impact factor: 5.157

9.  Secondary structure and patterns of evolution among mammalian mitochondrial 12S rRNA molecules.

Authors:  M S Springer; E Douzery
Journal:  J Mol Evol       Date:  1996-10       Impact factor: 2.395

10.  Mechanism of replication of human mitochondrial DNA. Localization of the 5' ends of nascent daughter strands.

Authors:  D P Tapper; D A Clayton
Journal:  J Biol Chem       Date:  1981-05-25       Impact factor: 5.157

View more
  8 in total

1.  Mitochondrial polymorphisms in rat genetic models of hypertension.

Authors:  Sivarajan Kumarasamy; Kathirvel Gopalakrishnan; Asher Shafton; Jeremy Nixon; Jayakumar Thangavel; Phyllis Farms; Bina Joe
Journal:  Mamm Genome       Date:  2010-05-05       Impact factor: 2.957

2.  Genetic monitoring detects an overlooked cryptic species and reveals the diversity and distribution of three invasive Rattus congeners in South Africa.

Authors:  Armanda D Bastos; Deenadayalan Nair; Peter J Taylor; Helene Brettschneider; Frikkie Kirsten; Elmarie Mostert; Emil von Maltitz; Jennifer M Lamb; Pim van Hooft; Steven R Belmain; Giancarlo Contrafatto; Sarah Downs; Christian T Chimimba
Journal:  BMC Genet       Date:  2011-02-16       Impact factor: 2.797

3.  Identification of candidate gene regions in the rat by co-localization of QTLs for bone density, size, structure and strength.

Authors:  Sofia Lagerholm; Hee-Bok Park; Holger Luthman; Marc Grynpas; Fiona McGuigan; Maria Swanberg; Kristina Åkesson
Journal:  PLoS One       Date:  2011-07-27       Impact factor: 3.240

4.  Polyamines in the life of Arabidopsis: profiling the expression of S-adenosylmethionine decarboxylase (SAMDC) gene family during its life cycle.

Authors:  Rajtilak Majumdar; Lin Shao; Swathi A Turlapati; Subhash C Minocha
Journal:  BMC Plant Biol       Date:  2017-12-28       Impact factor: 4.215

5.  Genomic analyses identify multiple Asian origins and deeply diverged mitochondrial clades in inbred brown rats (Rattus norvegicus).

Authors:  Emily E Puckett; Olivia Micci-Smith; Jason Munshi-South
Journal:  Evol Appl       Date:  2017-12-07       Impact factor: 5.183

6.  Droplet digital PCR shows the D-Loop to be an error prone locus for mitochondrial DNA copy number determination.

Authors:  Brian Li; Sonal Kaushik; Pola Kalinowski; BaRun Kim; Cynthia Gershome; Joyce Ching; Damon Poburko
Journal:  Sci Rep       Date:  2018-07-30       Impact factor: 4.379

7.  Mitogenome Sequencing in the Genus Camelus Reveals Evidence for Purifying Selection and Long-term Divergence between Wild and Domestic Bactrian Camels.

Authors:  Elmira Mohandesan; Robert R Fitak; Jukka Corander; Adiya Yadamsuren; Battsetseg Chuluunbat; Omer Abdelhadi; Abdul Raziq; Peter Nagy; Gabrielle Stalder; Chris Walzer; Bernard Faye; Pamela A Burger
Journal:  Sci Rep       Date:  2017-08-30       Impact factor: 4.379

8.  Characterisation of the Complete Mitochondrial Genome of Critically Endangered Mustela lutreola (Carnivora: Mustelidae) and Its Phylogenetic and Conservation Implications.

Authors:  Jakub Skorupski
Journal:  Genes (Basel)       Date:  2022-01-10       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.