Literature DB >> 33855227

Molecular assessment of proteins encoded by the mitochondrial genome of Clarias batrachus and Clarias gariepinus.

Gyanendra Bahadur Chand1, Sushant Kumar1, Gajendra Kumar Azad1.   

Abstract

The population of catfish, Clarias batrachus has substantially diminished in various countries and studies show that another related species Clarias gariepinus is replacing it. The better adaptability and survivability of C. gariepinus over C. batrachus could be attributed to the metabolic differences between these two species, which is primarily regulated by mitochondrial activities. To understand the reasons behind this phenomenon, we performed in silico analyses to decipher the differences between the proteins encoded by the mitochondrial genome of these two related species. Our analysis revealed that out of thirteen, twelve proteins encoded by the mitochondrial genome of these two species have substantial variations between them. We characterised these variations by analysing their effect on secondary structure, intrinsic disorder predisposition, and functional impact on protein and stability parameters. Our data show that most of the parameters are changing between these two closely related species. Altogether, we demonstrate the molecular insights into the mitochondrial genome-encoded proteins of these two species and predict their effect on protein function and stability that might be helping C. gariepinus to gain survivability better than the C. batrachus.
© 2021 The Authors.

Entities:  

Keywords:  Clarias batrachus; Clarias gariepinus; Mitochondrial genome; Protein stability; Variations

Year:  2021        PMID: 33855227      PMCID: PMC8024883          DOI: 10.1016/j.bbrep.2021.100985

Source DB:  PubMed          Journal:  Biochem Biophys Rep        ISSN: 2405-5808


Introduction

The popularity of Clarias batrachus (Linnaeus, 1758) has been attributed to its taste, high protein and low fat content, and its medicinal values [1,2]. Among Asian countries, the most prominent species of Clarias (family Clariidae) used for aquaculture are C. batrachus in India, C. fuscus (Taiwan and Hawaii) and C. macrocephalus (South-East Asia) [3]. Recently, the population of C. batrachus significantly declined due to the anthropogenic activities and the introduction of morphologically similar exotic catfish, C. gariepinus (Burchell,1822) [3,4]. Subsequently, C. gariepinus has further increased the vulnerability of C. batrachus [5,6] that has led to listing of C. batrachus in ‘‘Critically Endangered’’ category of the IUCN (IUCN, 2007) red list of threatened species [4]. Similar observations were also reported by Na-Nakorn and Brummett [7] who emphasized that the wild type population must be conserved from C. gariepinus, which may endanger the purity and viability of wild populations. Since, C. batrachus is slowly moving toward the extinction; therefore, it is rational to understand the genetic makeup of this species so that proper conservational strategies can be implemented. A study by Khedkar et al. [8] revealed the lack of genetic diversity in C. batrachus populations found in major riverine system of India that can contribute to inbreeding and disease susceptibility. A comparative study on C. batrachus and C. gariepinus from India revealed up to 99% substitution by C. gariepinus in fish markets [3]. At genetic level the C. gariepinus and C. batrachus have distinct characteristics. Karyotyping of these two species revealed that the C. gariepinus has 56 chromosomes (2n = 28) while C. batrachus has 104 chromosomes (2n = 52) [9]. Recently, C. batrachus genome has been sequenced and its annotation revealed the presence of 22,914 genes [10]. Similar study on C. gariepinus is required to better compare the overall genomic variations present between these two species. The better adaptation of the fish populations has been shown to be associated with positive selection pressure [11,12]. The molecular mechanisms of these adaptations are not properly understood; however, few studies have linked to the key processes of physiology, metabolism, better antioxidant and DNA repair mechanisms [[13], [14], [15]]. One of the most promising players required for better adaptation is the enhancement of energy metabolism. At cellular levels, the mitochondria regulate energy metabolism events and play an indispensable role in the species diversity and differentiation [16]. Therefore, mitochondrial genome (mitogenome) has been widely studied to understand the genetic diversity of different populations in fish species. Furthermore, the small genetic material of the mitogenome enables researchers to comprehensibly analyse the relatively rapid substitution rate as well as the maternal inheritance [17,18]. The variations in mitochondrial DNA (mtDNA) are influenced by environmental conditions affecting metabolic processes, which eventually led to implications in fitness, such as, caviomorph rodents related to anoxic subterranean environments [19], monkeys to altitude [20], killer whales (Orcinus orca) [21] and Pacific salmon (Oncorhynchus sp.) [22] to latitudinal clines. Altogether, studies on the regulation of mtDNA variations are keys for understanding the selection events. This study aimed to understand the molecular determinants of better survivability and adaptability of C. gariepinus over C. batrachus. Here, we analysed the variations present between the proteins encoded by the mitochondrial genome of C. gariepinus and C. batrachus to dissect the possible mechanisms of C. gariepinus adaptability. Our data revealed that there are many variations present in C. gariepinus protein that alters protein disorder parameters, secondary structure, and stability that might interfere in protein functioning. Recently, various algorithms have been developed to predict several aspects of protein structure, such as the prediction of secondary-structure [23], state of protein disorder, solvent accessibility of its amino acid residues [24], alteration in stability and function due to an amino acid substitution, sites for protein interactions [25], etc. Among these, predicting protein disorder has gained considerable interest in the last decade [26]. In eukaryotic proteins, the intrinsically disordered regions (IDRs) are abundant and are associated with various cellular functions [27,28]. IDRs are part of the protein that do not possess distinct three-dimensional structure, but are nevertheless functional [29]. Here, we used several bioinformatics tools to characterise the variations among the mitochondrial proteins of C. gariepinus and C. batrachus.

Material and methods

Sequence retrieval

As of 30th August 2020, the NCBI-genome-database has seven complete mitochondrial genome sequences of C. batrachus and C. gariepinus as shown in Table 1. Among these, three sequences were reported from both India (KC572134.1, NC_023923.1, and KM259918.1) and China (KY767672.1, KT001082.1, and NC_027661.1), while one from Hungary (KT809508.1). All three Indian sequences were for C. batrachus and the Hungarian sequence was for C. gariepinus. Of the three sequences reported from China, two were for C. gariepinus (KT001082.1 and NC_027661.1) and one was for C. batrachus (KY767672.1). Since, only China has reported both sequences of C. batrachus and C. gariepinus; therefore, we decided to perform a comparative study of the proteins encoded by the mitochondrial genome of these two species from China. Further, the two C. gariepinus sequence reported from China was similar (KT001082.1 and NC_027661.1), there were no variations present between them; therefore, we used one of the C. gariepinus sequence (NC_027661.1) for further analysis. We extracted the proteins encoded by the mitochondrial genome of KY767672.1 (C. batrachus) and NC_027661.1 (C. gariepinus) and used for analysis (Table 2). The protein identifier accession numbers of the proteins encoded by the mitochondrial genome of above two mentioned species are listed in Table 2.
Table 1

List of complete mitochondrial genome sequences of C. batrachus and C. gariepinus reported worldwide till August 2020.

S. No.SpeciesAccessionNumberSubmission dateReported fromAuthorsInstitute
1Clarias batrachusKY76767214-MAR-2017ChinaMa,A. et al.,College of Fisheries, Henan normal University, Henan, 453007, China
2Clarias gariepinusNC_02766101-JUN-2015ChinaHan,C. et al.,College of Life Science, Sun Yat-Sen University, Guangdong 510000, China
3Clarias gariepinusKT00108201-JUN-2015ChinaHan,C. et al.,College of Life Science, Sun Yat-Sen University, Guangdong 510000, China
4Clarias gariepinusKT80950822-SEP-2015HungaryKovacs, B. et al.,Department of Aquaculture, Szent Istvan University, Godollo 2100, Hungary
5Clarias batrachusKM25991801-AUG-2014IndiaKushwaha,B. et al.,Molecular Biology and Biotechnology,National Bureau of Fish Genetic Resources,UP, 226002, India
6Clarias batrachusKC57213401-FEB-2013IndiaMohindra,V. et al.,Fish Conservation Division, National Bureau of Fish Genetic Resources, UP, 226002, India
7Clarias batrachusNC_02392301-FEB-2013IndiaMohindra,V. et al.,Fish Conservation Division, National Bureau of Fish Genetic Resources, UP 226002, India
Table 2

List of protein Accession Number used in this study.

S. No.ProteinClarias batrachus (Accession number)Clarias gariepinus (Accession number)
1NADH dehydrogenase subunit 1AUX80750.1YP_009160664.1
2NADH dehydrogenase subunit 2AUX80751.1YP_009160665.1
3cytochrome c oxidase subunit 1AUX80752.1YP_009160666.1
4cytochrome c oxidase subunit 2AUX80753.1YP_009160667.1
5ATP synthase F0 subunit 8AUX80754.1YP_009160668.1
6ATP synthase F0 subunit 6AUX80755.1YP_009160669.1
7cytochrome c oxidase subunit 3AUX80756.1YP_009160670.1
8NADH dehydrogenase subunit 3AUX80757.1YP_009160671.1
9NADH dehydrogenase subunit 4LAUX80758.1YP_009160672.1
10NADH dehydrogenase subunit 4AUX80759.1YP_009160673.1
11NADH dehydrogenase subunit 5AUX80760.1YP_009160674.1
12NADH dehydrogenase subunit 6AUX80761.1YP_009160675.1
13cytochrome bAUX80762.1YP_009160676.1
List of complete mitochondrial genome sequences of C. batrachus and C. gariepinus reported worldwide till August 2020. List of protein Accession Number used in this study.

Multiple sequence alignments

The CLUSTAL Omega webserver developed at EMBL’s European Bioinformatics Institute, UK [30] was used for multiple sequence alignment with each mitochondrial genome encoded protein from C. batrachus and C. gariepinus.

Secondary structure predictions

To obtain a basic idea about the probable secondary structure, we compared the protein from C. batrachus and C. gariepinus using CFSSP (Chou and Fasman secondary structure prediction) [30] tool. This webserver predicts the most appropriate secondary structure, including, α-helix, β-sheet, and turns from the input peptide sequence.

Evaluating the protein intrinsic disorders

The intrinsic disorder distribution of each individual residues of the target polypeptide chain was evaluated by PONDR-VSL2 webserver [31]. The per-residue disorder predisposition scores have been designated on a scale from 0 to 1. The value of ‘0’ depicts fully ordered residues and ‘1’ represents fully disordered residues. Value ‘0.5’ is considered threshold above which is considered disordered while, below ‘0.5’ is considered ordered.

Prediction of impact of variation on protein function and stability

To predict the impact of variations on the function of protein, PROVEAN (Protein Variation Effect Analyzer) was used [32]. This tool predicts the implication of variation on protein function. The predefined threshold score of PROVEAN is -2.5. The variation is considered ‘deleterious’, if the score is equal to or below -2.5. Similarly, the variation is considered ‘neutral’, if the score is above the threshold. To analyse the effect of variations on protein stability I-mutant suite webserver was used [33]. I-mutant suite predicts the difference in free energy (ΔΔG) between the wild-type and mutant sequence. The positive value represents an increase in stability, while negative ΔΔG represents destabilisation.

Results

Identification of variations between the proteins encoded by the mitochondrial genome of Clarias batrachus and Clarias gariepinus

We downloaded (NCBI-genome database) the two mitochondrial sequences reported from China as discussed in methods and extracted the thirteen polypeptide sequences encoded by its mitogenome (Table 2). All these protein sequences obtained from C. batrachus and C. gariepinus were aligned using CLUSTAL Omega tool to check for similarities or variations. The C. batrachus mitochondrial proteins were used as wild-type sequences for this analysis. Using this algorithm, we identified the variations present between C.batrachus and C. gariepinus listed in Table 3. The most frequent variations were observed for NADH dehydrogenase subunit 3 that has 7.75% residues mutated (Table 3). Interestingly, we did not observe any change in amino acids for NADH dehydrogenase subunit 4L that means this protein is 100% conserved between these two species. The most number of amino acid variations at 29 sites was observed for NADH dehydrogenase subunit 5 (Table 3). Further, the location of the variation site for each protein is shown as schematics. Fig. 1 (A-F) shows the variations observed in NADH dehydrogenase subunits (subunit 1, 2, 3, 4, 5 and 6). The Fig. 2 (A-F) shows the variations observed for Cytochrome C Oxidase subunits (subunit 1, 2 and 3), ATP synthase F0 subunits (subunit 6 and 8) and Cytochrome b.
Table 3

Summary of variations observed between C. batrachus and C. gariepinus. The data was obtained by comparing the mitochondrial protein sequences of C. gariepinus and C. batrachus.

S. NoMitochondrial genome encoded ProteinsTotal Protein lengthNumber of point mutations observed in C. gariepinus (C. batrachus used as wild type)% mutant residues in C. gariepinus
1NADH dehydrogenase subunit 1324103.08
2NADH dehydrogenase subunit 2348226.32
3Cytochrome c oxidase subunit I51650.96
4Cytochrome c oxidase subunit II23052.12
5ATP synthase F0 subunit 85535.45
6ATP synthase F0 subunit 622720.88
7Cytochrome c oxidase subunit III26110.38
8NADH dehydrogenase subunit 311697.75
9NADH dehydrogenase subunit 4L98No mutation0
10NADH dehydrogenase subunit 4460112.39
11NADH dehydrogenase subunit 5608294.7
12NADH dehydrogenase subunit 6172Five point mutations and multiple mutations from residue 112-141
13Cytochrome b37971.84
Fig. 1

The sequence alignment of the NADH Dehydrogenase subunits encoded by mitochondrial genome of C. batrachus and C. gariepinus reported from China. (A–F) The amino acid substitutions are highlighted in the schematics of individual proteins.

Fig. 2

The sequence alignment of the Cytochrome C Oxidase subunits, ATP Synthase F0 subunits and Cytochrome b encoded by mitochondrial genome of C. batrachus and C. gariepinus reported from China. (A–F) The amino acid substitutions are highlighted in the schematics of individual proteins.

Summary of variations observed between C. batrachus and C. gariepinus. The data was obtained by comparing the mitochondrial protein sequences of C. gariepinus and C. batrachus. The sequence alignment of the NADH Dehydrogenase subunits encoded by mitochondrial genome of C. batrachus and C. gariepinus reported from China. (A–F) The amino acid substitutions are highlighted in the schematics of individual proteins. The sequence alignment of the Cytochrome C Oxidase subunits, ATP Synthase F0 subunits and Cytochrome b encoded by mitochondrial genome of C. batrachus and C. gariepinus reported from China. (A–F) The amino acid substitutions are highlighted in the schematics of individual proteins.

The secondary structures of proteins are altered due to the variations

Next, we studied the effect of these variations on the secondary structure of individual proteins. We used CFSSP webserver to predict the secondary structure of the polypeptide sequences, as explained in the methods section. Fig. 3 demonstrates the secondary structures of NADH dehydrogenase subunits (subunit 1, 2, 3, 5 and 6). The location of variations in secondary structure has been marked by an asterisk in each panel. The detailed analysis revealed that the NADH dehydrogenase subunit 1 has acquired changes in secondary structure at three positions (Fig. 3A, marked by asterisk). Among those, one position is leading to gain in the beta-sheet structure while rest two positions are leading to gain in alpha-helix (Fig. 3A, compare panel i and ii, location is marked by an asterisk). Similar analysis was performed for other subunits as well and our data show that NADH dehydrogenase subunit 2, 3, 5 and 6 have variations in secondary structures at 8, 2, 7 and 5 locations, respectively (Fig. 3B, C, E and F). The Cytochrome C Oxidase subunit 2 showed variation in secondary structure at three locations (Fig. 3D). We performed similar secondary structure predictions with other proteins that include Cytochrome C Oxidase subunits (subunit 1 and 3), ATP synthase F0 subunits (subunit 6 and 8) and Cytochrome b (supplementary fig. 1). Detailed analysis revealed that the change in the secondary structure was observed for Cytochrome C Oxidase subunit 1 at one location only (suppl. fig 1A), Cytochrome C Oxidase subunit 3 did not show any variation (suppl. fig. 1C). Similarly, ATP synthase F0 subunit 6 and 8 shows variation at one location each (suppl. Fig. 1D and E). The Cytochrome b also shows variation at one location only (suppl. fig. 1F). Altogether, our secondary structure prediction revealed alteration in secondary structure due to the variations in amino acid sequences between C. batrachus and C. gariepinus.
Fig. 3

Effect of amino acid substitutions on the secondary structure of proteins encoded by mitochondrial genome of C. batrachus and C. gariepinus. Panel (i) represents sequence of C. batrachus and panel (ii) represents sequence of C. gariepinus. (A–F) represents the individual proteins and asterisk show the locations of the variation in the secondary structure of C. gariepinus protein.

Effect of amino acid substitutions on the secondary structure of proteins encoded by mitochondrial genome of C. batrachus and C. gariepinus. Panel (i) represents sequence of C. batrachus and panel (ii) represents sequence of C. gariepinus. (A–F) represents the individual proteins and asterisk show the locations of the variation in the secondary structure of C. gariepinus protein.

The variations alter the intrinsic disorder parameters

The differences between the proteins encoded by the mitogenome of C. batrachus and C. gariepinus can be additionally demonstrated via the analysis of the per-residue intrinsic disorder predispositions of these proteins. Results of this analysis are shown in Fig. 4 and supplementary fig. 2, which illustrates the intrinsic disorder propensity of the proteins of the two species. The detailed analysis revealed that many amino acid substitutions do not lead to alteration in local disorder propensity; however, some of them cause noticeable alterations in the disorder predisposition (Fig. 4). For instance, NADH dehydrogenase subunit 1 local disorder predisposition in the vicinity of residue 100 and 175 was decreased in the C. gariepinus (Fig. 4A). The NADH dehydrogenase subunit 2 shows the loss of a peak at the end of the protein sequence in C. gariepinus (Fig. 4B). Similar analysis was performed using the other NADH dehydrogenase subunits and minor variations were observed for subunit 3, 4 and 5 (supplementary fig. 2). A new noticeable prominent peak was observed for NADH dehydrogenase subunit 6 of C. gariepinus (Fig. 4D) in the vicinity of residue 120, whereas the C. batrachus lacks it (Fig. 4D). Furthermore, the Cytochrome C Oxidase subunit 1, 2 and ATP synthase F0 subunit 6 show minor variations in disorder propensity (supplementary fig. 2). Cytochrome C Oxidase subunit 3 and Cytochrome b do not show any variation (supplementary fig. 2). The ATP synthase F0 subunit 8 shows an overall increase in protein disorder propensity (Fig. 4C). Altogether, our data indicate a change in intrinsic disorder propensity among few proteins encoded by mtDNA of these two species.
Fig. 4

Comparisons of the intrinsic disorder predisposition of the C. batrachus and C. gariepinus. A disorder threshold is indicated at score = 0.5, residues/regions with the disorder scores >0.5 are considered as disordered. (A–D) Each panel represents the disorder parameters of individual proteins including NADH dehydrogenase subunit 1, 2, 6 and ATP Synthase F0 subunit 8.

Comparisons of the intrinsic disorder predisposition of the C. batrachus and C. gariepinus. A disorder threshold is indicated at score = 0.5, residues/regions with the disorder scores >0.5 are considered as disordered. (A–D) Each panel represents the disorder parameters of individual proteins including NADH dehydrogenase subunit 1, 2, 6 and ATP Synthase F0 subunit 8.

The variations in amino acid also alter the protein functions and stability parameters

Subsequently, to understand the functional impact of the variations, we performed additional predictions that include protein function prediction (PROVEAN score) and change in stability (ΔΔG). We performed these two analyses for all variations observed in this study as shown in Table 4 and supplementary table 1 (complete list). The functional impact was demonstrated in terms of ‘neutral and deleterious’. Our analysis revealed that most of the variations were having no effect on protein function (neutral). However, few of them were deleterious to protein function (Table 4). The P92Q substitution in NADH dehydrogenase subunit 2 was observed as deleterious (Table 4). Furthermore, T26I of NADH dehydrogenase subunit 4, I192N of NADH dehydrogenase subunit 5 and M41G of Cytochrome b was found to be deleterious (Table 4 and supplementary table 1). Similarly, the stability predictions revealed that most of the variations decrease protein stability ΔΔG more than (-1.5 Kcal/mol) that includes I75F of NADH dehydrogenase subunit 1, I7T of NADH dehydrogenase subunit 3, I192 N and I548T of NADH dehydrogenase subunit 5. However, only one substitution S172N of NADH dehydrogenase subunit 1 (supplementary table 1) led to increase in stability with high reliability index (more than 7). Altogether, our analysis revealed that various amino acid substitutions alter protein functions and stability parameters.
Table 4

Protein structural analysis was performed by PROVEAN and I-mutant suite. The effect of mutation on protein function and stability was analysed using these two webservers.

NADH dehydrogenase subunit 1
VariantPROVEAN scorePrediction (cutoff = -2.5)ΔΔG value (Kcal/mol)StabilityReliability index
I75F−0.166Neutral−1.64Decrease8
M257T
−1.562
Neutral
−0.98
Decrease
7
NADH dehydrogenase subunit 2
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
P92Q−6.87Deleterious−0.99Decrease8
I155T1.306Neutral−2Decrease8
L179M−1.156Neutral−0.99Decrease5
A223G−0.754Neutral−1.17Decrease7
T224A−0.061Neutral−1.07Decrease9
T327A
0.926
Neutral
−0.91
Decrease
7
NADH dehydrogenase subunit 3
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
I7T
−1.135
Neutral
−1.74
decrease
2
NADH dehydrogenase subunit 4
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
T45A−1.572Neutral−1.34decrease8
P184H−1.243Neutral−1.32decrease7
I381M
2.144
Neutral
−1.46
decrease
8
NADH dehydrogenase subunit 5
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
L30V−0.729Neutral−1.29decrease7
Y38H2.038Neutral−1.28increase1
V39A−1.142Neutral−1.46decrease9
I45T1.663Neutral−2.2decrease8
A113S1.653Neutral−0.93decrease9
L117M0.532Neutral−1.37decrease7
I192N−4.605Deleterious−1.73decrease7
L207M−1.057Neutral−1.12decrease7
P275Q1.906Neutral−1.02decrease7
I472L−1.473Neutral−0.9decrease7
I548T1.504Neutral−1.66decrease2
V575L1.36Neutral−1.08decrease2
S577A
−1.158
Neutral
−1.03
decrease
7
NADH dehydrogenase subunit 6
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
L96F−1.391Neutral−1.02decrease5
V102M
−0.471
Neutral
−1.11
decrease
6
Cytochrome c oxidase subunit 1
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
L419V0.253Neutral−1.15decrease6
I460V
−0.294
Neutral
−0.97
decrease
7
Cytochrome c oxidase subunit 2
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
T117A−0.316Neutral−1.08decrease5
L172M
0.575
Neutral
−1.35
decrease
7
ATP synthase F0 subunit 8
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Effect/stability
Reliability index
L29M
−0.39
Neutral
−1.08
decrease
8
Cytochrome b
Variant
PROVEAN score
Prediction (cutoff = -2.5)
ΔΔG value (Kcal/mol)
Stability
Reliability index
M41G−3.169Deleterious−1.06decrease8
F81L2.572Neutral−1.19decrease5
Protein structural analysis was performed by PROVEAN and I-mutant suite. The effect of mutation on protein function and stability was analysed using these two webservers.

Discussion

All 13 proteins encoded by mtDNA are components of the electron transport chain and play a critical role in the organismal energy metabolism [34]. Therefore, mitochondrial genes are key molecules that connect the mechanisms of the evolution of the energy consumption [35]. Even in higher eukaryotes, the mtDNA polymorphisms have been linked to altered mitochondrial matrix pH and intracellular calcium dynamics [36] and contribute to the generation of reactive oxygen species (ROS) in murine cells [37]. Further, rare functional variants in mtDNA enhances aerobic performance of human athletes [38]. Here, we have comparatively studied the proteins encoded by the mitochondrial genome of C. batrachus and C. gariepinus to understand the impact of variations on protein structure and function. Our data revealed that the NADH dehydrogenase subunit 3 of C. gariepinus displayed the highest variability with 7.75% residues substituted compared with C. batrachus. Other NADH dehydrogenases also showed variations including subunit 2 (6.32%), subunit 5 (4.7%), subunit 4 (2.39%) and subunit 1 (3.08%) indicating that NADH dehydrogenases exhibit high levels of substitution in C. gariepinus. ATP synthase F0 subunit 8 also showed high variation (5.45%). Cytochrome C oxidase subunits 1, 3 showed a low level of variation (<1%). Ample studies revealed a positive association of proteins involved in oxidative phosphorylation with increased energetic demands (reviewed in Ref. [39]). The variations observed between these two species might be favouring the energy metabolism of C. gariepinus. The rapid rate of mutation in mtDNA makes it possible to produce advantageous or disadvantageous phenotypes in a relatively short time. Variations in protein sequences correlate with adaptation, such as a study examined the rhodopsin sequences from 2056 species of fish identified the occurrence of the same missense mutation and its independent selection toward light environments across these fishes [40]. Earlier studies have linked the variations in the protein encoded by mitogenome might contribute to the phenotypic variation in poultry and livestock. For instance, in cattle, mtDNA variation has been linked to economic traits such as increased production of milk, calving rates, increased weight and many others [[41], [42], [43], [44]]. Our result correlates with these studies and it seems that the NADH dehydrogenases and others displayed the variability in the C. gariepinus that might be associated with their positive selection; however, future experimental data are required to validate our current in silico predictions. Furthermore, the NADH enzymes play a vital role in respiratory-chain activities, thus influencing energy supply [45,46]. Foote et al. [21] identified two positively selected amino acid sites in the mitochondrial genes of killer whales (Orcinus orca) that influences its metabolic performance. Similarly, positive selection of mitochondrial genes were also observed in migratory Pacific salmon [22]. Subsequently, we show that variations between the mitochondrial proteins among these two species translate to an alteration in secondary structure, intrinsic protein disorder, protein stability that can impact protein function. Our data revealed a considerable alteration in protein disorder parameters of NADH dehydrogenase subunit 1, 2, and 6 (Fig. 4) among these two species that can have potential impact on protein function. Several IDRs are involved in folding of their interacting partners with variable degrees, thereby the resultant complexes exhibit structural and functional heterogeneity [47,48]. Proteins with IDRs retain extreme thermal and acid stability as well as remain functional under these adverse conditions. Furthermore, IDRs provide advantages to its carriers, at the molecular, supra-molecular, and organismal levels [49]. Studies on IDRs of human proteome have revealed that a considerable fraction of disease-associated mutations reside within the IDRs and the substitutions of residues have significant functional impact [50,51]. Whether the high variation rate of the proteins observed here could be related to phenotypic variation between C. batrachus and C. gariepinus, needs further investigation. The information reported in this study could be useful for future studies on mitogenome variations between two closely related fish species and their role in better adaptation to the environmental conditions.

CRediT authorship contribution statement

Gyanendra Bahadur Chand: Methodology, Validation, Writing - original draft and & editing. Sushant Kumar: Methodology, Validation, Visualization, Writing - original draft and & editing. Gajendra Kumar Azad: Conceptualization, Supervision, Methodology, Validation, Visualization, Writing - original draft and & editing.

Declaration of competing interest

Authors declare no conflict of interests.
  42 in total

1.  ISIS: interaction sites identified from sequence.

Authors:  Yanay Ofran; Burkhard Rost
Journal:  Bioinformatics       Date:  2007-01-15       Impact factor: 6.937

2.  Polymorphisms of mitochondrial ATPase 8/6 genes and association with milk production traits in Holstein cows.

Authors:  Ying-He Qin; Shi-Yi Chen; Song-Jia Lai
Journal:  Anim Biotechnol       Date:  2012       Impact factor: 2.282

3.  Phylogenetic performance of mitochondrial protein-coding genes in resolving relationships among vertebrates.

Authors:  R Zardoya; A Meyer
Journal:  Mol Biol Evol       Date:  1996-09       Impact factor: 16.240

Review 4.  Molecular genetics of succinate:quinone oxidoreductase in eukaryotes.

Authors:  I E Scheffler
Journal:  Prog Nucleic Acid Res Mol Biol       Date:  1998

5.  High rates of substitution of the native catfish Clarias batrachus by Clarias gariepinus in India.

Authors:  Gulab D Khedkar; Anita D Tiknaik; Rushidkumar N Shinde; Amol D Kalyankar; Tetsuzan Benny Ron; David Haymer
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2015-09-04       Impact factor: 1.514

6.  Predicting the functional effect of amino acid substitutions and indels.

Authors:  Yongwook Choi; Gregory E Sims; Sean Murphy; Jason R Miller; Agnes P Chan
Journal:  PLoS One       Date:  2012-10-08       Impact factor: 3.240

7.  Positive Darwinian selection in the piston that powers proton pumps in complex I of the mitochondria of Pacific salmon.

Authors:  Michael R Garvin; Joseph P Bielawski; Anthony J Gharrett
Journal:  PLoS One       Date:  2011-09-28       Impact factor: 3.240

8.  Identification of mitochondrial DNA polymorphisms that alter mitochondrial matrix pH and intracellular calcium dynamics.

Authors:  An-a Kazuno; Kae Munakata; Takeharu Nagai; Satoshi Shimozono; Masashi Tanaka; Makoto Yoneda; Nobumasa Kato; Atsushi Miyawaki; Tadafumi Kato
Journal:  PLoS Genet       Date:  2006-08       Impact factor: 5.917

9.  The EMBL-EBI search and sequence analysis tools APIs in 2019.

Authors:  Fábio Madeira; Young Mi Park; Joon Lee; Nicola Buso; Tamer Gur; Nandana Madhusoodanan; Prasad Basutkar; Adrian R N Tivey; Simon C Potter; Robert D Finn; Rodrigo Lopez
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

10.  Analysis of functional variants in mitochondrial DNA of Finnish athletes.

Authors:  Jukka Kiiskilä; Jukka S Moilanen; Laura Kytövuori; Anna-Kaisa Niemi; Kari Majamaa
Journal:  BMC Genomics       Date:  2019-10-29       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.