| Literature DB >> 28149042 |
Tariq Abdullah1, Muniba Faiza1, Prashant Pant2, Mohd Rayyan Akhtar3, Pratibha Pant4.
Abstract
BACKGROUND: Single nucleotide substitutions (SNS) in genetic codon are of prime importance due to their ability to alter an amino acid sequence as a result. Given the nature of genetic code, any SNS is expected to change the protein sequence randomly into any of the 64 possible codons. In this paper, we present a theoretical analysis of how single nucleotide substitutions in genetic codon may affect resulting amino acid residue and what is the most likely amino acid that will get selected as a result.Entities:
Keywords: Mutation; codon; genetic code; point mutation; single base substitutions or single nucleotide substitutions
Year: 2016 PMID: 28149042 PMCID: PMC5267951 DOI: 10.6026/97320630012098
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1A program code to find out all possible codons due to single nucleodite substitution in genetic code.
An amino acid replacement probability matrix (ARPM) calculated with the help of Perl program, showing the probability of a replacement event (raw data) of an amino acid by another due to single base replacement (SBR) mutation. Color coding of the cells was done to show the nature/group to which the amino acid belongs as shown in the table below.
| G | A | V | L | I | P | F | Y | W | S | T | C | M | N | Q | K | R | H | D | E | ST | |
| G | 12/36 | 4/36 | 4/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 1/36 | 2/36 | 0/36 | 2/36 | 0/36 | 0/36 | 0/36 | 0/36 | 6/36 | 0/36 | 2/36 | 2/36 | 1/36 |
| A | 4/36 | 12/36 | 4/36 | 0/36 | 0/36 | 4/36 | 0/36 | 0/36 | 0/36 | 4/36 | 4/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 2/36 | 2/36 | 0/36 |
| V | 4/36 | 4/36 | 12/36 | 6/36 | 3/36 | 0/36 | 2/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 1/36 | 0/36 | 0/36 | 0/36 | 0/36 | 0/36 | 2/36 | 2/36 | 0/36 |
| L | 0/54 | 0/54 | 6/54 | 19/54 | 4/54 | 4/54 | 6/54 | 0/54 | 1/54 | 2/54 | 0/54 | 0/54 | 2/54 | 0/54 | 2/54 | 0/54 | 4/54 | 2/54 | 0/54 | 0/54 | 3/54 |
| I | 0/27 | 0/27 | 3/27 | 4/27 | 6/27 | 0/27 | 2/27 | 0/27 | 0/27 | 2/27 | 3/27 | 0/27 | 3/27 | 2/27 | 0/27 | 1/27 | 1/27 | 0/27 | 0/27 | 0/27 | 0/27 |
| P | 0/36 | 4/36 | 0/36 | 4/36 | 0/36 | 12/36 | 0/36 | 0/36 | 0/36 | 4/36 | 4/36 | 0/36 | 0/36 | 0/36 | 2/36 | 0/36 | 4/36 | 2/36 | 0/36 | 0/36 | 0/36 |
| F | 0/18 | 0/18 | 2/18 | 6/18 | 2/18 | 0/18 | 2/18 | 2/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 |
| Y | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 0/18 | 4/18 |
| W | 1/9 | 0/9 | 0/9 | 1/9 | 0/9 | 0/9 | 0/9 | 0/9 | 0/9 | 1/9 | 0/9 | 2/9 | 0/9 | 0/9 | 0/9 | 0/9 | 2/9 | 0/9 | 0/9 | 0/9 | 2/9 |
| S | 2/54 | 4/54 | 0/54 | 2/54 | 2/54 | 4/54 | 2/54 | 2/54 | 1/54 | 14/54 | 6/54 | 4/54 | 0/54 | 2/54 | 0/54 | 0/54 | 6/54 | 0/54 | 0/54 | 0/54 | 3/54 |
| T | 0/36 | 1/36 | 0/36 | 0/36 | 3/36 | 1/36 | 0/36 | 1/36 | 0/36 | 3/36 | 6/36 | 0/36 | 1/36 | 6/36 | 2/36 | 2/36 | 2/36 | 1/36 | 1/36 | 2/36 | 2/36 |
| C | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 2/18 | 4/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 2/18 |
| M | 0/9 | 0/9 | 1/9 | 2/9 | 3/9 | 0/9 | 0/9 | 0/9 | 0/9 | 0/9 | 1/9 | 0/9 | 0/9 | 0/9 | 0/9 | 1/9 | 1/9 | 0/9 | 0/9 | 0/9 | 0/9 |
| N | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 0/18 | 2/18 | 0/18 | 2/18 | 2/18 | 0/18 | 0/18 | 2/18 | 0/18 | 4/18 | 0/18 | 2/18 | 2/18 | 0/18 | 0/18 |
| Q | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 2/18 | 4/18 | 0/18 | 2/18 | 2/18 |
| K | 0/18 | 0/18 | 0/18 | 0/18 | 1/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 1/18 | 4/18 | 2/18 | 2/18 | 2/18 | 0/18 | 0/18 | 2/18 | 2/18 |
| R | 6/54 | 0/54 | 0/54 | 4/54 | 1/54 | 4/54 | 0/54 | 0/54 | 2/54 | 6/54 | 2/54 | 2/54 | 1/54 | 0/54 | 2/54 | 2/54 | 18/54 | 2/54 | 0/54 | 0/54 | 2/54 |
| H | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 4/18 | 0/18 | 2/18 | 2/18 | 2/18 | 0/18 | 0/18 |
| D | 2/18 | 2/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 4/18 | 0/18 |
| E | 2/18 | 2/18 | 2/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 0/18 | 2/18 | 2/18 | 0/18 | 0/18 | 4/18 | 2/18 | 2/18 |
| ST | 1/27 | 0/27 | 0/27 | 3/27 | 0/27 | 0/27 | 0/27 | 4/27 | 2/27 | 3/27 | 0/27 | 2/27 | 0/27 | 0/27 | 2/27 | 2/27 | 2/27 | 0/27 | 0/27 | 2/27 | 4/27 |
Refined ARPM table with probability values in two decimal places. Replacement event with no (zero) probabilities were left blank. Color coding of the cells was done to show the nature/group to which the amino acid belong.
| G | A | V | L | I | P | F | Y | W | S | T | C | M | N | Q | K | R | H | D | E | STOP | |
| G | 0.33 | 0.11 | 0.11 | 0.03 | 0.06 | 0.06 | 0.06 | 0.06 | 0.03 | ||||||||||||
| A | 0.11 | 0.33 | 0.11 | 0.11 | 0.11 | 0.11 | 0.06 | 0.06 | |||||||||||||
| V | 0.11 | 0.11 | 0.33 | 0.17 | 0.08 | 0.06 | 0.03 | 0.06 | 0.06 | ||||||||||||
| L | 0.11 | 0.35 | 0.07 | 0.07 | 0.11 | 0.02 | 0.04 | 0.04 | 0.04 | 0.07 | 0.04 | 0.06 | |||||||||
| I | 0.11 | 0.15 | 0.22 | 0.07 | 0.07 | 0.11 | 0.11 | 0.07 | 0.04 | 0.04 | |||||||||||
| P | 0.11 | 0.11 | 0.33 | 0.11 | 0.11 | 0.06 | 0.11 | 0.06 | |||||||||||||
| F | 0.33 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | |||||||||||||||
| Y | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | |||||||||||||
| W | 0.11 | 0.11 | 0.11 | 0.22 | 0.22 | 0.22 | |||||||||||||||
| S | 0.04 | 0.07 | 0.04 | 0.04 | 0.07 | 0.04 | 0.04 | 0.02 | 0.26 | 0.11 | 0.07 | 0.04 | 0.11 | 0.06 | |||||||
| T | 0.03 | 0.08 | 0.03 | 0.03 | 0.08 | 0.17 | 0.03 | 0.17 | 0.06 | 0.06 | 0.06 | 0.03 | 0.03 | 0.06 | 0.06 | ||||||
| C | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | 0.11 | 0.11 | 0.11 | |||||||||||||
| M | 0.11 | 0.22 | 0.33 | 0.11 | 0.11 | 0.11 | |||||||||||||||
| N | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | 0.11 | 0.11 | |||||||||||||
| Q | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | 0.11 | 0.11 | |||||||||||||
| K | 0.06 | 0.11 | 0.06 | 0.22 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | ||||||||||||
| R | 0.11 | 0.07 | 0.02 | 0.07 | 0.04 | 0.11 | 0.04 | 0.04 | 0.02 | 0.04 | 0.04 | 0.33 | 0.04 | 0.04 | |||||||
| H | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | 0.11 | 0.11 | 0.11 | |||||||||||||
| D | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | |||||||||||||
| E | 0.11 | 0.11 | 0.11 | 0.11 | 0.11 | 0.22 | 0.11 | 0.11 | |||||||||||||
| STOP | 0.04 | 0.11 | 0.15 | 0.07 | 0.11 | 0.07 | 0.07 | 0.07 | 0.07 | 0.07 | 0.15 |
Ratio of Possible and observed replacements tabulated group-wise. Values in this table were calculated after counting the total number of possible one-group-to-other transitions in Table 2 and dividing it by the total number of cells containing non-zero values. For example, there are 36 possible non polar to non polar amino acid transition (color coded yellow) in Table 2 yet only 21 cells have a non zero value, Hence we get 21/36 = 0.58 .
| NPA | ARM | PUH | SC | AMDC | B+ | A- | STP | |
| NPA | 0.58 | 0.28 | 0.67 | 0.33 | 0.25 | 0.39 | 0.50 | 0.33 |
| ARM | 0.28 | 0.44 | 0.50 | 0.50 | 0.25 | 0.22 | 0.17 | 0.67 |
| PUH | 0.67 | 0.67 | 1.00 | 0.50 | 0.75 | 0.67 | 0.50 | 1.00 |
| SC | 0.33 | 0.50 | 0.50 | 0.25 | 0.00 | 0.50 | 0.00 | 0.50 |
| AMDC | 0.25 | 0.17 | 0.50 | 0.00 | 0.50 | 0.83 | 0.50 | 0.50 |
| B+ | 0.58 | 0.22 | 0.50 | 0.50 | 0.83 | 0.78 | 0.33 | 0.67 |
| A- | 0.50 | 0.17 | 0.00 | 0.00 | 0.50 | 0.33 | 1.00 | 0.50 |
| STP | 0.33 | 0.67 | 0.50 | 0.50 | 0.50 | 0.67 | 0.50 | 1.00 |
Figure 2Multiple sequence Alignment (MSA) of the nucleotide of AC4 and ACI ORFs of Papaya Leaf Curl Virus Single Nucleotode Substitutions observed in AV4 and other ORFs (data not shown) showed similar amino acid substitutions (synonymous as well as non-synonymous) at the protein sequence level as depicted in Singh-Pant et al. (2012).