| Literature DB >> 18230160 |
Pere Puigbò1, Ignacio G Bravo, Santiago Garcia-Vallvé.
Abstract
BACKGROUND: The Codon Adaptation Index (CAI) is a measure of the synonymous codon usage bias for a DNA or RNA sequence. It quantifies the similarity between the synonymous codon usage of a gene and the synonymous codon frequency of a reference set. Extreme values in the nucleotide or in the amino acid composition have a large impact on differential preference for synonymous codons. It is thence essential to define the limits for the expected value of CAI on the basis of sequence composition in order to properly interpret the CAI and provide statistical support to CAI analyses. Though several freely available programs calculate the CAI for a given DNA sequence, none of them corrects for compositional biases or provides confidence intervals for CAI values.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18230160 PMCID: PMC2246156 DOI: 10.1186/1471-2105-9-65
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Analysis of human mitochondrial genes that encode a subunit of complexes I-V of the mitochondrial respiratory chain encoded in the nuclear (a) or mitochondrial (b) genome.
| NDUFS1 | 2184 | 0.695 | * | 0.683 | 0.434 | 0.519 | 0.836 | |||
| NDUFS2 | 1392 | 0.765 | ** | 0.734 | 0.391 | 0.500 | 0.782 | |||
| NDUFS3 | 795 | 0.754 | * | 0.750 | 0.402 | 0.488 | 0.824 | |||
| NDUFS7 | 642 | 0.867 | ** | 0.780 | 0.442 | 0.446 | 0.991 | |||
| NDUFS8 | 633 | 0.868 | ** | 0.796 | 0.439 | 0.465 | 0.944 | |||
| NDUFV1 | 1395 | 0.825 | ** | 0.774 | 0.417 | 0.482 | 0.865 | |||
| NDUFV2 | 750 | 0.695 | 0.703 | 0.989 | 0.449 | 0.519 | 0.865 | |||
| SDHC | 510 | 0.699 | * | 0.679 | 0.377 | 0.457 | 0.825 | |||
| SDHD | 480 | 0.663 | * | 0.654 | 0.387 | 0.464 | 0.834 | |||
| SDHA | 1995 | 0.768 | * | 0.750 | 0.423 | 0.496 | 0.853 | |||
| SDHB | 843 | 0.778 | ** | 0.754 | 0.454 | 0.481 | 0.944 | |||
| UQCRFS1 | 825 | 0.711 | * | 0.711 | 0.391 | 0.483 | 0.810 | |||
| CYC1 | 978 | 0.759 | * | 0.750 | 0.379 | 0.449 | 0.844 | |||
| COX10 | 1332 | 0.744 | ** | 0.713 | 0.454 | 0.462 | 0.983 | |||
| COX11 | 831 | 0.738 | * | 0.725 | 0.407 | 0.513 | 0.793 | |||
| COX15 | 1140 | 0.707 | * | 0.688 | 0.411 | 0.472 | 0.871 | |||
| ATP5B | 1590 | 0.714 | * | 0.698 | 0.412 | 0.507 | 0.813 | |||
| ATP5A1 | 1512 | 0.695 | * | 0.684 | 0.409 | 0.519 | 0.788 | |||
| ATP5C1 | 897 | 0.726 | * | 0.705 | 0.463 | 0.509 | 0.910 | |||
| ATP5O | 642 | 0.700 | ** | 0.681 | 0.429 | 0.486 | 0.883 | |||
| ATP5D | 507 | 0.807 | ** | 0.748 | 0.410 | 0.426 | 0.962 | |||
| ATP5G1 | 411 | 0.776 | ** | 0.707 | 0.456 | 0.482 | 0.946 | |||
| ATP5G2 | 474 | 0.752 | ** | 0.686 | 0.472 | * | 0.451 | |||
| ATP5G3 | 429 | 0.720 | ** | 0.678 | 0.430 | 0.510 | 0.843 | |||
| ATP6V1A | 1854 | 0.709 | * | 0.702 | 0.451 | 0.525 | 0.859 | |||
| ATP6V1B1 | 1536 | 0.703 | 0.711 | 0.989 | 0.439 | 0.514 | 0.854 | |||
| ATP6V1D | 744 | 0.676 | 0.697 | 0.970 | 0.430 | 0.522 | 0.824 | |||
| ATP6V1E1 | 681 | 0.721 | * | 0.713 | 0.431 | 0.500 | 0.862 | |||
| ATP6V1E2 | 681 | 0.777 | ** | 0.733 | 0.410 | 0.466 | 0.880 | |||
| TCIRG1 | 2493 | 0.857 | ** | 0.781 | 0.421 | 0.434 | 0.970 | |||
| ATP6V0D2 | 1053 | 0.732 | * | 0.722 | 0.456 | 0.518 | 0.880 | |||
| ATP6V0C | 468 | 0.838 | ** | 0.748 | 0.511 | ** | 0.461 | |||
| ATP6F | 618 | 0.803 | ** | 0.741 | 0.510 | 0.514 | 0.992 | |||
| ATP6V0D1 | 1056 | 0.831 | ** | 0.793 | 0.457 | 0.495 | 0.923 | |||
| ATP6V0A1 | 2496 | 0.758 | * | 0.734 | 0.424 | 0.507 | 0.836 | |||
| ATP6V0A4 | 2523 | 0.770 | ** | 0.735 | 0.458 | 0.494 | 0.927 | |||
| ATP6V0A2 | 2571 | 0.748 | * | 0.728 | 0.450 | 0.491 | 0.916 | |||
| ND1 | 957 | 0.635 | 0.796 | 0.798 | 0.760 | ** | 0.456 | |||
| ND2 | 1044 | 0.616 | 0.774 | 0.796 | 0.677 | ** | 0.457 | |||
| ND3 | 345 | 0.571 | 0.703 | 0.812 | 0.701 | ** | 0.461 | |||
| ND4L | 297 | 0.550 | 0.679 | 0.810 | 0.738 | ** | 0.472 | |||
| ND4 | 1377 | 0.612 | 0.654 | 0.936 | 0.722 | ** | 0.455 | |||
| ND5 | 1812 | 0.651 | 0.750 | 0.868 | 0.723 | ** | 0.471 | |||
| ND6 | 525 | 0.612 | 0.754 | 0.812 | 0.361 | 0.551 | 0.655 | |||
| CYTB | 1134 | 0.655 | 0.711 | 0.921 | 0.758 | ** | 0.481 | |||
| COX1 | 1542 | 0.644 | 0.750 | 0.859 | 0.715 | ** | 0.509 | |||
| COX2 | 684 | 0.641 | 0.713 | 0.899 | 0.664 | ** | 0.503 | |||
| COX3 | 780 | 0.656 | 0.725 | 0.905 | 0.704 | ** | 0.497 | |||
| ATP8 | 207 | 0.606 | 0.688 | 0.881 | 0.633 | ** | 0.452 | |||
| ATP6 | 681 | 0.629 | 0.698 | 0.901 | 0.701 | ** | 0.472 | |||
Expected CAIs (eCAIs) at 95% (p = 0.05) and 99% (p = 0.01) confidence and 99% coverage were calculated using the Poisson method of the E-CAI server. For the sake of clarity, only the eCAI values at p = 0.05 are shown. CAIhm and CAImt mean CAI calculated using the mean nuclear and mitochondrial codon usage as a reference set, respectively. CAI values were calculated using the CAIcal tool .
* and ** mean that the CAI is higher than the eCAI estimated at 95% (*) and 99% (**) confidence and 99% coverage. Normalised CAI values (defined as the quotient between the CAI and its expected value) bigger than one are in bold and must be interpreted as evidence of adaptation to the reference codon usage beyond mere compositional biases.
Figure 1Graphical representation of the normalised CAIhm, defined as the quotient between the CAI of a gene and its expected value, versus G+C content at the third codon positions for the human genes that encode a subunit of a complex of the mitochondrial respiratory chain. Red squares represent mitochondrial genes encoded in the human mitochondrial genome and blue dots represent mitochondrial genes encoded in the human nuclear genome. An expected value of CAI was estimated for each gene with the E-CAI server, using the Poisson method and a 95% interval confidence and a 99% population coverage.