| Literature DB >> 32274012 |
Ali Mostafa Anwar1, Mohamed Soudy2, Radwa Mohamed1.
Abstract
Viruses show noticeable evolution to adapt and reproduce within their hosts. Theoretically, patterns and factors that affect the codon usage of viruses should reflect evolutionary changes that allow them to optimize their codon usage to their hosts. Some software tools can analyze the codon usage of organisms; however, their performance has room for improvement, as these tools do not focus on examining the codon usage co-adaptation between viruses and their hosts. This paper describes the vhcub R package, which is a crucial tool used to analyze the co-adaptation of codon usage between a virus and its host, with several implementations of indices and plots. The tool is available from: https://cran.r-project.org/web/packages/vhcub/. Copyright:Entities:
Keywords: Adaptation; Codon Usage Bias; Evolution; Natural selection; R; RStudio; Viruses
Mesh:
Substances:
Year: 2019 PMID: 32274012 PMCID: PMC7104870 DOI: 10.12688/f1000research.21763.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Functions available in vhcub, and the result returned from each one.
| Function name | Description | Value |
|---|---|---|
| fasta.read | Read fasta formate and convert it to data frame | A list with two data.frames; the first one for
|
| CAI.values | Measure the Codon Adaptation Index (CAI) using
| A data.frame containing the computed CAI
|
| dinuc.base | A measure of statistical dinucleotide over- and
| A data.frame containing the computed
|
| dinuc.
| A measure of statistical dinucleotide over- and
| A data.frame containing the computed
|
| dinuc.
| A measure of statistical dinucleotide over- and
| A data.frame containing the computed
|
| ENc.values | Measure the Effective Number of Codons (ENc)
| A data.frame containing the computed ENc
|
| GC.content | Calculates overall GC content as well as GC at first,
| A data.frame with overall GC content as
|
| RCDI.values | Measure the Relative Codon Deoptimization Index
| A data.frame containing the computed ENc
|
| RSCU.
| Measure the Relative Synonymous Codon Usage
| A data.frame containing the computed
|
| SCUO.
| Measure the Synonymous Codon Usage Eorderliness
| A data.frame containing the computed SCUO
|
| SiD.value | Measure the Similarity Index (SiD) between a virus
| A numeric represent a SiD value. |
| PR2.plot | Make a Parity rule 2 (PR2) plot
[ | A ggplot object. |
| ENc.
| Make an ENc-GC3 scatterplot
[ | A ggplot object. |
Figure 2. ENc-GC3 plot showing the values of the ENc versus the GC3 content for the virus (Escherichia virus T4) CDS, the solid red line represents the expected ENc values if the codon bias is affected by GC3s only.
Figure 3. PR2-plot showing CDS of the virus (Escherichia virus T4), plotted based on their GC bias [G3/(G3 + C3)] and AT bias [A3/(A3 + T3)] in the third codon position, the two solid red lines represent both coordinates (ordinate and abscissa) equal to 0.5, where A = T and G = C.
Figure 1. vhcub workflow, to analyze virus-host codon usage co-adaptation.
The white boxes represent the input fasta files. The red boxes represent three main analysis, each with different measures (the blue boxes), and the orange boxes represent ENc-GC3 plot and PR2-plot.