| Literature DB >> 35629316 |
Federico M Giorgi1, Carmine Ceraolo1,2, Daniele Mercatelli1.
Abstract
The R programming language is approaching its 30th birthday, and in the last three decades it has achieved a prominent role in statistics, bioinformatics, and data science in general. It currently ranks among the top 10 most popular languages worldwide, and its community has produced tens of thousands of extensions and packages, with scopes ranging from machine learning to transcriptome data analysis. In this review, we provide an historical chronicle of how R became what it is today, describing all its current features and capabilities. We also illustrate the major tools of R, such as the current R editors and integrated development environments (IDEs), the R Shiny web server, the R methods for machine learning, and its relationship with other programming languages. We also discuss the role of R in science in general as a driver for reproducibility. Overall, we hope to provide both a complete snapshot of R today and a practical compendium of the major features and applications of this programming language.Entities:
Keywords: CRAN; R; bioinformatics; data science; programming; statistics
Year: 2022 PMID: 35629316 PMCID: PMC9148156 DOI: 10.3390/life12050648
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
Figure 1Timeline of R history with selected milestones.
Figure 2(A) Box plots drawn using the default R boxplot() function in original R (left), R since 4.0.0 (middle) and ggplot2 (right). (B) Density distribution plots for three distributions, combined with the results of the Shapiro–Wilk test. (C) Default R boxplot comparing two distributions and providing the output p-value of the Student’s t-test. (D) Scatter plot indicating the co-expression of two genes, and the Pearson’s correlation coefficient of the joint distribution. (E) Example of overlapping different plot types in R: box plot, beeswarm plot and violin plot (BBV Plot) for three numeric distributions (called Gene 1, Gene 2 and Gene 3).
Figure 3(A) Concept diagram of how Rmarkdown can merge text and code blocks to create documents. (B) Example of an R IDE, RStudio, showing multiple elements to assist the R programmer. (C) Worldwide popularity of search terms “R”, “Python” and “Perl” in the years 2004–2021 (source: Google Trends). The topic of the search term is limited to “Programming Language”.
Selected list of text editors and IDEs for R programmers.
| Text Editor/IDE | Release Year | Web Link |
|---|---|---|
| RStudio | 2011 |
|
| Jupyter Notebook | 2014 |
|
| RKWard | 2002 |
|
| Eclipse StatET | 2010 |
|
| Google Colab | 2017 |
|
| Visual Studio Code | 2015 |
|
| vi/Vim | 1976 |
|
| Emacs ESS | 1997 |
|
| Sublime Text | 2008 |
|
| Notepad++ | 2003 |
|
Most popular programming languages according to the 2021 PYPL index.
| Rank | Language | Share | Trend |
|---|---|---|---|
| 1 | Python | 30.21% | −0.50% |
| 2 | Java | 17.82% | 1.30% |
| 3 | JavaScript | 9.16% | 0.60% |
| 4 | C# | 7.53% | 1.00% |
| 5 | C/C++ | 6.82% | 0.60% |
| 6 | PHP | 5.84% | −0.20% |
| 7 | R | 3.81% | 0.00% |
| 8 | Swift | 2.03% | −0.20% |
| 9 | Objective-C | 2.02% | −1.60% |
| 10 | MATLAB | 1.73% | −0.10% |