The RStudio team contributes code to many R packages and projects. R users are doing some of the most innovative and important work in science, education, and industry. It’s a daily inspiration and challenge to keep up with the community and all it is accomplishing.
The tidyverse is an opinionated collection of R packages designed for data science. All packages share an underlying philosophy and common APIs.
Sparklyr is an R interface to Apache Spark, a fast and general engine for big data processing. This package connects to local and remote Apache Spark clusters, a ‘dplyr’ compatible back-end, and an interface to Spark’s ML algorithms.
tidyr makes it easy to “tidy” your data. Tidy data is data that’s easy to work with: it’s easy to munge (with dplyr), visualise (with ggplot2 or ggvis) and model (with R’s hundreds of modelling packages).
readr makes it easy to read many types of tabular data including; Delimited files withread_delim(), read_csv(), read_tsv(), and read_csv2(), Fixed width files with read_fwf(), and read_table(), and Web log files with read_log().
The readxl package makes it easy to get data out of Excel and into R. readxl has no external dependencies, so it’s easy to install and use on all operating systems. It is designed to work with tabular data.