See RStudio + sparklyr for big data at Strata + Hadoop World

If big data is your thing, you use R, and you’re headed to Strata + Hadoop World in San Jose March 13 & 14th, you can experience in person how easy and practical it is to analyze big data with R and Spark.

In a beginner level talk by RStudio’s Edgar Ruiz and an intermediate level workshop by Win-Vector’s John Mount, we cover the spectrum: What R is, what Spark is, how Sparklyr works, and what is required to set up and tune a Spark cluster. You’ll also learn practical applications including: how to quickly set up a local Spark instance, store big data in Spark and then connect to the data with R, use R to apply machine-learning algorithms to big data stored in Spark, and filter and aggregate big data stored in Spark and then import the results into R for analysis and visualization.

2:40pm–3:20pm Wednesday, March 15, 2017 Sparklyr: An R interface for Apache Spark Edgar Ruiz (RStudio) Primary topic: Spark & beyond Location: LL21 C/D Level: Beginner Secondary topics: R

1:30pm–5:00pm Tuesday, March 14, 2017 Modeling big data with R, sparklyr, and Apache Spark John Mount (Win-Vector LLC) Primary topic: Data science & advanced analytics Location: LL21 C/D Level: Intermediate Secondary topics: R

While you’re at the conference be sure to look us up in the Innovator’s Pavilion - booth number P8 during the Expo Hall hours. We’ll have the latest books from RStudio authors, t-shirts to win, demonstrations of RStudio Connect and RStudio Server Pro and, of course, stickers and cheatsheets. Share with us what you’re doing with RStudio and get your product and company questions answered by RStudio employees.

See you in San Jose! (

More On Products and Technology

Stay Connected

Get updates when there's a new post.