Open Source


A CEO’s View of Open Source Data Science in the Enterprise

July 15, 2021 |

We recently sat down with Art Steinmetz, former CEO of OppenheimerFunds, to get his unique perspective on the value and viability of code-first, open source data science for the enterprise.


Managing Packages for Open Source Data Science

February 17, 2021 |

With over 15,000 R packages on CRAN, over 230,000 on PyPI, and more arriving every day, the task of managing a package environment for data science in R and Python can be daunting.


testthat 3.0.0

August 13, 2020 |

In this webinar, I'll introduce some of the major changes coming in testthat 3.0.0. The biggest new idea in testthat 3.0.0 is the idea of an edition.


Sharing on Short Notice: How to Get Your Materials Online With R Markdown

March 31, 2020 |

Educators create a lot of files for teaching- slides, exercises, solutions, assignments, data, figures - that all ultimately need to be shared with other people.


Scaling Data Science at the EPA

November 8, 2019 |

Join Jeff Hollister & David Smith to learn about two examples of EPA's journey in adopting popular open source tools.


A Gentle Introduction to Tidy Statistics in R

June 12, 2019 |

R is a fantastic language for statistical programming, but making the jump from point and click interfaces to code can be intimidating for individuals new to R.


Convenient analysis with broom

November 30, 2018 |

In this webinar I’ll demonstrate how to use to broom to work with many models at once.


Help me help you. Creating reproducible examples

September 21, 2018 |

What is a reprex? It’s a reproducible example. Making a great reprex is both an art and a science and this webinar will cover both aspects.


How to Work with List Columns

August 11, 2018 |

This webinar breaks down one of the most esoteric concepts in the Tidyverse: list columns.


Thinking inside the box: you can do that inside a data frame?!

April 13, 2018 |

The data frame is a crucial data structure in R and, especially, in the Tidyverse. Working on a column or a variable is a very natural operation, which is great. But what about row-oriented work?


What's new in dplyr 0.7.0

June 22, 2017 |

dplyr provides a “grammar” of data transformation, making it easy and elegant to solve the most common data manipulation challenges.