Back
 

Session Summary:

When people first learn about R’s capabilities to create fully integrated systems, automated visuals, and seamless data pipelines, the reaction can span from disbelief to amazement. R’s expansive capabilities can leave some feeling overwhelmed when tasked with larger projects like data pipelines. This talk invites the participant to hear the perspective of a self-taught R user who used curiosity and patience to create a functional data pipeline in R for a local health department. Specifically, this talk will touch on the following concepts:

  • Surveying Data Landscapes
  • File Structures
  • Saving Yourself with Data Validation
  • Modularizing Code and Connecting R Scripts
  • Thinking about Pipeline Sustainability
  • Remaining Calm in Unfamiliar R Territories

Talk materials are available at https://github.com/Meghansaha/pipelines_in_R.

Session Details

2022-07-27

01:30 PM to 01:50 PM

National Harbor 10+11

Watch Video

FEATURED SPEAKERS:

Meghan S Harris profile pic

Meghan S Harris

PCCTC @ Memorial Sloan Kettering

Meghan Harris is a self-taught R user that is currently a Data Scientist at the PCCTC at the Memorial Sloan Kettering Cancer Center. Meghan’s work allows her to work with data, create custom reports, dashboards, and various solutions using the R programming language daily. Meghan is an advocate for using open-source software when appropriate to encourage the development of programming skills, increased analysis/evaluation autonomy for agencies, and data literacy. In Meghan’s spare time, Meghan creates generative art in the R programming language, called Rtistry.