Workshop: Manipulating and Joining Data in R with dplyr
Description
The dplyr package is a popular R package that people often use to manipulate and join datasets. Attendees will need to have either some basic knowledge about using R or have previously attended Johns Hopkins Data Management Services' Introduction to R for Absolute Beginners workshop in order to take this one.
Attendees will learn to use several functions, including: mutate(), filter(), select(), summarize() and group_by(), in dplyr to manipulate data for the first half of the workshop. For the second part of this workshop, attendees will learn the join functions (e.g. left_join(), right_join(), inner_join(), semi_join(), anti_join(), full_join(), bind_rows() and bind_cols()) and set operations (e.g. union(), intersect() and setdiff()) in dplyr to combine two datasets.
Attendees will have plenty of opportunities to do hands-on activities on their laptops and work on datasets provided by instructors.
Other upcoming Data Management Services workshops about working with R include:
Who can attend?
- Faculty
- Staff
- Students