Tidying Data
Previously recorded session (March 2020)
This workshop will review a series of vignettes, each starting with a messy dataset that needs to be manipulated in order to be ready for analysis. We will use one or more of the following in each example: Linux command-line tools (awk/sed/grep/cut/paste, etc.), R, and Python.
Level: Intermediate
Prerequisites: Beginning to intermediate R and Python. Familiarity with standard linux tools such as awk, sed, grep.
Objective: To learn a few new tips and tricks that aid in making data more analysis-friendly while providing a conceptual framework for data organization.