Tidying Data

Previously recorded session (March 2020)

This workshop will review a series of vignettes, each starting with a messy dataset that needs to be manipulated in order to be ready for analysis. We will use one or more of the following in each example: Linux command-line tools (awk/sed/grep/cut/paste, etc.), R, and Python.

Level: Intermediate

Prerequisites: Beginning to intermediate R and Python. Familiarity with standard linux tools such as awk, sed, grep.

Objective: To learn a few new tips and tricks that aid in making data more analysis-friendly while providing a conceptual framework for data organization.

Upcoming Sessions

  • None currently scheduled

Course Materials