Tidying Data

Previously recorded session (March 2020)

This workshop will review a series of vignettes, each starting with a messy dataset that needs to be manipulated in order to be ready for analysis. We will use one or more of the following in each example: Linux command-line tools (awk/sed/grep/cut/paste, etc.), R, and Python.

Level: Intermediate

Prerequisites: Beginning to intermediate R and Python. Familiarity with standard linux tools such as awk, sed, grep.

Objective: To learn a few new tips and tricks that aid in making data more analysis-friendly while providing a conceptual framework for data organization.

Course Materials

Workshop website

Yale

Center for Research Computing

Tidying Data

Course Materials

See all the upcoming training HERE.

Search form

You are here

Tidying Data

Course Materials

See all the upcoming training HERE.