Data Cleaning, Preparation and Analysis in R (2019)

November 8, 2018
posts   R

Introduction

This workshop is geared to researchers wanting to use R for basic data manipulation and analysis. It will introduce participants to the basics of data tidying and manipulation (notably using tidyr & dplyr), how to set up data processing pipelines, and briefly cover working with a database. We will be using a genomics dataset for this course. This workshop is designed for novices, but we would like you to have some experience with R or have attended the Intro to R course on 11/8. (A laptop is required for the course.)

Date

November 8, 2018 (9:00 AM - 12:00 PM)

Instructors

Audience

All graduate students and researchers. Some familiarity with R is expected as a requirement to attend this course. You can attend the Intro to R course we provide or take this online tutorial https://www.datacamp.com/courses/free-introduction-to-r from DataCamp.

Data (csvs)

  1. Gapminder, you can get that by running this code in R:
gapminder <- read.csv("https://goo.gl/BtBnPg", header = T)
download.file('https://raw.githubusercontent.com/swcarpentry/r-novice-gapminder/gh-pages/_episodes_rmd/data/gapminder_wide.csv', 'data/gapminder_wide.csv')
  1. A zip file of several csv’s:

https://github.com/ucsdlib/workshops/raw/gh-pages/notebooks/data.zip

Unzip those into your ‘data’ folder inside your workshop project folder.

Resources