Find us on GitHub

A Data Carpentry Workshop

National Society of Black Engineers Professional Development Conference

September 28, 2017

8:00 am - 5:00 pm

Instructors: Kari Jordan, Christina Koch, Karl Broman

General Information

Data Carpentry workshops are for any researcher or professional who has data they want to analyze, and who wants to learn more about tools and techniques for working with data.

This hands-on workshop will cover how to work more effectively with data and an introduction to programming in R. By the end of the workshop, participants should be able to load data, conduct simple analyses, and produce basic visualizations. They will also be aware of best practices for data management to enable reproducibility.

No prior computational experience is required. Participants should bring their laptops and plan to participate actively.

Who: Participants in the National Society for Black Engineers Professional Development Conference, who want to learn more about (and get hands-on experience with!) data management practices and tools. In order to attend, please register through the eventbrite link above or contact the organizer.

Where: 301 East North Water Street, Chicago, IL. Get directions with OpenStreetMap or Google Maps.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating sytem (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below). They are also required to abide by Data Carpentry's Code of Conduct.

Contact: Please mail for more information.




Please be sure to complete this survey before the workshop.

Pre-workshop Survey


To participate in a Data Carpentry workshop, you will need working copies of the described software. Please make sure to install everything (or at least to download the installers) before the start of your workshop. Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop.

Please follow these Setup Instructions.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

Resources and Materials

We will use this Etherpad for chatting, taking notes, and sharing URLs and bits of code.


8:00a - 830a Software Installation Help
8:30a - 9:00a Introduction
9:00a - 10:00a Data Organization in Spreadsheets
10:00a - 10:15a Break
10:15a - 11:00a Data Cleaning in OpenRefine
11:00a - 11:15a Break
11:15a - 12:00p Introduction to R and RStudio [Script from workshop]
12:00p - 1:00p Lunch
1:00p - 2:45p Data Structures & Exploring Data Frames [Script from workshop]
2:45p - 3:00p Break
3:00p - 4:30p Creating Publication Quality Graphics [Script from workshop]
4:30p - 5:00p Workshop Wrap-Up (Post workshop survey: Post-workshop Survey)


We will be using material from these Data Carpentry lessons:

Lesson Website
Data Organization in Spreadsheets Spreadsheet Materials
Data Cleaning with OpenRefine Open Refine Materials
Data Analysis and Visualization in R R Materials
R for Reproducible Scientific Analysis R Materials