Introduction to R for Corrections Analysts
About the Course
This site contains information and materials about Introduction to R for Corrections Analysts, a course in the Advancing Data in Corrections Academy program.
Visit <academy.advancingdataincorrections.org> to enroll in the course.
Visit the Advancing Data in Corrections site for more information about the program.
Description
This course focuses on teaching the basic skills and concepts needed to use the R statistical programming language for working with data and running reproducible analysis. The R language is a powerful and flexible tool used for data analysis, data visualization, and statistics. Further, as a free and open-source tool, software licensing, black box technologies, and opaque cost structures are not a concern. Yet, as with many programming languages, the initial learning curve can be a little steep and somewhat daunting.
The aim of this course is not to make learners experts in R—that path is longer than can be covered in a single course. Rather, the intent with this course is to help get over the initial climb and provide the basic skills and experience to work with the R programming language, ask the right questions, and manipulate data to conduct exploratory data analysis and visualization.
With each lesson, learners will be presented with data, R code, and video instruction on how to use R to analyze data. The course teaches how to ingest, manipulate, summarize, and visualize data. Additionally, the course covers the basics of setting up an R project for reproducibility by emphasizing rules, best practices, and frameworks for combining code with prose commentary.
Intended Audience
This course is for analysts and researchers who are interested in learning R. Learners should have basic data management and analysis knowledge but need not have experience using R or other programming languages. This is not a statistics course, but it draws upon ideas and techniques from the field of statistics. This is also not a computer science course, but it draws on topics, technologies, and practices that are closely aligned with that field.
Learners can work on a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. Learners must be able to download and install R, RStudio, and R packages on their computers, all of which are available for free.
Learning Objectives
After completing this course, learners should be able to do the following:
Program and document code in the R language using the integrated development environment RStudio.
Perform fundamental data analysis tasks, including
Importing and managing data from files,
Manipulating, transforming, and cleaning data to prepare for analysis, and
Performing exploratory data analysis.
Visualize, communicate, and document your work systematically in a reproducible framework.
Structure
The course is organized into lessons that are meant to be completed sequentially. Each lesson includes a video introduction to the topic or technique with an example or demo of the topic being implemented, as well as an opportunity to put into practice some or all the material presented in the lesson through a practical exercise.
| Lesson | Lesson Objectives | ||
| 1: Why Use R? |
|
||
| 2: Install R and RStudio |
|
||
| 3: RStudio Orientation |
|
||
| 4: R Packages |
|
||
| 5: Importing Data |
|
||
| 6: Data Visualization I |
|
||
| 7: Programming Basics |
|
||
| 8: Filter and Sort Rows |
|
||
| 9: Select and Rename Columns |
|
||
| 10: Create New Columns |
|
||
| 11: Data Visualization II |
|
||
| 12: Summarize Data |
|
||
| 13: Reshape Data |
|
||
| 14: Join Data |
|
||
| 15: Data Visualization III |
|
||
| 16: Dates and Times |
|
||
| 17: Strings |
|
||
| 18: Strings | |||
| 19: Export Data |
|
||
| 20: Getting Help |
|
||
| 21: Literate Programming |
|
||
| 22: Putting It All Together |
|
||
Resources
- Hadley Wickham, Mine Çetinkaya-Rundel, and Garrett Grolemund, R for Data Science, Second Edition (O’Reilly Media, 2023). [Free E-book]
- Chester Ismay and Albert Y. Kim, Statistical Inference via Data Science: A ModernDive into R and the Tidyverse, Second Edition (Chapman and Hall/CRC, 2025). [Free E-book]
- Jenny Bryan, STAT 545: Data wrangling, exploration, and analysis R (2019). [Free E-book]
Estimated Time to Complete
10 hours
Keywords
R, data analysis, data visualization