Udemy

Exploratory Data Analysis in R

Enroll Now
  • 33 Students
  • Updated 1/2022
4.9
(04 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
5 Hour(s) 17 Minute(s)
Language
English
Taught by
Ray James Hoobler
Rating
4.9
(04 Ratings)

Course Overview

Exploratory Data Analysis in R

Four graphical techniques you can use to quickly explore your data

This example-based course introduces exploratory data analysis (EDA) using R. A primary objective is to apply graphical EDA techniques to representative data sets using the RStudio platform.


I have incorporated datasets from the NIST/SEMATECH e-Handbook of Statistical Methods into this course and adopted their fundamental approach of Exploratory Data Analysis.


We use scatter plots to examine relationships between two variables, determine if there is a linear or non-linear relationship, analyze variations of the dependent variable, and determine if there are outliers in the dataset.


Of course, we need to remember that causality implies association and that association does NOT imply causality.


We will summarise the distribution of a dataset graphically using histograms. This tool can quickly show us the location and spread of the data, and give us a good indication if the data follows a normal distribution, is skewed, has multiple modes or outliers.


An underused, complementary technique to histograms is the probability plot. We will construct probability plots by plotting the data against a theoretical normal distribution. If the data follows a normal distribution, the plot will form a straight line. We will use the normal probability plot to assess whether or not our examples follow a normal distribution.


Finally, we will use box plots to view the variation between different groups within the data.


Aside from scatterplots, most spreadsheet programs do not support these methods, so learning how to do this fundamental analysis in R can improve your ability to explore your data.

Course Content

  • 7 section(s)
  • 28 lecture(s)
  • Section 1 Introduction to EDA in R
  • Section 2 Graphical techniques - scatter plots
  • Section 3 Graphical techniques - histograms
  • Section 4 Graphical techniques - box plots
  • Section 5 Graphical techniques - probability plots
  • Section 6 Conclusion to EDA in R
  • Section 7 Extra materials for EDA in R

What You’ll Learn

  • Develop a fundamental framework to carry out your own Exploratory Data Analysis, The use of scatter plots and how to incorporate linear and non-linear models into your graphics, How to evaluate if your data is "normal" using histograms and probability plots, The power of box plots to compare groups

Skills covered in this course


Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed