Course Information
Course Overview
Gain new knowledge about R programming you wouldn't intuitively imagine - Extensive use of the tidyverse packages
You just downloaded datasets online. They came in a zip file. The first thing you do after downloading it, you extract the data with your favorite unzipping software such as WinZip or WinRar. Then you proceed with reading the data in with your analysis software (hopefully R). OK! Where is he going with this, you may wonder.
You are starting a project in R and realize your files are scattered in different paths on your computer. Your immediate reflex is to open the folders involved (say with windows explorer) and proceed to gather those files in one place before starting R. Wait, what's wrong with that? Hold that thought.
You visit a website frequently. This website is full of data—numbers, downloadable documents, and pictures alike. It may or may not have occurred to you that you can access the data programmatically and visualize it differently. Perhaps you had ideas about it but didn’t know how to get it done. Hold this thought also.
There is nothing wrong with unzipping files with a WinZip or WinRar. Still, it can be beneficial to do unzip files within R. After downloading a dataset or any zip files; you can go directly into R and manage your files there before your analysis. You ever thought about unzipping, copying and pasting, deleting files within R? This course will show you examples of that.
One of the goals of this course is to implant in you the thought of scraping data with ease. I want you to think you can scrape data and visualize it differently and doing so promptly. I will show you the commonly used web scraping techniques in R.
With APIs, you go a step further than scraping. In this course, I teach you how to retrieve data using HTTR and jsonlite packages. Specifically, use the GET function to retrieve data and the POST function to update your account. All this without logging onto your account. I use the peer-to-peer lending platform Lending Club to showcase the use of an API. The API, therefore, allows you to interact with your account programmatically. Combining this with a scheduler can prove highly efficient. A well-thought-out algorithm can be automated and handle repetitive tasks that would otherwise be routine.
This course will also introduce you to the version control system Git. You will learn the power of R Studio combined with Git and GitHub. I teach how to keep different versions of your script with Git and push files, including R scripts, datasets, and other files to the GitHub platform. You will also learn how to revert to previous versions of your code if you make mistakes in later versions. When you master this, you will no longer have to save different versions of your scripts in your directory.
To become an efficient data analyst, you have to be skilled at one or more programming languages. Why not R? This course should also serve as a barometer. If you feel comfortable with the material in this course, you should understand most R scripts you will encounter.
This course will not teach you how to hack into servers. The intent here is not to sway you towards criminal activities.
Course Content
- 5 section(s)
- 37 lecture(s)
- Section 1 Introduction
- Section 2 Install R, R Studio, Git, and GitHub
- Section 3 File Handling and Maps Functional Loops
- Section 4 Web Scraping Using the rvest Package
- Section 5 APIs with httr and jsonlite Packages
What You’ll Learn
- Up to date R programming
- How to loop with the maps family of functions
- How to do file management operation like copying and pasting files, deleting them, creating new directories and more and combining these operations with loops
- How to use Git and Github combined with R Studio
- Create a repository in github and push scripts to it directly from R Studio
- How to use the rvest package to scrape valuable data from websites
- Download files with R
- APIs with R
- Get data with a REST API using R
- Post data with a REST API using R
- Use R to AUTOMATE repetitive task such as scraping dynamic data at a set frequency
Reviews
-
WWesley Williams
Valuable
-
CCatherine Fralich
Nice Hacks
-
DDrew Rivers
Some useful tips but most sections need to be updated. For example, connecting to GitHub is more complex than it was 4 years ago; several functions have deprecated parameters (e.g., write_csv, map_chr); rvest functions have changed a bit, not to mention the webscraping sites (imbd and zillow) have substantially changed their html/css code which made it difficult to follow along with the course; taskscheduleR did not run on windows11 (could be my system but a Google search found this to be a common issue); and the section on APIs would have been better if instructor used a public API the students could follow along with instead of watching the instructor use a private API. Overall, this was probably a good course in 2021 (and maybe 2022), but now its more frustrating than beneficial. Udemy should really close down any R courses that have not been updated in more than 2 years.
-
MMartin Dunz
Everything was explained with useful examples, even Web-Scraping and REST API was done with examples.