Course Information
Course Overview
Get data from the web directly into R
In this course you will learn:
the basics of web scraping (using tags, classes and ids)
two great R packages: Rvest and Selenium
to scrape server-side and client-side rendered pages
to get data from json API’s
If you have ever wanted to collect:
text from the internet for NLP!
numbers from websites to visualize!
tables of data from the internet to put into your model!
or anything else involving getting data from the internet.
Then this course is for you!
At the end of the course you will be able to:
Extract data from structured tables on the internet
Collect text data from a website (like reviews) and structure it in a tidy data set
Collect a diverse set of data points from a website (like products) and structure it in a tidy data set
Collect structured data from an API - ready for analysis
Scrape websites that are normally very hard to extract data from.
Course Content
- 6 section(s)
- 30 lecture(s)
- Section 1 Introduction
- Section 2 Setup and resources
- Section 3 Scrape your first (server-side rendered) website
- Section 4 Scrabe a client-side rendered webpage
- Section 5 Getting data from an API
- Section 6 Wrap up
What You’ll Learn
- Web scraping with R
- The difference between server-side and client-side rendered websites
- Using Selenium and R together to scrape client side rendered websites
- Learning how to use the Rvest and RSelenium R-libraries
- Learn the basics of web scraping (using tags, classes and ids)
- Learn to get data from json API’s
Skills covered in this course
Reviews
-
MMartin Dunz
Really well described what must be done and very practical approach. In Video i cannot see the mouse pointer, so it makes it difficult to follow at which position the mouse id clicking or pointing, which is needed on this course. I am also missing a good description on how to use the Docker files, which to download, which commands to be called (a file with the commands is helping a lot), so within 10 Minutes a lot of the Docker setup used in this course would help a lot.
-
AAlessandro Noseda
The course exceeded expectations: in just a few hours it teaches many useful things.
-
SStefano Cruz
More than I spected.
-
MMarcus Jones
Mikkel is a gifted teacher. However, the course is in need of some updating. Two of the web sites and one of the packages used in the course have changed. I was still able to complete the course but needed extra time to figure out what was broken and adjust for the changes. Still there's lots to learn and the information is still reliable (hence 3 stars). I even learned one or two data munging tricks along the way. The XHR api trick was also a real eye-opener.