Course Information
Course Overview
Learn NLP in R with our easy to understand videos and free textbook!
Working with text data does not need to be difficult!
Follow along as we explain complex topics for a beginner audience. By the end of this course, you will be able to read in data from websites like twitter and wikipedia, clean it, and perform analysis.
We keep it easy.
This course is designed for a data analyst who is familiar with the R language but has absolutely no background in natural language processing or even statistics in general.
We break our course into three main sections: text mining, preparing and exploring text data, and analyzing text data.
Text Mining
Like with every other form of analytics, before any real work can be done, the data must exist (obviously) and be in a working format.
What’s Covered: APIs, Twitter Data, Webscraping, Wikipedia Data
Preparing and Exploring Text Data
Once the data has been properly gathered and mined, it needs to be put into a usable format. The following tutorials cover how to clean and explore text data.
What’s Covered: Regex, stringr package, tidytext package, tm package
Analyzing Text Data
After exploratory data analysis has been performed, we can do further analysis of the relationships and meaning in text.
What’s Covered: TF-IDF, Sentiment Analysis, Topic Modeling, Parts of Speech Tagging, Name Entity Recognition, Word Embeddings
So dive in and see what insights are hiding in your text data!
Course Content
- 10 section(s)
- 70 lecture(s)
- Section 1 Introduction
- Section 2 APIs with jsonlite
- Section 3 Twitter Data with rtweet
- Section 4 Web Scraping with rvest
- Section 5 Getting Wikipedia Data with getwiki
- Section 6 Regex and Stringr
- Section 7 Preparing Text Data with Tidytext
- Section 8 Visualize Text Data
- Section 9 Working in tm
- Section 10 Term Frequency - Inverse Document Frequency (TF-IDF)
What You’ll Learn
- Access Text Data from APIs with jsonlite
- Scrape the Web Using rvest
- Import Data from Twitter and Wikipedia
- Find Patterns using Regex
- Manipulate and Clean Data Using tidytext and tm
- Measure Emotion with Sentiment Analysis
- Surface Meaning with Topic Modeling
- Provide Context with Parts of Speech Tagging and Named Entity Recognition
- Quantify Relationships with Word Embeddings
Skills covered in this course
Reviews
-
MMeltem Yumsek
Some sections were too fast, and not enough background was provided. I can read through the book myself. The purpose of the lectures should be to make things more intuitive.
-
AAkash Dash
Course good on R techniques. What it could improve upon a bit is the INTERPRETATIONS OF THE OUTPUTS (especially POS tags, Word Vectors) as well as some examples of real-life applications.
-
JJedrzej.Wydra
Podoba mi się. Ładnie omówione zagadnienia. Ale! Niektóre zadania są prawie niemożliwe do rozwiązania bez spojrzenia do odpowiedzi. Niektóre zagadnienia są przestarzałe, np. Twitter zmienił API i wiadomości z kursu są nieprzydatne.
-
MMegan K Ebner
Great instructors! Really engaging material. I am not very technical and could figure out how to do everything (so far)