Course Information
Course Overview
A gentle yet thorough introduction to Data Science, Statistics and R using real life examples
Taught by a Stanford-educated, ex-Googler and an IIT, IIM - educated ex-Flipkart lead analyst. This team has decades of practical experience in quant trading, analytics and e-commerce.
This course is a gentle yet thorough introduction to Data Science, Statistics and R using real life examples.
Let’s parse that.
Gentle, yet thorough: This course does not require a prior quantitative or mathematics background. It starts by introducing basic concepts such as the mean, median etc and eventually covers all aspects of an analytics (or) data science career from analysing and preparing raw data to visualising your findings.
Data Science, Statistics and R: This course is an introduction to Data Science and Statistics using the R programming language. It covers both the theoretical aspects of Statistical concepts and the practical implementation using R.
Real life examples: Every concept is explained with the help of examples, case studies and source code in R wherever necessary. The examples cover a wide array of topics and range from A/B testing in an Internet company context to the Capital Asset Pricing Model in a quant finance context.
What's Covered:
Data Analysis with R: Datatypes and Data structures in R, Vectors, Arrays, Matrices, Lists, Data Frames, Reading data from files, Aggregating, Sorting & Merging Data Frames
Linear Regression: Regression, Simple Linear Regression in Excel, Simple Linear Regression in R, Multiple Linear Regression in R, Categorical variables in regression, Robust regression, Parsing regression diagnostic plots
Data Visualization in R: Line plot, Scatter plot, Bar plot, Histogram, Scatterplot matrix, Heat map, Packages for Data Visualisation : Rcolorbrewer, ggplot2
Descriptive Statistics: Mean, Median, Mode, IQR, Standard Deviation, Frequency Distributions, Histograms, Boxplots
Inferential Statistics: Random Variables, Probability Distributions, Uniform Distribution, Normal Distribution, Sampling, Sampling Distribution, Hypothesis testing, Test statistic, Test of significance
Course Content
- 14 section(s)
- 82 lecture(s)
- Section 1 Introduction
- Section 2 The 10 second answer : Descriptive Statistics
- Section 3 Inferential Statistics
- Section 4 Case studies in Inferential Statistics
- Section 5 Diving into R
- Section 6 Vectors
- Section 7 Arrays
- Section 8 Matrices
- Section 9 Factors
- Section 10 Lists and Data Frames
- Section 11 Regression quantifies relationships between variables
- Section 12 Linear Regression in Excel
- Section 13 Linear Regression in R
- Section 14 Data Visualization in R
What You’ll Learn
- Harness R and R packages to read, process and visualize data, Understand linear regression and use it confidently to build models , Understand the intricacies of all the different data structures in R, Use Linear regression in R to overcome the difficulties of LINEST() in Excel, Draw inferences from data and support them using tests of significance, Use descriptive statistics to perform a quick study of some data and present results
Skills covered in this course
Reviews
-
MMarkus Dammann
I needed to refresh my theoretical knowledge about statistics from scratch and learn "R" from the beginning on. From all the online courses I browsed this one seemed the best fit for my needs. And it definitely delivered what it promised and seem to be one of the few if not currently the only one that teaches R and Statistics together for beginners to both with real-life examples. I loved how they used examples from the financial market. What could be improved: use one example through all stages, from descriptive to regression analysis. This would help to apply the content of the course to your own example. It would also help to integrate better the parts of the two different lecturers, which seem relatively separate so far. The course could also be styled in a more engaging way adding images instead of mostly only text and vary the background jingle that gets really boring after 82 lectures. E.g. you could use one new jingle per section to give the student an acoustic feeling of advancing, similar to jingles of levels in old computer games. Overall good and necessary work! The criticism is only about details. Thank you very much, Loonycorn!
-
OOwen Knight
Enjoyed this a lot, as a complete newcomer to R the pace was about right. I'd have welcomed some exercises to try to test what I'd learned as well as fuller explanations of some concepts/functions. Otherwise, really interesting and I think will be useful.
-
JJorge Eduardo Ardila
Looks like I can practice what I need and learn some key concepts. Exercises in R should be more practical. Statisticals concepts were very good.
-
MMatthew Leo
Covers the very basics of using R to perform statistics and explains some simple statistical concepts. The course seems quite mathematically sound as far as it goes. You can follow along with the lectures, which are well put-together from a graphical standpoint, but the course would benefit from student exercises, especially in the statistical inferences case studies. It's one thing to see it done and follow along, it's another to try it out independently. Probably the best overall section of the course is linear regression. Toward the end the course content seems to thin and peter out. The sections on data visualization is cursory at best and leaves a lot of basic questions unanswered (e.g. how do I get my histogram to represent sales rather than counts?). The pace is somewhat slow, but with the course animations it isn't hard to follow along at double speed (although if you're not used to Indian English you might need to work your way up). You have to be realistic; a course like this isn't a replacement for a university level course in statistics (although it would be a good refresher for someone who's had one). Thoughtful students will have questions that simply can't be answered at this level of instruction (e.g., "What would be a good value for R-squared in my linear model?" A: it depends.) So a course like this can't really prepare you to actually work as a data scientist, this one does give you a taste for how it is done.