Course Information
Course Overview
Using practical real-world datasets to showcase how to visualize and analyze data with Python Pandas, scipy and numpy
This course is designed to teach analysts, students interested in data science, statisticians, data scientists how to analyze real world data by creating professional looking charts and using numerical descriptive statistics techniques in Python 3. You will learn how to use charting libraries in Python 3 to analyze real-world data about corruption perception, infant mortality rate, life expectancy, the Ebola virus, alcohol and liver disease data, World literacy rate, violent crime in the USA, soccer World Cup,
migrants deaths, etc.
You will also learn how to effectively use the various statistical libraries in Python 3 such as numpy, scipy.stats, pandas and statistics to create all descriptive statistics summaries that are necessary for analyzing real world data.
In this course, you will understand how each library handles missing values and you will learn how to compute the various statistics properly when missing values are present in the data.
The course will teach you all that you need to know in order to analyze hands on real world data using Python 3. You will be able to appropriately create the visualizations using seaborn, matplotlib or pandas libraries in Python 3.
Using a wide variety of world datasets, we will analyze each one of the data using these tools within pandas, matplotlib and seaborn:
Correlation plots
Box-plots for comparing groups distributions
Time series and lines plots
Side by side comparative pie charts
Areas charts
Stacked bar charts
Histograms of continuous data
Bar charts
Regression plots
Statistical measures of the center of the data
Statistical measures of spread in the data
Statistical measures of relative standing in the data
Calculating Correlation coefficients
Ranking and relative standing in data
Determining outliers in datasets
Binning data in terciles, quartiles, quintiles, deciles, etc.
The course is taught using Anaconda Jupyter notebook, in order to achieve a reproducible research goal, where we use markdowns to clearly
document the codes in order to make them easily understandable and shareable.
This is what some students are saying:
"I really like the tips that you share in every unit in the course sections. This was a well delivered course."
"I am a Data Scientist with many years using Python /Big Data. The content of this course provides a rich resource to students interested in learning hands on data visualization in Python and the analysis of descriptive statistics. I will recommend this course anyone trying to come into this domain."
Course Content
- 9 section(s)
- 34 lecture(s)
- Section 1 Getting started with the course
- Section 2 Exploratory data analysis using Python 3 graphical libraries.
- Section 3 Projects and hands on applications
- Section 4 Analyzing descriptive statistics using Pandas libraries in Python 3
- Section 5 Computing descriptive statistics in Python Scipy library
- Section 6 Computing Descriptive Statistics using the Statistics library in Python
- Section 7 Computing Descriptive Statistics using the Numpy library in Python
- Section 8 Hands on analysis of Descriptive statistics data in Python 3
- Section 9 Conclusion for the course
What You’ll Learn
- Create effective data visualizations, including histograms, boxplots, scatterplots, bar charts, and pie charts using Python libraries
- Learn how to explore datasets, identify patterns, and gain insights through both statistical measures and visual tools.
- Understand how the different Python libraries treat missing values in calculating descriptive statistics
- Be able to use effectively Python statistical libraries to compute descriptive statistics
- Create professional charts with real world data using Python 3
- Understand how and why some charting types are used to explore data in data science and Python
Skills covered in this course
Reviews
-
SSergio A Torres Varela
el curso es interesante, falta un poco de fuidéz y planeación a las lecciones.
-
TTejaswini Kalra
I needed a python course focused on data visualization and working with datasets and this course helped me in this regard.
-
KKarl Oskar Mattsson
As a Python developer needing a refresher to statistics this was a good course.
-
VVarun Sharma
Excellent course