Udemy

Complete Python Machine Learning & Data Science for Dummies

Enroll Now
  • 2,358 Students
  • Updated 7/2021
4.4
(74 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
10 Hour(s) 23 Minute(s)
Language
English
Taught by
Abhilash Nelson
Rating
4.4
(74 Ratings)
5 views

Course Overview

Complete Python Machine Learning & Data Science for Dummies

Machine Learning and Data Science for programming beginners using python with scikit-learn, SciPy, Matplotlib & Pandas


Hi.. Hello and welcome to my new course, Machine Learning with Python for Dummies. We will discuss about the overview of the course and the contents included in this course.


Artificial Intelligence, Machine Learning and Deep Learning Neural Networks are the most used terms now a days in the technology world. Its also the most mis-understood and confused terms too.


Artificial Intelligence is a broad spectrum of science which tries to make machines intelligent like humans. Machine Learning and Neural Networks are two subsets that comes under this vast machine learning platform


Lets check what's machine learning now. Just like we human babies, we were actually in our learning phase then. We learned how to crawl, stand, walk, then speak words, then make simple sentences.. We learned from our experiences. We had many trials and errors before we learned how to walk and talk. The best trials for walking and talking which gave positive results were kept in our memory and made use later. This process is highly compared to a Machine Learning Mechanism


Then we grew young and started thinking logically about many things, had emotional feelings, etc. We kept on thinking and found solutions to problems in our daily life. That's what the Deep Learning Neural Network Scientists are trying to achieve. A thinking machine.


But in this course we are focusing mainly in Machine Learning. Throughout this course, we are preparing our machine to make it ready for a prediction test. Its Just like how you prepare for your Mathematics Test in school or college. We learn and train ourselves by solving the most possible number of similar mathematical problems. Lets call these sample data of similar problems and their solutions as the 'Training Input' and 'Training Output' Respectively. And then the day comes when we have the actual test. We will be given new set of problems to solve, but very similar to the problems we learned, and based on the previous practice and learning experiences, we have to solve them. We can call those problems as 'Testing Input' and our answers as 'Predicted Output'. Later, our professor will evaluate these answers and compare it with its actual answers, we call the actual answers as 'Test Output'. Then a mark will be given on basis of the correct answers. We call this mark as our 'Accuracy'. The life of a machine learning engineer and a data-scientist is dedicated to make this accuracy as good as possible through different techniques and evaluation measures.


Here are the major topics that are included in this course. We are using Python as our programming language. Python is a great tool for the development of programs which perform data analysis and prediction. It has tons of classes and features which perform the complex mathematical analysis and give solutions in simple one or two lines of code so that we don't have to be a statistic genius or mathematical Nerd to learn data science and machine learning. Python really makes things easy.


These are the main topics that are included in our course


System and Environment preparation

-----------------------------------

Installing Python and Required Libraries (Anaconda)


Basics of python and sci-py

---------------------------

Python, Numpy , Matplotlib and Pandas Quick Courses


Load data set from csv / url

-----------------------------

Load CSV data with Python, NumPY and Pandas


Summarize data with description

--------------------------------

Peeking data, Data Dimensions, Data Types, Statistics, Class Distribution, Attribute Correlations, Univariate Skew


Summarize data with visualization

-----------------------------------

Univariate, Multivariate Plots


Prepare data

-------------

Data Transforms, Rescaling, Standardizing, Normalizing and Binarization


Feature selection – Automatic selection techniques

-----------------------------------

Univariate Selection, Recursive Feature Elimination, Principle Component Analysis and Feature Importance


Machine Learning Algorithm Evaluation

-----------------------------------

Train and Test Sets, K-fold Cross Validation, Leave One Out Cross Validation, Repeated Random Test-Train Splits.


Algorithm Evaluation Metrics

-----------------------------

Classification Metrics - Classification Accuracy, Logarithmic Loss, Area Under ROC Curve, Confusion Matrix, Classification Report.

Regression Metrics - Mean Absolute Error, Mean Squared Error, R 2.


Spot-Checking Classification Algorithms

-----------------------------------

Linear Algorithms - Logistic Regression, Linear Discriminant Analysis.

Non-Linear Algorithms - k-Nearest Neighbours, Naive Bayes, Classification and Regression Trees, Support Vector Machines.


Spot-Checking Regression Algorithms

-----------------------------------

Linear Algorithms - Linear Regression, Ridge Regression, LASSO Linear Regression and Elastic Net Regression.

Non-Linear Algorithms - k-Nearest Neighbours, Classification and Regression Trees, Support Vector Machines.


Choose The Best Machine Learning Model

-----------------------------------

Compare Logistic Regression, Linear Discriminant Analysis, k-Nearest Neighbours, Classification and Regression Trees, Naive Bayes, Support Vector Machines.


Automate and Combine Workflows with Pipeline

-----------------------------------

Data Preparation and Modelling Pipeline

Feature Extraction and Modelling Pipeline


Performance Improvement with Ensembles

-----------------------------------

Voting Ensemble

Bagging: Bagged Decision Trees, Random Forest, Extra Trees

Boosting: AdaBoost, Gradient Boosting


Performance Improvement with Algorithm Parameter Tuning

--------------------------------------------------------

Grid Search Parameter

Random Search Parameter Tuning


Save and Load (serialize and deserialize) Machine Learning Models

-----------------------------------

Using pickle

Using Joblib


finalize a machine learning project

-----------------------------------

steps For Finalizing classification models - pima indian dataset

Dealing with imbalanced class problem

steps For Finalizing multi class models - iris flower dataset

steps For Finalizing regression models - boston housing dataset


Predictions and Case Studies

----------------------------

Case study 1: predictions using the Pima Indian Diabetes Dataset

Case study: Iris Flower Multi Class Dataset

Case study 2: the Boston Housing cost Dataset


Machine Learning and Data Science is the most lucrative job in the technology arena now a days. Learning this course will make you equipped to compete in this area.


Best wishes with your learning. Se you soon in the class room.
















Course Content

  • 90 section(s)
  • 90 lecture(s)
  • Section 1 Course Overview & Table of Contents
  • Section 2 Introduction to Machine Learning - Part 1 - Concepts , Definitions and Types
  • Section 3 Introduction to Machine Learning - Part 2 - Classifications and Applications
  • Section 4 System and Environment preparation - Part 1
  • Section 5 System and Environment preparation - Part 2
  • Section 6 Learn Basics of python - Assignment
  • Section 7 Learn Basics of python - Flow Control
  • Section 8 Learn Basics of python - Functions
  • Section 9 Learn Basics of python - Data Structures
  • Section 10 Learn Basics of NumPy - NumPy Array
  • Section 11 Learn Basics of NumPy - NumPy Data
  • Section 12 Learn Basics of NumPy - NumPy Arithmetic
  • Section 13 Learn Basics of Matplotlib
  • Section 14 Learn Basics of Pandas - Part 1
  • Section 15 Learn Basics of Pandas - Part 2
  • Section 16 Understanding the CSV data file
  • Section 17 Load and Read CSV data file using Python Standard Library
  • Section 18 Load and Read CSV data file using NumPy
  • Section 19 Load and Read CSV data file using Pandas
  • Section 20 Dataset Summary - Peek, Dimensions and Data Types
  • Section 21 Dataset Summary - Class Distribution and Data Summary
  • Section 22 Dataset Summary - Explaining Correlation
  • Section 23 Dataset Summary - Explaining Skewness - Gaussian and Normal Curve
  • Section 24 Dataset Visualization - Using Histograms
  • Section 25 Dataset Visualization - Using Density Plots
  • Section 26 Dataset Visualization - Box and Whisker Plots
  • Section 27 Multivariate Dataset Visualization - Correlation Plots
  • Section 28 Multivariate Dataset Visualization - Scatter Plots
  • Section 29 Data Preparation (Pre-Processing) - Introduction
  • Section 30 Data Preparation - Re-scaling Data - Part 1
  • Section 31 Data Preparation - Re-scaling Data - Part 2
  • Section 32 Data Preparation - Standardizing Data - Part 1
  • Section 33 Data Preparation - Standardizing Data - Part 2
  • Section 34 Data Preparation - Normalizing Data
  • Section 35 Data Preparation - Binarizing Data
  • Section 36 Feature Selection - Introduction
  • Section 37 Feature Selection - Uni-variate Part 1 - Chi-Squared Test
  • Section 38 Feature Selection - Uni-variate Part 2 - Chi-Squared Test
  • Section 39 Feature Selection - Recursive Feature Elimination
  • Section 40 Feature Selection - Principal Component Analysis (PCA)
  • Section 41 Feature Selection - Feature Importance
  • Section 42 Refresher Session - The Mechanism of Re-sampling, Training and Testing
  • Section 43 Algorithm Evaluation Techniques - Introduction
  • Section 44 Algorithm Evaluation Techniques - Train and Test Set
  • Section 45 Algorithm Evaluation Techniques - K-Fold Cross Validation
  • Section 46 Algorithm Evaluation Techniques - Leave One Out Cross Validation
  • Section 47 Algorithm Evaluation Techniques - Repeated Random Test-Train Splits
  • Section 48 Algorithm Evaluation Metrics - Introduction
  • Section 49 Algorithm Evaluation Metrics - Classification Accuracy
  • Section 50 Algorithm Evaluation Metrics - Log Loss
  • Section 51 Algorithm Evaluation Metrics - Area Under ROC Curve
  • Section 52 Algorithm Evaluation Metrics - Confusion Matrix
  • Section 53 Algorithm Evaluation Metrics - Classification Report
  • Section 54 Algorithm Evaluation Metrics - Mean Absolute Error - Dataset Introduction
  • Section 55 Algorithm Evaluation Metrics - Mean Absolute Error
  • Section 56 Algorithm Evaluation Metrics - Mean Square Error
  • Section 57 Algorithm Evaluation Metrics - R Squared
  • Section 58 Classification Algorithm Spot Check - Logistic Regression
  • Section 59 Classification Algorithm Spot Check - Linear Discriminant Analysis
  • Section 60 Classification Algorithm Spot Check - K-Nearest Neighbors
  • Section 61 Classification Algorithm Spot Check - Naive Bayes
  • Section 62 Classification Algorithm Spot Check - CART
  • Section 63 Classification Algorithm Spot Check - Support Vector Machines
  • Section 64 Regression Algorithm Spot Check - Linear Regression
  • Section 65 Regression Algorithm Spot Check - Ridge Regression
  • Section 66 Regression Algorithm Spot Check - LASSO Linear Regression
  • Section 67 Regression Algorithm Spot Check - Elastic Net Regression
  • Section 68 Regression Algorithm Spot Check - K-Nearest Neighbors
  • Section 69 Regression Algorithm Spot Check - CART
  • Section 70 Regression Algorithm Spot Check - Support Vector Machines (SVM)
  • Section 71 Compare Algorithms - Part 1 : Choosing the best Machine Learning Model
  • Section 72 Compare Algorithms - Part 2 : Choosing the best Machine Learning Model
  • Section 73 Pipelines : Data Preparation and Data Modelling
  • Section 74 Pipelines : Feature Selection and Data Modelling
  • Section 75 Performance Improvement: Ensembles - Voting
  • Section 76 Performance Improvement: Ensembles - Bagging
  • Section 77 Performance Improvement: Ensembles - Boosting
  • Section 78 Performance Improvement: Parameter Tuning using Grid Search
  • Section 79 Performance Improvement: Parameter Tuning using Random Search
  • Section 80 Export, Save and Load Machine Learning Models : Pickle
  • Section 81 Export, Save and Load Machine Learning Models : Joblib
  • Section 82 Finalizing a Model - Introduction and Steps
  • Section 83 Finalizing a Classification Model - The Pima Indian Diabetes Dataset
  • Section 84 Quick Session: Imbalanced Data Set - Issue Overview and Steps
  • Section 85 Iris Dataset : Finalizing Multi-Class Dataset
  • Section 86 Finalizing a Regression Model - The Boston Housing Price Dataset
  • Section 87 Real-time Predictions: Using the Pima Indian Diabetes Classification Model
  • Section 88 Real-time Predictions: Using Iris Flowers Multi-Class Classification Dataset
  • Section 89 Real-time Predictions: Using the Boston Housing Regression Model
  • Section 90 SOURCE CODE AND FILES ATTACHED

What You’ll Learn

  • Machine Learning and Data Science using Python for Beginners


Reviews

  • Y
    Yury Rokash
    5.0

    This course really gives you an understanding and a general overview of the ML process, starting with preparing a dataset and ending with a complete model for predictions. I definitely recommend it to beginners.

  • N
    Nelly Diaz
    5.0

    I am happy with the context and the breakdown in small parts to help me understand them.

  • Y
    Yhasreen Abrahim
    3.5

    No data visualisation (graphs, charts) for the results produce. (just figures)

  • D
    Denzil Phillips
    1.5

    I cannot download the resources on the course

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed