Udemy

Data Cleaning & Preprocessing in Python for Machine Learning

Enroll Now
  • 161 Students
  • Updated 7/2022
4.4
(34 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
1 Hour(s) 34 Minute(s)
Language
English
Taught by
Ajatshatru Mishra
Rating
4.4
(34 Ratings)

Course Overview

Data Cleaning & Preprocessing in Python for Machine Learning

Learn how to resolve Data Quality issues in Machine Learning & Data Science using Data Cleaning in Python Pandas.

More often than not, real world data is messy and can rarely be used directly. It needs a lot of cleaning and preprocessing before it can be used in Analytics, Machine Learning or other application. Data Cleaning be a dirty job, which often requires lots of effort and advanced technical skills like familiarity with Pandas and other libraries.

For most of the data cleaning, all you need is data manipulation skills in Python. In this course you will learn just that. This course has lectures, quizzes and Jupyter notebooks, which will teach you to deal with real world raw data. The course contains tutorials on a range of data cleaning techniques, like imputing missing values, feature scaling and fixing data types issues etc.

In this you course you will learn:

  • How to detect and deal with missing values in the data.

  • How to detect and rectify incorrect data types.

  • How to deal with Categorical Columns.

  • How to detect and replace incorrect values with correct ones.

  • How to use Apply Lambda method for using advanced cleaning functions.

  • How to group the dataset by a particular column.

  • How to detect and remove outliers.

  • How to perform feature scaling.

  • How to clean and preprocess textual data for NLP.


Course Content

  • 4 section(s)
  • 31 lecture(s)
  • Section 1 Introduction and Setup
  • Section 2 Detecting Data Quality issues
  • Section 3 Data Cleaning and Preprocessing
  • Section 4 Data Cleaning and Preprocessing for NLP

What You’ll Learn

  • You will learn how to detect and impute missing values in the data.
  • How to detect and rectify incorrect data types.
  • How to deal with Categorical Columns.
  • How to detect and replace incorrect values with correct ones.
  • How to use Apply Lambda method for using advanced cleaning functions.
  • How to group the dataset by a particular column.
  • How to detect and remove outliers.
  • How to perform feature scaling.
  • How to clean and preprocess textual data for NLP.


Reviews

  • J
    José de Jesús Velázquez Hernández
    4.5

    I really appreciate the fact that you suggested the Anaconda environment. It makes everything easier

  • D
    Dino Arla
    5.0

    Thank you so much, very practical

  • S
    Sarowar Ahmed
    5.0

    So far.. Good

  • L
    Lefu Ntoa
    5.0

    Simple and very Practical

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed