Udemy

Data Science 2023: Data Preprocessing & Feature Engineering

Enroll Now
  • 306 Students
  • Updated 2/2023
4.3
(40 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
6 Hour(s) 14 Minute(s)
Language
English
Taught by
Anupam Khare
Rating
4.3
(40 Ratings)
4 views

Course Overview

Data Science 2023:  Data Preprocessing & Feature Engineering

Become expert in Data Cleaning and Feature Engineering for Machine Learning using Pandas & Scikit learn

Real-life data are dirty. This is the reason why preprocessing tasks take approximately 70% of the time in the ML modeling process. Moreover, there is a lack of dedicated courses which deal with this challenging task

Introducing,  "Data Science Course: Data Cleaning & Feature Engineering" a hardcore completely dedicated course to the most tedious tasks of Machine Learning modeling - "Data preprocessing".

if you want to enhance your data preprocessing skills to get better high-performing ML models, then this course is for you!

This course has been designed by experienced Data Scientists who will help you to understand the WHYs and HOWs of preprocessing.

I will walk you step-by-step into the process of data preprocessing. With every tutorial, you will develop new skills and improve your understanding of preprocessing  challenging ways to overcome this challenge

It is structured the following way:

Part 1- EDA (exploratory Data Analysis): Get insights into your dataset

Part 2 - Data Cleaning: Clean your data based on insights

Part 3 - Data Manipulation: Generating features, subsetting, working with dates, etc.

Part 4 - Feature Engineering- Get the data ready for modeling

Part 5 - Function writing with Pandas Darframe

Bonus Section: A few Interview preparation tips and strategies for data science enthusiasts in the job hunt

Who this course is for:

  1. Anyone who is interested in becoming efficient in data preprocessing

  2. People who are learning data scientists and want better to understand the various nuances of data and its treatment

  3. Budding data scientists who want to improve data preprocessing skills

  4. Anyone who is interested in preprocessing part of data science

This course is not for people who want to learn machine learning algorithms


Course Content

  • 10 section(s)
  • 36 lecture(s)
  • Section 1 Introduction and Way Forward
  • Section 2 Install Anaconda and Jupyter notebook and Resources
  • Section 3 Working with Large Dataset
  • Section 4 Understand data with EDA (Exploratory data Analysis)
  • Section 5 Data Cleaning:
  • Section 6 Data Manipulation
  • Section 7 Feature Engineering
  • Section 8 Writing Functions in Python
  • Section 9 Writing Data frames and Analytical output to Text or Excel workbooks
  • Section 10 Understanding And Debugging Common Errors

What You’ll Learn

  • Preprocessing the data takes 60%-70% of time. The course provides the entire toolbox to you to convert your raw data to model ready data
  • Become Expert in Python Pandas and scikit-learn for data manipulation and feature engineering
  • Become efficient in pre-processing data using various python packages such as pandas_profiling, catagory-encoders etc.
  • Learn feature Engineering techniques like encoding, imputation scaling etc. using Scikit-learn
  • Learn Scikit-learn Pipeline, Column tranformers to make the code readable and efficient
  • Learn to Write Python Functions which wraps various pandas functionalities to automate tasks
  • Export Analysis Output to Text file or Excel (export multiple dataframes to different sheets and multiple dataframes to same sheet in a workbook programatically
  • Bonus Lecture to help you strategise in interview preparations


Reviews

  • W
    Wilfred Borges
    5.0

    good course

  • S
    Syed Salahuddin Quadri
    1.0

    audio is too low, even i have to switch on my fan to hear the voice

  • P
    Priyanka Gavade
    4.5

    Yes, i could learn many things from here

  • M
    Masiyandaita Majange
    4.0

    Highly useful content that can get you started on relativey complex datasets. On negatives are that there some viddeos that appear to end abruptly and in one or two cases the instructor didn't pick up on errors in their own code. This however doesn't take away from the usefulness of this course! Recommended

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed