Udemy

Improving data quality in data analytics & machine learning

Enroll Now
  • 3,866 Students
  • Updated 12/2025
4.6
(576 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
5 Hour(s) 23 Minute(s)
Language
English
Taught by
Mike X Cohen
Rating
4.6
(576 Ratings)
2 views

Course Overview

Improving data quality in data analytics & machine learning

Learn why, when, and how to maximize the quality of your data to optimize data-based decisions

All of our decisions are based on data. Our sense organs gather data, our memories are data, and our gut-instincts are data. If you want to make good decisions, you need to have high-quality data.


This course is about data quality: What it means, why it's important, and how you can increase the quality of your data.


In this course, you will learn:

  1. High-level strategies for ensuring high data quality, including terminology, data documentation and management, and the different research phases in which you can check and increase data quality.

  2. Qualitative and quantitative methods for evaluating data quality, including visual inspection, error rates, and outliers. Python code is provided to see how to implement these visualizations and scoring methods using pandas, numpy, seaborn, and matplotlib.

  3. Specific data methods and algorithms for cleaning data and rejecting bad or unusual data. As above, Python code is provided to see how to implement these procedures using pandas, numpy, seaborn, and matplotlib.


This course is for

  1. Data practitioners who want to understand both the high-level strategies and the low-level procedures for evaluating and improving data quality.

  2. Managers, clients, and collaborators who want to understand the importance of data quality, even if they are not working directly with data.

Course Content

  • 9 section(s)
  • 45 lecture(s)
  • Section 1 Introduction
  • Section 2 Download course materials (Python code)
  • Section 3 Why data quality matters
  • Section 4 Ensuring high data quality
  • Section 5 Assessing data quality
  • Section 6 Data transformations
  • Section 7 Outliers and missing data
  • Section 8 Be a high-quality data scientist
  • Section 9 Bonus

What You’ll Learn

  • Strategies for increasing data quality
  • Ways to assess data quality
  • Interpreting data visualizations
  • How to spot problems in data


Reviews

  • P
    Pedro Aughusto Simões
    5.0

    Ótimo

  • P
    Paulo Rogerio de Freitas
    5.0

    Perfeito

  • O
    Oscar Alberto Sanchez Tobar
    5.0

    Very insightful content, with clear explanation in statistical themes.

  • L
    Leandro Donizete de Souza
    3.0

    O audio poderia ser em portugues mesmo que feito via IA ou Speech

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed