Udemy

Scikit-learn in Python: 100+ Data Science Exercises

Enroll Now
  • 40,299 Students
  • Updated 9/2024
  • Certificate Available
4.6
(99 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
0 Hour(s) 49 Minute(s)
Language
English
Taught by
Paweł Krakowiak
Certificate
  • Available
  • *The delivery and distribution of the certificate are subject to the policies and arrangements of the course provider.
Rating
4.6
(99 Ratings)

Course Overview

Scikit-learn in Python: 100+ Data Science Exercises

Master Machine Learning - Unleash the Power of Data Science for Predictive Modeling!

This course is a comprehensive, hands-on guide to one of the most essential libraries for machine learning in Python, Scikit-learn. This course employs a practical, exercise-driven approach that helps learners understand and apply various machine learning algorithms and techniques.

The course is organized into different sections, each devoted to a specific aspect of the Scikit-learn library. It covers everything from data preprocessing, including feature extraction and selection, to various machine learning models such as linear regression, decision trees, support vector machines, and ensemble methods, to model evaluation and hyperparameter tuning.

Each section is packed with carefully designed exercises that reinforce each concept and give you the chance to apply what you've learned. You will solve real-world problems that mirror the challenges faced by data scientists in the field. Detailed solutions accompany each exercise, enabling you to compare your work and gain a better understanding of how to best use Scikit-learn for machine learning tasks.

This course is perfect for anyone interested in expanding their data science toolkit. Whether you're a beginner looking to dive into machine learning, or a seasoned data scientist wanting to refine your skills, this course offers an enriching learning experience.


Scikit-learn - Unleash the Power of Machine Learning!

Scikit-learn is a versatile machine learning library in Python that provides a wide range of algorithms and tools for building and implementing machine learning models. It is widely used by data scientists, researchers, and developers to solve complex problems through classification, regression, clustering, and more. With Scikit-learn, you can efficiently preprocess data, select appropriate features, train and evaluate models, and perform model selection and hyperparameter tuning. It offers a consistent API, making it easy to experiment with different algorithms and techniques. Scikit-learn also provides useful utilities for data preprocessing, model evaluation, and model persistence. Its user-friendly interface and extensive documentation make it a go-to choice for machine learning practitioners looking to leverage the power of Python for their projects.


Topics you will find in this course:

  • preparing data to machine learning models

  • working with missing values, SimpleImputer class

  • classification, regression, clustering

  • discretization

  • feature extraction

  • PolynomialFeatures class

  • LabelEncoder class

  • OneHotEncoder class

  • StandardScaler class

  • dummy encoding

  • splitting data into train and test set

  • LogisticRegression class

  • confusion matrix

  • classification report

  • LinearRegression class

  • MAE - Mean Absolute Error

  • MSE - Mean Squared Error

  • sigmoid() function

  • entorpy

  • accuracy score

  • DecisionTreeClassifier class

  • GridSearchCV class

  • RandomForestClassifier class

  • CountVectorizer class

  • TfidfVectorizer class

  • KMeans class

  • AgglomerativeClustering class

  • HierarchicalClustering class

  • DBSCAN class

  • dimensionality reduction, PCA analysis

  • Association Rules

  • LocalOutlierFactor class

  • IsolationForest class

  • KNeighborsClassifier class

  • MultinomialNB class

  • GradientBoostingRegressor class

Course Content

  • 15 section(s)
  • 113 lecture(s)
  • Section 1 Tips
  • Section 2 Starter
  • Section 3 Exercises 1-10
  • Section 4 Exercises 11-20
  • Section 5 Exercises 21-30
  • Section 6 Exercises 31-40
  • Section 7 Exercises 41-50
  • Section 8 Exercises 51-60
  • Section 9 Exercises 61-70
  • Section 10 Exercises 71-80
  • Section 11 Exercises 81-90
  • Section 12 Exercises 91-100
  • Section 13 Exercises 100+
  • Section 14 Configuration (optional)
  • Section 15 Bonus

What You’ll Learn

  • solve over 100 exercises in numpy, pandas and scikit-learn
  • deal with real programming problems in data science
  • work with documentation and Stack Overflow
  • guaranteed instructor support


Reviews

  • R
    Rene Beyreiss
    4.0

    Gut geeignet zur Auffrischung. Während der Laufzeit bekam ich öfters einen Fehler - meistens ging es dann aber beim zweiten Mal.

  • R
    Richard Hall
    2.0

    Disappointing, the exercises vary so much in scope it's obviously been put together to arbitrarily hit 100. You will have a dozen exercises that are change one parameter and print a dataframe then one in the middle that's 'here's 5 steps that I found on the internet that are needed to complete a PCA- do them all'. There's no logical flow, most of the time you're learning how parameter are parsed in python rather than any data science. It would've been far better with 10 focused exercises than what it actually is. That's the minor problem though, the bigger one is the range of errors within the exercises. The PCA is a great example, the 'expected result' part of the instruction misses out the key element of the exercise, that you're meant to be reducing the dimensionality of the data. So if you follow each of the steps correctly you'll never hit what you're told the target is. Further, there's other similar errors, some exercises (25 from memory) that won't run in the form given, there's an import of tensorflow for no reason that makes it fall over. There's many exercises where the 'expected results' don't match the actual results. Finally, some of the api calls to scipy are now so old that the parameters have been deprecated and removed. It needs checking and updating before anyone should be paying for this.

  • R
    Ramesh C
    5.0

    very nice

  • M
    Manish Kumar Verma
    4.0

    Fascinating! Great insightful section of codes. Overall good experience.

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed