CTgoodjobs - Data Mining - Unsupervised Learning

儲存課程比較

課程資料

報名日期

全年招生

課程級別

短期課程

學習模式

線上教學

修業期

10 小時 42 分鐘

教學語言

英語

授課導師

AISPRY TUTOR

評分

4.1

(112 個評分)

7次瀏覽

課程簡介

Data Mining - Unsupervised Learning

The Data Mining - Unsupervised Learning course is designed to provide students with a comprehensive understanding of unsupervised learning techniques within the field of data mining. Unsupervised learning is a category of machine learning where algorithms are applied to unlabelled data to discover patterns, structures, and relationships without prior knowledge or guidance.

Throughout the course, students will explore various unsupervised learning algorithms and their applications in uncovering hidden insights from large datasets. The emphasis will be on understanding the principles, methodologies, and practical implementation of these algorithms rather than focusing on mathematical derivations.

The course will begin with an introduction to unsupervised learning, covering the basic concepts and goals. Students will learn how unsupervised learning differs from supervised learning and semi-supervised learning, and the advantages and limitations of unsupervised techniques. The importance of pre-processing and data preparation will also be discussed to ensure quality results.

The first major topic of the course will be clustering techniques. Students will dive into different clustering algorithms such as hierarchical clustering, k-means clustering, density-based clustering (e.g., DBSCAN), and expectation-maximization (EM) clustering. They will learn how to apply these algorithms to group similar data points together and identify underlying patterns and structures. The challenges and considerations in selecting appropriate clustering methods for different scenarios will be explored.

The course will then move on to dimensionality reduction, which aims to reduce the number of features or variables in a dataset while retaining relevant information. Students will explore techniques such as principal component analysis (PCA), singular value decomposition (SVD), and t-distributed stochastic neighbour embedding (t-SNE). They will understand how these methods can be used to visualize high-dimensional data and extract meaningful representations that facilitate analysis and interpretation.

Association rule mining will be another key topic covered in the course. Students will learn about the popular Apriori algorithm and FP-growth algorithm, which are used to discover interesting relationships and associations among items in transactional datasets. They will gain insights into evaluating and interpreting association rules, including support, confidence, and lift measures, and their practical applications in market basket analysis and recommendation systems.

The course will also address outlier detection, a critical task in unsupervised learning. Students will explore statistical approaches such as z-score and modified z-score, as well as distance-based approaches like the Local Outlier Factor and Isolation Forest. They will understand how to identify anomalies in data, which can provide valuable insights into potential fraud detection, network intrusion detection, or system failure prediction.

Evaluation and validation of unsupervised learning models will be an essential aspect of the course. Students will learn about internal and external evaluation measures, including silhouette coefficient, purity, and Rand index. They will gain skills in assessing the quality of clustering results and measuring the performance of dimensionality reduction techniques.

Throughout the course, students will be exposed to various real-world applications of unsupervised learning. They will discover how market segmentation can be achieved through clustering, enabling businesses to target specific customer segments effectively. They will also explore image and text clustering, which has applications in image recognition, document organization, and recommendation systems. The course will highlight anomaly detection, which plays a crucial role in identifying fraudulent transactions, network intrusions, or manufacturing defects. Lastly, students will learn how unsupervised learning powers recommender systems, providing personalized recommendations based on user behaviour and preferences.

Hands-on experience will be a significant component of the course. Students will work on practical exercises and projects, applying unsupervised learning algorithms to real-world datasets using popular data mining tools and programming libraries such as Python's scikit-learn or R's caret package. They will gain proficiency in pre-processing data, selecting appropriate algorithms, fine-tuning parameters, and interpreting and visualizing the results.

By the end of the course, students will have a solid understanding of unsupervised learning techniques, their practical applications, and the ability to leverage these methods to discover valuable insights and patterns from unlabelled data.

瀏覽更多詳情

課程章節

10 個章節
57 堂課

第 1 章 Introduction
第 2 章 About Analytics
第 3 章 Business Understanding Phase
第 4 章 Data Understanding Phase - Data Types
第 5 章 Data Understanding Phase - Data Collection
第 6 章 Understanding Basic Statistics
第 7 章 Data Preparation Phase - Exploratory Data Analysis (EDA)
第 8 章 Python Installation and Setup
第 9 章 Data Preparation Phase | Data Cleansing- Type Casting
第 10 章 Data Preparation Phase | Data Cleansing- Handling Duplicates

瀏覽更多詳情

課程內容

In Clustering or Segmentation, we reduce the number of rows. We have Hierarchical Clustering, Non-Hierarchical, Density-Based Clustering, Grid-based Clustering
In Dimension Reduction, we reduce the number of columns. Linear Patterns are handled by Linear Discriminant Analysis, Non-negative Matrix Factorization.
There is Collaborative Filtering in Recommendation System. Traditional Collaborative Filtering, Search-based Method, and Item-Item Collaborative Filtering.
In Unsupervised Learning has 6 divisions which include Clustering, Dimension Reduction, Association Rules, Recommendation Syst

瀏覽更多詳情

此課程所涵蓋的技能

評價

J
Johnstone Musyoka
5.0
it was enlightening and Indepth understanding
U
UĞUR ATAMAN
2.5
Speakers pronounciation is awful!
A
ATAKAN
3.0
The instructor's accent is too heavy to understand
D
Dipanshi pandey
3.0
good 👍

立即關注瀏覽更多

Udemy
關注
資訊科技
 關注

舉報

立即關注瀏覽更多

Udemy
關注
資訊科技
 關注

Udemy

Data Mining - Unsupervised Learning

課程資料

課程簡介

課程章節

課程內容

此課程所涵蓋的技能

評價

立即關注瀏覽更多

立即關注瀏覽更多

你可能感興趣的課程

進修攻略

媒體報道

Udemy

Data Mining - Unsupervised Learning

課程資料

課程簡介

課程章節

課程內容

此課程所涵蓋的技能

相關學習範疇

評價

立即關注瀏覽更多

立即關注瀏覽更多

你可能感興趣的課程

進修攻略

媒體報道

Udemy

Data Mining - Unsupervised Learning

免費會員專訊