Course Information
Course Overview
Learn to use the GUI-based comprehensive Data Miner data mining software suite implemented as the rattle package in R
Data Mining with Rattle is a unique course that instructs with respect to both the concepts of data mining, as well as to the "hands-on" use of a popular, contemporary data mining software tool, "Data Miner," also known as the 'Rattle' package in R software. Rattle is a popular GUI-based software tool which 'fits on top of' R software. The course focuses on life-cycle issues, processes, and tasks related to supporting a 'cradle-to-grave' data mining project. These include: data exploration and visualization; testing data for random variable family characteristics and distributional assumptions; transforming data by scale or by data type; performing cluster analyses; creating, analyzing and interpreting association rules; and creating and evaluating predictive models that may utilize: regression; generalized linear modeling (GLMs); decision trees; recursive partitioning; random forests; boosting; and/or support vector machine (SVM) paradigms. It is both a conceptual and a practical course as it teaches and instructs about data mining, and provides ample demonstrations of conducting data mining tasks using the Rattle R package. The course is ideal for undergraduate students seeking to master additional 'in-demand' analytical job skills to offer a prospective employer. The course is also suitable for graduate students seeking to learn a variety of techniques useful to analyze research data. Finally, the course is useful for practicing quantitative analysis professionals who seek to acquire and master a wider set of useful job skills and knowledge. The course topics are scheduled in 10 distinct topics, each of which should be the focus of study for a course participant in a separate week per section topic.
Course Content
- 10 section(s)
- 82 lecture(s)
- Section 1 Introduction, Orientation, and Demos
- Section 2 Rattle Interface Tabs and Introductory Script Demonstrations
- Section 3 Loading and Exploring Data
- Section 4 Data Visualizations with Ggobi and Data Transformation in Rattle
- Section 5 Cluster Analysis
- Section 6 Association Analysis
- Section 7 Decision Trees and Recursive Partitioning
- Section 8 Random Forests
- Section 9 Boosting
- Section 10 Support Vector Machines
What You’ll Learn
- Perform and support life-cycle data mining tasks and activities using the popular Data Miner ("Rattle") software suite., Understand the functionalities implicit in the data, explore, test, transform, cluster, associate, model, evaluate, and log tabs in the Data Miner ("Rattle") GUI software platform., Know how to explore, visualize, transform, and summarize data sets in Rattle., Know how to create advanced, interactive Ggobi visualizations of data., Know how to use, estimate and interpret: cluster analyses
- association analyses mining rules
- decision trees
- random forests
- boosting
- and support vector machines using Rattle.
Skills covered in this course
Reviews
-
DDean V Neubauer
Too many blurry slides. For a course entitled Data Mining with Rattle, most of the time was spent running R scripts. Useful information but made it feel like more of an R course than a Rattle course.
-
RRokas Klydzia
I'm afraid that it might be a bit outdated (from 2012), but I might be wrong. Otherwise it seems like very good material.
-
CClaudiu Papasteri
This is the only Rattle online course out there (even on Graham Williams' website). As allways, prof. Hubona provides very good reading materials and code resources. Some aspects of Rattle remain unexplored and some additional information about ML algorithms seemed unnecessary for the scope of this course. Nevertheless, the course does its job well.
-
MMatt Wills
Because im learning, baby! Yeah