STAT318-14S2 (C) Semester Two 2014

Data Mining

15 points

Details:
Start Date: Monday, 14 July 2014
End Date: Sunday, 16 November 2014
Withdrawal Dates
Last Day to withdraw from this course:
  • Without financial penalty (full fee refund): Sunday, 27 July 2014
  • Without academic penalty (including no fee refund): Sunday, 12 October 2014

Description

Parametric and non-parametric statistical methodologies and algorithms for data mining.

STAT318 and STAT462 are courses in Data Mining, suited to anyone with an interest in analysing data.  In these courses we introduce you to the statistical analysis of large datasets for both classification and association purposes.  
We cover analysis of both numeric and qualitative data and make use of the professional software package MATLAB.
In these courses we will show you how to use the package, enter, manipulate and analyze data in MATLAB.

The Courses will:
•  introduce data mining.
• introduce advanced data analysis techniques including classification and regression trees, ROC curves and FP-growth algorithm.
• introduce the use of the statistics computer package MATLAB.

Learning Outcomes

  • describe and conduct appropriate statistical modeling techniques for large datasets
  • be able to interpret the model results in such a way that a non-user of statistics can understand
  • use MATLAB competently
  • write a scientific and technical report

Prerequisites

i) 15 points from STAT200 to STAT299 and ii) a further 15 points from STAT200 to STAT299 or COSC200-299 or any other relevant subject with Head of School approval.

Course Coordinator

Marco Reale

Lecturer

Patrick W Saart

Assessment

Assessment Due Date Percentage 
Assignments 40%
Project presentation 20%
Final Examination 40%


Assignments give you practice in analysing data and presenting results in a written report.
The project will give the opportunity to acquire presentation skills.

The lectures are complemented by computer labs where you will be guided in conducting approriate analysis and modelling.

Textbooks / Resources

Textbook
Recommended reading:
Tan, Steinbach and Kumar 2006. Introduction to Data Mining. 769pp.

This is on a restricted loan in the Library.

Indicative Fees

Domestic fee $672.00

International fee $3,388.00

* All fees are inclusive of NZ GST or any equivalent overseas tax, and do not include any programme level discount or additional course-related expenses.

For further information see Mathematics and Statistics .

All STAT318 Occurrences

  • STAT318-14S2 (C) Semester Two 2014