STAT462-25S2 (C) Semester Two 2025

Data Mining

15 points

Details:
Start Date: Monday, 14 July 2025
End Date: Sunday, 9 November 2025
Withdrawal Dates
Last Day to withdraw from this course:
  • Without financial penalty (full fee refund): Sunday, 27 July 2025
  • Without academic penalty (including no fee refund): Sunday, 28 September 2025

Description

Data Mining

STAT462 is a course in statistical learning and data mining, suited to anyone with an interest in analysing large datasets. This course will introduce a variety of statistical learning and data mining techniques for classification, regression, clustering and association purposes. Possible topics include, classification and regression trees, random forests, Apriori algorithm, FP-growth algorithm and support vector machines. The lectures will be supplemented with laboratory sessions using the statistical software package R.

Learning Outcomes

  • The courses will:
  • introduce statistical learning and data mining
  • introduce advanced data analysis techniques for classification, regression, cluster analysis and association analysis
  • introduce the use of the statistics software package R

    You will be able to:
  • describe and conduct appropriate statistical modeling techniques
  • be able to interpret the analysis results in such a way that a non-user of statistics can understand
  • Use R competently
  • Write a scientific and technical report
    • University Graduate Attributes

      This course will provide students with an opportunity to develop the Graduate Attributes specified below:

      Employable, innovative and enterprising

      Students will develop key skills and attributes sought by employers that can be used in a range of applications.

      Biculturally competent and confident

      Students will be aware of and understand the nature of biculturalism in Aotearoa New Zealand, and its relevance to their area of study and/or their degree.

      Globally aware

      Students will comprehend the influence of global conditions on their discipline and will be competent in engaging with global and multi-cultural contexts.

Prerequisites

Subject to approval of the Head of School.

Timetable 2025

Students must attend one activity from each section.

Lecture A
Activity Day Time Location Weeks
01 Monday 16:00 - 17:00 E16 Lecture Theatre
14 Jul - 24 Aug
8 Sep - 19 Oct
Lecture B
Activity Day Time Location Weeks
01 Wednesday 11:00 - 12:00 A5 Lecture Theatre
14 Jul - 24 Aug
8 Sep - 19 Oct
Computer Lab A
Activity Day Time Location Weeks
01 Friday 09:00 - 10:00 Jack Erskine 010 Computer Lab
14 Jul - 24 Aug
8 Sep - 19 Oct
02 Thursday 13:00 - 14:00 Jack Erskine 010 Computer Lab
14 Jul - 24 Aug
8 Sep - 19 Oct
03 Tuesday 14:00 - 15:00 Jack Erskine 010 Computer Lab
14 Jul - 24 Aug
8 Sep - 19 Oct
Tutorial A
Activity Day Time Location Weeks
01 Tuesday 11:00 - 12:00 A4 Lecture Theatre
14 Jul - 24 Aug
8 Sep - 19 Oct

Course Coordinator

Philipp Wacker

Textbooks / Resources

Recommended Reading

Hastie, Trevor. , Tibshirani, Robert., Friedman, J. H; The elements of statistical learning : data mining, inference, and prediction ; 2nd ed; Springer, 2009.

James, Gareth; An introduction to statistical learning : with applications in R ; Springer, 2013.

G. James, D. Witten, T. Hastie and R. Tibshirani, An Introduction to Statistical Learning with Applications in R. (2014) Springer

Recommended reading:
T. Hastie, R. Tibshirani and J. Friedman, The Elements of Statistical Learning, Data Mining, Inference, and Prediction. (2013) Springer.

Indicative Fees

Domestic fee $1,138.00

* All fees are inclusive of NZ GST or any equivalent overseas tax, and do not include any programme level discount or additional course-related expenses.

For further information see Mathematics and Statistics .

All STAT462 Occurrences