STAT462-23S1 (C) Semester One 2023

Data Mining

15 points

Start Date: Monday, 20 February 2023
End Date: Sunday, 25 June 2023
Withdrawal Dates
Last Day to withdraw from this course:
  • Without financial penalty (full fee refund): Sunday, 5 March 2023
  • Without academic penalty (including no fee refund): Sunday, 14 May 2023


Data Mining

STAT318 and STAT462 are courses in statistical learning and data mining, suited to anyone with an interest in analysing large datasets. The courses will introduce a variety of statistical learning and data mining techniques for classification, regression, clustering and association purposes. Possible topics include, classification and regression trees, random forests, Apriori algorithm, FP-growth algorithm and support vector machines. The lectures will be supplemented with laboratory sessions using the statistical software package R.

Learning Outcomes

  • The courses will:
  • introduce statistical learning and data mining
  • introduce advanced data analysis techniques for classification, regression, cluster analysis and association analysis
  • introduce the use of the statistics software package R

    You will be able to:
  • describe and conduct appropriate statistical modeling techniques
  • be able to interpret the analysis results in such a way that a non-user of statistics can understand
  • Use R competently
  • Write a scientific and technical report


Subject to approval of the Head of School.

Course Coordinator

Heyang (Thomas) Li


Philipp Wacker

Textbooks / Resources

Recommended Reading

Hastie, Trevor. , Tibshirani, Robert., Friedman, J. H; The elements of statistical learning : data mining, inference, and prediction ; 2nd ed; Springer, 2009.

James, Gareth; An introduction to statistical learning : with applications in R ; Springer, 2013.

G. James, D. Witten, T. Hastie and R. Tibshirani, An Introduction to Statistical Learning with Applications in R. (2014) Springer

Recommended reading:
T. Hastie, R. Tibshirani and J. Friedman, The Elements of Statistical Learning, Data Mining, Inference, and Prediction. (2013) Springer.

Indicative Fees

Domestic fee $1,045.00

* All fees are inclusive of NZ GST or any equivalent overseas tax, and do not include any programme level discount or additional course-related expenses.

For further information see Mathematics and Statistics .

All STAT462 Occurrences