Use the Tab and Up, Down arrow keys to select menu items.
Parametric and non-parametric statistical methodologies and algorithms for data mining.
STAT318 and STAT462 are courses in statistical learning and data mining, suited to anyone with an interest in analysing large datasets. The courses will introduce a variety of statistical learning and data mining techniques for classification, regression, clustering and association purposes. Possible topics include, classification and regression trees, random forests, Apriori algorithm, FP-growth algorithm and support vector machines. The lectures will be supplemented with laboratory sessions using the statistical software package R.
The courses will:introduce statistical learning and data miningintroduce advanced data analysis techniques for classification, regression, cluster analysis and association analysisintroduce the use of the statistics software package RYou will be able to:describe and conduct appropriate statistical modeling techniquesbe able to interpret the analysis results in such a way that a non-user of statistics can understandUse R competentlyWrite a scientific and technical report
i) 15 points from STAT200 to STAT299 and ii) a further 15 points from STAT200 to STAT299 or COSC200-299 or any other relevant subject with Head of School approval.
Blair Robertson
G. James, D. Witten, T. Hastie and R. Tibshirani, An Introduction to Statistical Learning with Applications in R. (2014) SpringerRecommended reading:T. Hastie, R. Tibshirani and J. Friedman, The Elements of Statistical Learning, Data Mining, Inference, and Prediction. (2013) Springer.
STAT318 homepage General information for students Library portal LEARN
Domestic fee $720.00
International fee $3,450.00
* All fees are inclusive of NZ GST or any equivalent overseas tax, and do not include any programme level discount or additional course-related expenses.
For further information see Mathematics and Statistics .