units

FIT5142

Faculty of Information Technology

Monash University

Postgraduate - Unit

This unit entry is for students who completed this unit in 2015 only. For students planning to study the unit, please refer to the unit indexes in the the current edition of the Handbook. If you have any queries contact the managing faculty for your course or area of study.

print version

6 points, SCA Band 2, 0.125 EFTSL

Refer to the specific census and withdrawal dates for the semester(s) in which this unit is offered.

LevelPostgraduate
FacultyFaculty of Information Technology
OfferedCaulfield Second semester 2015 (Day)

Synopsis

Advanced methods of discovering patterns in large-scale multi-dimensional databases are discussed. Solving classification, clustering, association rules analysis and regression problems on different kinds of data are covered. Data pre-processing methods for dealing with noisy and missing data in the context of Big Data are reviewed. Evaluation and analysis of data mining models are emphasised. Hands-on case studies in building data mining models are performed using popular modern software packages.

Outcomes

On successful completion of this unit, students should be able to:

  • explain the kinds of data from which knowledge can be mined, the way each data type can be presented to a data mining algorithm, the kinds of patterns that can be mined from each data type;
  • evaluate the quality of data mining models;
  • perform pre-processing of large-scale multi-dimensional datasets in preparation for data mining experiments;
  • perform data pre-processing for data with outliers, incomplete and noisy data;
  • compare the various learning algorithms and the ability to effectively apply suitable algorithms to mine frequent patterns and associations from data, to perform data classification, data clustering and regression analysis;
  • use modern data mining tools to solve non-trivial data mining problems;
  • research the current trends in data mining applications;
  • work in a team to extract knowledge from a common dataset using various data mining methods and techniques.

Assessment

Examination (3 hours): 60%; In-semester assessment: 40%

Workload requirements

Minimum total expected workload equals 12 hours per week comprising:

(a.) Contact hours for on-campus students:

  • Two hours of lectures
  • One 2-hour laboratory

(b.) Additional requirements (all students):

  • A minimum of 8 hours independent study per week for completing lab and project work, private study and revision.

See also Unit timetable information

Chief examiner(s)

Prerequisites

FIT5047 or FIT5045 or equivalent
Sound fundamental knowledge in maths and statistics; database and computer programming knowledge.

Additional information on this unit is available from the faculty at: