COT5230

Data mining

R Redpath

6 points
* 4 hours per week
* Second semester
* Caulfield
* Prerequisites: COT4230 and COT4300 or COT4330 or equivalent level of knowledge (While not a formal prerequisite, CSC3200 would provide a useful theoretical background.)

Objectives To develop student knowledge of the techniques and methods for data exploration in large databases, both those currently being used and those which are presently being researched. For students to become familiar with the currently available techniques for the extraction of information from large databases.

Synopsis This subject will study the application of database and semantics, information filtering and pattern recognition techniques for the exploration of data in databases. Tools such as Kohonen filtering, minimum message length classification and genetic algorithms will be examined. Statistical methods such as moment measures, multiple regression, significance testing and harmonic analysis will be studied. Data quality will be considered. The expression of rules using a logic representation language will be coupled with a study of basic linguistic semantics in order to quantify user research dialogues. The overall thrust of this subject is a practical one drawing on current theory. Extensive practical work using the above techniques will be undertaken. Visualisation of database relationships and of retrieved information will complete the study.

Assessment Five assignments (each 20%): 100%

Back to the Information Technology Handbook, 1998
Handbook Contents | University Handbooks | Monash University


Published by Monash University, Australia
Maintained by wwwdev@monash.edu.au
Approved by M Rambert, Faculty of Information Technology
Copyright © Monash University 1997 - All Rights Reserved - Caution