EPM5005 - Data management and statistical computing
6 points, SCA Band 2, 0.125 EFTSL
Offered
Alfred Hospital First semester 2007 (Off-campus)
Synopsis
This unit will describe and demonstrate the complexity of data management and statistical computing methods. It will enable students to communicate effectively about the issues in storing and retrieving information, and in assessing the quality and limitations of data repositories. It uses examples from real data sets to give students practical skills in design, data management, assessment of data quality and handling of large volumes of data.
Objectives
On completion of this unit students should be able to demonstrate:
- Understanding of different sources and methods of data storage such as unit records, matrix files, longitudinal data, relational databases;
- Understanding of relational database concepts and design, and other data structures;
- Proficiency in the handling and analysis of large data sets;
- Skills in data manipulation and management using the major statistical software packages;
- Skills in linking files through unique and non-unique identifiers;
- Understanding of data quality control and data entry methods, and experience in applying validation checks to data;
- Skills in data cleaning, identification of outliers and data trimming using appropriate statistical methods;
- Understanding of processes leading to finalisation of data sets prior to analysis;
- Ability to communicate with researchers in data-related issues of design, conduct and analysis of studies.
Assessment
Three written assignments
final examination
Co-requisites
MPH1040