6 points, SCA Band 2, 0.125 EFTSL
Postgraduate - Unit
Refer to the specific census and withdrawal dates for the semester(s) in which this unit is offered.
Faculty
Unit guides
Offered
- First semester 2017 (Day)
- Second semester 2017 (Day)
- Teaching Period 3 2017 (Online)
- Teaching Period 6 2017 (Online)
Notes
Monash Online offerings are only available to students enrolled in the Graduate Diploma in Data ScienceGraduate Diploma in Data Science (http://online.monash.edu/course/graduate-diploma-data-science/?Access_Code=MON-GDDS-SEO2&utm_source=seo2&utm_medium=referral&utm_campaign=MON-GDDS-SEO2) via Monash Online.
Synopsis
This unit introduces tools and techniques for data wrangling. It will cover the problems that prevent raw data from being effectively used in analysis and the data cleansing and pre-processing tasks that prepare it for analytics. These include, for example, the handling of bad and missing data, data integration and initial feature selection. It will also introduce text mining and web analytics. Python and the Pandas environment will be used for implementation.
Outcomes
At the completion of this unit, students should be able to:
- parse data in the required format;
- assess the quality of data for problem identification;
- resolve data quality issues ready for the data analysis process;
- integrate data sources for data enrichment;
- document the wrangling process for professional reporting;
- write program scripts for data wrangling processes.
Assessment
In-semester assessment: 100%
Workload requirements
Minimum total expected workload equals 144 hours per semester comprising:
- Contact hours for on-campus students:
- Two hours/week lectures
- Two hours/week laboratories
- Contact hours for Monash Online students:
- Two hours/week online group sessions
- Online students generally do not attend lecture, tutorial and laboratory sessions, however should plan to spend equivalent time working through resources and participating in discussions.
- Additional requirements (all students):
- A minimum of 8 hours per week of personal study (22 hours per week for Monash Online students) for completing lab/tutorial activities, assignments, private study and revision, and for online students, participating in discussions.
See also Unit timetable information